Formerly: CTO @codecov.bsky.social
More Formerly: Co-Founder/CTO @GameWisp
Real talk: I cheesed the Grymforge fight by lowering the platform and just had my barbarian yeet carafes of water at the construct from the upper staircase. Gravity's a bitch, Grymforge.
Real talk: I cheesed the Grymforge fight by lowering the platform and just had my barbarian yeet carafes of water at the construct from the upper staircase. Gravity's a bitch, Grymforge.
What I'm driving at: is there a measurable, repeatable way to show things are improving? If so, can that metric be reported on?
(Thanks for engaging with this btw, it's interesting work)
What I'm driving at: is there a measurable, repeatable way to show things are improving? If so, can that metric be reported on?
(Thanks for engaging with this btw, it's interesting work)
So is the conclusion basically binary? The llm output is either good enough or it isn't?
So is the conclusion basically binary? The llm output is either good enough or it isn't?
1. Is there a metric for how effective these tests are? Does such a question even make sense?
2. If so, maybe someday @codecov.bsky.social can support it 😁?
1. Is there a metric for how effective these tests are? Does such a question even make sense?
2. If so, maybe someday @codecov.bsky.social can support it 😁?
For everything else: xml
For everything else: xml
I'm not even a wrestling fan and his Vince McMahon episodes were really interesting.
Every Christmas he covers a "non bastard", of these his Aaron Swartz episodes were superb.
I'm not even a wrestling fan and his Vince McMahon episodes were really interesting.
Every Christmas he covers a "non bastard", of these his Aaron Swartz episodes were superb.