Mehil Shah
shahmehil.bsky.social
Mehil Shah
@shahmehil.bsky.social
PhD Student in CS and SE @ Dalhousie University
Reposted by Mehil Shah
I wrote some thoughts on how to build good LM benchmarks: ofir.io/How-to-Build...
How to Build Good Language Modeling Benchmarks
Building benchmarks is important because they shine a spotlight on the weaknesses of existing language models and so can guide the community on how to improve them.
ofir.io
November 25, 2024 at 9:54 PM