One idea, many ways to say it – but does your brain track those options while you speak?
Using LLMs, we put this to the test.
www.biorxiv.org/content/10.1...
We show for the 1st time that the brain represents multiple alternatives simultaneously in both listening and speaking.
🧵
One idea, many ways to say it – but does your brain track those options while you speak?
Using LLMs, we put this to the test.
www.biorxiv.org/content/10.1...
We show for the 1st time that the brain represents multiple alternatives simultaneously in both listening and speaking.
🧵
In our new paper, we survey major reg efforts & find they rely on benchmarking, which we know to be problematic. How did this happen & what can we do about it?
arxiv.org/pdf/2501.15693
In our new paper, we survey major reg efforts & find they rely on benchmarking, which we know to be problematic. How did this happen & what can we do about it?
arxiv.org/pdf/2501.15693
Interested in LLM-as-a-Judge?
Want to get the best judge for ranking your system?
our new work is just for you:
"JuStRank: Benchmarking LLM Judges for System Ranking"
🕺💃
arxiv.org/abs/2412.09569
Interested in LLM-as-a-Judge?
Want to get the best judge for ranking your system?
our new work is just for you:
"JuStRank: Benchmarking LLM Judges for System Ranking"
🕺💃
arxiv.org/abs/2412.09569