We're looking for someone to join the research agent evaluation team, starting Fall 2025. Application link to be available soon, but feel free to send us your CV and/or come talk to us at #ACL2025. 🧵
We're looking for someone to join the research agent evaluation team, starting Fall 2025. Application link to be available soon, but feel free to send us your CV and/or come talk to us at #ACL2025. 🧵
TL;DR: linguistic diversity, writing systems (and pretty scripts), classifiers, and a school for Newar kids in Kathmandu.
sites.bu.edu/lislab/2025/...
TL;DR: linguistic diversity, writing systems (and pretty scripts), classifiers, and a school for Newar kids in Kathmandu.
sites.bu.edu/lislab/2025/...
It complements recent evals (eg PaperBench from OpenAI
) on replication! See 👇 for details
We introduce RExBench, a benchmark that tests if a coding agent can implement a novel experiment based on existing research and code.
Finding: Most agents we tested had a low success rate, but there is promise!
It complements recent evals (eg PaperBench from OpenAI
) on replication! See 👇 for details
The New England Mechanistic Interpretability (NEMI) Workshop is happening Aug 22nd 2025 at Northeastern University!
A chance for the mech interp community to nerd out on how models really work 🧠🤖
🌐 Info: nemiconf.github.io/summer25/
📝 Register: forms.gle/v4kJCweE3UUH...
The New England Mechanistic Interpretability (NEMI) Workshop is happening Aug 22nd 2025 at Northeastern University!
A chance for the mech interp community to nerd out on how models really work 🧠🤖
🌐 Info: nemiconf.github.io/summer25/
📝 Register: forms.gle/v4kJCweE3UUH...
cphnlp.github.io
cphnlp.github.io
- Invited talks by @loubnabnl.hf.co (HF) @mziizm.bsky.social (Cohere) @najoung.bsky.social (BU) @kylelo.bsky.social (AI2) Yohei Oseki (UTokyo)
- Exciting posters by other participants
Register to attend and/or present your poster at cphnlp.github.io /1
- Invited talks by @loubnabnl.hf.co (HF) @mziizm.bsky.social (Cohere) @najoung.bsky.social (BU) @kylelo.bsky.social (AI2) Yohei Oseki (UTokyo)
- Exciting posters by other participants
Register to attend and/or present your poster at cphnlp.github.io /1
@bucds.bsky.social in 2026! BU has SCHEMES for LM interpretability & analysis, I couldn't be more pumped to join a burgeoning supergroup w/ @najoung.bsky.social @amuuueller.bsky.social. Looking for my first students, so apply and reach out!
ACL 2025 Ling theory & Cognitive modeling track is looking for emergency reviewers. The emergency review period is between 3/18-26, and these reviewers will be excluded from the ARR cycle. If you're interested, please sign up here! docs.google.com/forms/d/1fH7...
ACL 2025 Ling theory & Cognitive modeling track is looking for emergency reviewers. The emergency review period is between 3/18-26, and these reviewers will be excluded from the ARR cycle. If you're interested, please sign up here! docs.google.com/forms/d/1fH7...
Internally, I’ve been developing and using a library that makes this extremely easy, and I decided to open-source it
Meet the decoding library: github.com/benlipkin/de...
1/7
Internally, I’ve been developing and using a library that makes this extremely easy, and I decided to open-source it
Meet the decoding library: github.com/benlipkin/de...
1/7
arxiv.org/abs/2410.17482
arxiv.org/abs/2410.17482
* Presentations from Aditya Yedetore and Hayley Ross on neural network generalizations!
* I'm giving a keynote at GenBench & organizing BlackboxNLP
* Ask me about our faculty hiring & PhD/postdoc positions at Boston University!
* Presentations from Aditya Yedetore and Hayley Ross on neural network generalizations!
* I'm giving a keynote at GenBench & organizing BlackboxNLP
* Ask me about our faculty hiring & PhD/postdoc positions at Boston University!
* Presentations from Aditya Yedetore and Hayley Ross on neural network generalizations!
* I'm giving a keynote at GenBench & organizing BlackboxNLP
* Ask me about our faculty hiring & PhD/postdoc positions at Boston University!
* Presentations from Aditya Yedetore and Hayley Ross on neural network generalizations!
* I'm giving a keynote at GenBench & organizing BlackboxNLP
* Ask me about our faculty hiring & PhD/postdoc positions at Boston University!