Evals, metrics, multilinguality, multiculturality, multimodality, and (dabbling in) reasoning
100% Product of public schools
https://saxon.me/
Interestingly, only for some multilingual models is this true. Aya knows China best in Chinese, but LLaMA's best in English always.
Well, not sure how easy it is to merge both technologies, but it surely takes out a competitor on the increasingly vital serving side
In particular I'm interested in stuff covering AI/ML/NLP/CV, which does any mix of diagnosing problems with peer review and publication or proposes fixes.
In particular I'm interested in stuff covering AI/ML/NLP/CV, which does any mix of diagnosing problems with peer review and publication or proposes fixes.
Some medium-small nonanonymous MLposter posted a screenshot of one of our colleague's hyphenated names with a "wow you see interesting names in research" caption
it was drawing strange racial remarks from the anons 1/3
Some medium-small nonanonymous MLposter posted a screenshot of one of our colleague's hyphenated names with a "wow you see interesting names in research" caption
it was drawing strange racial remarks from the anons 1/3
Your four short multiple choice answers will help a lot :)
Your four short multiple choice answers will help a lot :)
Claim here is "no" for reasons like something about mammals making them more likely to develop intelligence, or that the post asteroid predator-free environment bred competition, favoring intelligence
youtu.be/8Gh2gycaavI
Claim here is "no" for reasons like something about mammals making them more likely to develop intelligence, or that the post asteroid predator-free environment bred competition, favoring intelligence
youtu.be/8Gh2gycaavI
www.youtube.com/watch?v=mDhB...
www.youtube.com/watch?v=mDhB...
Featuring @ofirpress.bsky.social @saining.bsky.social @idavidrein.bsky.social @efleisig.bsky.social and Wenda Xu
www.youtube.com/watch?v=d_zX...
Featuring @ofirpress.bsky.social @saining.bsky.social @idavidrein.bsky.social @efleisig.bsky.social and Wenda Xu
www.youtube.com/watch?v=d_zX...
"S-risk is the risk that AGI doesn’t kill us all, but instead enslaves and tortures us for eternity (the ‘S’ stands for suffering). It was awesome to learn about it."
directing.attention.to/p/ill-never-...
"S-risk is the risk that AGI doesn’t kill us all, but instead enslaves and tortures us for eternity (the ‘S’ stands for suffering). It was awesome to learn about it."
directing.attention.to/p/ill-never-...
/1
www.youtube.com/watch?v=B9hG...
www.youtube.com/watch?v=B9hG...
This lecture is open to the public. If you're interested in joining, here's a GCal invite link: calendar.google.com/calendar/eve...
This lecture is open to the public. If you're interested in joining, here's a GCal invite link: calendar.google.com/calendar/eve...
Check out my, @marstin.bsky.social and @xiangyue96.bsky.social's tutorial, "The Science of Benchmarking: What's Measured, What's Missing, What's Next" on December 2 from 1:30 to 4:00pm.
benchmarking.science
What will we cover?
1/3
Check out my, @marstin.bsky.social and @xiangyue96.bsky.social's tutorial, "The Science of Benchmarking: What's Measured, What's Missing, What's Next" on December 2 from 1:30 to 4:00pm.
benchmarking.science
What will we cover?
1/3
Are the equations supporting an argument or are they just a fancy way to express something simple? Do introduced terms do anything or get referenced anywhere?
I find the answer is usually no in the kinds of papers I review
Are the equations supporting an argument or are they just a fancy way to express something simple? Do introduced terms do anything or get referenced anywhere?
I find the answer is usually no in the kinds of papers I review
a 🧵 1/n
Drain: arxiv.org/abs/2511.04820
Strain: direct.mit.edu/qss/article/...
Oligopoly: direct.mit.edu/qss/article/...