Solim LeGris
solimlegris.bsky.social
Solim LeGris
@solimlegris.bsky.social
PhD student at NYU. Interested in making machines insightful.
Reposted by Solim LeGris
AGI is just astrology for smart computer boys
November 21, 2025 at 3:12 AM
Reposted by Solim LeGris
Today in Nature Machine Intelligence, Kazuki Irie & I discuss 4 classic challenges for neural nets — systematic generalization, catastrophic forgetting, few-shot learning, & reasoning. We argue there is a unifying fix: the right incentives & practice. rdcu.be/eLRmg
October 20, 2025 at 1:18 PM
Reposted by Solim LeGris
🚨 New preprint: "Decision rule inference limits social escape from learning traps" (with Rheza Budiono and Cate Hartley of the @hartleylabnyu.bsky.social ✨). Read here: osf.io/preprints/ps.... This is more work on a very curious phenomena!
OSF
osf.io
September 26, 2025 at 3:30 AM
Reposted by Solim LeGris
Today we open-sourced a new project for developing behavioral experiments online. It is called Smile. Announcement of v0.1.0: todd.gureckislab.org/2025/07/22/s... Smile has been used internally in my lab for several years and has substantially increased our productivity.
July 21, 2025 at 10:47 PM
Reposted by Solim LeGris
I don't know what world hassabis is living in but the reality is the reverse.
AI is creating a world whereby there's less trust (by making it difficult to differentiate real from ai generated), ever wider inequity gap, and ever more intrusive surveillance
June 4, 2025 at 10:40 AM
Reposted by Solim LeGris
Fantastic new work by @johnchen6.bsky.social (with @brendenlake.bsky.social and me trying not to cause too much trouble).

We study systematic generalization in a safety setting and find LLMs struggle to consistently respond safely when we vary how we ask naive questions. More analyses in the paper!
Do LLMs show systematic generalization of safety facts to novel scenarios?

Introducing our work SAGE-Eval, a benchmark consisting of 100+ safety facts and 10k+ scenarios to test this!

- Claude-3.7-Sonnet passes only 57% of facts evaluated
- o1 and o3-mini passed <45%! 🧵
May 30, 2025 at 5:32 PM
Reposted by Solim LeGris
New preprint alert! We often prompt ICL tasks using either demonstrations or instructions. How much does the form of the prompt matter to the task representation formed by a language model? Stick around to find out 1/N
May 23, 2025 at 5:38 PM
Reposted by Solim LeGris
my god
March 24, 2025 at 6:29 PM
Reposted by Solim LeGris
Out today in Nature Machine Intelligence!

From childhood on, people can create novel, playful, and creative goals. Models have yet to capture this ability. We propose a new way to represent goals and report a model that can generate human-like goals in a playful setting... 1/N
February 21, 2025 at 4:29 PM
Reposted by Solim LeGris
The part of George Orwell’s 1984 that everyone forgets is how the music and publishing industries have been replaced by a machine that spits out songs and bad novels “without any human intervention.” The goal is to keep you from ever having to think.
February 8, 2025 at 8:06 PM
Reposted by Solim LeGris
I wrote about the concept of agency (both human and artificial) in the year 2025. gracewlindsay.com/2025/01/24/2...
2025: Agency gained and lost
If you’ve had even a passing glance at tech journalism over the past few months, you know the top buzzword for AI in 2025 is agentic. Agentic AI (according to many think pieces and press releases a…
gracewlindsay.com
January 24, 2025 at 8:41 PM
Reposted by Solim LeGris
Our paper on if you can incentivize rule induction in humans with money is finally out (answer is: it appears to be a very weak/0-ish effect in contrast to the huge effect of financial incentives on rote, repetitive tasks). credit to pamop, ben newell & dan bartels psycnet.apa.org/fulltext/202...
APA PsycNet
psycnet.apa.org
January 21, 2025 at 5:06 PM
Reposted by Solim LeGris
the rapid transition of academics off x (despite temporarily reducing reach/followers) makes you wonder what’s stopping us from ending the for-profit, closed access publishing industry. it’s, like…. we can just do it? or if not, interesting to consider what the inertial differences are.
November 30, 2024 at 12:56 PM