Dhruv Batra
@dhruvbatra.bsky.social
Co-founder & Chief Scientist at Yutori. Prev: Senior Director leading FAIR Embodied AI at Meta, and Professor at Georgia Tech.
Solved: robustness to paraphrasing and false premises, OCR, world-knowledge based reasoning.
Open: spatial reasoning, data-efficiency, learning compatible representations.
Open: spatial reasoning, data-efficiency, learning compatible representations.
October 23, 2025 at 5:18 PM
Solved: robustness to paraphrasing and false premises, OCR, world-knowledge based reasoning.
Open: spatial reasoning, data-efficiency, learning compatible representations.
Open: spatial reasoning, data-efficiency, learning compatible representations.
Lots to be done. Thank you to all our collaborators and the research community for this recognition!
October 21, 2025 at 7:27 PM
Lots to be done. Thank you to all our collaborators and the research community for this recognition!
Fun-fact: the T-shirt I'm wearing is an inside joke about the quality of 2015 models.
However, every few years we rediscover the lesson that on difficult tasks, VLMs silently regress to being nearly blind.
x.com/DhruvBatra_/...
However, every few years we rediscover the lesson that on difficult tasks, VLMs silently regress to being nearly blind.
x.com/DhruvBatra_/...
October 21, 2025 at 7:27 PM
Fun-fact: the T-shirt I'm wearing is an inside joke about the quality of 2015 models.
However, every few years we rediscover the lesson that on difficult tasks, VLMs silently regress to being nearly blind.
x.com/DhruvBatra_/...
However, every few years we rediscover the lesson that on difficult tasks, VLMs silently regress to being nearly blind.
x.com/DhruvBatra_/...
Anything by Ted Chiang
October 20, 2025 at 3:52 AM
Anything by Ted Chiang
I dunno man, Dagger is cool.
October 20, 2025 at 3:51 AM
I dunno man, Dagger is cool.
Somebody is a fan of Abundance
June 10, 2025 at 5:33 AM
Somebody is a fan of Abundance
Good. Autonomous interface locomotion is the fundamental robotics problem of our time. The more the merrier.
April 1, 2025 at 5:12 PM
Good. Autonomous interface locomotion is the fundamental robotics problem of our time. The more the merrier.
The web is the ultimate boss-level for agents — dynamic, non-deterministic, and noisy; some mistakes are inevitable and so far, every agent fails eventually.
Yutori is building superhuman agents for this ultimate digital environment.
Join our waitlist for early access to our product!
yutori.com
Yutori is building superhuman agents for this ultimate digital environment.
Join our waitlist for early access to our product!
yutori.com
Yutori
We’re building AI agents that can reliably do everyday digital tasks for you on the web, towards an AI chief-of-staff for everyone.
yutori.com
March 27, 2025 at 2:31 PM
The web is the ultimate boss-level for agents — dynamic, non-deterministic, and noisy; some mistakes are inevitable and so far, every agent fails eventually.
Yutori is building superhuman agents for this ultimate digital environment.
Join our waitlist for early access to our product!
yutori.com
Yutori is building superhuman agents for this ultimate digital environment.
Join our waitlist for early access to our product!
yutori.com
𝐈𝐦𝐚𝐠𝐢𝐧𝐞 𝐚 𝐰𝐨𝐫𝐥𝐝 𝐰𝐡𝐞𝐫𝐞 𝐧𝐨 𝐡𝐮𝐦𝐚𝐧 𝐡𝐚𝐬 𝐭𝐨 𝐝𝐢𝐫𝐞𝐜𝐭𝐥𝐲 𝐢𝐧𝐭𝐞𝐫𝐚𝐜𝐭 𝐰𝐢𝐭𝐡 𝐭𝐡𝐞 𝐰𝐞𝐛 𝐚𝐠𝐚𝐢𝐧.
Where teams of AI assistants coordinate to book flights, manage budgets, or file paperwork—proactively surfacing insights and correcting errors.
Only problem — no one knows how to build AI agents that actually work.
Where teams of AI assistants coordinate to book flights, manage budgets, or file paperwork—proactively surfacing insights and correcting errors.
Only problem — no one knows how to build AI agents that actually work.
March 27, 2025 at 2:31 PM
𝐈𝐦𝐚𝐠𝐢𝐧𝐞 𝐚 𝐰𝐨𝐫𝐥𝐝 𝐰𝐡𝐞𝐫𝐞 𝐧𝐨 𝐡𝐮𝐦𝐚𝐧 𝐡𝐚𝐬 𝐭𝐨 𝐝𝐢𝐫𝐞𝐜𝐭𝐥𝐲 𝐢𝐧𝐭𝐞𝐫𝐚𝐜𝐭 𝐰𝐢𝐭𝐡 𝐭𝐡𝐞 𝐰𝐞𝐛 𝐚𝐠𝐚𝐢𝐧.
Where teams of AI assistants coordinate to book flights, manage budgets, or file paperwork—proactively surfacing insights and correcting errors.
Only problem — no one knows how to build AI agents that actually work.
Where teams of AI assistants coordinate to book flights, manage budgets, or file paperwork—proactively surfacing insights and correcting errors.
Only problem — no one knows how to build AI agents that actually work.
Ah, understood. No idea about the tracing of that meme.
March 23, 2025 at 3:27 PM
Ah, understood. No idea about the tracing of that meme.
Seems like the ultimate thing to rally around, no? To the extent there is any purpose, what's the alternative?
March 23, 2025 at 2:28 AM
Seems like the ultimate thing to rally around, no? To the extent there is any purpose, what's the alternative?
I'm already there for low-stakes queries.
March 23, 2025 at 1:02 AM
I'm already there for low-stakes queries.
Where's the skepticism coming from? Now that web search and citations are in there, isn't it easy to verify and thus become more confident?
March 23, 2025 at 12:59 AM
Where's the skepticism coming from? Now that web search and citations are in there, isn't it easy to verify and thus become more confident?
The way it's always been done isn't handling the current scale well (as evidenced by the feedback from the authors). Yes, outsource to a company, pay for creation of new tools, start new companies, all of the standard ways of addressing a growing market.
February 26, 2025 at 3:52 PM
The way it's always been done isn't handling the current scale well (as evidenced by the feedback from the authors). Yes, outsource to a company, pay for creation of new tools, start new companies, all of the standard ways of addressing a growing market.
Why is it volunteer work? Why doesn't an organization that takes in millions in sponsorship professionalize?
February 26, 2025 at 3:46 PM
Why is it volunteer work? Why doesn't an organization that takes in millions in sponsorship professionalize?
Some of us did :)
December 21, 2024 at 4:51 PM
Some of us did :)