It's extra special because ICCV25 marks the 10-year anniversary of the VQA paper.
When we started, the idea of answering any question about any image seemed outlandish.
It's extra special because ICCV25 marks the 10-year anniversary of the VQA paper.
When we started, the idea of answering any question about any image seemed outlandish.
People act like AI is the issue, when it’s actually part of the fix.
If we're honest: most of what we make, most of the time, is slop by our own standards.
That’s the generator–discriminator gap in creative work that Ira Glass talks about.
People act like AI is the issue, when it’s actually part of the fix.
If we're honest: most of what we make, most of the time, is slop by our own standards.
That’s the generator–discriminator gap in creative work that Ira Glass talks about.
Because the laws of physics do not prohibit X and the forces of biology gave us curiosity.
Because the laws of physics do not prohibit X and the forces of biology gave us curiosity.
Today, we’re telling our story — show before you talk!
𝘞𝘦 𝘢𝘳𝘦 𝘳𝘦-𝘪𝘮𝘢𝘨𝘪𝘯𝘪𝘯𝘨 𝘩𝘰𝘸 𝘱𝘦𝘰𝘱𝘭𝘦 𝘪𝘯𝘵𝘦𝘳𝘢𝘤𝘵 𝘸𝘪𝘵𝘩 𝘵𝘩𝘦 𝘸𝘦𝘣 — one of humanity’s greatest inventions and a a mess overdue for an overhaul.
yutori.com
Today, we’re telling our story — show before you talk!
𝘞𝘦 𝘢𝘳𝘦 𝘳𝘦-𝘪𝘮𝘢𝘨𝘪𝘯𝘪𝘯𝘨 𝘩𝘰𝘸 𝘱𝘦𝘰𝘱𝘭𝘦 𝘪𝘯𝘵𝘦𝘳𝘢𝘤𝘵 𝘸𝘪𝘵𝘩 𝘵𝘩𝘦 𝘸𝘦𝘣 — one of humanity’s greatest inventions and a a mess overdue for an overhaul.
yutori.com
🌐 sites.google.com/view/vlms4all
🌐 sites.google.com/view/vlms4all
Why? Whom does this possibly harm?
Why? Whom does this possibly harm?
Brilliant talk by Ilya, but he's wrong on one point.
We are NOT running out of data. We are running out of human-written text.
We have more videos than we know what to do with. We just haven't solved pre-training in vision.
Just go out and sense the world. Data is easy.
Brilliant talk by Ilya, but he's wrong on one point.
We are NOT running out of data. We are running out of human-written text.
We have more videos than we know what to do with. We just haven't solved pre-training in vision.
Just go out and sense the world. Data is easy.
See, model naming isn't that hard.
See, model naming isn't that hard.
If you work in digital or physical AI agents, I'm scheduling chats (Dec 9-12). DMs open.
If you work in digital or physical AI agents, I'm scheduling chats (Dec 9-12). DMs open.
— a language model in the technical sense
— a "modern" AI system
— an auto-regressive symbol-sequence models, built with transformers, trained with SGD and self-supervised learning
— something else?
dhruvbatra.substack.com/p/the-term-l...
— a language model in the technical sense
— a "modern" AI system
— an auto-regressive symbol-sequence models, built with transformers, trained with SGD and self-supervised learning
— something else?
dhruvbatra.substack.com/p/the-term-l...