Knowing where someone looks is key to a Theory of Mind. We test 111 VLMs and 65 humans to compare their inferences.
Project page: grow-ai-like-a-child.github.io/gaze/
🧵1/11
Proof: different articles present at the specified journal/volume/page number, and their titles exist nowhere on any searchable repository.
Take this as a warning to not use LMs to generate your references!
Proof: different articles present at the specified journal/volume/page number, and their titles exist nowhere on any searchable repository.
Take this as a warning to not use LMs to generate your references!
A thread (1/n) - #ICML2025 ✅
A thread (1/n) - #ICML2025 ✅
However, humans, since an extremely age 🧒, are extremely sensitive to other people's gaze 🙄 👀
No mentors, no labs, only pre-doc students, 111 VLMs, and we did it 😎
However, humans, since an extremely age 🧒, are extremely sensitive to other people's gaze 🙄 👀
No mentors, no labs, only pre-doc students, 111 VLMs, and we did it 😎
However, humans, since an extremely age 🧒, are extremely sensitive to other people's gaze 🙄 👀
No mentors, no labs, only pre-doc students, 111 VLMs, and we did it 😎
Knowing where someone looks is key to a Theory of Mind. We test 111 VLMs and 65 humans to compare their inferences.
Project page: grow-ai-like-a-child.github.io/gaze/
🧵1/11
Knowing where someone looks is key to a Theory of Mind. We test 111 VLMs and 65 humans to compare their inferences.
Project page: grow-ai-like-a-child.github.io/gaze/
🧵1/11
We spent 2 years to systematically to examine and show the lack of such in MLLMs: arxiv.org/abs/2410.10855
We spent 2 years to systematically to examine and show the lack of such in MLLMs: arxiv.org/abs/2410.10855