https://cs.stanford.edu/~shaoyj/
While AI R&D races to automate everything, we took a different approach: auditing what workers want vs. what AI can deliver across the US workforce.🧵
While AI R&D races to automate everything, we took a different approach: auditing what workers want vs. what AI can deliver across the US workforce.🧵
With EgoNormia, a 1.8k ego-centric video 🥽 QA benchmark, we show that this is surprisingly challenging!
With EgoNormia, a 1.8k ego-centric video 🥽 QA benchmark, we show that this is surprisingly challenging!
A group of agents is eager to work with you. By providing feedback, you will see the agent's identity and its feedback to you!
A group of agents is eager to work with you. By providing feedback, you will see the agent's identity and its feedback to you!
Introducing Collaborative Gym (Co-Gym), a framework for enabling & evaluating human-agent collaboration! I now get used to agents proactively seeking confirmations or my deep thinking.(🧵 with video)
Introducing Collaborative Gym (Co-Gym), a framework for enabling & evaluating human-agent collaboration! I now get used to agents proactively seeking confirmations or my deep thinking.(🧵 with video)
Introducing Collaborative Gym (Co-Gym), a framework for enabling & evaluating human-agent collaboration! I now get used to agents proactively seeking confirmations or my deep thinking.(🧵 with video)
Introducing Collaborative Gym (Co-Gym), a framework for enabling & evaluating human-agent collaboration! I now get used to agents proactively seeking confirmations or my deep thinking.(🧵 with video)
Introducing talkarena.org — an open platform where users speak to LAMs and receive text responses. Through open interaction, we focus on rankings based on user preferences rather than static benchmarks.
🧵 (1/5)
Yijia Shao, Tianshi Li, Weiyan Shi, Yanchen Liu, Diyi Yang
Th, Dec 12, 11:00 PST - Poster Session 3 West
Yijia Shao, Tianshi Li, Weiyan Shi, Yanchen Liu, Diyi Yang
Th, Dec 12, 11:00 PST - Poster Session 3 West
Oral: Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
Manling Li · Shiyu Zhao · Qineng Wang · Kangrui Wang · … · Weiyu Liu · Percy Liang · Li Fei-Fei · Jiayuan Mao · Jiajun Wu
Wed 11 Dec 11:50 PM UTC [East Ballroom A, B]
Oral: Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
Manling Li · Shiyu Zhao · Qineng Wang · Kangrui Wang · … · Weiyu Liu · Percy Liang · Li Fei-Fei · Jiayuan Mao · Jiajun Wu
Wed 11 Dec 11:50 PM UTC [East Ballroom A, B]
I am working on developing LM agents as collaborative research partners, learning aids, personal assistants, and more. Let's connect and chat!!
I am working on developing LM agents as collaborative research partners, learning aids, personal assistants, and more. Let's connect and chat!!
It's not too late to catch up using this handy list from the Stanford AI Lab blog:
ai.stanford.edu/blog/emnlp-2...
It's not too late to catch up using this handy list from the Stanford AI Lab blog:
ai.stanford.edu/blog/emnlp-2...