Computer Vision, MultiModal AI Agents, Video AI
Research Director at salesforceairesearch.com
Adjunct Professor at cs.stanford.edu & svl.stanford.edu
🔗 www.niebles.net
Our paper, "Exploring Diffusion Transformer Designs via Grafting," has been accepted as an Oral at #NeurIPS2025, with only 77 out of 21k submissions receiving this honor.
📄Paper: arxiv.org/abs/2506.05340
🌎Website: grafting.stanford.edu
🧑🏻💻Code: github.com/keshik6/graf...
Missed my talk on AI Agents: from Language to Multimodal Reasoning?
Summary and slides are here:
www.niebles.net/blog/2025/mm...
Shared our work on Multimodal AI Agents at the #ICCV2025 Workshop on Multi-Modal Reasoning. 🤖
All the slides, key papers, and the research journey are consolidated in this new blog post:
📄https://www.niebles.net/blog/2025/mmagents/
@iccv.bsky.social
strefer.github.io
arxiv.org/abs/2509.03501
arxiv.org/abs/2509.03501
We will see you at #ICCV2025 🏖️
✅ Auto & scalable
✅ Fine-grained, space–time–grounded queries
✅ Effective
📄: arxiv.org/abs/2509.03501
🌐: strefer.github.io
Had a blast creating this with the @salesforce.com team!
youtu.be/r98jGdLtO6Q
Check out details of our work:
arxiv.org/abs/2504.12513
🕐Come check it out right now until 13:00
“AdaVid: Adaptive Video-Language Pretraining”
🪧ExHall D Poster # 203
📝 arxiv.org/abs/2504.12513
Had the chance to present our ViUnit poster to fellow ACs. If you missed it, come to our Sunday poster session.
See details in the 🧵⬇️
🗓️ Sun Jun 15, 10:30AM-12:30PM
📍 ExHall D Poster #346
🔗 Paper: arxiv.org/abs/2412.08859
📝 Blog: www.niebles.net/blog/2025/vi...
#VisualProgramming #RobustAI
🗓️ Fri Jun 13, 4PM-6PM
📍 ExHall D Poster #306
🔗 Paper: arxiv.org/abs/2504.02259
🌐 Website: longvideohaystack.github.io
💻 Code: github.com/LongVideoHay...
📊 Data: huggingface.co/datasets/LVH...
#VideoUnderstanding
🗓️ Thu Jun 12, 12:00-13:00PM
📍 ExHall D Poster #202
🔗 Paper: arxiv.org/abs/2504.12513
🌐 Website: chaitanya100100.github.io/AdaVid/
#VideoLanguage #Pretraining
@cvprconference.bsky.social
blog: www.niebles.net/blog/2025/vl...
arxiv: arxiv.org/abs/2505.03181
Work with Jake Grigsby, Michael Ryoo and Yuke Zhu
#AI #MachineLearning #DeepLearning
blog: www.niebles.net/blog/2025/vl...
arxiv: arxiv.org/abs/2505.03181
#multimodalAI #agents #VLM
By @salesforce.com AI Research’s Shelby Heinecke.
See video here:
youtube.com/watch?v=vlvv...
Reposted by Juan Carlos Niebles
🎬 "The AI Research Lab - Explained" debuts with our groundbreaking work on Large Action Models! Sr. Mgr Shelby Heinecke reveals how we're training these specialized models to generate precise, executable actions. t.co/XLhlN2EZyk
Reposted by Czech Republic, Juan Carlos Niebles
cvpr.thecvf.com/Conferences/...
📝 My latest blog explores this fascinating question!
Read more here: www.niebles.net/blog/2025/cr...
#AI #creativity #artificialintelligence