✅ New scenes not in existing web data
✅ Runs in ~15 min on one GPU
Work led by Candace Ross in collaboration with @afeinstein20.bsky.social , Florian Bordes, and @polkirichenko.bsky.social
Check it out on HuggingFace, ArXiv & NeurIPS! huggingface.co/datasets/fac...
✅ New scenes not in existing web data
✅ Runs in ~15 min on one GPU
Work led by Candace Ross in collaboration with @afeinstein20.bsky.social , Florian Bordes, and @polkirichenko.bsky.social
Check it out on HuggingFace, ArXiv & NeurIPS! huggingface.co/datasets/fac...
🧵2/3
🧵2/3
- Closed models, GPT-4o, are also brittle to the choice of delimiter.
🧵
- Closed models, GPT-4o, are also brittle to the choice of delimiter.
🧵
w/ @polkirichenko.bsky.social Sam Bell Kamalika Chaudhuri
Paper: arxiv.org/abs/2506.09038
Code: github.com/facebookrese...
bsky.app/profile/polk...
🧵2/2
w/ @polkirichenko.bsky.social Sam Bell Kamalika Chaudhuri
Paper: arxiv.org/abs/2506.09038
Code: github.com/facebookrese...
bsky.app/profile/polk...
🧵2/2
Learn more on our site and code at facebookresearch.github.io/maze_navigat...
Learn more on our site and code at facebookresearch.github.io/maze_navigat...
Come by our NeurIPS poster Exhibit Halls A-C #3204 11am PST Thursday to learn more.
Come by our NeurIPS poster Exhibit Halls A-C #3204 11am PST Thursday to learn more.