Saining Xie
banner
saining.bsky.social
Saining Xie
@saining.bsky.social
http://www.sainingxie.com
researcher in #deeplearning #computervision | assistant professor at @NYU_Courant @nyuniversity | previous: research scientist @metaai (FAIR) @UCSanDiego
Check out this new paper from Willis on inference time scaling of diffusion models!
Inference-time scaling for LLMs improves the model's ability in many perspectives, but what about diffusion models?
In our latest study—Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps—we reframe inference-time scaling as a search problem over sampling noises. 🧵[1/n]
January 17, 2025 at 7:02 PM
two exciting directions for diffusion models in 2025: either going (extremely) small or going (extremely) big with your steps
January 17, 2025 at 2:36 PM
Reposted by Saining Xie
Visual-spatial intelligence–we rely on it to perceive, interact, and navigate our everyday spaces. To what capacity do MLLMs possess it? Do they mirror how humans think and reason about space?

Presenting “Thinking in Space: How Multimodal Models See, Remember, and Recall Spaces”! [1/n]
December 23, 2024 at 10:45 PM