Yash Kant
yashkant.bsky.social
Yash Kant
@yashkant.bsky.social
ai phd at university of toronto //
prev at meta, snap research and georgia tech //
web: https://yashkant.github.io/
I will be at hashtag#CVPR25 in Nashville! ✨

Please come chat with me and Ethan Weber - during our poster session on Pippo, on Sat 5-7pm (Hall D)! 😊 👋

Web: yashkant.github.io/pippo

CC: @ethanjohnweber.bsky.social, @igilitschenski.bsky.social
June 10, 2025 at 2:18 AM
Reposted by Yash Kant
🧑Pippo: High-Resolution Multi-View Humans from a Single Image
@yashkant.bsky.social, Ethan Weber, Jin Kyu Kim, Rawal Khirodkar, Su Zhaoen, Julieta M., Igor Gilitschenski, Shunsuke Saito, Timur Bagautdinov 3/🧵
arxiv.org/abs/2502.07785
Pippo: High-Resolution Multi-View Humans from a Single Image
We present Pippo, a generative model capable of producing 1K resolution dense turnaround videos of a person from a single casually clicked photo. Pippo is a multi-view diffusion transformer and does n...
arxiv.org
March 3, 2025 at 7:47 PM
Reposted by Yash Kant
I am excited to share that my students @kai-he.bsky.social, @yashkant.bsky.social, Ziyi Wu, and Toshiya Yura, our previous research visitor from Sony, will present papers at #CVPR2025. 🎉 Check out their amazing work! 1/🧵
March 3, 2025 at 7:47 PM
Reposted by Yash Kant
Pippo : High-Resolution Multi-View Humans from a Single Image

TL;DR: 1K Multiview Diffusion Transformer pre-trained on 3B Human images without captions; post-trained on 2.5K studio captures with pixel-aligned control via ControlMLP; generates > 5x views at inference
February 18, 2025 at 10:16 AM