Assistant Professor of the Generative Intelligence Lab at Carnegie Mellon University. Understanding and creating pixels. All the code and models are available at http://github.com/junyanz.
Reposted by Zhu -
Reposted by Zhu -
Code: github.com/AvaLovelace1...
Website: avalovelace1.github.io/LegoGPT/
Reposted by Zhu -
(🔊)
Reposted by Zhu -
Reposted by Zhu -
cs.cmu.edu/~syncd-proje...
w/ Xi Yin, @junyanz.bsky.social, Ishan Misra, and Samaneh Azadi
Reposted by Zhu -
We welcome submissions in 2 tracks:
1) unpublished work up to 4 pages
2) papers published within last 2 years
Submit by Mar 28 & join us with amazing speakers in Nashville:
www.cv4animals.com
🦒🪼🐬🐿️🦩🐢🦘🦜🦥🦋
@cvprconference.bsky.social
Reposted by Zhu -
Here's a blog post on why we often miss what's right in front of us. #visionscience
aaronhertzmann.com/2024/05/09/i...
Reposted by Zhu -
We exploit tactile sensing to enhance geometric details for text- and image-to-3D generation.
Check out our #NeurIPS2024 work on Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation: ruihangao.github.io/TactileDream...
1/3
Reposted by Zhu -
The two year fellowship supports Wang’s work in data attribution for text-to-image models.
Read about his achievement in our news site! www.ri.cmu.edu/robotics-ins...
Reposted by Zhu -
huggingface.co/spaces/pairc...
I demo'ed this at #SIGRAPHASIA2024 and it went great! :)
3/3
Reposted by Zhu -
Reposted by Zhu -
A method for decomposing a video into complete layers, including objects and their associated effects (e.g., shadows, reflections).
It enables a wide range of cool applications, such as video stylization, compositions, moment retiming, and object removal.
Reposted by Zhu -
sites.google.com/ttic.edu/ope...