Come work with us to advance foundational technologies that enable AI systems to model and interact meaningfully with the world!
Topics on our homepage: research.nvidia.com/labs/sil/
Application link below
Come work with us to advance foundational technologies that enable AI systems to model and interact meaningfully with the world!
Topics on our homepage: research.nvidia.com/labs/sil/
Application link below
Wed, Jun 11, 8am-noon, or join in at 10:20 after the break. tinyurl.com/nv-kaolin-cv...
Wed, Jun 11, 8am-noon, or join in at 10:20 after the break. tinyurl.com/nv-kaolin-cv...
We repurpose Score Distillation Sampling (SDS) for audio, turning any pretrained audio diffusion model into a tool for diverse tasks, including source separation, impact synthesis & more.
🎧 Demos, audio examples, paper: research.nvidia.com/labs/toronto...
🧵below
We repurpose Score Distillation Sampling (SDS) for audio, turning any pretrained audio diffusion model into a tool for diverse tasks, including source separation, impact synthesis & more.
🎧 Demos, audio examples, paper: research.nvidia.com/labs/toronto...
🧵below
Meet WeatherWeaver, a video model for controllable synthesis and removal of diverse weather effects — such as 🌧️ rain, ☃️ snow, 🌁 fog, and ☁️ clouds — for any input video.
Meet WeatherWeaver, a video model for controllable synthesis and removal of diverse weather effects — such as 🌧️ rain, ☃️ snow, 🌁 fog, and ☁️ clouds — for any input video.
🤗 Now supports remote backend via litellm, e.g. Hugging Face, ollama
🎨 UI/UX overhaul
This lays the foundation for Blender agents and more advanced 3D AI models (coming this year)
🤗 Now supports remote backend via litellm, e.g. Hugging Face, ollama
🎨 UI/UX overhaul
This lays the foundation for Blender agents and more advanced 3D AI models (coming this year)
🎢 objaverse subsampled to <500 poly models, converted to untextured objs
🔧 suitable for training autoregressive transformer-based 3D models, which have limited context length, such as LLaMA-Mesh
🎢 objaverse subsampled to <500 poly models, converted to untextured objs
🔧 suitable for training autoregressive transformer-based 3D models, which have limited context length, such as LLaMA-Mesh
We enable LLMs to generate 3D meshes by representing them as plain text and fine-tuning, unifying 3D and text modalities in a single model.
🔎 Webpage research.nvidia.com/labs/toronto...
🕹️ Interactive Demo huggingface.co/spaces/Zheng...
💾 Model checkpoint available
We enable LLMs to generate 3D meshes by representing them as plain text and fine-tuning, unifying 3D and text modalities in a single model.
🔎 Webpage research.nvidia.com/labs/toronto...
🕹️ Interactive Demo huggingface.co/spaces/Zheng...
💾 Model checkpoint available
We make single-step distilled generators better and faster using our new method, multi-student distillation (MSD)!
Explore the project page to learn more: research.nvidia.com/labs/toronto...
We make single-step distilled generators better and faster using our new method, multi-student distillation (MSD)!
Explore the project page to learn more: research.nvidia.com/labs/toronto...
Introducing SOURCE: A method to understand how individual training examples influence neural net behavior, allowing us to make AI models more transparent and trustworthy!
📄 Full paper: openreview.net/pdf?id=3NaqG...
Introducing SOURCE: A method to understand how individual training examples influence neural net behavior, allowing us to make AI models more transparent and trustworthy!
📄 Full paper: openreview.net/pdf?id=3NaqG...