UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback
UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback
Qwen Image Edit — Camera Angle Control
Qwen Image Edit — Camera Angle Control
ProcGen3D: Learning Neural Procedural Graphs for Image-to-3D Reconstruction
ProcGen3D: Learning Neural Procedural Graphs for Image-to-3D Reconstruction
github.com/adambarbato/...
ByteDance SA2VA segmentation model built on the Qwen 3 VL 4B vision-language architecture.
github.com/adambarbato/...
ByteDance SA2VA segmentation model built on the Qwen 3 VL 4B vision-language architecture.
AI compositing workflow built on Qwen Image Edit 2509 with Fusion LoRA.
AI compositing workflow built on Qwen Image Edit 2509 with Fusion LoRA.
Chrono Edit Upscaler Lora
Chrono Edit Upscaler Lora
Qwen-Image-Edit lighting controls
Qwen-Image-Edit lighting controls
UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback (Shot Matching basically).
UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback (Shot Matching basically).
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training
Human Mesh Modeling for Anny Body
Human Mesh Modeling for Anny Body
BindWeave: Subject-Consistent Video Generation via Cross-Modal Integration
BindWeave: Subject-Consistent Video Generation via Cross-Modal Integration
arxiv.org/abs/2511.01266
arxiv.org/abs/2511.01266
dx8152/Qwen-Edit-2509-Multiple-angles
dx8152/Qwen-Edit-2509-Multiple-angles
github.com/Tencent-Huny...
WorldMirror: Universal 3D World Reconstruction with Any-Prior Prompting
github.com/Tencent-Huny...
WorldMirror: Universal 3D World Reconstruction with Any-Prior Prompting
SplatMASK - Manual Animated MASKS for ComfyUI workflows
SplatMASK - Manual Animated MASKS for ComfyUI workflows
github.com/vita-epfl/St...
stable-video-infinity.github.io/homepage/
youtu.be/43lh-3kV3bs?...
github.com/vita-epfl/St...
stable-video-infinity.github.io/homepage/
youtu.be/43lh-3kV3bs?...
Fibo: Text to image trained exclusively on licensed data.
Fibo: Text to image trained exclusively on licensed data.
Foley Control: Video Guided Sound Effect Generation with a Frozen Latent Audio Model.
Foley Control: Video Guided Sound Effect Generation with a Frozen Latent Audio Model.
iGGT: end-to-end large unified Transformer that integrates spatial reconstruction with instance-level contextual understanding.
iGGT: end-to-end large unified Transformer that integrates spatial reconstruction with instance-level contextual understanding.
MoCha: End-to-End Video Character Replacement without Structural Guidance
MoCha: End-to-End Video Character Replacement without Structural Guidance
ChronoEdit: image editing (Nvidia research)
Nano banana like but open source.
ChronoEdit: image editing (Nvidia research)
Nano banana like but open source.
next-scene-qwen-image-lora-2509 is a LoRA adapter fine-tuned on Qwen-Image-Edit (build 2509), purpose-built to generate cinematic image sequences with natural visual progression from frame to frame.
next-scene-qwen-image-lora-2509 is a LoRA adapter fine-tuned on Qwen-Image-Edit (build 2509), purpose-built to generate cinematic image sequences with natural visual progression from frame to frame.
WorldGrow — a generative method that creates infinite EXPLICIT 3D worlds
WorldGrow — a generative method that creates infinite EXPLICIT 3D worlds
Long cat AI video generation
Long cat AI video generation