Zhenjun Zhao
ericzzj.bsky.social
Zhenjun Zhao
@ericzzj.bsky.social
ericzzj1989.github.io
PhD from CUHK. 3D vision, SLAM, SfM, Image Matching (https://github.com/ericzzj1989/Awesome-Image-Matching).
Pinned
🎉 Thrilled to share our CVPR 2025 Award Candidate & Oral paper:

🔹 GlobustVP
Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World

🧱 Global optimality
💥 Tolerates up to 70% outliers
⚡ Fast runtime

📄 Paper: arxiv.org/abs/2505.04788

💻 Code: github.com/WU-CVGL/GlobustVP

1/
4D3R: Motion-Aware Neural Reconstruction and Rendering of Dynamic Scenes from Monocular Videos

Mengqi Guo, Bo Xu, Yanyan Li, Gim Hee Lee

tl;dr: joint optimization of motion mask and scene reconstruction

arxiv.org/abs/2511.05229
November 10, 2025 at 2:39 PM
FastGS: Training 3D Gaussian Splatting in 100 Seconds

Shiwei Ren, Tianci Wen, Yongchun Fang, Biao Lu

tl;dr: multi-view consistency->densification and pruning

arxiv.org/abs/2511.04283
November 7, 2025 at 1:51 PM
Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization

Shaohan Li, Yunpeng Shi, Gilad Lerman

tl;dr: translation averaging->Welsch loss->Message-Passing Least Squares with distance-based cycle-consistency

arxiv.org/abs/2511.02329
November 5, 2025 at 1:00 PM
TurboMap: GPU-Accelerated Local Mapping for Visual SLAM

Parsa Hosseininejad, Kimia Khabiri, Shishir Gopinath, Soudabeh Mohammadhashemi, Karthik Dantu, Steven Y. Ko

tl;dr: GPU->triangulation & map point fusion & local BA; CPU->keyframe culling

arxiv.org/abs/2511.02036
November 5, 2025 at 12:59 PM
WildfireX-SLAM: A Large-scale Low-altitude RGB-D Dataset for Wildfire SLAM and Beyond

Zhicong Sun, Jacqueline Lo, Jinxing Hu

tl;dr: synthetic dataset

arxiv.org/abs/2510.27133
November 4, 2025 at 4:33 PM
Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process

Jiayi Chen, Wenxuan Song, Pengxiang Ding, Ziyang Zhou, Han Zhao, Feilong Tang, Donglin Wang, Haoang Li

tl;dr: multiple modalities->single synchronous denoising trajectory

arxiv.org/abs/2511.01718
November 4, 2025 at 4:32 PM
JOGS: Joint Optimization of Pose Estimation and 3D Gaussian Splatting

Yuxuan Li, Tao Wang, Xianben Yang

tl;dr: Lucas-Kanade 3D optical flow+reprojection errors->poses; standard differentiable rendering3DGS parameters

arxiv.org/abs/2510.26117
October 31, 2025 at 8:39 AM
PointSt3R: Point Tracking through 3D Grounded Correspondence

Rhodri Guerrier, Adam W. Harley, @dimadamen.bsky.social

tl;dr: fine-tune MASt3R with point matching loss and visibility head to handle dynamic scenes

arxiv.org/abs/2510.26443
October 31, 2025 at 8:39 AM
The Impact and Outlook of 3D Gaussian Splatting

Bernhard Kerbl

tl;dr: in title

arxiv.org/abs/2510.26694
October 31, 2025 at 8:36 AM
STG-Avatar: Animatable Human Avatars via Spacetime Gaussian

Guangan Jiang, Tianzi Zhang, Dong Li, @ericzzj.bsky.social, Haoang Li, Mingrui Li, Hongyu Wang

tl;dr: 3DGS-based framework for high-fidelity animatable human avatar reconstruction

arxiv.org/abs/2510.22140
October 30, 2025 at 8:34 AM
Epipolar Geometry Improves Video Generation Models

Orest Kupyn, Fabian Manhardt, Federico Tombari, Christian Rupprecht

tl;dr: Wan->diverse videos->Sampson epipolar error->relative reward signals->Flow-DPO->video generation rankings->3D-consistent videos

arxiv.org/abs/2510.21615
October 30, 2025 at 8:31 AM
PlanarGS: High-Fidelity Indoor 3D Gaussian Splatting Guided by Vision-Language Planar Priors

Xirui Jin, Renbiao Jin, Boying Li, Danping Zou, Wenxian Yu

tl;dr: depth & normal priors from DUSt3R+semantic planar priors from Grounded SAM & geometric cues & cross-view fusion

arxiv.org/abs/2510.23930
October 30, 2025 at 8:30 AM
Kineo: Calibration-Free Metric Motion Capture From Sparse RGB Cameras

Charles Javerliat, Pierre Raimbaud, Guillaume Lavoué

tl;dr: confidence-driven reliable correspondences+graph-based global optimization

arxiv.org/abs/2510.24464
October 30, 2025 at 8:29 AM
AtlasGS: Atlanta-world Guided Surface Reconstruction with Implicit Structured Gaussians

Xiyu Zhang, Chong Bao, Yipeng Chen, Hongjia Zhai, Yitong Dong, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

tl;dr: implicit-structured+semantic+Atlanta-world guided planar regularized GS

arxiv.org/abs/2510.25129
October 30, 2025 at 8:29 AM
RoGER-SLAM: A Robust Gaussian Splatting SLAM System for Noisy and Low-light Environment Resilience

Huilin Yin, Zhaolin Yang, Linchuan Zhang, Gerhard Rigoll, Johannes Betz

tl;dr: 3DGS rendering->low-pass filtering->fusion+adaptive tracking+CLIP-based enhancement

arxiv.org/abs/2510.22600
October 30, 2025 at 8:28 AM
TWC-SLAM: Multi-Agent Cooperative SLAM with Text Semantics and WiFi Features Integration for Similar Indoor Environments

Chunyu Li, Shoubin Chen, Dong Li, Weixing Xue, Qingquan Li

tl;dr: text semantics & WiFi features->multi-agent cooperative SLAM

arxiv.org/abs/2510.22754
October 30, 2025 at 8:27 AM
PLANA3R: Zero-shot Metric Planar 3D Reconstruction via Feed-Forward Planar Splatting

Changkun Liu, Bin Tan, Zeran Ke, Shangzhan Zhang, Jiachen Liu, Ming Qian, Nan Xue, Yujun Shen, Tristan Braud

arxiv.org/abs/2510.18714
October 23, 2025 at 7:03 PM
Advances in 4D Representation: Geometry, Motion, and Interaction

Mingrui Zhao, Sauradip Nag, Kai Wang, Aditya Vora, Guangda Ji, Peter Chun, Ali Mahdavi-Amiri, Hao Zhang

tl;dr: in title

arxiv.org/abs/2510.19255
October 23, 2025 at 7:01 PM
PoseCrafter: Extreme Pose Estimation with Hybrid Video Synthesis

Qing Mao, Tianxin Huang, Yu Zhu, Jinqiu Sun, Yanning Zhang, Gim Hee Lee

tl;dr: DynamiCrafter+ViewCrafter->intermediate frames->ORB+RANSAC->top k frames

arxiv.org/abs/2510.19527
October 23, 2025 at 7:00 PM
LightGlueStick: a Fast and Robust Glue for Joint Point-Line Matching

Aidyn Ubingazhibov, Rémi Pautrat, Iago Suárez, Shaohui Liu, @marcpollefeys.bsky.social, Viktor Larsson

tl;dr: LightGlue+GlueStick+line message

arxiv.org/abs/2510.16438
October 23, 2025 at 6:59 PM
GSPlane: Concise and Accurate Planar Reconstruction via Structured Representation

Ruitong Gan, Junran Peng, Yang Liu, Chuanchen Luo, Qing Li, Zhaoxiang Zhang

tl;dr: normals from Metric3D v2+planar regions from SAM->planar Gaussians

arxiv.org/abs/2510.17095
October 23, 2025 at 6:57 PM
Initialize to Generalize: A Stronger Initialization Pipeline for Sparse-View 3DGS

Feng Zhou, Wenkai Guo, Pu Cao, Zhicheng Zhang, Jianqin Yin

tl;dr: initialization matters in sparse-view 3DGS

arxiv.org/abs/2510.17479
October 23, 2025 at 6:56 PM
PAGE-4D: Disentangled Pose and Geometry Estimation for 4D Perception

Kaichen Zhou, Yuhan Wang, Grace Chen, Xinhai Chang, Gaspard Beaudouin, Fangneng Zhan, Paul Pu Liang, Mengyu Wang

tl;dr: dynamic VGGT

arxiv.org/abs/2510.17568
October 23, 2025 at 6:55 PM
VAR-SLAM: Visual Adaptive and Robust SLAM for Dynamic Environments

João Carlos Virgolino Soares, Gabriel Fischer Abati, Claudio Semini

tl;dr: semantics->known dynamic objects; adaptive robust kernels->unknown dynamic objects

arxiv.org/abs/2510.16205
October 23, 2025 at 6:54 PM
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery

Jie-Ying Lee, Yi-Ruei Liu, Shr-Ruei Tsai, Wei-Cheng Chang, Chung-Ho Wu, Jiewen Chan, @ericzzj.bsky.social, Chieh Hubert Lin, Yu-Lun Liu

amazing project page & demo!
skyfall-gs.jayinnn.dev
arxiv.org/abs/2510.15869
October 20, 2025 at 5:06 PM