Tarek Bouamer
tarekbouamer.bsky.social
Tarek Bouamer
@tarekbouamer.bsky.social
Research Specialist @ATRC, 3D Computer Vision, Machine Learning & Robotics. Previously ICG @TU_Graz, Paris-Sud & CentraleSupélec 🎓.

Looking for innovative research opportunities 🔍 in AI, robotics, and 3D vision.
Reposted by Tarek Bouamer
Last week we launched IMC2025-Ongoing on
@kaggle.com

The dataset is exactly as in IMC2025, but the competition is on-going for a year, making it better for academic leaderboard and persistency.
kaggle.com/competitions...
1/2
Dmytro Mishkin 🇺🇦 on X: "Last week we launched IMC2025-Ongoing on @kaggle The dataset is exactly as in IMC2025, but the competition is on-going for a year, making it better for academic leaderboard and persistency. https://t.co/ejmeN3Gh5B 1/2" / X
Last week we launched IMC2025-Ongoing on @kaggle The dataset is exactly as in IMC2025, but the competition is on-going for a year, making it better for academic leaderboard and persistency. https://t.co/ejmeN3Gh5B 1/2
x.com
November 10, 2025 at 8:32 PM
Reposted by Tarek Bouamer
MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM

Yuxuan Zhou, Xingxing Li, Shengyu Li, Zhuohao Yan, Chunxi Xia, Shaoquan Feng

tl;dr: MASt3R-SLAM+IMU+GNSS

arxiv.org/abs/2509.20757
September 26, 2025 at 1:09 PM
Reposted by Tarek Bouamer
LongSplat a robust unposed 3D Gaussian Splatting for Casual Long Videos
web linjohnss.github.io/longsplat/
code github.com/NVlabs/LongS...
September 25, 2025 at 8:40 PM
Reposted by Tarek Bouamer
MapAnything, a simple, end-to-end trained transformer model that directly regresses the factored metric 3D geometry of a scene given various types of inputs (images, calibration, poses, or depth).
code: github.com/facebookrese...
web: map-anything.github.io
September 17, 2025 at 6:00 PM
Reposted by Tarek Bouamer
3D and 4D World Modeling: A Survey

tl;dr: in title

arxiv.org/abs/2509.07996
September 11, 2025 at 1:54 PM
Reposted by Tarek Bouamer
OmniMap: A General Mapping Framework Integrating Optics, Geometry, and Semantics

Yinan Deng, Yufeng Yue, Jianyu Dou, Jingyu Zhao, Jiahui Wang, Yujie Tang, Yi Yang, Mengyin Fu

tl;dr: optics, geometry, and semantics->3DGS-Voxel hybrid representation

arxiv.org/abs/2509.07500
September 10, 2025 at 8:04 PM
Reposted by Tarek Bouamer
Faster VGGT with Block-Sparse Global Attention

Chung-Shien Brian Wang, Christian Schmidt, Jens Piekenbrinck, Bastian Leibe

tl;dr: block-sparse attention replaces global attention

another work to improve scalability of VGGT

arxiv.org/abs/2509.07120
September 10, 2025 at 8:06 PM
Life is hard without the fast internet we’re used to.

I could not even join my Google Meet this morning. 😞
September 9, 2025 at 5:19 PM
Reposted by Tarek Bouamer
CausNVS: Autoregressive Multi-view Diffusion for Flexible 3D Novel View Synthesis

Xin Kong, Daniel Watson, Yannick Strümpler, @miniemeyer.bsky.social, Federico Tombari

tl;dr: a framewise attention layer with causal masking on top of a pretrained 2D diffusion backbone

arxiv.org/abs/2509.06579
September 9, 2025 at 11:01 AM
Stages of the eclipse, captured by a friend.
#mooneclipse
September 8, 2025 at 9:13 AM
Lunar eclipse over Abu Dhabi tonight at 22:20 PM 🌑🌕✨
September 7, 2025 at 8:16 PM
Reposted by Tarek Bouamer
Apply for the AITHYRA-CeMM International PhD Program!

15-20 fully funded PhD fellowships available in Vienna, AT
in AI/ML and Life Sciences

Deadline for applications:
10 September 2025 apply.cemm.at
July 25, 2025 at 10:27 AM
Reposted by Tarek Bouamer
Franca official code and pretrained models are up on github and pytorch hub! github.com/valeoai/franca
Eager to learn how will it be used.
July 28, 2025 at 7:20 PM
Reposted by Tarek Bouamer
Reconstruct, Inpaint, Finetune: Dynamic Novel-view Synthesis from Monocular Videos

Kaihua Chen, @tarashakhurana.bsky.social, Deva Ramanan

tl;dr: in title; fine-tune CogVideoX->train 2D video-inpainter

arxiv.org/abs/2507.12646
July 18, 2025 at 10:22 AM
Reposted by Tarek Bouamer
St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World

TL;DR: a feed-forward; (reconstructs+tracks dynamic video content); dust3r-like pointmaps for a pair of frames captured at different moments (1/2)
April 22, 2025 at 4:30 PM
July 15, 2025 at 11:49 PM
Reposted by Tarek Bouamer
🌌🛰️🔭Want to explore universal visual features? Check out our interactive demo of concepts learned from our #ICML2025 paper "Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment".

Come see our poster at 4pm on Tuesday in East Exhibition hall A-B, E-1208!
July 15, 2025 at 2:36 AM
Reposted by Tarek Bouamer
Mind the Gap: Aligning Vision Foundation Models to Image Feature Matching

Yuhan Liu, Jingwen Fu, Yang Wu, Kangyi Wu, Pengna Li, Jiayi Wu, Sanping Zhou, Jingmin Xin

tl;dr: Stable Diffusion+attention-based prompt in LoFTR-type framework

no eval. on IMC

arxiv.org/abs/2507.10318
July 15, 2025 at 10:47 AM
Reposted by Tarek Bouamer
Leveraging Automatic CAD Annotations for Supervised Learning in 3D Scene Understanding

Yuchen Rao, Stefan Ainetter, Sinisa Stekovic, @vincentlepetit.bsky.social , Friedrich Fraundorfer

tl;dr: in title
arxiv.org/abs/2504.13580
April 28, 2025 at 9:13 AM
Reposted by Tarek Bouamer
A Guide to Structureless Visual Localization

Vojtech Panek, Qunjie Zhou, Yaqing Ding, Sérgio Agostinho, Zuzana Kukelova @sattlertorsten.bsky.social @lealtaixe.bsky.social

tl;dr: RoMa>MAST3r outdoors with 5pt solver, indoors MAST3r is king. M3Dv2 depth comparable to MAST3r
arxiv.org/abs/2504.17636
April 28, 2025 at 8:02 AM
Reposted by Tarek Bouamer
🚀 Never miss a beat in science again!

📬 Scholar Inbox is your personal assistant for staying up to date with your literature. It includes: visual summaries, collections, search and a conference planner.

Check out our white paper: arxiv.org/abs/2504.08385
#OpenScience #AI #RecommenderSystems
April 14, 2025 at 11:04 AM
Reposted by Tarek Bouamer
Super excited to share Visual Chronicles! Huge kudos to @boyangdeng.bsky.social on his fantastic internship work with us at Google DeepMind. It was one of the coolest and most fun projects I've ever been a part of!

Tell us what trends we discovered surprise you: boyangdeng.com/visual-chron...
April 14, 2025 at 3:40 PM
#EidMubarak to all my friends and colleagues celebrating! 🌙✨

May these blessed days bring joy, peace, and prosperity to you and your families. 🕌🤲
March 30, 2025 at 5:52 AM
March 9, 2025 at 8:50 PM
Reposted by Tarek Bouamer
(1/3) Happy to share LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes, that uplifts visual features from models such as DINOv2 (left) & CLIP (mid) to 3DGS scenes. Joint work w. @dlarlus.bsky.social @jmairal.bsky.social
Webpage & code: juliettemarrie.github.io/ludvig
January 31, 2025 at 9:59 AM