Lightnews — Scholar-powered news

Reposted by Dávid Komorowicz

Joe Kider

@hepcatjk.bsky.social

Choosing the right colormap is tricky, too often, they hide subtle details or distort the data. Our new method transforms colormaps to boost local contrast and reveal just noticeable differences, all while keeping the visualization perceptually accurate and accessible.

dl.acm.org/doi/10.1145/...

August 15, 2025 at 3:44 PM

Reposted by Dávid Komorowicz

Andrei Bursuc

@abursuc.bsky.social

1/ Can open-data models beat DINOv2? Today we release Franca, a fully open-sourced vision foundation model. Franca with ViT-G backbone matches (and often beats) proprietary models like SigLIPv2, CLIP, DINOv2 on various benchmarks setting a new standard for open-source research.

July 21, 2025 at 2:47 PM

Reposted by Dávid Komorowicz

Wenzel Jakob

@wjakob.bsky.social

How can one reconstruct the complete 3D interior of a wood block using only photos of its surfaces? 🪵
At SIGGRAPH'25 (Thursday!), Maria Larsson will present *Mokume*: a dataset of 190 diverse wood samples and a pipeline that solves this inverse texturing challenge. 🧵👇

August 8, 2025 at 11:53 AM

Reposted by Dávid Komorowicz

Dmytro Mishkin

@ducha-aiki.bsky.social

VLM-Guided Visual Place Recognition for Planet-Scale Geo-Localization

Sania Waheed, Na Min An, Michael Milford , Sarvapali D. Ramchurn, Shoaib Ehsan

tl;dr: in title
arxiv.org/abs/2507.17455

August 5, 2025 at 8:11 AM

Reposted by Dávid Komorowicz

Zhenjun Zhao

@ericzzj.bsky.social

Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping

Chong Cheng, Zijian Wang, Sicheng Yu, Yu Hu, Nanjie Yao, Hao Wang

tl;dr: submap alignment->point cloud registration->robust Umeyama algorithm->global point cloud and camera trajectory

arxiv.org/abs/2507.18541

July 25, 2025 at 12:43 PM

Reposted by Dávid Komorowicz

Johan Edstedt

@parskatt.bsky.social

New 3D foundation model dropped.

Note: Seems they might have messed up their image matching metrics (seems like acc rather than auc), but should be at least as good as mast3r.

July 24, 2025 at 10:50 PM

Dávid Komorowicz

@dawars.me

Turns out that by default huggingface models run on the CPU...

July 20, 2025 at 12:10 PM

Reposted by Dávid Komorowicz

Guillaume Dalle

@gdalle.bsky.social

Awesome initiative 🎉
This leaves me wondering though: how come authors attending #EurIPS still have to register for the main #NeurIPS (in the Americas) for their paper to be considered accepted?
You stopped so short of actually allowing ML researchers to fly less!

A meme where Gru explains his evil plan.
1. Organize NeurIPS event in Europe
2. Reduce carbon footprint and barriers to entry
3. Force accepted authors to present in the US

July 17, 2025 at 2:12 PM

Reposted by Dávid Komorowicz

Guillaume Dalle

@gdalle.bsky.social

Sofar it doesn’t look good: neurips.cc/FAQ/AuthorRe...

“At least one author of each accepted paper must register for the main conference. A ‘Virtual Only Pass’ is not sufficient.”

A meme where Anakin and Padme discuss the logics of allowing a NeurIPS event in Europe while forcing authors to also present in the US for publication

July 17, 2025 at 7:32 AM

Reposted by Dávid Komorowicz

Ashley Lynch ✂️🎞️

@ashleylynch.bsky.social

WeTransfer just changed their TOS giving themselves permission to train AI on any content you transfer and produce derivative works based on content you transfer that they are allowed to monetize and you are not allowed payment for.

Stop using WeTransfer.

July 14, 2025 at 11:05 PM

Reposted by Dávid Komorowicz

Linus Härenstam-Nielsen

@linushn.bsky.social

The code for our #CVPR2025 paper, PRaDA: Projective Radial Distortion Averaging, is now out!

Turns out distortion calibration from multiview 2D correspondences can be fully decoupled from 3D reconstruction, greatly simplifying the problem

arxiv.org/abs/2504.16499
github.com/DaniilSinits...

July 9, 2025 at 1:54 PM

Reposted by Dávid Komorowicz

Christoph Reich

@christophreich.bsky.social

🦖 We present “Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion”. #ICCV2025
🌍: visinf.github.io/scenedino/
📃: arxiv.org/abs/2507.06230
🤗: huggingface.co/spaces/jev-a...
@jev-aleks.bsky.social @fwimbauer.bsky.social @olvrhhn.bsky.social @stefanroth.bsky.social @dcremers.bsky.social

July 9, 2025 at 1:18 PM

Reposted by Dávid Komorowicz

Paul-Edouard Sarlin

@pesarlin.bsky.social

We just released COLMAP v3.12, which adds long-awaited, end-to-end support for multi-camera rigs and 360° panoramas 👀 COLMAP just got better at handling your robotics, AR/VR, or 360 data - try it yourself and let us know! github.com/colmap/colma... Kudos to Johannes & team for this great work 🚀

July 1, 2025 at 4:33 PM

Reposted by Dávid Komorowicz

Dmytro Mishkin

@ducha-aiki.bsky.social

Dense Match Summarization for Faster Two-view Estimation

Jonathan Astermark, Anders Heyden, Viktor Larsson
tl;dr: use clustering to reduce RANSAC time when using dense methods like RoMa.
Kudos for eval on WxBS.
P.S. now the same, but for BA?

arxiv.org/abs/2506.028...

June 24, 2025 at 12:22 PM

Reposted by Dávid Komorowicz

Lu Sang

@lu-sang.bsky.social

🤗 I’m excited to share our recent work: TwoSquared: 4D Reconstruction from 2D Image Pairs.
🔥 Our method produces geometry, texture-consistent, and physically plausible 4D reconstructions
📰 Check our project page sangluisme.github.io/TwoSquared/
❤️ @ricmarin.bsky.social @dcremers.bsky.social

April 23, 2025 at 4:48 PM

Reposted by Dávid Komorowicz

Dominik Schnaus

@schnaus.bsky.social

Can we match vision and language representations without any supervision or paired data?

Surprisingly, yes!

Our #CVPR2025 paper with @neekans.bsky.social and @dcremers.bsky.social shows that the pairwise distances in both modalities are often enough to find correspondences.

⬇️ 1/4

June 3, 2025 at 9:27 AM

Reposted by Dávid Komorowicz

Felix Wimbauer

@fwimbauer.bsky.social

Can you train a model for pose estimation directly on casual videos without supervision?

Turns out you can!

In our #CVPR2025 paper AnyCam, we directly train on YouTube videos and achieve SOTA results by using an uncertainty-based flow loss and monocular priors!

⬇️

May 13, 2025 at 8:11 AM

Reposted by Dávid Komorowicz

hardmaru

@hardmaru.bsky.social

We also found that this allows the CTM to decide to spend less time thinking on simpler images, thus saving energy. When identifying a gorilla, for example, the CTM’s attention moves from eyes to nose to mouth in a pattern remarkably similar to human visual attention.

May 12, 2025 at 2:42 AM

Reposted by Dávid Komorowicz

Zhenjun Zhao

@ericzzj.bsky.social

High Dynamic Range Novel View Synthesis with Single Exposure

Kaixuan Zhang, Hu Wang, Minxian Li, Mingwu Ren, Mao Ye, Xiatian Zhu

tl;dr:single exposure LDR images in training; LDR image->model+lift->HDR colors; HDR image->LDR image->additional supervision

arxiv.org/abs/2505.01212

May 5, 2025 at 8:52 PM

Reposted by Dávid Komorowicz

Stefano Esposito

@s-esposito.bsky.social

📢 New paper CVPR 25!
Can meshes capture fuzzy geometry? Volumetric Surfaces uses adaptive textured shells to model hair, fur without the splatting / volume overhead. It’s fast, looks great, and runs in real time even on budget phones.
🔗 autonomousvision.github.io/volsurfs/
📄 arxiv.org/pdf/2409.02482

May 5, 2025 at 1:00 PM

Reposted by Dávid Komorowicz

Nando Metzger

@nandometzger.bsky.social

8th ZurichCV is on the 29th of April. We have two fantastic speakers: Linus Scheibenreif (ETH Zurich) will talk about self-supervised learning for satellite imagery, and Pascal Chang (Disney Research) will give us a preview of his soon-to-be-published work.

RSVP: www.zurichai.ch/events/zuric...

ZurichCV #9 | ZurichAI

Linus Scheibenreif (ETH Zurich) will talk about self-supervised learning for satellite imagery, and Pascal Chang (ETH Zurich/Disney Research) will present his recent work (topic to be announced).

www.zurichai.ch

April 20, 2025 at 6:31 AM

Reposted by Dávid Komorowicz

Richard Meredith

@rtm223.me

No meal has ever sustained me for more than a few hours, a mere blip on the timeline of my life, 0.001% of my expected lifespan. So therefore I'll no longer be paying at restaurants

Hypervisible @hypervisible.blacksky.app · Apr 17

Nasty work. www.vanityfair.com/news/story/m...

But their defense also hinges on the argument that the individual books themselves are, essentially, worthless—one expert witness for Meta describes that the influence of a single book in LLM pretraining “adjusted its performance by less than 0.06% on industry standard benchmarks, a meaningless change no different from noise.” Furthermore, Meta says, that while the company “has invested hundreds of millions of dollars in LLM development,” they see no market in paying authors to license their books because “for there to be a market, there must be something of value to exchange, but none of Plaintiffs works has economic value, individually, as training data.” (An argument essential to fair use, but that also sounds like a scaled up version of a scenario in which the New York Philharmonic board argues against paying individual members of the orchestra because the organization spent a lot of money on the upkeep of David Geffen Hall, and also, a solo bassoon cannot play every part in “The Rite of Spring.”)

April 17, 2025 at 11:53 AM

Reposted by Dávid Komorowicz

Giorgos Tolias

@gtolias.bsky.social

The Visual Recognition Group at CTU in Prague organizes the 49th Pattern Recognition and Computer Vision Colloquium with D. Karatzas, M. Masana, T. Tommasi, P. Mettes @pascalmettes.bsky.social , E. Brachmann @ericbrachmann.bsky.social and V. Stojnic @stojnicv.xyz

cmp.felk.cvut.cz/colloquium/#...

April 7, 2025 at 1:57 PM

Reposted by Dávid Komorowicz

Delio Vicini

@deliovicini.bsky.social

3D Gaussian splatting relies on depth-sorting of splats, which is costly and prone to artifacts (e.g., "popping"). In our latest work, "StochasticSplats", we replace sorted alpha blending by stochastic transparency, an unbiased Monte Carlo estimator from the real-time rendering literature.

April 7, 2025 at 7:57 AM

Reposted by Dávid Komorowicz

Munich Center for Machine Learning

@munichcenterml.bsky.social

𝗠𝗖𝗠𝗟 𝗕𝗹𝗼𝗴: Robots & self-driving cars rely on scene understanding, but AI models for understanding these scenes need costly human annotations. Daniel Cremers & his team introduce 🥤🥤 CUPS: a scene-centric unsupervised panoptic segmentation approach to reduce this dependency. 🔗 mcml.ai/news/2025-04...

April 3, 2025 at 9:45 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news