Lily Goli
banner
lilygoli.bsky.social
Lily Goli
@lilygoli.bsky.social
PhD student at University of Toronto, Research Intern at Waabi, ex. Google DeepMind, 3D Vision Enthusiast
That words my frustration perfectly! Tho the other side of the coin is that you’ll probably become an expert in that subject which can be useful.
December 7, 2024 at 4:13 PM
Disclaimer: these are personalized conclusions i.e. your personality has a huge effect on how accurate these thoughts are in your case.

Note: picture is from my breakfast at GDM.
December 6, 2024 at 9:46 PM
Thought 3:
Developing your own code is a great educational process but always be aware of the open-source code out there. Probably a polished released code from months of effort is more useful/better than your code that was written in half a day and was not tested extensively.
December 6, 2024 at 9:46 PM
Thought 2: collaboration
Good collaborators can have so much impact on your progress, however you shouldn’t necessarily start a collaboration with every brilliant person out there. Only do so if you have a clear idea of how you all fit into the project. Or find a project that fits the collaboration
December 6, 2024 at 9:46 PM
Is that exciting tho? For me, not as much as diving into something new.

Is it wise to switch focus often? Probably not. There’s a balance—and with the field moving so fast, it often leans toward sticking with what you know best.
December 6, 2024 at 9:46 PM
Thought1: speed
As PhD students we usually want to maximize publications which might lead to writing quick papers. Can a paper written relatively fast be a good paper? I would argue: only if it’s building on your previous good work, i.e. you already know a lot about that subject. /
December 6, 2024 at 9:46 PM
I have not used it in a loss before, but I do agree that if the epipolar error is not too high or too low compared to your overall stats, you probably can't rely on it to infer anything.
December 2, 2024 at 10:57 PM
This was my last work at Google DeepMind. An amazing experience. I am very happy that I had the chance to work with my great collaborators on this project: Sara Sabour, Mark Matthews, @marcusabrubaker.bsky.social, Dmitry Lagun, Alec Jacobson, David Fleet, Saurabh Saxena and @taiyasaki.bsky.social
December 2, 2024 at 3:14 AM
…and with that we get good quality masks, without human annotation or synthetic supervision!
Look at our website for more results!
romosfm.github.io
December 2, 2024 at 3:14 AM
We train a tiny MLP that classifies SAMv2 features as moving or static given the weak supervisory signal from high and low error masks. These features help complete the motion masks over the video effectively!
December 2, 2024 at 3:14 AM
so...how does it work?

We find the Fundamental matrix between each two adjacent frames in the video with RANSAC. We then identify parts of the frame that have a very low or a very high epipolar error, as weak supervision signals to find the moving objects. Now, how do we complete these signals?
December 2, 2024 at 3:14 AM
Okay but why care about motion masks?
We show that good motion masks improve SfM performance, making COLMAP+our masks the SOTA on synthetic benchmarks. We also collect a real evaluation dataset with GT camera pose using a robotic arm, to evaluate our method in real casual captures.
December 2, 2024 at 3:14 AM
Our masks are robust to slow or fast camera movements and can find multiple moving objects even when they are in the background. Look at the pedestrian! 🚶
December 2, 2024 at 3:14 AM
First and foremost: some results!!
In RoMo an optimization process disentangles camera ego motion from scene motion, yielding masks for moving objects 🛵
December 2, 2024 at 3:14 AM