Cyril Diagne
cyrildiagne.bsky.social
Cyril Diagne
@cyrildiagne.bsky.social
AI indie hacker. Prev: founded Clipdrop (YCW21, acq. Stability AI), resident at Google A&C Lab, Prof. & head of MID at ECAL
How does one keep track? The monodepth/tracking field these days:
December 6, 2024 at 10:05 AM
Align3R: estimates camera poses and consistent depth maps from monocular videos.

Combining it with trackers like Cotracker3 or SAM2 could unlock many fun applications! (cf: VideoDoodles by Yu & al)

Project page (with demo): igl-hkust.github.io/Align3R.gith...
Code: github.com/jiah-cloud/A...
December 6, 2024 at 9:48 AM
Resemble Enhance seems pretty good: github.com/resemble-ai/...
November 22, 2024 at 8:39 AM
SAMURAI: improve the tracking robustness of SAM2 with 2 main contributions:
- adding motion information to the mask selection
- curating the memory bank based on motion cues

Project: yangchris11.github.io/samurai
Code: github.com/yangchris11/...
Paper: arxiv.org/abs/2411.11922
November 21, 2024 at 8:17 AM
Pyramid Flow is quite impressive for img2video, given than it was only trained on public datasets. Clearly not as dynamic and stable as commercial solutions, but the gap seems to be closing github.com/jy0205/Pyram...
November 20, 2024 at 9:35 AM
Very cool idea for working with videos and transformers: remove duplicate patches and add length embeddings instead. They get a 40% speedup finetuning Vit-L using example packing.

Run-Length Tokenization (NeurIPS 2024 spotlight):
Project: rccchoudhury.github.io/rlt
Code: github.com/rccchoudhury...
November 18, 2024 at 9:42 AM