Cyril Diagne
cyrildiagne.bsky.social
Cyril Diagne
@cyrildiagne.bsky.social
AI indie hacker. Prev: founded Clipdrop (YCW21, acq. Stability AI), resident at Google A&C Lab, Prof. & head of MID at ECAL
Reposted by Cyril Diagne
Excited to share our new research at Jasper Research! 🚀

LBM: Latent Bridge Matching for Fast Image-to-Image Translation

Try out our @hf.co space for object relighting!

🤗 @gradio-hf.bsky.social demo: huggingface.co/spaces/jaspe...
👉 Paper: arxiv.org/abs/2503.07535
💻 Repo: github.com/gojasper/LBM
March 13, 2025 at 4:00 PM
Alright who’s making this
March 7, 2025 at 5:08 PM
Align3R: estimates camera poses and consistent depth maps from monocular videos.

Combining it with trackers like Cotracker3 or SAM2 could unlock many fun applications! (cf: VideoDoodles by Yu & al)

Project page (with demo): igl-hkust.github.io/Align3R.gith...
Code: github.com/jiah-cloud/A...
December 6, 2024 at 9:48 AM
Am I the only one amazed that this is what 2*4TB (with thermal case) looks like now?
December 4, 2024 at 9:41 AM
Oh no 😂 I’m torn between rofl for this troll and a fear to see this little drama escalating
November 28, 2024 at 10:38 AM
The past few months have been... intense! There's still quite some work to do before the finish line, but excited to launch in the coming weeks 💪⚡️
November 27, 2024 at 4:42 PM
Adobe Podcast V2 is a really impressive audio enhancer.
Is there any open-source tech close to it?

bsky.app/profile/pins...
Putting Adobe Podcast V2 to the test 🤯
November 22, 2024 at 8:38 AM
SAMURAI: improve the tracking robustness of SAM2 with 2 main contributions:
- adding motion information to the mask selection
- curating the memory bank based on motion cues

Project: yangchris11.github.io/samurai
Code: github.com/yangchris11/...
Paper: arxiv.org/abs/2411.11922
November 21, 2024 at 8:17 AM
Pyramid Flow is quite impressive for img2video, given than it was only trained on public datasets. Clearly not as dynamic and stable as commercial solutions, but the gap seems to be closing github.com/jy0205/Pyram...
November 20, 2024 at 9:35 AM
A bit surprised with this data from Clerk on sign-in methods preferences: From a sample of 2.5M sign-in, <2% of users chose to use magic links.
November 20, 2024 at 9:31 AM
"A nicely maintained and over-spec’d server just has a smell to it" - great writeup by @kcimc.bsky.social benchmarking various cloud GPU providers for a realtime diffusion installation: kcimc.medium.com/realtime-dif...
Realtime diffusion in the cloud
In December 2023, I implemented a realtime diffusion toolkit with Daito Manabe and Rhizomatiks. The toolkit is based on SDXL Turbo running…
kcimc.medium.com
November 19, 2024 at 9:18 AM
Very cool idea for working with videos and transformers: remove duplicate patches and add length embeddings instead. They get a 40% speedup finetuning Vit-L using example packing.

Run-Length Tokenization (NeurIPS 2024 spotlight):
Project: rccchoudhury.github.io/rlt
Code: github.com/rccchoudhury...
November 18, 2024 at 9:42 AM
Hello there! I’m an AI indie hacker based in Paris.
I love everything related to generative AI, from research papers and techniques to cool AI content, frontend dev…etc.
I’m currently working on generative video models, and working hard to do a public release in the coming weeks.
Cheers ✌️
November 15, 2024 at 10:27 PM