Lightnews — Scholar-powered news

Luis

@lusxvr.bsky.social

34 followers 12 following 12 posts

Research @HuggingFace | CS @ TUM

Posts Replies Media Videos

Luis

@lusxvr.bsky.social

Today, we are open-sourcing our pipeline to deduplicate large-scale image datasets.

On one GPU, we can deduplicate 10k images against 1M indexed test images in ~60 seconds. But how?

July 2, 2025 at 2:08 PM

Luis

@lusxvr.bsky.social

New blog post for nanoVLM is live!

“nanoVLM – The simplest way to train a VLM in pure PyTorch”

We break down the full stack: architecture (SigLIP + SmolLM2), pixel shuffle, training pipeline, and inference.

With Colab + HF Space to try it out.

May 21, 2025 at 1:19 PM

Luis

@lusxvr.bsky.social

We just dropped full DDP support in native PyTorch for nanoVLM! Huge thanks to @geronimo-ai.bsky.social for the majority of this PR, check out his blog that explains how to do it: medium.com/@geronimo7/g...

May 19, 2025 at 10:07 AM

Reposted by Luis

Andi

@andimara.bsky.social

Real-time SmolVLM in a web browser with transformers.js.

All running locally with no installs. Just open the website.

May 14, 2025 at 3:39 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news