Lightnews — Scholar-powered news

Sergio Paniego

@sergiopaniego.bsky.social

64 followers 71 following 18 posts

AI PhD. Technology enables us to be more human. 🏳️‍🌈

Posts Replies Media Videos

Sergio Paniego

@sergiopaniego.bsky.social

🧠 Following Hugging Face's blog on scaling test-time compute with open models—letting models "think longer," inspired by OpenAI & DeepMind—I created a recipe to extend inference time for Instruct LLMs, tackling harder tasks like complex math problems.

Links below 👇

Scaling test-time compute with open models diagram

January 7, 2025 at 10:34 AM

Sergio Paniego

@sergiopaniego.bsky.social

I’m a big fan of smol models—compact, efficient, and perfect for inference/training on limited resources. Even better when they’re multimodal! 🤏✨

I explored fine-tuning SmolVLM, a multimodal smol model using TRL with SFT and DPO, creating 2 hands-on projects!

🔗Links below👇

December 18, 2024 at 8:23 AM

Sergio Paniego

@sergiopaniego.bsky.social

💡I've been exploring how to go smol with multimodal RAG.

I've created a project using SmolVLM and ColSmolVLM to create a multimodal RAG that can run on Colab's free tier.

Featuring:
🤏👀 SmolVLM (VLM)
🤏📚ColQwen2 (Doc Retrieval)
⚙️ Runs in Colab's free-tier GPU

Link below

December 16, 2024 at 5:23 PM

Sergio Paniego

@sergiopaniego.bsky.social

💡 New Multimodal RAG Recipe with Re-Ranking 💡

I explored how to enhance a multimodal RAG pipeline by integrating a re-ranker!

Featuring:
✨ Qwen2-VL-7B (VLM)
📚 ColQwen2 (Doc Retrieval)
🔍 MonoQwen2 (Re-ranking)
🔥 Optimized for consumer GPUs with quantized VLMs.

Link below:

December 12, 2024 at 5:29 PM

Sergio Paniego

@sergiopaniego.bsky.social

✨ Gave a talk on autonomous driving today to undergrad students! We covered everything from definitions to real-world examples, plus cutting-edge concepts like Generative World Models and Vision-Language Models (VLMs). Exciting future ahead! 🚗💡

December 3, 2024 at 5:12 PM

Sergio Paniego

@sergiopaniego.bsky.social

I've been exploring the latest Llama 3.2 releases and working on a couple of projects you may find interesting:

1️⃣ Understanding tool calling with Llama 3.2 🔧
2️⃣ Using Text Generation Inference (TGI) with Llama models 🦙

(links in the next post)

November 29, 2024 at 10:10 AM

Sergio Paniego

@sergiopaniego.bsky.social

💡 A few days ago, I came across a fascinating post about Agentic RAG by Erika Cardenas and Leonie Monigatti, and it inspired me to dive into the concept and bring it to life in code!

November 27, 2024 at 5:26 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news