Lightnews — Scholar-powered news

merve

@merve.bsky.social

8.4K followers 680 following 240 posts

proud mediterrenean 🧿 open-sourceress at hugging face 🤗 multimodality, zero-shot vision, vision language models, transformers

Posts Replies Media Videos

merve

@merve.bsky.social

here's a good blog on successful DSE model MCDSE, compression and more huggingface.co/blog/marco/a...

Visually Multilingual: Introducing mcdse-2b

A Blog post by Marco Cimolai on Hugging Face

huggingface.co

April 15, 2025 at 4:27 PM

merve

@merve.bsky.social

the model also has impressive OCR capabilities ⬇️

April 11, 2025 at 7:10 PM

merve

@merve.bsky.social

we'll give this model a test on agentic capabilities but here's an example from paper:

April 11, 2025 at 7:09 PM

merve

@merve.bsky.social

This model consists of a dynamic res handling MoonViT encoder, a projection layer and a 16B MoE decoder (with 2.8B active params)

the paper introduces an interesting pre-training pipeline to handle long context and the model saw 4.4T tokens arxiv.org/pdf/2504.07491

April 11, 2025 at 7:08 PM

Reposted by merve

Andi

@andimara.bsky.social

Smol but mighty:
• 256M delivers 80% of the performance of our 2.2B model.
• 500M hits 90%.
Both beat our SOTA 80B model from 17 months ago! 🎉

Efficiency 🤝 Performance

Explore the collection here: huggingface.co/collections/...
Blog: huggingface.co/blog/smolervlm

January 23, 2025 at 1:33 PM

merve

@merve.bsky.social

Learn more from their blog post here huggingface.co/blog/vdr-2b-... 📖

Visual Document Retrieval Goes Multilingual

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

January 13, 2025 at 11:12 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news