Lightnews — Scholar-powered news

Martin Jaggi

@mjaggi.bsky.social

810 followers 160 following 39 posts

Prof at EPFL
AI • Climbing

Posts Replies Media Videos

Martin Jaggi

@mjaggi.bsky.social

June 20, 2025 at 8:08 PM

Martin Jaggi

@mjaggi.bsky.social

Our method results in >6x faster pre-training of LLMs across many languages. The approach is applicable to both high and low-resource languages, and also improves upon its English-speaking DCLM cousin, which inspired this work.

April 23, 2025 at 5:06 AM

Martin Jaggi

@mjaggi.bsky.social

Using the 'right' data can hugely speed up LLM training, but how to find the best training data in the vast sea of a whole web crawl?

We propose a simple classifier-based selection, enabling multilingual LLMs 🧵

Enhancing Multilingual LLM Pretraining with Model-Based Data Selection

April 23, 2025 at 5:06 AM

Martin Jaggi

@mjaggi.bsky.social

The Swiss AI Initiative has launched open calls for disruptive ideas - Democratizing large-scale AI for the benefit of society.

Send your idea by end of March 🏃‍♂️‍➡️ , and run on one of the largest public AI clusters globally. Everyone is eligible to apply!

swiss-ai.org

March 4, 2025 at 11:13 PM

Martin Jaggi

@mjaggi.bsky.social

new open weights, 24B model, with comparable performance to Llama 3.3 70B 😮. congrats mistral team!
mistral.ai/news/mistral...

January 30, 2025 at 7:01 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news