antix5.bsky.social
@antix5.bsky.social
Reposted
Ever wonder how LLM developers choose their pretraining data? It’s not guesswork— all AI labs create small-scale models as experiments, but the models and their data are rarely shared.
DataDecide opens up the process: 1,050 models, 30k checkpoints, 25 datasets & 10 benchmarks 🧵
April 15, 2025 at 1:01 PM
Reposted
This is *extremely* cool

I'm increasingly excited about using the OLMo based apps for daily use - I find the playground genuinely better than the commercial apps whenever I need some originality, and the transparency/privacy guarantees are just so much stronger
For years it’s been an open question — how much is a language model learning and synthesizing information, and how much is it just memorizing and reciting?

Introducing OLMoTrace, a new feature in the Ai2 Playground that begins to shed some light. 🔦
April 9, 2025 at 4:43 PM
Reposted
An assembly of 18 European companies, labs, and universities have banded together to launch 🇪🇺 EuroBERT!

It's a state-of-the-art multilingual encoder for 15 European languages, designed to be finetuned for retrieval, classification, etc.

Details in 🧵
March 10, 2025 at 9:43 AM
Reposted
I shared a controversial take the other day at an event and I decided to write it down in a longer format: I’m afraid AI won't give us a "compressed 21st century"

Here: thomwolf.io/blog/scienti...

It's an extension of this interview discussion from the AI summit: youtu.be/AxBd3G0lFLs?...
March 6, 2025 at 1:03 PM
Reposted
We've just released the new Spaces search and it's quite mind blowing

Explore over 400k AI Apps in the most intuitive way

background removal, image-to-3D, comic factory, sound transcription, image editing, clothes virtual try-on, etc

All made by AI builders for AI builders

huggingface.co/spaces
February 5, 2025 at 9:45 PM
Reposted
🏎️ Today I'm introducing a method to train static embedding models that run 100x to 400x faster on CPU than common embedding models, while retaining 85%+ of the quality!

Including 2 models with training scripts, datasets, metrics, evals, ideation, all public.

Details in 🧵
January 15, 2025 at 3:26 PM
Reposted
Everything that was released passed week in open AI 🤠

> Link to all models, datasets, demos huggingface.co/collections/...
> Text-readable version is here huggingface.co/posts/merve/...
January 17, 2025 at 3:28 PM