Lightnews — Scholar-powered news

Jonas Hübotter

@jonhue.bsky.social

200 followers 81 following 19 posts

PhD student at ETH Zurich
jonhue.github.io

Posts Replies Media Videos

Jonas Hübotter

@jonhue.bsky.social

Paper: arxiv.org/pdf/2410.05026

Joint work with the amazing @marbaga.bsky.social, @gmartius.bsky.social, @arkrause.bsky.social

July 14, 2025 at 7:38 PM

Jonas Hübotter

@jonhue.bsky.social

We propose an algorithm that does this by actively maximizing expected information gain of the demonstrations, with a couple of tricks to estimate this quantity and mitigate forgetting.
Interestingly, this solution is viable even without any information about pre-training!

July 14, 2025 at 7:35 PM

Jonas Hübotter

@jonhue.bsky.social

Our method significantly improves accuracy (measured as perplexity) for large language models and achieves a new state-of-the-art on the Pile benchmark.

If you're interested in test-time training or active learning, come chat with me at our poster session!

April 21, 2025 at 2:40 PM

Jonas Hübotter

@jonhue.bsky.social

We introduce SIFT, a novel data selection algorithm for test-time training of language models. Unlike traditional nearest neighbor methods, SIFT uses uncertainty estimates to select maximally informative data, balancing relevance & diversity.