Lightnews — Scholar-powered news

Aditi Mavalankar

@aditimavalankar.bsky.social

1K followers 120 following 14 posts

Research Scientist at DeepMind working on Gemini Thinking

Posts Replies Media Videos

Aditi Mavalankar

@aditimavalankar.bsky.social

This was a really fun collaboration with my brilliant collaborators Hassan Mansoor, Zita Marinho, Masha Samsikova, and @schaul.bsky.social!

March 17, 2025 at 11:16 AM

Aditi Mavalankar

@aditimavalankar.bsky.social

In addition to this, AuPair has been shown to work better across CodeForces difficulty levels and preserve coverage of problem categories from the training data distribution (see paper for more details).

March 17, 2025 at 11:16 AM

Aditi Mavalankar

@aditimavalankar.bsky.social

4) the responses produced by the model have high diversity for the more performant models.

March 17, 2025 at 11:16 AM

Aditi Mavalankar

@aditimavalankar.bsky.social

3) our approach exhibits strong scaling with inference-time compute, and even after 100+ LLM calls, we do not see plateauing in the scaling curve;

March 17, 2025 at 11:16 AM

Aditi Mavalankar

@aditimavalankar.bsky.social

2) we observe strong generalisation across datasets and models, implying that the process of curating these examples can be performed once and the benefits in performance can be reaped multiple times;

March 17, 2025 at 11:16 AM

Aditi Mavalankar

@aditimavalankar.bsky.social

Injecting different examples into the prompt has several benefits: 1) we see significant gains in performance compared to best-of-N and self-repair baselines on multiple model families: Gemini, Gemma, and GPT;

March 17, 2025 at 11:16 AM

Aditi Mavalankar

@aditimavalankar.bsky.social

Fun fact: the title “AuPair” has multiple interpretations: at a higher level, it guides LLMs to better behaviour with a predefined set of examples; it is also a conjunction of Au, the chemical symbol for gold, and pair, i.e. golden pairs!

March 17, 2025 at 11:16 AM

Aditi Mavalankar

@aditimavalankar.bsky.social

For the coding domain, a golden example pair, or AuPair, contains the problem description, an incorrect guess, and a fix that improves the solution.

March 17, 2025 at 11:16 AM

Aditi Mavalankar

@aditimavalankar.bsky.social

Our submodular approach yields a fixed ordered set of complementary and useful AuPairs. For a budget of N LLM calls, the model is given N different prompts to answer the same question, where each prompt contains a different golden example.

March 17, 2025 at 11:16 AM

Aditi Mavalankar

@aditimavalankar.bsky.social

The key idea underlying our approach is simple: our approach curates a fixed set of golden examples (AuPairs) that are provided as 1-shot in-context examples during inference. We show that using AuPairs significantly improves code repair performance and scales well with inference compute!

March 17, 2025 at 11:16 AM

Aditi Mavalankar

@aditimavalankar.bsky.social

😃😃😃

November 23, 2024 at 12:05 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news