Lightnews — Scholar-powered news

Allen Nie

@allenanie.bsky.social

2.3K followers 450 following 33 posts

Stanford CS PhD working on RL and LLMs with Emma Brunskill and Chris Piech. Co-creator of Trace. Prev @GoogleDeepMind @MicrosoftResearch

Specifically
- Offline RL
- In-context RL
- Causality

https://anie.me/about
Unverified hot takes go to this account

Posts Replies Media Videos

Allen Nie

@allenanie.bsky.social

People say Ching-an and I are indistinguishable…is that true 🤣

December 10, 2024 at 11:15 PM

Allen Nie

@allenanie.bsky.social

Come check us out near the Tesla Booth in West Exhibition Hall A 3-5pm! Come and claim your mug 🤣 we have an identity crisis — people keep thinking we are from IBM for some reason…

December 10, 2024 at 11:05 PM

Allen Nie

@allenanie.bsky.social

Once you build an agent with Trace, you can use ANY LLM optimizer you want. With the release of Trace 0.1.3, we introduce TextGrad (github.com/microsoft/Tr...) as an optimizer for the RL agent, along with OPRO and OptoPrime.

December 10, 2024 at 7:52 PM

Allen Nie

@allenanie.bsky.social

What enables Trace to be an RL-style agentic library? We use **Generative Optimization** techniques (LLM as an optimizer) to derive an analog to RL's policy gradient algorithm. The agent makes a move, receives feedback/reward, and updates its parameters.

December 10, 2024 at 7:52 PM

Allen Nie

@allenanie.bsky.social

In Trace, you define an Agent with declarative Python functions using Trace primitives. Trace provides flexible ways to mark what you want to change -- for example, we mark two prompts and two functions below as trainable.