Allen Nie
banner
allenanie.bsky.social
Allen Nie
@allenanie.bsky.social
Stanford CS PhD working on RL and LLMs with Emma Brunskill and Chris Piech. Co-creator of Trace. Prev @GoogleDeepMind @MicrosoftResearch

Specifically
- Offline RL
- In-context RL
- Causality

https://anie.me/about
Unverified hot takes go to this account
People say Ching-an and I are indistinguishable…is that true 🤣
December 10, 2024 at 11:15 PM
Come check us out near the Tesla Booth in West Exhibition Hall A 3-5pm! Come and claim your mug 🤣 we have an identity crisis — people keep thinking we are from IBM for some reason…
December 10, 2024 at 11:05 PM
Once you build an agent with Trace, you can use ANY LLM optimizer you want. With the release of Trace 0.1.3, we introduce TextGrad (github.com/microsoft/Tr...) as an optimizer for the RL agent, along with OPRO and OptoPrime.
December 10, 2024 at 7:52 PM
What enables Trace to be an RL-style agentic library? We use **Generative Optimization** techniques (LLM as an optimizer) to derive an analog to RL's policy gradient algorithm. The agent makes a move, receives feedback/reward, and updates its parameters.
December 10, 2024 at 7:52 PM
In Trace, you define an Agent with declarative Python functions using Trace primitives. Trace provides flexible ways to mark what you want to change -- for example, we mark two prompts and two functions below as trainable.
December 10, 2024 at 7:52 PM
This talk is just fascinating — “o1 has an effective way to scale compute at inference time” — but you just can’t tell us what it exactly is 🤣
November 19, 2024 at 12:29 AM
Noam Brown giving a talk on o1 at Stanford right now 🔥
November 19, 2024 at 12:06 AM