Prithviraj "Raj" Ammanabrolu
banner
rajammanabrolu.bsky.social
Prithviraj "Raj" Ammanabrolu
@rajammanabrolu.bsky.social
AI, RL, NLP, Games Asst Prof at UCSD
Research Scientist at Nvidia
Lab: http://pearls.ucsd.edu
Personal: prithvirajva.com
Reposted by Prithviraj "Raj" Ammanabrolu
I am extremely honored and humbled to have been awarded a Test-of-Time award for my 2005 paper "From Linear Story Generation to Branching Story Graphs" with R. Michael Young
November 12, 2025 at 4:08 PM
I've done a few versions of this talk but this is the first that's been recorded publicly, thanks to IVADO Montreal

A good overview of things my lab has been up to in the last year or so at least in balancing safety/capabilities of (embodied) AI Agents

www.youtube.com/watch?v=S-kV...
Navigating the Safety-Capability Spectrum when Teaching Agents with Feedback -Prithviraj Ammanabrolu
YouTube video by IVADO
www.youtube.com
November 3, 2025 at 11:04 PM
Reposted by Prithviraj "Raj" Ammanabrolu
🔥Excited to share our new work: "A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning"!

We study what actually works for agentic multi-turn RL with varying 🌎Environment, 🤖Policy, and ⭐Reward.

We conduct various ablations and empirical analysis on 🧩TextWorld, 🧙ALFWorld, and 🧑‍💻SWE-Gym.
October 26, 2025 at 9:36 PM
I'll be at #CoLM2025 and the IVADO agents workshop right before in Montreal. My students will be presenting two papers in the main conf. I'll also do a ws keynote where I'll talk about some of our latest. Come by and say hi next week!
October 1, 2025 at 5:21 PM
I'm probably mostly going to stop posting on this site. There's close to no engagement and it's not worth the effort to cross post for the amount of time that takes. Find me elsewhere / email me
June 29, 2025 at 8:54 PM
I recently left Mosaic/Databricks Research. It's been a ride building out the RL team from <4 ppl to 20+ across two companies & acquisition +figuring out RL as a Service in prod. Mosaic had insane talent density

Some "relaxation" while I put out Prof fires for a smol bit then new adventures!
June 19, 2025 at 3:48 PM
Reposted by Prithviraj "Raj" Ammanabrolu
If you work in the intersection of NLP and games/narrative, then this workshop is for you! wordplay-workshop.github.io/cfp/

Organized by the amazing @laramartin.net and @rajammanabrolu.bsky.social (among others)
/call_for_papers
Official website for the Wordplay Workshop at EMNLP 2025. Exploring interactive narratives, text-adventure games, and AI agents in language-based environments. Join us in Suzhou, China, November 5th-9...
wordplay-workshop.github.io
June 17, 2025 at 4:47 PM
The thing that feels so off about the core tech world is that every convo is very transactional. Maybe true elsewhere too. "Oh you're an expert in RL, can you answer questions about my new startup?"

Every single (Bay) party. No I do not want to consult. I just wanna hang out.
June 16, 2025 at 5:54 AM
Of all the labeling startups out there to acquihire, this was... an interesting choice. Says a lot actually
June 10, 2025 at 5:34 PM
. @bosungkim.bsky.social will be at #CVPR2025 in Nashville this week to present this and just generally talk about scaling memory for embodied agents!

Catch her at the poster sessions and also the Foundation Models meets Embodied Agents Workshop on Wed
"Foundation" models for embodied agents are all the rage but how to actually do complex looong context reasoning? Can we scale Beyond Needle(s) in the (Embodied) Haystack?

∞-THOR is an infinite len sim framework + guide on (new) architectures/training methods for VLA models
June 9, 2025 at 4:56 PM
I've heard this personally from multiple PMs at AI companies. Students are one of the biggest demographics and they need to "break in" and have even more usage to improve their metrics. Classic corporate economic incentives
AI companies in the US gave access to their systems to students for free during college exams

China disabled access to AI systems during nationwide college exams www.theverge.com/news/682737/...

Feel free to draw your own conclusions
China shuts down AI tools during nationwide college exams
New age problems require new age solutions.
www.theverge.com
June 9, 2025 at 3:24 PM
Tis the era of bringing back every AI benchmark ever but this time by the LLM people and for the LLMs
June 6, 2025 at 10:57 PM
Had a fun little visit to Cambridge LTL where I talked about a bunch of my lab's latest papers including some still not public with the key takeaway that "RL can absolutely learn new things and is not just resurfacing knowledge"
talks.cam.ac.uk/show/archive...
talks.cam : Language Technology Lab Seminars
talks.cam.ac.uk
June 5, 2025 at 5:51 PM
Interesting tidbit from UCSD's Victor Shih on a podcast talking about Chinese AGI efforts is that Deepseek is good at Chinese govt doc understanding cause that's what affects stock prices most and DS is a hedge fund.
www.youtube.com/watch?v=b1Te...
Xi Jinping’s paranoid approach to AGI, debt crisis, & Politburo politics — Victor Shih
YouTube video by Dwarkesh Patel
youtu.be
June 3, 2025 at 4:05 AM
Looks like Gemini gets AIR 6 in #JEE2025 with a score of 323

Only 5 highschoolers in all India do better than an LLM in the single most important exam of their to get into the IITs

The legacy edu selection systems are now worse than useless
June 2, 2025 at 6:54 AM
I get prepping for worst case scenarios but a lot of AI Safety debates I somehow end up these days in boil down to "assume you have Machine God in a box, now tell me how to align it"

I could rant for hours but seriously y'all this isn't productive
June 1, 2025 at 1:47 AM
"Foundation" models for embodied agents are all the rage but how to actually do complex looong context reasoning? Can we scale Beyond Needle(s) in the (Embodied) Haystack?

∞-THOR is an infinite len sim framework + guide on (new) architectures/training methods for VLA models
May 23, 2025 at 4:08 PM
Reposted by Prithviraj "Raj" Ammanabrolu
I'm presenting today on AI Agents vs Agency Law at the 12th Governance of Emerging Technologies and Science Conference events.asucollegeoflaw.com/gets/

If agency law were to be applied to AI Agents (mostly in ecommerce settings), where does current AI align with the laws and where does it not?
May 19, 2025 at 6:54 PM
I like the Ultra Scale Playbook from Huggingface and give it to my MS/first year PhD students to read as a prereq huggingface.co/spaces/nanot...

Is there a "RLSys" version of this on scaling RL+LLM training? If not + there's OSS community interest, I'll prob write one?
May 15, 2025 at 4:39 PM
that's what's hot
May 14, 2025 at 12:31 AM
We now have a whole YouTube video explaining our MINDcraft paper, check it out!
youtu.be/MeEcxh9St24
May 10, 2025 at 8:08 PM
The part of the Prof job I'm enjoying by far the most right now is teaching actually
May 8, 2025 at 9:47 PM
This is reasonably written and echoes many of my own fears. The upside of AI is too huge to pass up but also there's a high chance that the vast majority of humanity is on track to becoming economically obsolete without really any transition plan www.theguardian.com/books/2025/m...
May 7, 2025 at 7:35 PM
What's with these arguments over whether X or Y or whatever was the first LLM RL library? These all came in the last 3 months

We wrote multiturn RL4LMs like 3+ years ago github.com/allenai/RL4LMs

There were other simple versions even before. ML ppl approaching goldfish memory
GitHub - allenai/RL4LMs: A modular RL library to fine-tune language models to human preferences
A modular RL library to fine-tune language models to human preferences - allenai/RL4LMs
github.com
May 7, 2025 at 3:20 PM
Cool looking paper from @markriedl.bsky.social's lab arxiv.org/abs/2505.03547 making significant improvements over our arxiv.org/abs/2001.10161 from 5+ years ago
May 7, 2025 at 5:10 AM