Lightnews — Scholar-powered news

Reposted by Prithviraj "Raj" Ammanabrolu

Mark Riedl

@markriedl.bsky.social

My former MS student Chris Cui (now PhD student with @rajammanabrolu.bsky.social)motivates Text Adventure Games as testbeds for reasoning. Provides a new benchmark suite of text games. Observes that Zork still kicks LLM’s butts despite training on walkthroughs arxiv.org/abs/2504.14128

TALES: Text Adventure Learning Environment Suite

Reasoning is an essential skill to enable Large Language Models (LLMs) to interact with the world. As tasks become more complex, they demand increasingly sophisticated and diverse reasoning capabiliti...

arxiv.org

December 3, 2025 at 12:42 AM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

My entire PEARLS Lab, and many NVIDIA colleagues, will be at #neurips2025 to chat about their latest. Some papers accepted to the conf are already outdated so just reach out to. Thread 🧵

November 24, 2025 at 10:10 PM

Reposted by Prithviraj "Raj" Ammanabrolu

Mark Riedl

@markriedl.bsky.social

I am extremely honored and humbled to have been awarded a Test-of-Time award for my 2005 paper "From Linear Story Generation to Branching Story Graphs" with R. Michael Young

November 12, 2025 at 4:08 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

I've done a few versions of this talk but this is the first that's been recorded publicly, thanks to IVADO Montreal

A good overview of things my lab has been up to in the last year or so at least in balancing safety/capabilities of (embodied) AI Agents

www.youtube.com/watch?v=S-kV...

Navigating the Safety-Capability Spectrum when Teaching Agents with Feedback -Prithviraj Ammanabrolu

YouTube video by IVADO

www.youtube.com

November 3, 2025 at 11:04 PM

Reposted by Prithviraj "Raj" Ammanabrolu

Ruiyi Wang

@ruiyiwang.bsky.social

🔥Excited to share our new work: "A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning"!

We study what actually works for agentic multi-turn RL with varying 🌎Environment, 🤖Policy, and ⭐Reward.

We conduct various ablations and empirical analysis on 🧩TextWorld, 🧙ALFWorld, and 🧑‍💻SWE-Gym.

October 26, 2025 at 9:36 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

I'll be at #CoLM2025 and the IVADO agents workshop right before in Montreal. My students will be presenting two papers in the main conf. I'll also do a ws keynote where I'll talk about some of our latest. Come by and say hi next week!

October 1, 2025 at 5:21 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

I'm probably mostly going to stop posting on this site. There's close to no engagement and it's not worth the effort to cross post for the amount of time that takes. Find me elsewhere / email me

June 29, 2025 at 8:54 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

I recently left Mosaic/Databricks Research. It's been a ride building out the RL team from <4 ppl to 20+ across two companies & acquisition +figuring out RL as a Service in prod. Mosaic had insane talent density

Some "relaxation" while I put out Prof fires for a smol bit then new adventures!

June 19, 2025 at 3:48 PM

Reposted by Prithviraj "Raj" Ammanabrolu

Mark Riedl

@markriedl.bsky.social

If you work in the intersection of NLP and games/narrative, then this workshop is for you! wordplay-workshop.github.io/cfp/

Organized by the amazing @laramartin.net and @rajammanabrolu.bsky.social (among others)

/call_for_papers

Official website for the Wordplay Workshop at EMNLP 2025. Exploring interactive narratives, text-adventure games, and AI agents in language-based environments. Join us in Suzhou, China, November 5th-9...

wordplay-workshop.github.io

June 17, 2025 at 4:47 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

The thing that feels so off about the core tech world is that every convo is very transactional. Maybe true elsewhere too. "Oh you're an expert in RL, can you answer questions about my new startup?"

Every single (Bay) party. No I do not want to consult. I just wanna hang out.

June 16, 2025 at 5:54 AM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

Of all the labeling startups out there to acquihire, this was... an interesting choice. Says a lot actually

June 10, 2025 at 5:34 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

. @bosungkim.bsky.social will be at #CVPR2025 in Nashville this week to present this and just generally talk about scaling memory for embodied agents!

Catch her at the poster sessions and also the Foundation Models meets Embodied Agents Workshop on Wed

Prithviraj "Raj" Ammanabrolu @rajammanabrolu.bsky.social · May 23

"Foundation" models for embodied agents are all the rage but how to actually do complex looong context reasoning? Can we scale Beyond Needle(s) in the (Embodied) Haystack?

∞-THOR is an infinite len sim framework + guide on (new) architectures/training methods for VLA models

June 9, 2025 at 4:56 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

I've heard this personally from multiple PMs at AI companies. Students are one of the biggest demographics and they need to "break in" and have even more usage to improve their metrics. Classic corporate economic incentives

Mark Riedl @markriedl.bsky.social · Jun 9

AI companies in the US gave access to their systems to students for free during college exams

China disabled access to AI systems during nationwide college exams www.theverge.com/news/682737/...

Feel free to draw your own conclusions

China shuts down AI tools during nationwide college exams

New age problems require new age solutions.

www.theverge.com

June 9, 2025 at 3:24 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

Tis the era of bringing back every AI benchmark ever but this time by the LLM people and for the LLMs

June 6, 2025 at 10:57 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

Had a fun little visit to Cambridge LTL where I talked about a bunch of my lab's latest papers including some still not public with the key takeaway that "RL can absolutely learn new things and is not just resurfacing knowledge"
talks.cam.ac.uk/show/archive...

talks.cam : Language Technology Lab Seminars

talks.cam.ac.uk

June 5, 2025 at 5:51 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

Interesting tidbit from UCSD's Victor Shih on a podcast talking about Chinese AGI efforts is that Deepseek is good at Chinese govt doc understanding cause that's what affects stock prices most and DS is a hedge fund.
www.youtube.com/watch?v=b1Te...

Xi Jinping’s paranoid approach to AGI, debt crisis, & Politburo politics — Victor Shih

YouTube video by Dwarkesh Patel

youtu.be

June 3, 2025 at 4:05 AM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

Looks like Gemini gets AIR 6 in #JEE2025 with a score of 323

Only 5 highschoolers in all India do better than an LLM in the single most important exam of their to get into the IITs

The legacy edu selection systems are now worse than useless

June 2, 2025 at 6:54 AM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

I get prepping for worst case scenarios but a lot of AI Safety debates I somehow end up these days in boil down to "assume you have Machine God in a box, now tell me how to align it"

I could rant for hours but seriously y'all this isn't productive

June 1, 2025 at 1:47 AM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

"Foundation" models for embodied agents are all the rage but how to actually do complex looong context reasoning? Can we scale Beyond Needle(s) in the (Embodied) Haystack?

∞-THOR is an infinite len sim framework + guide on (new) architectures/training methods for VLA models

May 23, 2025 at 4:08 PM

Reposted by Prithviraj "Raj" Ammanabrolu

Mark Riedl

@markriedl.bsky.social

I'm presenting today on AI Agents vs Agency Law at the 12th Governance of Emerging Technologies and Science Conference events.asucollegeoflaw.com/gets/

If agency law were to be applied to AI Agents (mostly in ecommerce settings), where does current AI align with the laws and where does it not?

May 19, 2025 at 6:54 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

I like the Ultra Scale Playbook from Huggingface and give it to my MS/first year PhD students to read as a prereq huggingface.co/spaces/nanot...

Is there a "RLSys" version of this on scaling RL+LLM training? If not + there's OSS community interest, I'll prob write one?

May 15, 2025 at 4:39 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

that's what's hot

May 14, 2025 at 12:31 AM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

We now have a whole YouTube video explaining our MINDcraft paper, check it out!
youtu.be/MeEcxh9St24

May 10, 2025 at 8:08 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

The part of the Prof job I'm enjoying by far the most right now is teaching actually

May 8, 2025 at 9:47 PM

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

This is reasonably written and echoes many of my own fears. The upside of AI is too huge to pass up but also there's a high chance that the vast majority of humanity is on track to becoming economically obsolete without really any transition plan www.theguardian.com/books/2025/m...

May 7, 2025 at 7:35 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news