Research Scientist at Nvidia
Lab: http://pearls.ucsd.edu
Personal: prithvirajva.com
A good overview of things my lab has been up to in the last year or so at least in balancing safety/capabilities of (embodied) AI Agents
www.youtube.com/watch?v=S-kV...
A good overview of things my lab has been up to in the last year or so at least in balancing safety/capabilities of (embodied) AI Agents
www.youtube.com/watch?v=S-kV...
We study what actually works for agentic multi-turn RL with varying 🌎Environment, 🤖Policy, and ⭐Reward.
We conduct various ablations and empirical analysis on 🧩TextWorld, 🧙ALFWorld, and 🧑💻SWE-Gym.
We study what actually works for agentic multi-turn RL with varying 🌎Environment, 🤖Policy, and ⭐Reward.
We conduct various ablations and empirical analysis on 🧩TextWorld, 🧙ALFWorld, and 🧑💻SWE-Gym.
Some "relaxation" while I put out Prof fires for a smol bit then new adventures!
Some "relaxation" while I put out Prof fires for a smol bit then new adventures!
Organized by the amazing @laramartin.net and @rajammanabrolu.bsky.social (among others)
Organized by the amazing @laramartin.net and @rajammanabrolu.bsky.social (among others)
Every single (Bay) party. No I do not want to consult. I just wanna hang out.
Every single (Bay) party. No I do not want to consult. I just wanna hang out.
Catch her at the poster sessions and also the Foundation Models meets Embodied Agents Workshop on Wed
∞-THOR is an infinite len sim framework + guide on (new) architectures/training methods for VLA models
Catch her at the poster sessions and also the Foundation Models meets Embodied Agents Workshop on Wed
China disabled access to AI systems during nationwide college exams www.theverge.com/news/682737/...
Feel free to draw your own conclusions
talks.cam.ac.uk/show/archive...
talks.cam.ac.uk/show/archive...
www.youtube.com/watch?v=b1Te...
www.youtube.com/watch?v=b1Te...
Only 5 highschoolers in all India do better than an LLM in the single most important exam of their to get into the IITs
The legacy edu selection systems are now worse than useless
Only 5 highschoolers in all India do better than an LLM in the single most important exam of their to get into the IITs
The legacy edu selection systems are now worse than useless
I could rant for hours but seriously y'all this isn't productive
I could rant for hours but seriously y'all this isn't productive
∞-THOR is an infinite len sim framework + guide on (new) architectures/training methods for VLA models
∞-THOR is an infinite len sim framework + guide on (new) architectures/training methods for VLA models
If agency law were to be applied to AI Agents (mostly in ecommerce settings), where does current AI align with the laws and where does it not?
If agency law were to be applied to AI Agents (mostly in ecommerce settings), where does current AI align with the laws and where does it not?
Is there a "RLSys" version of this on scaling RL+LLM training? If not + there's OSS community interest, I'll prob write one?
Is there a "RLSys" version of this on scaling RL+LLM training? If not + there's OSS community interest, I'll prob write one?
youtu.be/MeEcxh9St24
youtu.be/MeEcxh9St24
We wrote multiturn RL4LMs like 3+ years ago github.com/allenai/RL4LMs
There were other simple versions even before. ML ppl approaching goldfish memory
We wrote multiturn RL4LMs like 3+ years ago github.com/allenai/RL4LMs
There were other simple versions even before. ML ppl approaching goldfish memory