Jenny Shen
banner
jennyshen056.bsky.social
Jenny Shen
@jennyshen056.bsky.social
1st year CS PhD student @UCSD
🕒 PO’clock continues: meet IRPO! We rethink RLHF for retrieval—an NDCG-weighted DPO objective that teaches LLMs to use long doc lists faithfully & efficiently. Dive in 🚀 arxiv.org/abs/2504.15477
It's *PO'clock, this time IRPO In-Context Ranking Policy Optimization!

An RL algorithm inspired by trad retrieval that trains agents to more effectively use lists of documents in context for better multi-hop {QA, agentic tasks, and more}!
April 23, 2025 at 4:39 PM
Reposted by Jenny Shen
Introducing TALES - Text Adventure Learning Environment Suite

A benchmark of a few hundred text envs: science experiments and embodied cooking to solving murder mysteries. We test over 30 of the best LLM agents and pinpoint failure modes +how to improve

👨‍💻pip install tale-suite
April 22, 2025 at 6:43 PM