Yijia Shao
echoshao8899.bsky.social
Yijia Shao
@echoshao8899.bsky.social
CS PhD student @StanfordNLP
https://cs.stanford.edu/~shaoyj/
It’s also my honor to have economists from ‪@stanforddel.bsky.social join this project. As headlines are saying 2025 is a year of agents, we believe AI agent development is not solely a technical thing. Thanks Humishka, Yucheng, Jiaxin, David, ‪@erikbryn.bsky.social‬ and ‪@diyiyang.bsky.social!
June 12, 2025 at 4:41 PM
This project would not have been possible without the thoughtful participation of the 1,500+ domain workers. Many of those we contacted cold on LinkedIn thanked us for amplifying their voices—but truly, the honor is ours.
June 12, 2025 at 4:39 PM
🚀 We’re making the WORKBank database public and building an interactive data explorer!
👇 To get notified when it’s live or request an occupation we missed (see Appendix D.1 in our paper), drop a comment below.

forms.gle/ocDWGhRDS8y6...
WORKBank Database: Feedback & Interest Form
In our paper, we develop a novel auditing framework to assess which occupational tasks workers want AI agents to automate or augment, and how those desires align with the current technological capabil...
forms.gle
June 12, 2025 at 4:39 PM
Mapping tasks to skills–and comparing currently high-paid skills and required human agency as AI agents enter the workforce—we see: core human strengths move from data processing toward interpersonal and organizational skills.

Read our blog post: futureofwork.saltlab.stanford.edu
June 12, 2025 at 4:39 PM
The study also reveals insights on the future of HUMAN work.

Mapping the Human Agency Scale across jobs shows which roles AI can’t replace. Currently, only Mathematicians & Aerospace Engineers have most AI expert ratings that fall into H5 (Human Involvement Essential).
June 12, 2025 at 4:39 PM
Despite the buzz around "AI software engineers," "AI journalists," etc., our Human Agency Scale uncovers task-level nuances within every occupation.

We suggest that AI agent R&D and products account for them for more responsible, higher-quality adoption.
June 12, 2025 at 4:36 PM
Workers generally prefer higher levels of human agency, hinting at friction as AI capabilities advance.

From transcript analysis, the top collaboration model envisioned by workers is “role-based” AI support (23.1%) - utilizing AI systems that embody specific roles.
June 12, 2025 at 4:35 PM
The impact of AI agents on work isn’t just a binary “automate or not.”

We introduce the Human Agency Scale: a 5-level scale to capture the spectrum between automation and augmentation--where technology complements and enhances human capabilities.
June 12, 2025 at 4:35 PM
Jointly considering worker desire and technological capability allows us to classify tasks into four zones to guide AI agent deployment and development.

Alarmingly, 41.0% of YC companies are mapped to Low Priority and Automation “Red Light” Zone.
June 12, 2025 at 4:35 PM
We rank tasks by worker desire for automation. For 46.1% of tasks receive a positive attitude (>3/5) – with notable variation across sectors.

Transcript analysis reveals top concerns: (1) lack of trust (45%), (2) fear of job replacement (23%), (3) loss of human touch (16.3%)
June 12, 2025 at 4:34 PM
In our new paper: arxiv.org/abs/2506.06576

We collaborate with economists to develop an audio-enhanced auditing framework.

- 1500 domain workers from 104 occupations shared their desires.
- 52 AI agent researchers & developers evaluated today’s technological capabilities.
June 12, 2025 at 4:34 PM
Hi, I found your work very interesting and hope to have a chance to reach out. Is there a way to contact you? I tried DM on this site and redit but both fails. Thank you so much for your consideration!
cs.stanford.edu
February 14, 2025 at 8:51 PM
Thanks Vinay, Yucheng, John & @diyiyang.bsky.social for the amazing collaboration, and to all the friends—met or yet to be met—who shared suggestions for the platform release!

The release won't be possible without the generous support from US Navy Research, NSF, Google, and Microsoft Azure!
February 12, 2025 at 7:27 PM
Try it out today at cogym.saltlab.stanford.edu!
Read our preprint to learn more details: arxiv.org/abs/2412.15701
February 12, 2025 at 7:25 PM
You can request official support for a new task or vote on existing task requests through our GitHub repository!
github.com/SALT-NLP/col...
Build software better, together
GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.
github.com
February 12, 2025 at 7:25 PM
We welcome contributions of new task environments and agents.

Contributed agents will be deployed on our platform to study their interaction dynamics with real users. A great chance to distribute your agent in the wild!
February 12, 2025 at 7:25 PM
Collaborative Gym is now released at github.com/SALT-NLP/col....

Besides backend primitives, we also open-source our UI to facilitate human-agent interaction research. The UI resonates design of OpenAI canvas with side-by-side chat panel and a shared workspace for human and agent, but can do more!
GitHub - SALT-NLP/collaborative-gym: Framework and toolkits for building and evaluating collaborative agents that can work together with humans.
Framework and toolkits for building and evaluating collaborative agents that can work together with humans. - SALT-NLP/collaborative-gym
github.com
February 12, 2025 at 7:25 PM
You can request official support for a new task or vote on existing task requests through our GitHub repository!
github.com/SALT-NLP/col...
Build software better, together
GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.
github.com
February 12, 2025 at 7:23 PM
We welcome contributions of new task environments and agents.

Contributed agents will be deployed on our platform to study their interaction dynamics with real users. A great chance to distribute your agent in the wild!
February 12, 2025 at 7:23 PM
Collaborative Gym is now released at github.com/SALT-NLP/col....

Besides backend primitives, we also open-source our UI to facilitate human-agent interaction research. The UI resonates design of OpenAI canvas with side-by-side chat panel and a shared workspace for human and agent, but can do more!
GitHub - SALT-NLP/collaborative-gym: Framework and toolkits for building and evaluating collaborative agents that can work together with humans.
Framework and toolkits for building and evaluating collaborative agents that can work together with humans. - SALT-NLP/collaborative-gym
github.com
February 12, 2025 at 7:22 PM
Hi @narphorium.bsky.social , thank you! Can finally reply to you because our team wants to check whether the taxonomy can be used to examine other agentic systems (e.g. coding agents) first. It's indeed very useful. You can check out my recent blog post if interested: cs.stanford.edu/people/shaoy...
Hands-on Experience with Devin: Reflections from a Person Building and Evaluating Agentic Systems
Why I’m interested in making agentic systems collaborative.
cs.stanford.edu
January 26, 2025 at 11:26 PM