PhD at MIT CSAIL '23, Harvard '16, former Google APM. Dog mom to NSDTR Ducki.
Lots of progress in RL research over last 10 years, but too much performance-driven => overfitting to benchmarks (like the ALE).
1⃣ Let's advance science of RL
2⃣ Let's be explicit about how benchmarks map to formalism
1/X
Lots of progress in RL research over last 10 years, but too much performance-driven => overfitting to benchmarks (like the ALE).
1⃣ Let's advance science of RL
2⃣ Let's be explicit about how benchmarks map to formalism
1/X
With @thakurdhanaraj.bsky.social @alicetiara.bsky.social and @mkgerchick.bsky.social
To register:
cdt.org/event/advoca...
With @thakurdhanaraj.bsky.social @alicetiara.bsky.social and @mkgerchick.bsky.social
To register:
cdt.org/event/advoca...
This work won an ✨outstanding paper✨ award at RLC :D
Link: cohere.com/events/coher...
This work won an ✨outstanding paper✨ award at RLC :D
Link: cohere.com/events/coher...
We must celebrate and share our wins, too, to build and maintain momentum.
appropriations.house.gov/sites/evo-su...
We must celebrate and share our wins, too, to build and maintain momentum.
appropriations.house.gov/sites/evo-su...
I've followed Lisa Cook for a while since she has developed the leading perspective on AI over at the Fed: www.federalreserve.gov/newsevents/s...
I've followed Lisa Cook for a while since she has developed the leading perspective on AI over at the Fed: www.federalreserve.gov/newsevents/s...
(please reshare)
We seek applicants with experience in language modeling who are excited about high-impact applications in the health and social sciences!
More info in thread
1/3
Healthcare, Customer Service, & Logistics bosses are buying AI products that set compensation structures & wages using real time data. @wilneida.bsky.social & I did an audit & here are our findings
equitablegrowth.org/how-artifici...
Healthcare, Customer Service, & Logistics bosses are buying AI products that set compensation structures & wages using real time data. @wilneida.bsky.social & I did an audit & here are our findings
equitablegrowth.org/how-artifici...
“Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners”, lead by Callie Muslimani (Alberta)
&
“Goals vs. Rewards: Towards a Comparative Study of Objective Specification Mechanisms” lead by Septia Rani (CSU).
Come to our posters today!
“Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners”, lead by Callie Muslimani (Alberta)
&
“Goals vs. Rewards: Towards a Comparative Study of Objective Specification Mechanisms” lead by Septia Rani (CSU).
Come to our posters today!
Applications received by September 15 will receive full consideration.
Applications received by September 15 will receive full consideration.
My great hope for the nation is that we can find shared priorities and get all parties to work in the interests of the people. And maybe we can fix that ridiculous $2000 asset limit sometime???
My great hope for the nation is that we can find shared priorities and get all parties to work in the interests of the people. And maybe we can fix that ridiculous $2000 asset limit sometime???
It's especially dangerous for consequential domains like medicine! arxiv.org/pdf/2502.14898
It's especially dangerous for consequential domains like medicine! arxiv.org/pdf/2502.14898
Is everything broken rn? Yes. Will it stay broken? That's on us.
Is everything broken rn? Yes. Will it stay broken? That's on us.
www.nature.com/articles/d41...
www.nature.com/articles/d41...
arxiv.org/abs/2506.07962
But is there any evidence for that?
In our latest work w/ David Danks @berkustun, we show explanations fail to help people, even under optimal conditions.
PDF shorturl.at/yaRua