Daniel Jiang
danielrjiang.bsky.social
Daniel Jiang
@danielrjiang.bsky.social
Research Scientist Meta, Adjunct Professor at University of Pittsburgh. Interested in reinforcement learning, approximate DP, adaptive experimentation, Bayesian optimization, & operations research. http://danielrjiang.github.io

📍Chicago, IL
Topics of interest include offline RL, post-training large language models with RLHF, and long-term recommendation systems. If you’re interested, please email me and/or apply here: www.metacareers.com/jobs/1142270...
Postdoctoral Researcher, Monetization (PhD)
Meta's mission is to build the future of human connection and the technology that makes it possible.
www.metacareers.com
March 17, 2025 at 1:59 PM
There’s one from ASOS.com that provides A/B test data over time (across many experiments, each with several arms).

Dataset: osf.io/64jsb/

Paper: arxiv.org/abs/2111.10198

We used it in a paper to benchmark an AE method. But I’d also love to know of other alternatives out there.
ASOS Digital Experiments Dataset
A novel dataset that can support the end-to-end design and running of Online Controlled Experiments (OCE) with adaptive stopping. Hosted on the Open Science Framework
osf.io
February 21, 2025 at 5:27 AM
I know one of the organizers is @eugenevinitsky.bsky.social. They did a great job and organized a very enjoyable conference.
December 10, 2024 at 8:18 AM
Reposted by Daniel Jiang
I collected some folk knowledge for RL and stuck them in my lecture slides a couple weeks back: web.mit.edu/6.7920/www/l... See Appendix B... sorry, I know, appendix of a lecture slide deck is not the best for discovery. Suggestions very welcome.
web.mit.edu
November 27, 2024 at 1:36 PM