banner
samuelschmidgall.bsky.social
@samuelschmidgall.bsky.social
PhD at Johns Hopkins University and Researcher at Google Deepmind working on LLM agents
🎉Read the preprint: agentrxiv.github.io
Try out AgentRxiv: github.com/SamuelSchmid...
Let’s explore how agents can accelerate research—together.
🧵8/8
March 24, 2025 at 2:25 PM
✨ In parallel experiments with 3 independent labs sharing pre-prints through AgentRxiv, the best method achieved 79.8% accuracy—a 13.7% relative improvement—while reaching key milestones faster than in sequential experiments.
🧵6/8
March 24, 2025 at 2:25 PM
🏥 We also wondered how well the methods our agents discovered perform on out-of-domain benchmarks (MMLU-Pro, GPQA, & MedQA) and with five other language models. We find the top performing algorithm SDA improves across these benchmarks on average by 3.3%.
🧵5/8
March 24, 2025 at 2:25 PM
🥇We perform experiments where agents are asked to develop new reasoning techniques on MATH-500. We find that when agents are given access to previous research, accuracy improved from 70.2% to 78.2% – an 11.4% relative improvement over the gpt-4o mini baseline and 9.7% over gpt-4o mini with CoT.
🧵4/8
March 24, 2025 at 2:25 PM
To address this, we introduce AgentRxiv—a framework that lets LLM agent laboratories upload and retrieve reports from a shared preprint server in order to collaborate, share insights, and iteratively build on each other’s research.
🧵3/8
March 24, 2025 at 2:25 PM
🚀🌐Introducing AgentRxiv: a framework where autonomous research agents can upload, retrieve, and build on each other’s research.

AgentRxiv takes your research direction and progressively outputs research papers and code repositories, building on its previous work with each new paper!
🧵
March 24, 2025 at 2:25 PM
Agent Laboratory consists of three primary phases that guide the research process: (1) Literature Review, (2) Experimentation, and (3) Report Writing. During each phase, LLM agents collaborative, integrating tools like arXiv, Hugging Face, Python, and LaTeX.
February 27, 2025 at 5:25 PM
🚀🔬 Introducing Agent Laboratory: an assistant for automating machine learning research

Agent Laboratory takes your research ideas and outputs a research paper and code repository, allowing you to allocate more effort toward ideation rather than low-level coding and writing [Re-sharing from X]
February 27, 2025 at 5:25 PM
🔥 Really great overview of Agent Laboratory by Two Minute Papers

video: youtu.be/2ky50XT0Nb0?...
agent lab webpage: agentlaboratory.github.io
February 27, 2025 at 5:22 PM
I'm excited to start as a Student Researcher at Google DeepMind working on medical AI!
December 27, 2024 at 11:07 PM