Now: @jhuclsp @jhucompsci
Past: @allen_ai @uwnlp @Penn @cogcomp @Illinois_Alma @MSFTResearch
x.com/jackjingyuz...
x.com/jackjingyuz...
We studied agentic tool recovery—when your LLM selects a set of tools to execute, but one turns out to be unavailable or incorrect.
We studied agentic tool recovery—when your LLM selects a set of tools to execute, but one turns out to be unavailable or incorrect.
to help shape the future of biomedical AI.
⚔️ Check it out: biomedarena.ai
to help shape the future of biomedical AI.
⚔️ Check it out: biomedarena.ai
Niyati Bafna @niyatibafna.bsky.social 's recent work introduces the **translation barrier hypothesis**, a framework for understanding multilingual model behavior.
Paper: huggingface.co/papers/2506...
Niyati Bafna @niyatibafna.bsky.social 's recent work introduces the **translation barrier hypothesis**, a framework for understanding multilingual model behavior.
Paper: huggingface.co/papers/2506...
📄 Preprint: arxiv.org/pdf/2505.20321
📄 Preprint: arxiv.org/pdf/2505.20321
Apparently, ChatGPT has a better grasp than @nyuniversity.
x.com/nebedaay/st...
Apparently, ChatGPT has a better grasp than @nyuniversity.
x.com/nebedaay/st...
TL;DR: We propose BloomScrub a framework to certifiably remove long verbatim quotes to reduce the risk of copyright violations.
TL;DR: We propose BloomScrub a framework to certifiably remove long verbatim quotes to reduce the risk of copyright violations.
Answering this requires evaluating *evaluate* whether LLMs can provide critiques that are *grounded* in the context of science papers.
See @JiefuOu's dataset which has a collection of paper claims and their critiques: arxiv.org/pdf/2503.21717
Answering this requires evaluating *evaluate* whether LLMs can provide critiques that are *grounded* in the context of science papers.
See @JiefuOu's dataset which has a collection of paper claims and their critiques: arxiv.org/pdf/2503.21717
self-supervised.cs.jhu.edu/sp2025/
These resources may be helpful if you're:
(1) looking for slides to teach about LLMs, or
(2) interested in diving deeper into the field.
self-supervised.cs.jhu.edu/sp2025/
These resources may be helpful if you're:
(1) looking for slides to teach about LLMs, or
(2) interested in diving deeper into the field.
I'd love to chat about RL and its interpretability, data influence for post-training, CogSci for LLM. Feel free to reach out and let's have some coffee together ☕ !
TLDR— Proposed a framework for benchmarking LLMs' 𝒄𝒓𝒆𝒂𝒕𝒊𝒗𝒊𝒕𝒚.
x.com/Yining__Lu/...
I'd love to chat about RL and its interpretability, data influence for post-training, CogSci for LLM. Feel free to reach out and let's have some coffee together ☕ !
But what if you want a bird’s-eye view of science, or to identify over- and under-explored areas?
We introduce 🔺Science Hierarchography🔺, the goal of organizing science papers into conceptual hierarchies.
arxiv.org/abs/2504.13834
But what if you want a bird’s-eye view of science, or to identify over- and under-explored areas?
We introduce 🔺Science Hierarchography🔺, the goal of organizing science papers into conceptual hierarchies.
arxiv.org/abs/2504.13834
(1) "GenEx: Generating an Explorable World"
openreview.net/pdf?id=8NlU...
TLDR— Physical exploration can be expensive, and even impossible. Our proposed policy mitigates this by enabling agents to form an imaginative model of the 3D world.
(1) "GenEx: Generating an Explorable World"
openreview.net/pdf?id=8NlU...
TLDR— Physical exploration can be expensive, and even impossible. Our proposed policy mitigates this by enabling agents to form an imaginative model of the 3D world.
Excited that students from my lab are off to top PhD programs!
Muhan Gao @muhan_gao→ Texas A&M
Zhouxiang Feng @FocusV857→ Rice
Abe Hou @abe_hou→ Stanford
Taiming Lu @TaiMingLu→ Princeton
Dongwei Jiang @Dongwei__Jiang→ USC
Excited that students from my lab are off to top PhD programs!
Muhan Gao @muhan_gao→ Texas A&M
Zhouxiang Feng @FocusV857→ Rice
Abe Hou @abe_hou→ Stanford
Taiming Lu @TaiMingLu→ Princeton
Dongwei Jiang @Dongwei__Jiang→ USC
Took the team out for some badminton fun today—amazing energy, lots of laughs, and a reminder of how lucky I am to work with this crew!
Took the team out for some badminton fun today—amazing energy, lots of laughs, and a reminder of how lucky I am to work with this crew!
See Abe Hou @abe_hou 's study in the context of "vaccine hesitancy" where we can use historical data for comparison and validation.
arxiv.org/abs/2503.09639
See Abe Hou @abe_hou 's study in the context of "vaccine hesitancy" where we can use historical data for comparison and validation.
arxiv.org/abs/2503.09639