Now: @jhuclsp @jhucompsci
Past: @allen_ai @uwnlp @Penn @cogcomp @Illinois_Alma @MSFTResearch
We studied agentic tool recovery—when your LLM selects a set of tools to execute, but one turns out to be unavailable or incorrect.
We studied agentic tool recovery—when your LLM selects a set of tools to execute, but one turns out to be unavailable or incorrect.
📄 Preprint: arxiv.org/pdf/2505.20321
📄 Preprint: arxiv.org/pdf/2505.20321
Apparently, ChatGPT has a better grasp than @nyuniversity.
x.com/nebedaay/st...
Apparently, ChatGPT has a better grasp than @nyuniversity.
x.com/nebedaay/st...
* It's simple: Rewrite content by targeting and transforming the few longest quotes.
* It's simple: Rewrite content by targeting and transforming the few longest quotes.
Answering this requires evaluating *evaluate* whether LLMs can provide critiques that are *grounded* in the context of science papers.
See @JiefuOu's dataset which has a collection of paper claims and their critiques: arxiv.org/pdf/2503.21717
Answering this requires evaluating *evaluate* whether LLMs can provide critiques that are *grounded* in the context of science papers.
See @JiefuOu's dataset which has a collection of paper claims and their critiques: arxiv.org/pdf/2503.21717
But what if you want a bird’s-eye view of science, or to identify over- and under-explored areas?
We introduce 🔺Science Hierarchography🔺, the goal of organizing science papers into conceptual hierarchies.
arxiv.org/abs/2504.13834
But what if you want a bird’s-eye view of science, or to identify over- and under-explored areas?
We introduce 🔺Science Hierarchography🔺, the goal of organizing science papers into conceptual hierarchies.
arxiv.org/abs/2504.13834
Took the team out for some badminton fun today—amazing energy, lots of laughs, and a reminder of how lucky I am to work with this crew!
Took the team out for some badminton fun today—amazing energy, lots of laughs, and a reminder of how lucky I am to work with this crew!