findings from large scale survey of 800 researchers on how they use LMs in their research #colm2025
findings from large scale survey of 800 researchers on how they use LMs in their research #colm2025
Meet Ai2 ScholarQA, an experimental solution that allows you to ask questions that require multiple scientific papers to answer. It gives more in-depth and contextual answers with table comparisons and expandable sections 💡
Try it now: scholarqa.allen.ai
Meet Ai2 ScholarQA, an experimental solution that allows you to ask questions that require multiple scientific papers to answer. It gives more in-depth and contextual answers with table comparisons and expandable sections 💡
Try it now: scholarqa.allen.ai
Meet Ai2 ScholarQA, an experimental solution that allows you to ask questions that require multiple scientific papers to answer. It gives more in-depth and contextual answers with table comparisons and expandable sections 💡
Try it now: scholarqa.allen.ai
🍫Introducing Cocoa, our new interaction paradigm for balancing human & AI agency in complex human-AI workflows. 🧵
🍫Introducing Cocoa, our new interaction paradigm for balancing human & AI agency in complex human-AI workflows. 🧵
We conducted a large-scale survey of verified authors of different fields, race, gender, seniority to find out - results🧵
See results in comments!
🔗 Arxiv link: arxiv.org/abs/2411.05025
We conducted a large-scale survey of verified authors of different fields, race, gender, seniority to find out - results🧵
Boulder is a lovely college town 30 minutes from Denver and 1 hour from Rocky Mountain National Park 😎
Apply by December 15th!
Boulder is a lovely college town 30 minutes from Denver and 1 hour from Rocky Mountain National Park 😎
Apply by December 15th!
@uwnlp.bsky.social & Ai2
With open models & 45M-paper datastores, it outperforms proprietary systems & match human experts.
Try out our demo!
openscholar.allen.ai
@uwnlp.bsky.social & Ai2
With open models & 45M-paper datastores, it outperforms proprietary systems & match human experts.
Try out our demo!
openscholar.allen.ai
Benchmarks like Chatbot Arena contain underspecified queries, which can lead to arbitrary eval judgments. What happens if we provide evaluators with context (e.g who's the user, what's their intent) when judging LM outputs? 🧵↓
Benchmarks like Chatbot Arena contain underspecified queries, which can lead to arbitrary eval judgments. What happens if we provide evaluators with context (e.g who's the user, what's their intent) when judging LM outputs? 🧵↓
@benn9.bsky.social & Yoonjoo Lee's #EMNLP paper explored ways to create such tables using LLMs, and how to evaluate them against a large set of lit review tables we extracted from arXiv.
Have you ever constructed a table to organize your literature review process? Can we use LMs to generate these automatically?
We are excited to present ArxivDIGESTables 🍽️ a study of collecting, generating, and evaluating 🎓 scientific literature review tables 📃!
@benn9.bsky.social & Yoonjoo Lee's #EMNLP paper explored ways to create such tables using LLMs, and how to evaluate them against a large set of lit review tables we extracted from arXiv.
We introduce #meronymity, a novel design paradigm to mitigate social barriers in public social interactions by revealing aspects of identity to balance credibility & privacy. @axz.bsky.social @jbragg.bsky.social @josephc.bsky.social @karger.bsky.social
We introduce #meronymity, a novel design paradigm to mitigate social barriers in public social interactions by revealing aspects of identity to balance credibility & privacy. @axz.bsky.social @jbragg.bsky.social @josephc.bsky.social @karger.bsky.social