But we are still welcoming submissions through ACL Rolling Review 🎉
ARR Commitment Deadline: June 6th
Acceptance notifications will be on June 20th, and then SIGdial will be held in Avignon, France: August 25th - 27th
But we are still welcoming submissions through ACL Rolling Review 🎉
ARR Commitment Deadline: June 6th
Acceptance notifications will be on June 20th, and then SIGdial will be held in Avignon, France: August 25th - 27th
From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮
And this isn’t a one-off. The pattern holds across RL algorithms and models.
🧵A Deep Dive
From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮
And this isn’t a one-off. The pattern holds across RL algorithms and models.
🧵A Deep Dive
To discuss these problems, we will host the first ORIGen workshop at @colmweb.org! Submissions welcome from NLP, HCI, CogSci, and anything human-centered, due June 20 :)
origen-workshop.github.io
To discuss these problems, we will host the first ORIGen workshop at @colmweb.org! Submissions welcome from NLP, HCI, CogSci, and anything human-centered, due June 20 :)
origen-workshop.github.io
📄Arxiv: arxiv.org/pdf/2505.07775
💻 GitHub: github.com/beyzabozdag/...
📄Arxiv: arxiv.org/pdf/2505.07775
💻 GitHub: github.com/beyzabozdag/...
By revealing the dependencies within chains, we significantly improve how LLM reasoning can be verified.
🧵[1/n]
By revealing the dependencies within chains, we significantly improve how LLM reasoning can be verified.
🧵[1/n]
To find out, we introduce SafeArena (safearena.github.io), a benchmark to assess the capabilities of web agents to complete harmful web tasks. A thread 👇
To find out, we introduce SafeArena (safearena.github.io), a benchmark to assess the capabilities of web agents to complete harmful web tasks. A thread 👇
Introducing Persuade Me If You Can (PMIYC)—a new framework to evaluate (1) how persuasive LLMs are and (2) how easily they can be persuaded! 🚀
📄Arxiv: arxiv.org/abs/2503.01829
🌐Project Page: beyzabozdag.github.io/PMIYC/
NN-CIFT slashes data valuation costs by 99% using tiny neural nets (205k params, just 0.0027% of 8B LLMs) while maintaining top-tier performance!
🌐 shuhaibm.github.io/refed/
🧵 [1/n]
@convai-uiuc.bsky.social @gokhantur.bsky.social
In our latest work, "Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems", we delve into how strategic delays can enhance dialogue systems.
Paper Website: merterm.github.io/positive-fri...
@convai-uiuc.bsky.social @gokhantur.bsky.social
Catch our work, "Simulating User Agents for Embodied Conversational AI," at the Open World Agents Workshop on Dec 15.
Poster: tinyurl.com/yc42h4ud
Audio: tinyurl.com/mr2hd35a
@dilekh.bsky.social @gokhantur.bsky.social
Catch our work, "Simulating User Agents for Embodied Conversational AI," at the Open World Agents Workshop on Dec 15.
Poster: tinyurl.com/yc42h4ud
Audio: tinyurl.com/mr2hd35a
@dilekh.bsky.social @gokhantur.bsky.social