Lightnews — Scholar-powered news

Asher Zheng

@asher-zheng.bsky.social

160 followers 170 following 10 posts

PhD @ UT Linguistics
Semantics/Pragmatics/NLP
https://asherz720.github.io/
Prev.@UoEdinburgh @Hanyang

Posts Replies Media Videos

Asher Zheng

@asher-zheng.bsky.social

By analyzing model reasoning, we find extra reasoning introduces overcomplication (img left), misunderstanding, and internal inconsistency (img right). This shows the current LLMs still lack sophisticated pragmatic understanding in many ways.

June 3, 2025 at 11:56 AM

Asher Zheng

@asher-zheng.bsky.social

We evaluate a range of LLMs in terms of how good they are at perceiving strategic language. We show models struggle with our metrics while showing an overall good understanding of Gricean principles. Model size tends to have a positive effect, while reasoning does not help.

June 3, 2025 at 11:56 AM

Asher Zheng

@asher-zheng.bsky.social

(2) BaT and PaT are valid terms that reflect strategic gains/losses, which can to some extent predict conversational outcomes. In addition, our metrics are more objective. When conditioned on cases where the outcome is made based on logical arguments, the predictive power rises.

June 3, 2025 at 11:56 AM

Asher Zheng

@asher-zheng.bsky.social

We also introduce CHARM, an annotated dataset of real legal cross-examination dialogues. By applying our framework, we show (1) (non-)cooperative discourse are distinct over the identified properties (img left), and BaT and PaT show such a distributional distinction (img right).

June 3, 2025 at 11:56 AM

Asher Zheng

@asher-zheng.bsky.social

Based on the components above, we introduce three metrics—Benefit at Turn (BaT), Penalty at Turn (PaT), and Normalized Relative Benefit at Turn (NRBaT)—to measure the strategic gains, losses, and cumulative benefits at a turn.

June 3, 2025 at 11:56 AM

Asher Zheng

@asher-zheng.bsky.social

Language is often strategic, but LLMs tend to play nice. How strategic are they really? Probing into that is key for future safety alignment.

👉Introducing CoBRA🐍, a framework that assesses strategic language.

Work with my amazing advisors @jessyjli.bsky.social and @David I. Beaver!

June 3, 2025 at 11:56 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news