Lightnews — Scholar-powered news

Mingqian Zheng

@mingqian-zheng.bsky.social

56 followers 210 following 9 posts

PhD @CMU LTI
https://eeelisa.github.io/

Posts Replies Media Videos

Mingqian Zheng

@mingqian-zheng.bsky.social

How and when should LLM guardrails be deployed to balance safety and user experience?

Our #EMNLP2025 paper reveals that crafting thoughtful refusals rather than detecting intent is the key to human-centered AI safety.

📄 arxiv.org/abs/2506.00195
🧵[1/9]

October 20, 2025 at 8:04 PM

Reposted by Mingqian Zheng

David Jurgens

@davidjurgens.bsky.social

Why do some emails get a reply and not others? Does it have more to do with how you write it or who you are—or maybe both? In our new #NAACL2025 paper we looked at 11M emails to causally test what factors will help you get a reply. 📬

The first page of the NAACL 2025 paper Causally Modeling the Linguistic and Social Factors that Predict Email Response

May 1, 2025 at 3:15 AM

Reposted by Mingqian Zheng

Xuhui Zhou

@nlpxuhui.bsky.social

When interacting with ChatGPT, have you wondered if they would ever "lie" to you? We found that under pressure, LLMs often choose deception. Our new #NAACL2025 paper, "AI-LIEDAR ," reveals models were truthful less than 50% of the time when faced with utility-truthfulness conflicts! 🤯 1/

April 28, 2025 at 8:36 PM

Reposted by Mingqian Zheng

Akhila Yerukola

@akhilayerukola.bsky.social

Did you know? Gestures used to express universal concepts—like wishing for luck—vary DRAMATICALLY across cultures?
🤞means luck in US but deeply offensive in Vietnam 🚨

📣 We introduce MC-SIGNS, a test bed to evaluate how LLMs/VLMs/T2I handle such nonverbal behavior!

📜: arxiv.org/abs/2502.17710

Figure showing that interpretations of gestures vary dramatically across regions and cultures. ‘Crossing your fingers,’ commonly used in the US to wish for good luck, can be deeply offensive to female audiences in parts of Vietnam. Similarly, the 'fig gesture,' a playful 'got your nose' game with children in the US, carries strong sexual connotations in Japan and can be highly offensive.

February 26, 2025 at 4:23 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news