Lightnews — Scholar-powered news

Akhila Yerukola

@akhilayerukola.bsky.social

400 followers 240 following 18 posts

PhD student at CMU LTI; Interested in pragmatics and cross-cultural understanding;
intern @ Allen Institute for AI |Prev: Senior Research Engineer @ Samsung Research America | Masters @ Stanford
https://akhila-yerukola.github.io/

Posts Replies Media Videos

Akhila Yerukola

@akhilayerukola.bsky.social

Also, this work began while I interned with Nanyun Peng and @skgabrie.bsky.social at sunny UCLA under the guidance of my advisor @maartensap.bsky.social ! Grateful for their mentorship throughout! 🙌

February 26, 2025 at 6:30 PM

Akhila Yerukola

@akhilayerukola.bsky.social

Special thanks to: @sunipadev.bsky.social @841io.bsky.social , @nouhadziri.bsky.social , Jocelyn Shen, @shaily99.bsky.social , @simi97k.bsky.social ,
@vijaytarian.bsky.social , @apratapa.xyz for helpful discussions and feedback on this work!

February 26, 2025 at 4:23 PM

Akhila Yerukola

@akhilayerukola.bsky.social

Huge shoutout to my amazing collaborators: @skgabrie.bsky.social, Nanyun (Violet) Peng, @maartensap.bsky.social!!

February 26, 2025 at 4:23 PM

Akhila Yerukola

@akhilayerukola.bsky.social

🚀 I'm passionate about developing culturally contextual safety guardrails to make AI more sensitive and aware. If this work interests you, please feel free to reach out—I’d love to connect!

February 26, 2025 at 4:23 PM

Akhila Yerukola

@akhilayerukola.bsky.social

For more interesting findings, please check out our preprint 📜 arxiv.org/abs/2502.17710

Data 📚 github.com/Akhila-Yeruk...

Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures

Gestures are an integral part of non-verbal communication, with meanings that vary across cultures, and misinterpretations that can have serious social and diplomatic consequences. As AI systems becom...

arxiv.org

February 26, 2025 at 4:23 PM

Akhila Yerukola

@akhilayerukola.bsky.social

The cross-cultural safety risks aren’t theoretical – they’re already impacting several applications, such as:
✈️ AI-powered travel guides
🎭 AI-generated ad visuals
🤖 Automated content moderation
Culturally contextual safety guardrails are needed for AI systems!

February 26, 2025 at 4:23 PM

Akhila Yerukola

@akhilayerukola.bsky.social

🔬 Key Takeaway 🥉
All models—T2I, LLMs, and VLMs—exhibit US-centric biases, with higher accuracy in identifying offensive gestures in US contexts than in non-US ones (e.g., middle finger 🖕 in US vs UK)

February 26, 2025 at 4:23 PM

Akhila Yerukola

@akhilayerukola.bsky.social

🔬 Key Takeaway 🥈
All models—T2I, LLMs, and VLMs—often default to US-centric interpretations of universal concepts (e.g., "good luck" → 🤞), overlooking the cultural variation in gestures used to express them

February 26, 2025 at 4:23 PM

Akhila Yerukola

@akhilayerukola.bsky.social

🔬 Key Takeaway 🥇
(a) T2I models struggle to reject offensive gestures. LLMs tend to overflag gestures as offensive. VLMs show mixed results, with some performing near chance and others over-flagging
(b) Adding scene context doesnt affect LLMs but worsens T2I and VLM performance

February 26, 2025 at 4:23 PM

Akhila Yerukola

@akhilayerukola.bsky.social

We assess how well T2I systems, LLMs, and VLMs understand cross-cultural gestures—revealing gaps in AI’s ability to navigate nonverbal communication safely. 💫

Table outlining different prompt formulations used to evaluate T2I (Text-to-Image), LLM (Large Language Model), and VLM (Vision-Language Model) responses to gestures, illustrated with the ‘fingers-crossed’ gesture in Vietnam. The table categorizes prompts into three conditions: (1) Explicit: Country – directly stating both ‘fingers-crossed’ and 'Vietnam', (2) Explicit: Country + Scene – adding contextual details such as a 'women’s community gathering,' and (3) Implicit Mention – referencing the gesture’s meaning ('wishing someone luck') without explicitly naming the gesture, while still mentioning Vietnam. The table also specifies evaluation metrics: RQ1 and RQ3 focus on rejection and offensiveness classification rates, while RQ2 measures error rates.

February 26, 2025 at 4:23 PM

Akhila Yerukola

@akhilayerukola.bsky.social

🌍 Introducing MC-SIGNS — a testbed of 288 gesture-country pairs across 25 gestures & 85 countries, carefully annotated by cultural experts for:
1️⃣Offensiveness – how inappropriate a gesture is
2️⃣Confidence score
3️⃣Cultural meaning – associated gloss
4️⃣Contextual factors – when/where it may be risky

Table displaying examples of aggregated annotations from MC-SIGNS, listing gestures, their associated cultural meanings, contexts where they may be inappropriate, and their offensiveness ratings. The table includes gestures such as 'Horns' in Brazil (infidelity), 'Fig Sign' in Indonesia (female genitalia), and 'OK' in Turkey (homophobic). Each gesture is rated for offensiveness (Off/Obs) or hatefulness (Hate) based on annotations from five evaluators, with specific scenarios suggested for avoidance, such as public spaces, professional settings, or LGBTQ+ forums.

February 26, 2025 at 4:23 PM

Akhila Yerukola

@akhilayerukola.bsky.social

Why This Matters? 🤔
Humans can resolve such misunderstandings through social cues and context.
But AI? It generates STATIC content — ads 🎭, travel tips 🛫🏝️, and images 📸 — without accounting for the cross-cultural safety risks.

February 26, 2025 at 4:23 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news