Lightnews — Scholar-powered news

Srishti

@srishtiy.bsky.social

300 followers 420 following 14 posts

Posts Replies Media Videos

Srishti

@srishtiy.bsky.social

We then propose 5 frameworks to evaluate cultures in VLMs:
1️⃣ Processual Grounding - who defines culture?
2️⃣ Material Culture - what is represented?
3️⃣ Symbolic Encoding - how is meaning layered?
4️⃣ Contextual Interpretation - who understands and frames meaning?
5️⃣ Temporality -when is culture situated?

June 2, 2025 at 10:36 AM

Srishti

@srishtiy.bsky.social

Modern Vision-Language Models (VLMs) often fail at cultural understanding. But culture isn’t just recognizing things like food, clothes, rituals etc. It's how meaning is made and understood; it also about symbolism, context, and how these things evolve over time.

June 2, 2025 at 10:36 AM

Srishti

@srishtiy.bsky.social

I am excited to announce our latest work 🎉 "Cultural Evaluations of Vision-Language Models Have a Lot to Learn from Cultural Theory". We review recent works on culture in VLMs and argue for deeper grounding in cultural theory to enable more inclusive evaluations.

Paper 🔗: arxiv.org/pdf/2505.22793

Paper title "Cultural Evaluations of Vision-Language Models
Have a Lot to Learn from Cultural Theory"

June 2, 2025 at 10:36 AM

Srishti

@srishtiy.bsky.social

Thanks! In one result we do compare image-text frames, focusing on if highlighted text frames align with image frames in the article - e.g. political in text often aligns with policy/public-op in images (Diagonal shows there is often
an alignment).
Your work sounds interesting—keen to chat more :) !

April 7, 2025 at 6:26 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news