Lightnews — Scholar-powered news

Shivani Kumar

@shivanikumar.bsky.social

Work done at #UMSI with the amazing @davidjurgens.bsky.social! Read more in our preprint! 🔗
📄 Paper: arxiv.org/abs/2502.14083
📂 Dataset: huggingface.co/datasets/shi...

@umichresearch.bsky.social #umichresearch #umich
(n/n)

March 1, 2025 at 12:56 AM

Shivani Kumar

@shivanikumar.bsky.social

🏁 Final verdict? Across languages & contexts, models struggle to exceed chance in moral reasoning, highlighting gaps, especially in data-scarce languages.
UniMoral supports studies on cross-cultural moral generalization, bias detection, & value quantification to enhance ethics in AI! (8/n)

March 1, 2025 at 12:56 AM

Shivani Kumar

@shivanikumar.bsky.social

Are models better at psychological vs. real-world dilemmas?

👍 Yes, models perform better on psychological scenarios than Reddit dilemmas.
The gap is larger in predicting ethics & decision factors.
Why? Structured scenarios align with values, while Reddit dilemmas add noise and ambiguity. (7/n)

March 1, 2025 at 12:56 AM

Shivani Kumar

@shivanikumar.bsky.social

Do the responder's values improve predictions?

👍 Yes, context matters!
Values aid action prediction, but models rely on surface patterns. Surprisingly, a short self-authored persona works as well as values in personalizing predictions. Examples also help in identifying decision factors. (6/n)

March 1, 2025 at 12:56 AM

Shivani Kumar

@shivanikumar.bsky.social

Can models reason equally well in different languages?

👎 No! Moral reasoning varies.
English, Spanish & Russian outperform. Arabic & Hindi show lower confidence due to limited data & complex morphology.
➕ Identifying decision factors lags behind action prediction. (5/n)

March 1, 2025 at 12:56 AM

Shivani Kumar

@shivanikumar.bsky.social

Can AI reason morally?

We tested LLMs with UniMoral to:
⚖️ Make action choices
🏛️ Identify ethical preferences
✅ Recognize influences
🔮 Predict consequences
Insights: LLMs excel at action & consequence but lag in ethics & factors. But, how well do they generalize across languages and contexts? (4/n)

March 1, 2025 at 12:56 AM

Shivani Kumar

@shivanikumar.bsky.social

What’s inside?

💭 Multilingual Hypothetical + Reddit based dilemmas
🌐 Action choices of people across 46 countries!
🔎 Ethical principles preferences
📊 Cultural & moral profiles of annotators
🔁 Consequence modeling
Think of it as a "CT scan" of human moral judgment! (3/n)

March 1, 2025 at 12:56 AM

Shivani Kumar

@shivanikumar.bsky.social

Why care?🤔

AI thrives on decision-making, yet most NLP research in moral reasoning relies on fragmented, western-centric data. What’s missing? A dataset capturing the full cycle: actions ⚖️, ethics 🏛️, consequences 🔄, and cultural nuance 🌏.
That’s where UniMoral comes in. (2/n)

March 1, 2025 at 12:56 AM

Shivani Kumar

@shivanikumar.bsky.social

Are models better at psychological vs. real-world dilemmas?

👍 Yes, models perform better on psychological scenarios than Reddit dilemmas.
The gap is larger in predicting ethics & decision factors.
Why? Structured scenarios align with values, while Reddit dilemmas add noise and ambiguity. (7/n)

March 1, 2025 at 12:43 AM

Shivani Kumar

@shivanikumar.bsky.social

Do the responder's values improve predictions?

👍 Yes, context matters!
Values aid action prediction, but models rely on surface patterns. Surprisingly, a short self-authored persona works as well as values in personalizing predictions. Examples also help in identifying decision factors. (6/n)

March 1, 2025 at 12:43 AM

Shivani Kumar

@shivanikumar.bsky.social

Can models reason equally well in different languages?

👎 No! Moral reasoning varies.
English, Spanish & Russian outperform. Arabic & Hindi show lower confidence due to limited data & complex morphology.
➕ Identifying decision factors lags behind action prediction. (5/n)

March 1, 2025 at 12:43 AM

Shivani Kumar

@shivanikumar.bsky.social

Can AI reason morally?

We tested LLMs with UniMoral to:
⚖️ Make action choices
🏛️ Identify ethical preferences
✅ Recognize influences
🔮 Predict consequences
Insights: LLMs excel at action & consequence but lag in ethics & factors. But, how well do they generalize across languages and contexts? (4/n)

March 1, 2025 at 12:43 AM

Shivani Kumar

@shivanikumar.bsky.social

What’s inside?

💭 Multilingual Hypothetical + Reddit based dilemmas
🌐 Action choices of people across 46 countries!
🔎 Ethical principles preferences
📊 Cultural & moral profiles of annotators
🔁 Consequence modeling
Think of it as a "CT scan" of human moral judgment! (3/n)

March 1, 2025 at 12:43 AM

Shivani Kumar

@shivanikumar.bsky.social

Why care?🤔

AI thrives on decision-making, yet most NLP research in moral reasoning relies on fragmented, western-centric data. What’s missing? A dataset capturing the full cycle: actions ⚖️, ethics 🏛️, consequences 🔄, and cultural nuance 🌏.
That’s where UniMoral comes in. (2/n)

March 1, 2025 at 12:43 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news