Lightnews — Scholar-powered news

Simon Munzert

@simonsaysnothin.bsky.social

410 followers 440 following 28 posts

Professor of Data Science and Public Policy | Hertie School Data Science Lab | Elections, Public Opinion, Data

Posts Replies Media Videos

Simon Munzert

@simonsaysnothin.bsky.social

This week, the 2025 edition of the Lancet Countdown Report on Health and Climate Change was released — and I was able to contribute a small part again (look for indicator 5.2 in our version of the #ShowYourStripes chart, that’s me!). 1/4

Chart 1 of 2 from Lancet Countdown 2025 providing time-series information on climate and health indicators reported in the paper. Lots of unfavorable, a few favorable developments.

Chart 2 of 2 from Lancet Countdown 2025 providing time-series information on climate and health indicators reported in the paper. Lots of unfavorable, a few favorable developments.

October 31, 2025 at 8:43 AM

Simon Munzert

@simonsaysnothin.bsky.social

Main finding #2: Group markers drive over-moderation. Words like "muslim", "gay", or "jews" make mis-classifying non-hate speech as hate speech more likely.

Table providing results from a SHAP Values analysis, showing which speech tokens contribute to misclassification decisions.

May 12, 2025 at 8:25 PM

Simon Munzert

@simonsaysnothin.bsky.social

Main finding #1: Performance varies wildly across moderation services, datasets, and metrics, but some of the failure rates are astonishing (FPR & FNR > 75% on balanced samples for some implicit and explicit speech!).

Table reporting some key findings from the paper - hate speech classification performance statistics across moderation services and datasets.

May 12, 2025 at 8:25 PM

Simon Munzert

@simonsaysnothin.bsky.social

Our paper investigates how commercial content moderation APIs handle group-targeted hate speech. This is a black-box audit (we have no idea how the models look exactly) of 5 major APIs with five million queries based on four datasets.

Info chart describing the design of the study. Figure title: Our black-box audit framework to evaluate commercial content moderation APIs.

May 12, 2025 at 8:25 PM

Simon Munzert

@simonsaysnothin.bsky.social

Have you ever used a content moderation API, such as Perspective API or OpenAI's Moderation API, for your research or to inform moderation decisions? Well, they might not have given you what you think they would.

Bill Murray sitting on a bed, somehow lost. Scene from movie Lost in Translation. Title says "Lost in Moderation"

May 12, 2025 at 8:25 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news