Lightnews — Scholar-powered news

Sujay Nagaraj

@snagaraj.bsky.social

60 followers 98 following 20 posts

MD/PhD student | University of Toronto | Machine Learning for Health

Posts Replies Media Videos

Sujay Nagaraj

@snagaraj.bsky.social

Our algorithm can improve safety and performance by flagging regretful predictions for abstention or data cleaning.

For example, we demonstrate that, by abstaining from prediction using our algorithm, we can reduce mistakes compared to standard approaches:

April 19, 2025 at 11:04 PM

Sujay Nagaraj

@snagaraj.bsky.social

We develop a method that trains models over plausible clean datasets to anticipate regretful predictions, helping us spot when a model is unreliable at the individual-level.

April 19, 2025 at 11:04 PM

Sujay Nagaraj

@snagaraj.bsky.social

We capture this effect with a simple measure: regret.

Regret is inevitable with label noise, but it can tell us where models silently fail, and how we can guide safer predictions

April 19, 2025 at 11:04 PM

Sujay Nagaraj

@snagaraj.bsky.social

We can frame this problem as learning from noisy labels.

Plenty of algorithms have been designed to handle label noise by predicting well on average, but we show how they still fail on specific individuals.

April 19, 2025 at 11:04 PM

Sujay Nagaraj

@snagaraj.bsky.social

Many ML models predict labels that don’t reflect what we care about, e.g.:
– Diagnoses from unreliable tests
– Outcomes from noisy electronic health records

In a new paper w/@berkustun, we study how this subjects individuals to a lottery of mistakes.
Paper: bit.ly/3Y673uZ
🧵👇

April 19, 2025 at 11:04 PM

Sujay Nagaraj

@snagaraj.bsky.social

Our algorithm can improve safety and performance by flagging regretful predictions for abstention or for data cleaning. For example, we demonstrate how abstaining from prediction on these instances can reduce mistakes compared to standard approaches:

April 19, 2025 at 10:09 PM

Sujay Nagaraj

@snagaraj.bsky.social

We develop a method to anticipate regretful predictions by training models over plausible clean datasets.

This helps us spot when a model is unreliable at the individual-level.

April 19, 2025 at 10:09 PM

Sujay Nagaraj

@snagaraj.bsky.social

We capture this effect with a simple measure: regret.

Regret is inevitable with label noise -- it tells us where models silently fail, and how we can guide safer predictions.

April 19, 2025 at 10:09 PM

Sujay Nagaraj

@snagaraj.bsky.social

We can frame this as learning from noisy labels.

Plenty of algorithms have been designed to handle label noise by predicting well on average —
But we show how they can still fail on specific individuals.

April 19, 2025 at 10:09 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news