Lightnews — Scholar-powered news

Shuvom Sadhuka

@shuvoms.bsky.social

10/ Also, I will be sticking around in San Diego for NeurIPS afterwards to present SSME. See my co-author’s thread on that here:
bsky.app/profile/dmsh...

Divya Shanmugam @dmshanmugam.bsky.social · Oct 17

New #NeurIPS2025 paper: how should we evaluate machine learning models without a large, labeled dataset? We introduce Semi-Supervised Model Evaluation (SSME), which uses labeled and unlabeled data to estimate performance! We find SSME is far more accurate than standard methods.

November 18, 2025 at 6:40 PM

Shuvom Sadhuka

@shuvoms.bsky.social

9/ This was joint work with
Sophia Lin, @bergerlab.bsky.social , and @emmapierson.bsky.social ! A special shoutout to Sophia Lin who was a high school student when we started this project.

Arxiv: arxiv.org/pdf/2511.11684
Code: github.com/shuvom-s/fun...

November 18, 2025 at 6:39 PM

Shuvom Sadhuka

@shuvoms.bsky.social

8/ We find that the estimated mortality risk required to admit male patients into the ICU (4.5% is lower than that for female patients (5.1%). We find a similar disparity in hospital admissions as well.

November 18, 2025 at 6:38 PM

Shuvom Sadhuka

@shuvoms.bsky.social

7/ We then apply our model to MIMIC-IV, a dataset of medical records from a Boston-area emergency department. We model the flow of patients from the ED to the hospital to the ICU using our funnel model.

November 18, 2025 at 6:38 PM

Shuvom Sadhuka

@shuvoms.bsky.social

6/ We introduce a funnel model that predicts both the ground truth label and the human decisions, while accounting for unobserved covariates that affect both the label and the decisions.

November 18, 2025 at 6:38 PM

Shuvom Sadhuka

@shuvoms.bsky.social

5/ This censoring of the ground truth happens over multiple stages and isn't random. Patients who make it to later stages tend to be at higher risk of cancer. If we train only on the population with observed labels, we’ll be learning from a biased subset of the population.

November 18, 2025 at 6:38 PM

Shuvom Sadhuka

@shuvoms.bsky.social

4/ We only observe the ground truth outcome (biopsy result) for patients who make it to the last stage.

November 18, 2025 at 6:37 PM

Shuvom Sadhuka

@shuvoms.bsky.social

3/ For instance, in a breast cancer diagnosis, a clinician may first administer a breast exam, then order a mammogram for patients with concerning exams, before ordering a biopsy for patients with concerning mammograms.

November 18, 2025 at 6:37 PM

Shuvom Sadhuka

@shuvoms.bsky.social

2/ In many decision-making settings we only observe ground truth labels only after a sequence of human decisions.

November 18, 2025 at 6:37 PM

Shuvom Sadhuka

@shuvoms.bsky.social

Relatedly, @dmshanmugam.bsky.social is on the academic job market and I strongly recommend working with her! She has an impressive array of reliable ML work, often with applications in biomedical/health settings, and I'd recommend talking to her if you can.

October 17, 2025 at 5:45 PM

Shuvom Sadhuka

@shuvoms.bsky.social

congrats! sad I missed it :/

May 10, 2025 at 1:42 AM

Shuvom Sadhuka

@shuvoms.bsky.social

It's interesting because "Shuvom" is definitely the highest signal for gender out of the list mentioned, but it probably just doesn't know this, since my name is pretty rare.

March 27, 2025 at 3:59 PM

Shuvom Sadhuka

@shuvoms.bsky.social

Some past writing I like:

Democracy and the CLT: shuvom-s.github.io/blog/2020/de...

Privacy vs collaboration in genomics: shuvom-s.github.io/blog/2023/ov...

Please check out the others too if interested! [3/3]

Democracy and the Central Limit Theorem | Shuvom Sadhuka

Is democracy just an average of preferences?

shuvom-s.github.io

February 13, 2025 at 11:00 PM

Shuvom Sadhuka

@shuvoms.bsky.social

Relatedly, I finally consolidated my scattered writing into a blog: shuvom-s.github.io/blog/

For now, I'm hosting on my (rebuilt) personal website, but curious if anyone has recs on a platform to blog (Medium? Substack? Personal site?). I've turned on comments with giscus. [2/3]

blog | Shuvom Sadhuka

A simple, whitespace theme for academics. Based on [*folio](https://github.com/bogoli/-folio) design.

shuvom-s.github.io

February 13, 2025 at 11:00 PM

Shuvom Sadhuka

@shuvoms.bsky.social

context: https://www.nyc.gov/assets/oti/html/nyc-core-framework/subway-icons.html

December 5, 2024 at 12:25 AM

Shuvom Sadhuka

@shuvoms.bsky.social

@dmshanmugam @manish_raghavan @mit_caml @lab_berger @2plus2make5 Labeled examples give access to samples from (s,y) but unlabeled examples still provide samples from the marginal for s. We should use those samples! By turning evaluation into a semi-supervised problem, we can obtain more reliable ...

December 5, 2024 at 12:25 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news