Lightnews — Scholar-powered news

Reposted by Kenny Peng

Divya Shanmugam

@dmshanmugam.bsky.social

New #NeurIPS2025 paper: how should we evaluate machine learning models without a large, labeled dataset? We introduce Semi-Supervised Model Evaluation (SSME), which uses labeled and unlabeled data to estimate performance! We find SSME is far more accurate than standard methods.

October 17, 2025 at 4:29 PM

Kenny Peng

@kennypeng.bsky.social

Being Divya's labmate (and fellow ferry commuter) has been a real pleasure, and I've learned a ton from both her research itself and her approach to research (and also from the other random things she knows about).

Divya Shanmugam @dmshanmugam.bsky.social · 27d

I am on the job market this year! My research advances methods for reliable machine learning from real-world data, with a focus on healthcare. Happy to chat if this is of interest to you or your department/team.

October 14, 2025 at 4:02 PM

Kenny Peng

@kennypeng.bsky.social

"those already relatively advantaged are, empirically, more able to pay time costs and navigate administrative burdens imposed by the mechanisms."

This point by @nkgarg.bsky.social has greatly shaped my thinking about the role of computer science in public service settings.

Nikhil Garg @nkgarg.bsky.social · Aug 11

New piece, out in the Sigecom Exchanges! It's my first solo-author piece, and the closest thing I've written to being my "manifesto." #econsky #ecsky
arxiv.org/abs/2507.03600

Screenshot of paper abstract, with text: "A core ethos of the Economics and Computation (EconCS) community is that people have complex private preferences and information of which the central planner is unaware, but which an appropriately designed mechanism can uncover to improve collective decisionmaking. This ethos underlies the community’s largest deployed success stories, from stable matching systems to participatory budgeting. I ask: is this choice and information aggregation “worth it”? In particular, I discuss how such systems induce heterogeneous participation: those already relatively advantaged are, empirically, more able to pay time costs and navigate administrative burdens imposed by the mechanisms. I draw on three case studies, including my own work – complex democratic mechanisms, resident crowdsourcing, and school matching. I end with lessons for practice and research, challenging the community to help reduce participation heterogeneity and design and deploy mechanisms that meet a “best of both worlds” north star: use preferences and information from those who choose to participate, but provide a “sufficient” quality of service to those who do not."

August 12, 2025 at 1:04 PM

Kenny Peng

@kennypeng.bsky.social

How do we reconcile excitement about sparse autoencoders with negative results showing that they underperform simple baselines? Our new position paper makes a distinction: SAEs are very useful for tools for discovering *unknown* concepts, less good for acting on *known* concepts.

August 5, 2025 at 5:26 PM

Kenny Peng

@kennypeng.bsky.social

One paragraph pitch for why sparse autoencoders are cool (they learn *interpretable* text embeddings)

$Text embeddings capture tons of information, but individual dimensions are uninterpretable. It would be great if each dimension reflected a concept (“dimension 12 is about cats”). But text embeddings are ~1000 dimensions and there are millions of human concepts. So we need a higher dimensional embedding. Now notice that while there are tons of human concepts, they appear *sparsely*—any piece of text invokes a tiny fraction of concepts. This motivates learning a sparse high-dimensional encoding of text embeddings. Turns out SAEs work great for this in practice, producing *interpretable text embeddings*.$

July 30, 2025 at 5:22 PM

Kenny Peng

@kennypeng.bsky.social

We're presenting two papers Wednesday at #ICML2025, both at 11am.

Come chat about "Sparse Autoencoders for Hypothesis Generation" (west-421), and "Correlated Errors in LLMs" (east-1102)!

Short thread ⬇️

July 16, 2025 at 5:09 AM

Kenny Peng

@kennypeng.bsky.social

Are LLMs correlated when they make mistakes? In our new ICML paper, we answer this question using responses of >350 LLMs. We find substantial correlation. On one dataset, LLMs agree on the wrong answer ~2x more than they would at random. 🧵(1/7)

arxiv.org/abs/2506.07962

Heat map showing that more accurate models have more correlated errors.

July 3, 2025 at 12:54 PM

Reposted by Kenny Peng

Divya Shanmugam

@dmshanmugam.bsky.social

New work 🎉: conformal classifiers return sets of classes for each example, with a probabilistic guarantee the true class is included. But these sets can be too large to be useful.

In our #CVPR2025 paper, we propose a method to make them more compact without sacrificing coverage.

A gif explaining the value of test-time augmentation to conformal classification. The video begins with an illustration of TTA reducing the size of the predicted set of classes for a dog image, and goes on to explain that this is because TTA promotes the true class's predicted probability to be higher, even when it's predicted to be unlikely.

June 14, 2025 at 3:00 PM

Reposted by Kenny Peng

Raj Movva

@rajmovva.bsky.social

We'll present HypotheSAEs at ICML this summer! 🎉
Draft: arxiv.org/abs/2502.04382

We're continuing to cook up new updates for our Python package: github.com/rmovva/Hypot...

(Recently, "Matryoshka SAEs", which help extract coarse and granular concepts without as much hyperparameter fiddling.)

May 5, 2025 at 9:27 PM

Reposted by Kenny Peng

Erica Chiang

@ericachiang.bsky.social

I’m really excited to share the first paper of my PhD, “Learning Disease Progression Models That Capture Health Disparities” (accepted at #CHIL2025)! ✨ 1/

📄: arxiv.org/abs/2412.16406

May 1, 2025 at 12:57 PM

Reposted by Kenny Peng

Emma Pierson

@emmapierson.bsky.social

The US government recently flagged my scientific grant in its "woke DEI database". Many people have asked me what I will do.

My answer today in Nature.

We will not be cowed. We will keep using AI to build a fairer, healthier world.

www.nature.com/articles/d41...

My ‘woke DEI’ grant has been flagged for scrutiny. Where do I go from here?

My work in making artificial intelligence fair has been noticed by US officials intent on ending ‘class warfare propaganda’.

www.nature.com

April 25, 2025 at 5:19 PM

Kenny Peng

@kennypeng.bsky.social

Our lab had a #dogathon 🐕 yesterday where we analyzed NYC Open Data on dog licenses. We learned a lot of dog facts, which I’ll share in this thread 🧵

1) Geospatial trends: Cavalier King Charles Spaniels are common in Manhattan; the opposite is true for Yorkshire Terriers.

April 2, 2025 at 2:16 PM

Reposted by Kenny Peng

Gabriel Agostini

@gsagostini.bsky.social

Migration data lets us study responses to environmental disasters, social change patterns, policy impacts, etc. But public data is too coarse, obscuring these important phenomena!

We build MIGRATE: a dataset of yearly flows between 47 billion pairs of US Census Block Groups. 1/5

March 28, 2025 at 3:25 PM

Reposted by Kenny Peng

Raj Movva

@rajmovva.bsky.social

💡New preprint & Python package: We use sparse autoencoders to generate hypotheses from large text datasets.

Our method, HypotheSAEs, produces interpretable text features that predict a target variable, e.g. features in news headlines that predict engagement. 🧵1/

March 18, 2025 at 3:17 PM

Kenny Peng

@kennypeng.bsky.social

(1/n) New paper/code! Sparse Autoencoders for Hypothesis Generation

HypotheSAEs generates interpretable features of text data that predict a target variable: What features predict clicks from headlines / party from congressional speech / rating from Yelp review?

arxiv.org/abs/2502.04382

March 18, 2025 at 3:29 PM

Reposted by Kenny Peng

Sophie Greenwood

@sjgreenwood.bsky.social

Please repost to get the word out! @nkgarg.bsky.social and I are excited to present a personalized feed for academics! It shows posts about papers from accounts you’re following bsky.app/profile/pape...

March 10, 2025 at 3:12 PM

Kenny Peng

@kennypeng.bsky.social

In new work, we show a "No Free Lunch Theorem" for human-AI Collaboration (w/ @nkgarg.bsky.social and Jon Kleinberg).

(And if you're at #AAAI, I'm presenting at 11:15am today in the Humans and AI session. Poster 12:30-2:30.)

arxiv.org/abs/2411.15230

February 27, 2025 at 2:30 PM

Kenny Peng

@kennypeng.bsky.social

I'm at #NeurIPS, presenting work w/ @nkgarg.bsky.social where we study algorithmic monoculture using a matching markets model: If many firms or colleges all use the same algorithm to evaluate applicants, what happens?

Poster is in a few hours, come chat!
Wed 11am-2pm | West Ballroom A-D #5505

December 11, 2024 at 4:16 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news