Lightnews — Scholar-powered news

Itamar Avitan

@avitanit.bsky.social

Kudos to our NeurIPS 2025 reviewers for thoughtful, human-generated reviews. I’ll be presenting poster #2010 in San Diego on Fri, 5 Dec from 4:30–7:30 p.m. PT. Come say hi!
arXiv : arxiv.org/abs/2510.23321
Code and data: github.com/brainsandmachines/oddoneout_model_recovery

Model-Behavior Alignment under Flexible Evaluation: When the Best-Fitting Model Isn't the Right One

Linearly transforming stimulus representations of deep neural networks yields high-performing models of behavioral and neural responses to complex stimuli. But does the test accuracy of such predictio...

arxiv.org

November 20, 2025 at 2:05 PM

Itamar Avitan

@avitanit.bsky.social

Our work reveals a sharp trade-off between predictive accuracy and model identifiability. Flexible mappings maximize predictivity, but blur the distinction between competing computational hypotheses.

November 20, 2025 at 2:05 PM

Itamar Avitan

@avitanit.bsky.social

Further analyses showed that linear probing was the culprit. The linear fit warps each model's original feature space, erasing its unique signature and making all aligned models converge toward a human-like representation.

November 20, 2025 at 2:05 PM

Itamar Avitan

@avitanit.bsky.social

The key dependent measure is how often the data-generating model actually achieves the highest prediction accuracy. The surprising result: even with massive datasets (millions of trials), the best-performing model is often not the right one.

November 20, 2025 at 2:05 PM

Itamar Avitan

@avitanit.bsky.social

Each simulation worked like this: (1) pick one model from 20 candidate NNs and fit it to human responses; (2) sample a synthetic dataset from that model using NEW triplets; (3) test all 20 models on this generated data, measuring cross-validated prediction accuracy.

November 20, 2025 at 2:05 PM

Itamar Avitan

@avitanit.bsky.social

We ran model recovery simulations using models fitted to the massive THINGS odd-one-out data shared by @martinhebart.bsky.social , @cibaker.bsky.social et al. Each simulation tested whether a neural network model would “win” the model comparison if it had generated the behavioral data.

November 20, 2025 at 2:05 PM

Itamar Avitan

@avitanit.bsky.social

In our new NeurIPS 2025 paper, we ask: does better predictive accuracy necessarily mean better mechanistic correspondence between neural networks and human representations? neurips.cc/virtual/2025...

NeurIPS Poster Model–Behavior Alignment under Flexible Evaluation: When the Best-Fitting Model Isn’t the Right OneNeurIPS 2025

neurips.cc

November 20, 2025 at 2:05 PM

Itamar Avitan

@avitanit.bsky.social

They also showed that if we nudge the NN representations toward human judgments by linearly transforming the representation space itself crossvalidated prediction accuracy is boosted almost to the reliability bound. arxiv.org/abs/2211.01201

Human alignment of neural network representations

Today's computer vision models achieve human or near-human level performance across a wide variety of vision tasks. However, their architectures, data, and learning algorithms differ in numerous ways ...

arxiv.org

November 20, 2025 at 2:05 PM

Itamar Avitan

@avitanit.bsky.social

@lukasmut.bsky.social , @lorenzlinhardt.bsky.social et al, showed that neural network representations can be strong predictors of human odd-one-out judgments: the image humans select as “odd” among three is often the one whose activation pattern differs most from the other two.

November 20, 2025 at 2:05 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news