Itamar Avitan
avitanit.bsky.social
Itamar Avitan
@avitanit.bsky.social
PhD candidate at Ben-Gurion University.
Kudos to our NeurIPS 2025 reviewers for thoughtful, human-generated reviews. I’ll be presenting poster #2010 in San Diego on Fri, 5 Dec from 4:30–7:30 p.m. PT. Come say hi!
arXiv : arxiv.org/abs/2510.23321
Code and data: github.com/brainsandmachines/oddoneout_model_recovery
Model-Behavior Alignment under Flexible Evaluation: When the Best-Fitting Model Isn't the Right One
Linearly transforming stimulus representations of deep neural networks yields high-performing models of behavioral and neural responses to complex stimuli. But does the test accuracy of such predictio...
arxiv.org
November 20, 2025 at 2:05 PM
Our work reveals a sharp trade-off between predictive accuracy and model identifiability. Flexible mappings maximize predictivity, but blur the distinction between competing computational hypotheses.
November 20, 2025 at 2:05 PM
Further analyses showed that linear probing was the culprit. The linear fit warps each model's original feature space, erasing its unique signature and making all aligned models converge toward a human-like representation.
November 20, 2025 at 2:05 PM
The key dependent measure is how often the data-generating model actually achieves the highest prediction accuracy. The surprising result: even with massive datasets (millions of trials), the best-performing model is often not the right one.
November 20, 2025 at 2:05 PM
Each simulation worked like this: (1) pick one model from 20 candidate NNs and fit it to human responses; (2) sample a synthetic dataset from that model using NEW triplets; (3) test all 20 models on this generated data, measuring cross-validated prediction accuracy.
November 20, 2025 at 2:05 PM
We ran model recovery simulations using models fitted to the massive THINGS odd-one-out data shared by @martinhebart.bsky.social , @cibaker.bsky.social et al. Each simulation tested whether a neural network model would “win” the model comparison if it had generated the behavioral data.
November 20, 2025 at 2:05 PM
In our new NeurIPS 2025 paper, we ask: does better predictive accuracy necessarily mean better mechanistic correspondence between neural networks and human representations? neurips.cc/virtual/2025...
NeurIPS Poster Model–Behavior Alignment under Flexible Evaluation: When the Best-Fitting Model Isn’t the Right OneNeurIPS 2025
neurips.cc
November 20, 2025 at 2:05 PM
They also showed that if we nudge the NN representations toward human judgments by linearly transforming the representation space itself crossvalidated prediction accuracy is boosted almost to the reliability bound. arxiv.org/abs/2211.01201
Human alignment of neural network representations
Today's computer vision models achieve human or near-human level performance across a wide variety of vision tasks. However, their architectures, data, and learning algorithms differ in numerous ways ...
arxiv.org
November 20, 2025 at 2:05 PM
@lukasmut.bsky.social , @lorenzlinhardt.bsky.social et al, showed that neural network representations can be strong predictors of human odd-one-out judgments: the image humans select as “odd” among three is often the one whose activation pattern differs most from the other two.
November 20, 2025 at 2:05 PM