Lightnews — Scholar-powered news

Lennart Purucker

@lennartpurucker.bsky.social

130 followers 150 following 16 posts

PhD student sup. by Frank Hutter; researching automated machine learning and foundation models for (small) tabular data!

Website: https://ml.informatik.uni-freiburg.de/profile/purucker/

Posts Replies Media Videos

Lennart Purucker

@lennartpurucker.bsky.social

TabArena is a living benchmark. With the community, we will continually update it!

Authors: @nickerickson.bsky.social Lennart Purucker @atschalz.bsky.social @dholzmueller.bsky.social Prateek Desai David Salinas Frank Hutter
LB: tabarena.ai
Paper: arxiv.org/abs/2506.16791
Code: tabarena.ai/code
12/

June 23, 2025 at 10:15 AM

Lennart Purucker

@lennartpurucker.bsky.social

The TabArena team consists of experienced researchers and open-source developers. At the same time, we are also authors of some of the methods benchmarked in our work. We challenge you to find any mistakes or biases in our work to further improve TabArena!

11/

June 23, 2025 at 10:15 AM

Lennart Purucker

@lennartpurucker.bsky.social

We are continuing to improve the TabArena and its usability. You can already use our implementations of all models we benchmarked with scikit-learn-like interfaces:

10/

June 23, 2025 at 10:15 AM

Lennart Purucker

@lennartpurucker.bsky.social

Many benchmarks evaluate methods using holdout validation. We show that this incorrectly represents the relative comparison of methods and results in much worse peak performance! Non-ensemble methods like RealMLP and ModernNCA gain more from 8-fold CV.

9/

June 23, 2025 at 10:15 AM

Lennart Purucker

@lennartpurucker.bsky.social

The worth of models lies not only in their individual performance but also in their contribution to a multi-model ensemble. We build an ensemble of strong and diverse model configurations and show that it significantly outperforms the current SOTA on tabular data, AutoGluon.

8/

June 23, 2025 at 10:15 AM

Lennart Purucker

@lennartpurucker.bsky.social

In terms of training and inference time, tree-based methods still shine compared to modern neural networks.

7/

June 23, 2025 at 10:15 AM

Lennart Purucker

@lennartpurucker.bsky.social

We evaluate three foundation models. TabDPT runs on every dataset and is mid-field with good regression results. TabPFNv2 and TabICL achieve very good results on subsets of the benchmark within their corresponding dataset constraints (left: TabPFNv2, right: TabICL).

6/

June 23, 2025 at 10:15 AM

Lennart Purucker

@lennartpurucker.bsky.social

On the full benchmark, the recent deep learning models RealMLP and TabM take the top spots with weighted ensembling, slightly outperforming boosted trees on average, although boosted trees are faster. Without ensembling, CatBoost wins.

5/

June 23, 2025 at 10:15 AM

Lennart Purucker

@lennartpurucker.bsky.social

Where possible, we coordinate with authors to obtain good hyperparameter search spaces. For tree-based baselines, we took implementations from AutoGluon and made them better by carefully optimizing their search spaces, so these might be the best baselines out there.

4/

June 23, 2025 at 10:15 AM

Lennart Purucker

@lennartpurucker.bsky.social

We curated datasets by 𝗺𝗮𝗻𝘂𝗮𝗹𝗹𝘆 checking 1053 datasets from prior benchmarks. Only 51 were realistic tabular IID predictive tasks with 500-250K samples, which we share via OpenML. Together with the community, we aim to extend TabArena's datasets in the future!

3/

June 23, 2025 at 10:15 AM

Lennart Purucker

@lennartpurucker.bsky.social

TabArena implements best practices for SOTA performance: 8-fold inner cross-validation with bagging, outer cross-validation for evaluation, early stopping where possible, extensive tuning, and weighted ensembles of hyperparameter configurations to obtain peak performance.

2/

June 23, 2025 at 10:15 AM

Lennart Purucker

@lennartpurucker.bsky.social

🚨What is SOTA on tabular data, really? We are excited to announce 𝗧𝗮𝗯𝗔𝗿𝗲𝗻𝗮, a living benchmark for machine learning on IID tabular data with:

📊 an online leaderboard (submit!)
📑 carefully curated datasets
📈 strong tree-based, deep learning, and foundation models

🧵

June 23, 2025 at 10:15 AM

Lennart Purucker

@lennartpurucker.bsky.social

The tabular foundation model TabPFN v2 is finally public 🎉🥳
This is excellent news for (small) tabular ML! Checkout our Nature article (nature.com/articles/s41...) and code (github.com/PriorLabs/Ta...)

January 9, 2025 at 8:33 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news