Sylvain Combettes
banner
sylvaincom.bsky.social
Sylvain Combettes
@sylvaincom.bsky.social
Co-founding CEO at Formel AI • PhD from ENS Paris-Saclay

🔗 https://sylvaincom.github.io
Reposted by Sylvain Combettes
Today at #EuroScipy2025, @glemaitre58.bsky.social and I presented a tutorial on pitfalls of machine learning for imbalanced classification problems.

We discussed what (not) to do when fitting a classifier and obtaining degenerate precision or recall values.

probabl-ai.github.io/calibration-...
Imbalanced classification: pitfalls and solutions — Probabilistic calibration of cost-sensitive learning
probabl-ai.github.io
August 19, 2025 at 11:58 AM
Reposted by Sylvain Combettes
✨️💥skrub: machine learning with dataframes

New release 💫 0.6
A huge one, with the super powerful new "DataOps", and many improvements all over the library.
Exciting!!
⚡ Release 0.6.0 is now out! ⚡

🚀 Major update! Skrub DataOps, various improvements for the TableReport, new tools for applying transformers to the columns, and a new robust transformer for numerical features are only some of the features included in this release.
July 24, 2025 at 4:16 PM
Reposted by Sylvain Combettes
Come work with us on tslearn in beautiful Rennes!

(deadline for application is soon!)

jobs.inria.fr/public/class...
Python software engineer for tslearn
Offre d'emploi Inria
jobs.inria.fr
February 20, 2025 at 9:59 AM
Reposted by Sylvain Combettes
Just put on line a talk I gave summarizing what I have learned across the years as a maintainer of open source.

It's _opinions_ (been there, done that), but I'm willing to defend them, having stewarded my share of successful open source projects.
speakerdeck.com/gaelvaroquau...
Open source software: how to live long and go far
An opinionated guide to building open-source software tools with a focus on Python and science A talk that I gave when I was stepping down as a lead…
speakerdeck.com
February 6, 2025 at 8:31 PM
Our first flagship feature is the `EstimatorReport`. You feed it your scikit-learn compatible estimator and your dataset, and it displays a helper with metrics and plots to help you investigate your estimator. Computed for you in one-line of code. Blazing fast thanks to caching. Check out our docs!
🚀 Meet skore—the @scikit-learn.org sidekick!

🔹 Offers guidance on modeling
🔹 Automated reports & key metrics
💡 Built by scikit-learn maintainers. Open-source & ready to use!

Try it 👉 tinyurl.com/bdhszwtn
GitHub - probabl-ai/skore: the scikit-learn sidekick
the scikit-learn sidekick. Contribute to probabl-ai/skore development by creating an account on GitHub.
tinyurl.com
January 23, 2025 at 3:49 PM
Reposted by Sylvain Combettes
❄️ The Christmas release is here! ❄️

Introducing scikit-learn 1.6 with:

🟢 2 major features & 34 improvements
🔵 5 efficiency boosts & 21 enhancements
🟡 14 API changes
🔴 30 fixes
👥 160 amazing contributors

youtu.be/7wiHChpwJe8
scikit-learn Version 1.6.0 Release Highlights
YouTube video by scikit-learn
youtu.be
December 20, 2024 at 9:44 AM
Reposted by Sylvain Combettes
Merci @lemonde.fr pour un joli résumé de mes aventures scientifiques et logiciels 📈📠
www.lemonde.fr/sciences/art...

Beaucoup de messages qui me tiennent à cœur : travail d'équipe, logiciel libre, rigueur scientifique

Merci aux collègues et amis qui ont témoigné, je suis ému de lire
Gaël Varoquaux, vedette de l’intelligence artificielle et défenseur du logiciel libre
L’informaticien et chercheur à l’Inria est l’expert français le plus cité dans les publications scientifiques portant sur l’IA. Avec Scikit-learn, un programme de machine learning dont il est le cocré...
www.lemonde.fr
December 15, 2024 at 5:36 AM
Reposted by Sylvain Combettes
This year, there are 16 positions at CNRS in computer science (8 in "applied" domains → ask me - 8 on "fundamental" domains → ask the other David).

@mathurinmassias.bsky.social has a good list of advice mathurinm.github.io/cnrs_inria_a...

Official 🔗 www.ins2i.cnrs.fr/en/cnrsinfo/...

Don't wait!
November 23, 2024 at 7:33 PM
Reposted by Sylvain Combettes
Sometimes you think you are right by doing everything "by the book." But sometimes the book is just a tiny part of the full story. Keep digging and writing a new chapter with more insights is actually fun...
New podcast episode! This one is about imbalanced-learn and how the maintainer looks back with some lessons learned.

If you are dealing with imbalanced classification use-cases, like fraud, you'll want to listen in on this one!

youtu.be/npSkuNcm-Og
Imbalanced-learn: regrets and onwards - with Guillaume Lemaitre, core-maintainer
YouTube video by probabl
youtu.be
December 5, 2024 at 10:15 AM
Reposted by Sylvain Combettes
🎉⚡️Release 0.4:
◼ Easily use deep learning for text entries
◼ TableVectorizer can remove columns with too many missing values
◼ TableReport more robust and prettier
...

1/5
November 27, 2024 at 8:46 PM
Reposted by Sylvain Combettes
I recently shared some of my reflections on how to use probabilistic classifiers for optimal decision-making under uncertainty at @pydataparis.bsky.social 2024.

Here is the recording of the presentation:

www.youtube.com/watch?v=-gYn...
November 27, 2024 at 2:17 PM