Max kuhn
banner
topepo.bsky.social
Max kuhn
@topepo.bsky.social
Writing modeling packages at @posit.co (née RStudio). Opinions are my own. https://max-kuhn.org/
Pinned
I last posted here about 6 months ago. Here's what I've been working on and/or thinking about. #rstats, #statistics, #ml

package upkeep: we are doing major preventive maintenance on the tidymodels packages ("upkeep week!"). It's rote but very rewarding work. Error messages are 100x better.

1/3
Applied Machine Learning for Tabular Data
aml4td.org
I just gave a 20m overview of the #TabPFN deep learning model at R/Pharma.

The slides are at: topepo.github.io/2025-r-pharma/

(link to Quarto files on the second slide)
November 5, 2025 at 7:10 PM
We've made a blog post about a new major version of the #rstats tune package! Two main changes: parallel processing frameworks and the ability to tune postprocessors.

tidyverse.org/blog/2025/11...
tune version 2.0.0
A new version of tune brings significant updates for model tuning and parallel processing.
tidyverse.org
November 5, 2025 at 7:06 PM
Reposted by Max kuhn
I'm excited to share side::kick(), an experimental open-source coding agent for RStudio built entirely in R. It can interact with your files, communicate with your active #rstats session, and run code.

Check it out: github.com/simonpcouch/...
November 5, 2025 at 3:57 PM
Reposted by Max kuhn
I'm excited to share a new version of the Air formatter for #rstats, with support for tabular formatting! Super useful with `tibble::tribble()` calls or `data.table::fcase()`.

It's currently experimental as we're looking to get feedback on the feature, so please let us know what you think.
October 21, 2025 at 10:34 AM
Before #NoKings in Hartford CT...
October 18, 2025 at 7:33 PM
Reposted by Max kuhn
Shannon's slides are always so unbelievably clear and helpful!!!

github.com/shannonpileg...

I'm having "Ohhhhh that's what that means" moments every 10 seconds here.
#positconf2025
September 18, 2025 at 3:09 PM
Reposted by Max kuhn
I'm all in on @topepo.bsky.social and co's new {important} and other variable importance/feature selection tools in tidymodels! #rstats
September 12, 2025 at 6:17 PM
Reposted by Max kuhn
Once again, @davisvaughan.bsky.social's extrachecks have saved me from a likely CRAN rejection for an upcoming #RStats package submission. Thanks Davis! github.com/DavisVaughan...
GitHub - DavisVaughan/extrachecks
Contribute to DavisVaughan/extrachecks development by creating an account on GitHub.
github.com
September 3, 2025 at 7:57 PM
Reposted by Max kuhn
Simon Wood, the GOAT of generalized additive models & creator of the mgcv #rstats package, has an Annual Review of Statistics essay on GAMs, available open access #statssky #mlsky

www.annualreviews.org/content/jour...
September 10, 2025 at 2:15 AM
Reposted by Max kuhn
ML success ≠ Kaggle leaderboard. The real world rewards:
- Clear explanations
- Thoughtful metrics
- Collaboration with domain experts

A 0.01 lift in F1 score won’t save you if no one understands your model.

#DataSciene #MachineLearning #AI #RStats
September 9, 2025 at 8:45 PM
Reposted by Max kuhn
Announcing a new blog series on LLMs from @veerle.hypebright.nl!

In Part 2, “Talking to LLMs: From Prompt to Response”, we get hands-on with LLM-powered apps. This guide is for #Python & #RStats users who want to go beyond the basics.

Check it out here: shiny.posit.co/blog/posts/s...
September 5, 2025 at 4:53 PM
Slides from my #rstats talk “Measuring LLM Effectiveness” at #dataconfAI with @simonpcouch.com.

topepo.github.io/2025_NYR/

Video in about a month.

Great conference!
August 27, 2025 at 11:10 PM
Reposted by Max kuhn
🧠📊 3 days. 2 workshops. 20 talks. 1 amazing community.
#dataconfAI is officially wrapped!

Thanks for showing up with insights, ideas, inspiration, and curiosity. And to all who made it unforgettable—speakers, attendees, sponsors, and volunteers.

See you at the next one! 🚀
August 27, 2025 at 10:00 PM
Reposted by Max kuhn
In working on an eval for an experimental tidymodels AI assistant, I realized that today's frontier LLMs know much more about #rstats tidymodels than I thought.

www.simonpcouch.com/blog/2025-08...
August 26, 2025 at 6:42 PM
It's a lot of fun! Everyone gets something out of it.

Plus, @davisvaughan.bsky.social always finds a great barista!
We still have spots available for tidyverse dev day on Sept 19: www.tidyverse.org/blog/2025/07.... Please come along to contribute to the tidyverse and have a bunch of fun along the way! It's open to all, but is most convenient if you're coming to posit::conf or live near Atlanta #rstats
Tidyverse developer day 2025
Join us in Atlanta for tidyverse developer day on September 19, 2025!
www.tidyverse.org
August 20, 2025 at 10:26 PM
Reposted by Max kuhn
Start off The NY Data Science & AI Conference w/ hands-on workshops on Aug 25 in NYC or online:

📊 Machine Learning in R w/ Max Kuhn
🤖 Intro to LLMs/AI w/ Daniel Chen

🎟️ Learn more & register at dataconf.ai/nyc

#RStats #AI #Workshops #databs @topepo.bsky.social @chendaniely.bsky.social
August 15, 2025 at 5:25 PM
Reposted by Max kuhn
Excited to share my newest quarto revealjs plugin: imagemover

Easily reposition and resize images directly in your quarto revealjs slides for a much smoother slidecrafting experience

github.com/EmilHvitfeld...
#quarto
August 13, 2025 at 7:20 PM
Reposted by Max kuhn
Time to convert this into an LLM powered snippet using {chores} by @simonpcouch.com. #useR2025 #rstats
CreateBranding shiny app is now available here: umair.shinyapps.io/create_brand...

You can now download the palettes, scales and a theme for ggplot2. See a demo here: youtu.be/C7-rhLPrA3o

Try it out and let me know your suggestions for improvements. #rstats
August 10, 2025 at 1:28 AM
Reposted by Max kuhn
Welp, {chores} by @simonpcouch.com is an immediate install for sure.

Basically it's {usethis} plus llm bundled into RStudio/Positron key encoding.

Excited!!! 🧹🧺

#useR2025 #rstats #couchverse?
August 9, 2025 at 7:44 PM
Reposted by Max kuhn
Don't miss out learning from the best, Max Kuhn! @topepo.bsky.social

#dataBS #Tidymodels #MachineLearning
📊 Want to level up your R modeling skills? Max Kuhn’s Machine Learning in R workshop is an intro to tidymodels, covering data prep, resampling, tuning & evaluation using real workflows!

📍Aug 25 in NYC or online
🎟️ & info: dataconf.ai/nyc

#RStats #Tidymodels #MachineLearning @topepo.bsky.social
August 5, 2025 at 7:00 PM
Reposted by Max kuhn
📊 Want to level up your R modeling skills? Max Kuhn’s Machine Learning in R workshop is an intro to tidymodels, covering data prep, resampling, tuning & evaluation using real workflows!

📍Aug 25 in NYC or online
🎟️ & info: dataconf.ai/nyc

#RStats #Tidymodels #MachineLearning @topepo.bsky.social
July 31, 2025 at 4:57 PM
We are super excited to have you join us for the day!
posit.co Posit @posit.co · Jul 29
Check out our Modeling & ML with #RStats workshops at posit::conf!

🔢 Intro to ML w/ tidymodels @simonpcouch.com
🏗️ Feature Engineering & Tuning @topepo.bsky.social @emilhvitfeldt.bsky.social
↔️ Causal Inference @malcolmbarrett.malco.io @lucystats.bsky.social

Learn more: pos.it/conf-2025-workshops
July 29, 2025 at 8:30 PM
Positron is definitely visually more than RStudio, and this is a helpful overview.
posit.co Posit @posit.co · Jul 28
Take a quick tour of Positron, Posit's next-generation data science IDE, built by the creators of RStudio.

Read the blog to learn more: posit.co/blog/a-quick...
July 28, 2025 at 4:57 PM
Reposted by Max kuhn
The call for papers for LatinR 2025 (online) is now open! You can present in English, Spanish, or Portuguese 🗣️ #RStats latinr.org/en/blog/en/2...
Call for papers – LatinR 2024
latinr.org
July 25, 2025 at 10:15 AM