Charlie Gao
banner
shikokuchuo.net
Charlie Gao
@shikokuchuo.net
Software Engineer, Open Source @posit.co
Reposted by Charlie Gao
To be effective, data science agents need to be able to read plots reliably. @sara-altman.bsky.social and I wrote about some concerning findings on LLMs' ability to interpret plots when the content contradicts their expectations on the @posit.co blog.

posit.co/blog/introdu...
When plotting, LLMs see what they expect to see - Posit
Data science agents need to accurately read plots even when the content contradicts their expectations. Our testing shows today's LLMs still struggle here.
posit.co
November 13, 2025 at 3:07 PM
I put out a patch release of mirai today. Version 2.5.2 really improves the OpenTelemetry integration so you can more easily see into your async workflows. Other key ecosystem packages will roll out with this enabled - next up: Shiny!

mirai.r-lib.org

#Rstats
Minimalist Async Evaluation Framework for R
Designed for simplicity, a mirai evaluates an R expression asynchronously, locally or distributed over the network. Built on nanonext and NNG for modern networking and concurrency, scales efficiently ...
mirai.r-lib.org
November 5, 2025 at 10:37 PM
Reposted by Charlie Gao
purrr 1.2.0 out now — mostly removing long deprecated functions but a few small performance nad parallel processing improvements. Learn more at www.tidyverse.org/blog/2025/11... #rstats
purrr 1.2.0
This release tightens up the package by removing long-deprecated functions, making `map_chr()` and predicate functions more type-safe, and requiring a newer version of carrier to make `in_parallel()`...
www.tidyverse.org
November 4, 2025 at 2:10 PM
Reposted by Charlie Gao
mcptools 0.2.0 is on CRAN! mcptools implements the Model Context Protocol in #rstats, allowing ellmer users to share context and tools more easily between LLM applications.

Read more: github.com/posit-dev/mc...
October 30, 2025 at 12:13 PM
Reposted by Charlie Gao
reticulate v1.44 is on CRAN!

Positron integration is now enabled by default. Enjoy R and Python REPL consoles powered by concurrent threads in the same process, zero‑copy R↔Python data transfer, and full Variables Pane support.

Full changelog: github.com/rstudio/reti...

#rstats #python #pydata
github.com
October 25, 2025 at 3:37 PM
Reposted by Charlie Gao
I'm excited to share a new version of the Air formatter for #rstats, with support for tabular formatting! Super useful with `tibble::tribble()` calls or `data.table::fcase()`.

It's currently experimental as we're looking to get feedback on the feature, so please let us know what you think.
October 21, 2025 at 10:34 AM
Reposted by Charlie Gao
Gave a talk about orbital last week at Socal RUG, it was delightful as always to spend time with these people.

Slides: emilhvitfeldt.github.io/talk-orbital...
Recording: www.youtube.com/live/pm-Ve3w...
October 20, 2025 at 6:31 PM
Reposted by Charlie Gao
I'll be keynoting at R/Pharma in a few weeks! The conference is free and virtual; all are welcome to join. My talk "Practical AI for data science" is on November 5th and should be broadly accessible for #rstats users, regardless of industry. :)

Learn more & register: rinpharma.com
R/Pharma
The conference is a scientifically & industry oriented, collegial event focused on the use of R in the development of pharmaceuticals.
rinpharma.com
October 20, 2025 at 4:50 PM
Reposted by Charlie Gao
I recently added a Deepwiki badge to mirai.r-lib.org.

Not because I think the current documentation is lacking 😅, but it takes you to an `Ask DeepWiki` prompt with the full mirai source code as context so you can ask questions (and get answers) in *any* language.

Any feedback welcome! #rstats
Minimalist Async Evaluation Framework for R
Designed for simplicity, a mirai evaluates an R expression asynchronously in a parallel process, locally or distributed over the network. Modern networking and concurrency, built on nanonext and NNG, ...
mirai.r-lib.org
October 13, 2025 at 2:46 PM
Reposted by Charlie Gao
I wrote a paper about "Examining the Interface Design of Tidyverse" 📝

TL;DR to follow in future 😊

arxiv.org/abs/2510.10382
Examining the Interface Design of Tidyverse
The tidyverse is a popular meta-package comprising several core R packages to aid in various data science tasks, including data import, manipulation and visualisation. Although functionalities offered...
arxiv.org
October 14, 2025 at 3:31 AM
I recently added a Deepwiki badge to mirai.r-lib.org.

Not because I think the current documentation is lacking 😅, but it takes you to an `Ask DeepWiki` prompt with the full mirai source code as context so you can ask questions (and get answers) in *any* language.

Any feedback welcome! #rstats
Minimalist Async Evaluation Framework for R
Designed for simplicity, a mirai evaluates an R expression asynchronously in a parallel process, locally or distributed over the network. Modern networking and concurrency, built on nanonext and NNG, ...
mirai.r-lib.org
October 13, 2025 at 2:46 PM
Reposted by Charlie Gao
Thanks to @paleolimbot.bsky.social's great work, the sedonadb R package is now also available to Windows via R-universe 🎉 I'm excited particularly because I believe sedonadb is the easiest tool for R users to access GeoParquet data. #rstats

apache.r-universe.dev/sedonadb
sedonadb: Bindings for Apache SedonaDB
apache.r-universe.dev
October 11, 2025 at 5:02 PM
Reposted by Charlie Gao
Seriously cool stuff from @shikokuchuo.net on mirai+ellmer+shiny+httr2 with otel for telemetry at Shiny in Production!

You can use it to improve the performance of your Shiny apps by getting really granular information about it what's going on!

#rstats
October 9, 2025 at 9:44 AM
mirai 2.5.1 released.

mirai.r-lib.org

This is a patch release, but brings a new synchronous mode: `daemons(sync = TRUE)` runs your code in the same process. This means you can now drop into a browser session *within* your mirai #rstats code!

p.s. amazing hex stickers made by someone @posit.co!!
October 6, 2025 at 10:48 AM
Reposted by Charlie Gao
Take your plots from drab to fab! #ggplot2
🎨 Theming got a huge overhaul with the latest #ggplot2 release. In honour of that @teunbrand.bsky.social has written a comprehensive deep-dive into styling your plots, covering both old and new functionality. Grab a coffee and dive in!

#rstats
ggplot2 styling
This post discusses one function in ggplot2: `theme()`. Find out about the glamour of graphics in this deep-dive article.
www.tidyverse.org
October 1, 2025 at 8:13 AM
Posit::conf(2025)

"Purrrfectly parallel, purrrfectly distributed"

The slides from the talk are available at: shikokuchuo-posit2025.share.connect.posit.cloud

See why we now have parallel purrr, powered by mirai, and how to use it to distribute compute to other machines.

#positconf2025 #tidyverse
Purrrfectly parallel Purrrfectly distributed
shikokuchuo-posit2025.share.connect.posit.cloud
September 19, 2025 at 3:11 AM
Reposted by Charlie Gao
Can't wait to see everyone at posit::conf(2025)!!

Here's my annual "things to do" list for food, coffee, bakeries, and bars near conf!

#positconf2025 #rstats

docs.google.com/document/d/1...
atlanta-posit-conf-2025
posit::conf 2025 Hotel Coffee ⭐ I think this looks the most promising, there are other locations that look decent even if this one seems new and has few reviews and is inside another building 7 minu...
docs.google.com
September 13, 2025 at 4:01 PM
I'm hoping you'll join our session on Strengthening the R Ecosystem at posit::conf(2025) on the Thursday morning!

Find out about the hottest new thing in the tidyverse: "purrrfectly parallel, purrrfectly distributed" (yes, it's something about cats).

#rstats #tidyverse #PositConf2025

pos.it/conf
September 11, 2025 at 10:53 AM
Reposted by Charlie Gao
{otel} + #rstats is becoming a crazy partnership.
This project has driven me to use/write {promises} in ways that makes my 🧠 ache 😅. The reward is worth it!

See you at #PositConf2025 where I'll talk about this in more detail
More #rstats + OpenTelemetry news: @schloerke.com finally got all of the async issues in {ellmer} and {shiny} sorted out and tracing now works!

Here's what combining {shiny} + {ellmer} + {httr2} + {mirai} + {otel} looks like today.

(All credit for the screenshots to @schloerke.com.)
September 10, 2025 at 7:13 PM
Reposted by Charlie Gao
tidyverse blog: mirai 2.5.0

www.tidyverse.org/blog/2025/09...

This milestone release delivers enhanced observability through OpenTelemetry, reproducible parallel RNG, and key user interface improvements.

We’ve also packed in twice as many changes as usual!
#Rstats #tidyverse
mirai 2.5.0
mirai - minimalist async evaluation framework for R - brings production-grade parallel and distributed computing to the ecosystem.
www.tidyverse.org
September 5, 2025 at 3:21 PM
tidyverse blog: mirai 2.5.0

www.tidyverse.org/blog/2025/09...

This milestone release delivers enhanced observability through OpenTelemetry, reproducible parallel RNG, and key user interface improvements.

We’ve also packed in twice as many changes as usual!
#Rstats #tidyverse
mirai 2.5.0
mirai - minimalist async evaluation framework for R - brings production-grade parallel and distributed computing to the ecosystem.
www.tidyverse.org
September 5, 2025 at 3:21 PM
tidyverse blog: Breaking down language barriers in data science

nanonext is a messaging and concurrency toolkit, connecting R to polyglot data science pipelines involving modules written in Python, Rust, Go and C++ etc.

www.tidyverse.org/blog/2025/09...

#RStats #tidyverse
nanonext 1.7.0
nanonext is a communications and concurrency toolbox that facilitates fast, powerful and reliable data exchange in polyglot data science workflows.
www.tidyverse.org
September 2, 2025 at 6:37 PM
Reposted by Charlie Gao
If you'd like to contribute to the tidyverse or tidymodels packages -- or learn how to -- join us at the tidyverse developer day! There are some tickets left for underrepresented minorities. #RStats #RLadies

More info in the blog post: www.tidyverse.org/blog/2025/07...
Tidyverse developer day 2025
Join us in Atlanta for tidyverse developer day on September 19, 2025!
www.tidyverse.org
September 1, 2025 at 2:48 PM
Reposted by Charlie Gao
I'll be talking about Quarto at PyData in September.

Focus will be on how Quarto can be used to get your Notebooks Analysis to the world easily, and under different forms.

Don't hesitate to share anything you would like me to tackle, and if it relates I am happy to !

And maybe see you there !
📢 Talk Announcement

"From Jupyter Notebook to Publish-Ready Report: Effortless Sharing with Quarto", by @cderv.bsky.social from @posit.co.

📜 Talk info: pretalx.com/pydata-paris-2025/talk/K7CAAE
📅 Schedule: pydata.org/paris2025/schedule
🎟 Tickets: pydata.org/paris2025/tickets
August 28, 2025 at 4:58 PM
Reposted by Charlie Gao
We still have spots available for tidyverse dev day on Sept 19: www.tidyverse.org/blog/2025/07.... Please come along to contribute to the tidyverse and have a bunch of fun along the way! It's open to all, but is most convenient if you're coming to posit::conf or live near Atlanta #rstats
Tidyverse developer day 2025
Join us in Atlanta for tidyverse developer day on September 19, 2025!
www.tidyverse.org
August 20, 2025 at 9:57 PM