Paulius Alaburda
alaburda.bsky.social
Paulius Alaburda
@alaburda.bsky.social
Love all things R, data, medicine and energy! Head of Data Analytics @ Ignitis Lithuania 🇱🇹
Reposted by Paulius Alaburda
I gave a talk on #QA in #GitHub: ditch the spreadsheets, embed QA in your workflow, make it a habit.

More details ⬇️

🛝 slides: statsrhian.github.io/qa-in-github
📝 blog post: rhian.rbind.io/posts/2025-1...

#rstats #rap #quality-assurance
Ditch the Spreadsheets
statsrhian.github.io
November 19, 2025 at 4:55 PM
Reposted by Paulius Alaburda
Thoughtful (as always) blog post from Nicholas Carlini. "Are large language models worth it?" A nice read giving his perspective on risks of ML models.

Post: nicholas.carlini.com/writing/2025...

For people who prefer, this is the video of the talk from @colmweb.org www.youtube.com/watch?v=PngH...
November 19, 2025 at 4:56 PM
Reposted by Paulius Alaburda
Methodology
November 18, 2025 at 9:47 AM
Reposted by Paulius Alaburda
Why does feature development slow & what can we do about it? (This problem because acute with vibe coding.) tidyfirst.substack.com/p/why-does-d...

tl;dr use the time between features to create options. What? You have no time between features? We'll talk about that later...
Why Does Development Slow?
It's the options
tidyfirst.substack.com
November 19, 2025 at 4:01 PM
Reposted by Paulius Alaburda
😂
Dicing an Onion, the Mathematically Optimal Way
There is more than one way to dice an onion…
pudding.cool
November 19, 2025 at 4:11 PM
Reposted by Paulius Alaburda
it never ceases to amaze me that I can refer to duckdb functions in R expressions as though they were R functions and everything gets translated to SQL
November 19, 2025 at 7:47 PM
Reposted by Paulius Alaburda
I recently discovered Conventional Comments (conventionalcomments.org) for providing a pseudo-standard set of labels for feedback and just tried it for an article review and it was really helpful to specify issues vs. thoughts vs. suggestions, etc. Hopefully it's helpful for the authors too!
November 17, 2025 at 3:52 PM
Reposted by Paulius Alaburda
For Day 16 of #30DayMapChallenge: Cell, use mapgl's `turf_voronoi()` to create Voronoi "cells" from input points.

Even better - make your Voronoi polygons interactive and dynamic in Shiny!

#rstats #GIS
November 16, 2025 at 11:08 PM
Reposted by Paulius Alaburda
For people trying to teach themselves more about statistics, go read about these different approaches and try to make sense of why they don't exactly agree. What are they doing differently? Use wikipedia. Look up new terms along the way.
My #rstats cheat code for today is the binom.confint function in the binom package that will spit out *12* different ways of calculating a CI for a proportion.

Also, this is why you use R for statistics...

(and of course the correct CI method is bayes 😎)
November 16, 2025 at 10:41 AM
Reposted by Paulius Alaburda
Our paper on improving statistical reporting in psychology is now online 🎉

As a part of this paper, we also created the Transparent Statistical Reporting in Psychology checklist, which researchers can use to improve their statistical reporting practices

www.nature.com/articles/s44...
November 14, 2025 at 8:43 PM
Reposted by Paulius Alaburda
I wrote something about publication bias at statmodeling.stat.columbia.edu/2025/11/14/t...
November 14, 2025 at 9:17 PM
Reposted by Paulius Alaburda
testthat 3.3.0 out now! This is a massive release with tons of improvements including better failure messages, new expectations, improved snapshotting, new vignettes, and much much more: tidyverse.org/blog/2025/11... Post includes some thoughts on developing an #rstats package with Claude Code.
testthat 3.3.0
testthat 3.3.0 brings improved expectations with better error messages, new expectations for common testing patterns, and lifecycle changes including the removal of `local_mock()` and `with_mock()`. I...
tidyverse.org
November 13, 2025 at 5:24 PM
Reposted by Paulius Alaburda
Adding citations of people who might review the paper
November 14, 2025 at 10:06 AM
Reposted by Paulius Alaburda
Pretty cool is an understatement.

There is 1.5 million hours of video game play recorded, via telemetry data! This is a very cool study🎮
We released a pretty cool dataset/preprint today looking at video game play, cognition, time-use and a ton of self-reported psych measures at osf.io/preprints/ps... with @nballou.bsky.social @matti.vuorre.com @thomashakman.bsky.social @rpsychologist.com and @shuhbillskee.bsky.social RRs coming soon
November 14, 2025 at 5:02 PM
Reposted by Paulius Alaburda
New Data Scientists when they find out their job is mostly dashboarding and data engineering
November 14, 2025 at 8:25 PM
Reposted by Paulius Alaburda
I wrote a lil post on the amazing work that
@ginareynolds.bsky.social does championing ggplot2 extension developers and teaching others to build their own!

The post features the Scrollytelling Quarto extension and the group's cute #RStats hex 🐱:

rworks.dev/posts/ggplot...
An Introduction to Writing Your Own ggplot2 Geoms – R Works
The ggextenders club provides inspiration and resources for those venturing into the exciting world of creating custom ggplot2 extensions.
rworks.dev
November 3, 2025 at 3:22 PM
Reposted by Paulius Alaburda
Project structure for scientific coding projects
- the latest in my Better Code, Better Science series open.substack.com/pub/russpold...
Project structure for scientific coding projects
Better Code, Better Science: Chapter 6, Part 3
open.substack.com
November 11, 2025 at 2:51 PM
Reposted by Paulius Alaburda
Any #rstats people on here have experience with hierarchal generalized additive models (HGAM) in ecology?

I’m in need of some help in possibly using one with some data!

🦇🌎🧪🧫
November 11, 2025 at 6:20 PM
Reposted by Paulius Alaburda
🤏🤏🤏
November 10, 2025 at 9:14 PM
Reposted by Paulius Alaburda
It's here! I've just released my latest book, "Twin Wolves: Balancing risk and reward to make the most of AI."

This is a tight, executive-level read on how to approach AI (both ML/AI and genAI) in your company.

twinwolvesai.com

#dataBS
Twin Wolves AI
Balancing risk and reward to make the most of AI
TwinWolvesAI.com
November 7, 2025 at 2:45 PM
Reposted by Paulius Alaburda
We are looking for #rstats community feedback on 3 new dplyr functions!

We're aiming to expand the `filter()` family:

- `filter()` to keep rows
- `filter_out()` to drop rows
- `when_any()` and `when_all()` as modifiers

Read more and leave feedback here:
github.com/tidyverse/ti...
November 7, 2025 at 4:03 PM
Reposted by Paulius Alaburda
My keynote about data science tools at posit::conf is now online! I originally meant it to be a talk about Positron, but as I was writing it, it took a left turn through the history of RStudio and into the philosophy of tool design & how to build stuff for people.

www.youtube.com/watch?v=tGre...
10 Years of Data Science Tools...and What Happens Next (Jonathan McPherson) | posit::conf(2025)
YouTube video by Posit PBC
www.youtube.com
November 7, 2025 at 6:11 PM
Reposted by Paulius Alaburda
What are your favorite books/articles/resources about graphic design for academic posters and presentations?
November 4, 2025 at 1:44 AM
Reposted by Paulius Alaburda
On the blog: Think for Yourself

"By skimming past the friction necessary for learning, the pursuit of convenience can end up deskilling rather than enhancing skills."

kevlinhenney.medium.com/think-for-yo...
Think for Yourself
Understand and improve on LLM-generated code
kevlinhenney.medium.com
November 4, 2025 at 4:39 PM
Reposted by Paulius Alaburda
For when your old windows machine catches fire?
November 4, 2025 at 10:22 PM