Collin Berke
@collinberke.bsky.social
Media Research Analyst | #rstats | data enthusiast | news, sports, and podcast aficionado
Website: https://www.collinberke.com/
GitHub: https://github.com/collinberke
LinkedIn: https://www.linkedin.com/in/collinberke/
Website: https://www.collinberke.com/
GitHub: https://github.com/collinberke
LinkedIn: https://www.linkedin.com/in/collinberke/
Pinned
Collin Berke
@collinberke.bsky.social
· Jul 28
That feeling of nostalgia when you look at some old #RStats code using magrittr's `%>%`.
Taking a moment to pause and reflect on some good times.
Taking a moment to pause and reflect on some good times.
November 10, 2025 at 7:51 PM
That feeling of nostalgia when you look at some old #RStats code using magrittr's `%>%`.
Taking a moment to pause and reflect on some good times.
Taking a moment to pause and reflect on some good times.
This … This is is why I’m here.
No notes.
No notes.
Do you teach #rstats? Do your students complain about how lame and old-fashioned dplyr is? Don't worry: I have the solution for you: github.com/hadley/genzp....
genzplyr is dplyr, but bussin fr fr no cap.
genzplyr is dplyr, but bussin fr fr no cap.
GitHub - hadley/genzplyr: dplyr but make it bussin fr fr no cap
dplyr but make it bussin fr fr no cap. Contribute to hadley/genzplyr development by creating an account on GitHub.
github.com
November 7, 2025 at 2:59 AM
This … This is is why I’m here.
No notes.
No notes.
Reposted by Collin Berke
Thank you so much, Jen! If all goes well, that submission will be the final #TidyTuesday for 2025! It is *so* helpful when people submit datasets! I've been able to implement features to catch errors in datasets, and I have more on the way. Maybe I'll even finally finish the easier to use website!
Woot, just submitted another dataset to #TidyTuesday and now there are 5 PRs waiting for Jon to review.
I screenrecorded the process, so if you want to see how EASY it is to contribute a dataset to this AMAZING #rstats community, check out this video.
youtu.be/Kp7pyYwLcwc
I screenrecorded the process, so if you want to see how EASY it is to contribute a dataset to this AMAZING #rstats community, check out this video.
youtu.be/Kp7pyYwLcwc
October 29, 2025 at 1:03 PM
Thank you so much, Jen! If all goes well, that submission will be the final #TidyTuesday for 2025! It is *so* helpful when people submit datasets! I've been able to implement features to catch errors in datasets, and I have more on the way. Maybe I'll even finally finish the easier to use website!
Today I read: How I build software quickly from @evanhahn.com: evanhahn.com/how-i-build-...
Some great advice within. Three sections really landed for me:
1. embrace rough drafts
2. make small changes.
3. avoid wandering the code.
Good read. I highly suggest others check it out.
Some great advice within. Three sections really landed for me:
1. embrace rough drafts
2. make small changes.
3. avoid wandering the code.
Good read. I highly suggest others check it out.
How I build software quickly
Know how good your code needs to be for the task at hand. Start with a rough draft. Try to soften requirements if you can. Don't get distracted. Make small changes. Practice specific skills.
evanhahn.com
October 24, 2025 at 4:47 PM
Today I read: How I build software quickly from @evanhahn.com: evanhahn.com/how-i-build-...
Some great advice within. Three sections really landed for me:
1. embrace rough drafts
2. make small changes.
3. avoid wandering the code.
Good read. I highly suggest others check it out.
Some great advice within. Three sections really landed for me:
1. embrace rough drafts
2. make small changes.
3. avoid wandering the code.
Good read. I highly suggest others check it out.
Reposted by Collin Berke
I only need *5* more dataset curations (and to review those 5 plus the open PR) to be done curating #TidyTuesday for 2025! I can definitely do at least one myself, so I just need 4 of you to curate a dataset! Check out github.com/rfordatascie... for #RStats instructions.
github.com
October 24, 2025 at 11:28 AM
I only need *5* more dataset curations (and to review those 5 plus the open PR) to be done curating #TidyTuesday for 2025! I can definitely do at least one myself, so I just need 4 of you to curate a dataset! Check out github.com/rfordatascie... for #RStats instructions.
Reposted by Collin Berke
We need to have a conversation about random seeds. Don't use 42.
blog.genesmindsmachines.com/p/if-your-ra...
blog.genesmindsmachines.com/p/if-your-ra...
If your random seed is 42 I will come to your office and set your computer on fire🔥
Figuratively. More likely you'll get a stern talking to.
blog.genesmindsmachines.com
October 22, 2025 at 12:49 PM
We need to have a conversation about random seeds. Don't use 42.
blog.genesmindsmachines.com/p/if-your-ra...
blog.genesmindsmachines.com/p/if-your-ra...
TIL: Set expiry dates on metrics.
Recently came across this idea in two blog posts from @adrianhoward.com:
🔗 adrianhoward.com/posts/give-y...
🔗 adrianhoward.com/posts/three-...
I also really like the three-question checklist for reviewing data products (e.g., metrics, dashboards) 👇
#dataBS 1/2
Recently came across this idea in two blog posts from @adrianhoward.com:
🔗 adrianhoward.com/posts/give-y...
🔗 adrianhoward.com/posts/three-...
I also really like the three-question checklist for reviewing data products (e.g., metrics, dashboards) 👇
#dataBS 1/2
Give Your Metrics an Expiry Date
TL;DR: Set expiry dates for your metrics!
adrianhoward.com
October 20, 2025 at 4:55 PM
TIL: Set expiry dates on metrics.
Recently came across this idea in two blog posts from @adrianhoward.com:
🔗 adrianhoward.com/posts/give-y...
🔗 adrianhoward.com/posts/three-...
I also really like the three-question checklist for reviewing data products (e.g., metrics, dashboards) 👇
#dataBS 1/2
Recently came across this idea in two blog posts from @adrianhoward.com:
🔗 adrianhoward.com/posts/give-y...
🔗 adrianhoward.com/posts/three-...
I also really like the three-question checklist for reviewing data products (e.g., metrics, dashboards) 👇
#dataBS 1/2
A follow up TIL to this: I always had an intuitive sense that #RStats `...` was about passing arguments to other functions.
A net new concept for me was the *capturing* of the arguments using `list(...)`. @josiahparry.com's explainer was very helpful (04M45S): www.youtube.com/watch?v=oIMF...
A net new concept for me was the *capturing* of the arguments using `list(...)`. @josiahparry.com's explainer was very helpful (04M45S): www.youtube.com/watch?v=oIMF...
October 17, 2025 at 5:06 PM
A follow up TIL to this: I always had an intuitive sense that #RStats `...` was about passing arguments to other functions.
A net new concept for me was the *capturing* of the arguments using `list(...)`. @josiahparry.com's explainer was very helpful (04M45S): www.youtube.com/watch?v=oIMF...
A net new concept for me was the *capturing* of the arguments using `list(...)`. @josiahparry.com's explainer was very helpful (04M45S): www.youtube.com/watch?v=oIMF...
Attended the @rconsortium.bsky.social's How to use pointblank to understand, validate, and document your data workshop led by @richmeister.bsky.social: r-consortium.org/webinars/how...
Some past workshop materials are here: github.com/rich-iannone...
#RStats #dataBS 1/2
Some past workshop materials are here: github.com/rich-iannone...
#RStats #dataBS 1/2
How to use pointblank to understand, validate, and document your data – R Consortium
r-consortium.org
October 16, 2025 at 8:41 PM
Attended the @rconsortium.bsky.social's How to use pointblank to understand, validate, and document your data workshop led by @richmeister.bsky.social: r-consortium.org/webinars/how...
Some past workshop materials are here: github.com/rich-iannone...
#RStats #dataBS 1/2
Some past workshop materials are here: github.com/rich-iannone...
#RStats #dataBS 1/2
TIL: The #RStats testthat package has a `skip()` function: testthat.r-lib.org/reference/sk...
I used to comment out tests I didn't want to fix. I'd often forget about these. With skip() I get a nice reminder. Now I'll be sure to go back and fix broken tests I left for later ... yeah, later 😂
I used to comment out tests I didn't want to fix. I'd often forget about these. With skip() I get a nice reminder. Now I'll be sure to go back and fix broken tests I left for later ... yeah, later 😂
Skip a test — skip
skip_if() and skip_if_not() allow you to skip tests, immediately
concluding a test_that() block without executing any further expectations.
This allows you to skip a test without failure, if for some ...
testthat.r-lib.org
October 15, 2025 at 9:06 PM
TIL: The #RStats testthat package has a `skip()` function: testthat.r-lib.org/reference/sk...
I used to comment out tests I didn't want to fix. I'd often forget about these. With skip() I get a nice reminder. Now I'll be sure to go back and fix broken tests I left for later ... yeah, later 😂
I used to comment out tests I didn't want to fix. I'd often forget about these. With skip() I get a nice reminder. Now I'll be sure to go back and fix broken tests I left for later ... yeah, later 😂
I'm often amazed at how much typing I've saved myself by using `dput(names())` when working with #RStats.
I wrote about this back in 2024, and I was reminded about it's utility again today: www.collinberke.com/til/posts/20...
I wrote about this back in 2024, and I was reminded about it's utility again today: www.collinberke.com/til/posts/20...
Use base::dput() to easily create and save objects – Collin K. Berke, Ph.D.
Need to create and store an object quickly, use this trick
www.collinberke.com
October 10, 2025 at 9:24 PM
I'm often amazed at how much typing I've saved myself by using `dput(names())` when working with #RStats.
I wrote about this back in 2024, and I was reminded about it's utility again today: www.collinberke.com/til/posts/20...
I wrote about this back in 2024, and I was reminded about it's utility again today: www.collinberke.com/til/posts/20...
Reposted by Collin Berke
Interesting field work in the AI jungle.
Here's an earlier small scale study I did with Claude
nightingaledvs.com/what-does-ai...
Here's an earlier small scale study I did with Claude
nightingaledvs.com/what-does-ai...
October 10, 2025 at 2:17 AM
Interesting field work in the AI jungle.
Here's an earlier small scale study I did with Claude
nightingaledvs.com/what-does-ai...
Here's an earlier small scale study I did with Claude
nightingaledvs.com/what-does-ai...
TIL: Microsoft Teams can be configured to use ISO 8601 date formatting.
I don't know why this makes me so excited. Perhaps it's the analyst in me who's had to wrestle with and wrangle various date formats in the past ...
Obligatory @xkcd.com link: xkcd.com/1179/
#dataBS
I don't know why this makes me so excited. Perhaps it's the analyst in me who's had to wrestle with and wrangle various date formats in the past ...
Obligatory @xkcd.com link: xkcd.com/1179/
#dataBS
ISO 8601
xkcd.com
October 8, 2025 at 2:46 PM
TIL: Microsoft Teams can be configured to use ISO 8601 date formatting.
I don't know why this makes me so excited. Perhaps it's the analyst in me who's had to wrestle with and wrangle various date formats in the past ...
Obligatory @xkcd.com link: xkcd.com/1179/
#dataBS
I don't know why this makes me so excited. Perhaps it's the analyst in me who's had to wrestle with and wrangle various date formats in the past ...
Obligatory @xkcd.com link: xkcd.com/1179/
#dataBS
Interesting framing of the use of LLMs for coding in a post I recently bumped into: codemanship.wordpress.com/2025/09/30/c... (via news.ycombinator.com/item?id=4542...).
I'm certainly not AI-averse when performing coding tasks. But, 'Comprehension Debt' is something to be mindful of.
I'm certainly not AI-averse when performing coding tasks. But, 'Comprehension Debt' is something to be mindful of.
Comprehension Debt: The Ticking Time Bomb of LLM-Generated Code
An effect that’s being more and more widely reported is the increase in time it’s taking developers to modify or fix code that was generated by Large Language Models. If you’ve wo…
codemanship.wordpress.com
September 30, 2025 at 5:24 PM
Interesting framing of the use of LLMs for coding in a post I recently bumped into: codemanship.wordpress.com/2025/09/30/c... (via news.ycombinator.com/item?id=4542...).
I'm certainly not AI-averse when performing coding tasks. But, 'Comprehension Debt' is something to be mindful of.
I'm certainly not AI-averse when performing coding tasks. But, 'Comprehension Debt' is something to be mindful of.
I just came across the `data.tree` #RStats package:
📦: cran.r-project.org/web/packages...
🧑💻: github.com/gluc/data.tree
📚: cran.r-project.org/web/packages...
I was looking for a way to create simple tree diagrams of various outcomes and probabilities. Any #dataBS folks have other suggestions?
📦: cran.r-project.org/web/packages...
🧑💻: github.com/gluc/data.tree
📚: cran.r-project.org/web/packages...
I was looking for a way to create simple tree diagrams of various outcomes and probabilities. Any #dataBS folks have other suggestions?
data.tree: General Purpose Hierarchical Data Structure
Create tree structures from hierarchical data, and traverse the tree in various orders. Aggregate, cumulate, print, plot, convert to and from data.frame and more. Useful for decision trees, machine le...
cran.r-project.org
September 29, 2025 at 5:07 PM
I just came across the `data.tree` #RStats package:
📦: cran.r-project.org/web/packages...
🧑💻: github.com/gluc/data.tree
📚: cran.r-project.org/web/packages...
I was looking for a way to create simple tree diagrams of various outcomes and probabilities. Any #dataBS folks have other suggestions?
📦: cran.r-project.org/web/packages...
🧑💻: github.com/gluc/data.tree
📚: cran.r-project.org/web/packages...
I was looking for a way to create simple tree diagrams of various outcomes and probabilities. Any #dataBS folks have other suggestions?
I got introduced to @randyau.com's 'Data Cleaning IS Analysis, Not Grunt Work' post during the #dataBS Conf this week: www.counting-stuff.com/data-cleanin... . I just finished--it was a great read.
Here are some quotes and thoughts I'm walking away with 👇
1/9 #RStats
Here are some quotes and thoughts I'm walking away with 👇
1/9 #RStats
Data Cleaning IS Analysis, Not Grunt Work
Also, most data cleaning articles suck
www.counting-stuff.com
September 28, 2025 at 4:59 AM
I got introduced to @randyau.com's 'Data Cleaning IS Analysis, Not Grunt Work' post during the #dataBS Conf this week: www.counting-stuff.com/data-cleanin... . I just finished--it was a great read.
Here are some quotes and thoughts I'm walking away with 👇
1/9 #RStats
Here are some quotes and thoughts I'm walking away with 👇
1/9 #RStats
Just registered for the @rconsortium.bsky.social's 'How to use `pointblank` to understand, validate, and document your data' online workshop. I'm looking forward to learning more from @richmeister.bsky.social.
Sign up here: r-consortium.org/webinars/how...
#RStats #DataBS
Sign up here: r-consortium.org/webinars/how...
#RStats #DataBS
How to use pointblank to understand, validate, and document your data – R Consortium
r-consortium.org
September 26, 2025 at 3:13 PM
Just registered for the @rconsortium.bsky.social's 'How to use `pointblank` to understand, validate, and document your data' online workshop. I'm looking forward to learning more from @richmeister.bsky.social.
Sign up here: r-consortium.org/webinars/how...
#RStats #DataBS
Sign up here: r-consortium.org/webinars/how...
#RStats #DataBS
Great resource! 👀
Thanks, @emilhvitfeldt.bsky.social!
Thanks, @emilhvitfeldt.bsky.social!
I'm exited to announce a new resource about making slides with quarto and revealjs. This book is the combination of all the work I have done in this area, reordered and polished up
There isn't a lot of new information yet, but this format allows me to add more easily
slidecrafting-book.com
#quarto
There isn't a lot of new information yet, but this format allows me to add more easily
slidecrafting-book.com
#quarto
September 24, 2025 at 4:26 PM
Great resource! 👀
Thanks, @emilhvitfeldt.bsky.social!
Thanks, @emilhvitfeldt.bsky.social!
Reposted by Collin Berke
I am beyond excited to announce that ggplot2 4.0.0 has just landed on CRAN.
It's not every day we have a new major #ggplot2 release but it is a fitting 18 year birthday present for the package.
Get an overview of the release in this blog post and be on the lookout for more in-depth posts #rstats
It's not every day we have a new major #ggplot2 release but it is a fitting 18 year birthday present for the package.
Get an overview of the release in this blog post and be on the lookout for more in-depth posts #rstats
ggplot2 4.0.0
A new major version of ggplot2 has been released on CRAN. Find out what is new here.
www.tidyverse.org
September 11, 2025 at 11:20 AM
September 10, 2025 at 2:50 PM
I've been excitedly waiting for this feature to be released for the #rstats bigrquery package. Now it's finally here! 🏎️💨
bigrquery 1.6.0 is now on CRAN. Big feature is that it now uses bigrquerystorage (by Bruno Tremblay) if installed, which should make your downloads way way faster! #rstats
September 9, 2025 at 8:44 PM
I've been excitedly waiting for this feature to be released for the #rstats bigrquery package. Now it's finally here! 🏎️💨
I recently enjoyed using the {gganimate} #RStats package for #DataVis animations. So, I explored it further and drafted some notes. What resulted was some example animations using the palmer `penguins` dataset and some B1G QB passing data from {cfbfastR}.
I was pleased with the outcome. Links 👇
I was pleased with the outcome. Links 👇
August 26, 2025 at 4:04 AM