pgdataviz.bsky.social
@pgdataviz.bsky.social
Statistical Programmer (Epidemiologist by training) using #Rstats and #Python mostly.
#DataScience #Dataviz

https://pgdatavizandstats.netlify.app/
My submission for #PydyTuesday. A quick jaunt back to Python and some Violin plots

Code here: pgdatavizandstats.netlify.app/data_visuali...

#rstats | #python | #dataviz | #TidyTuesday
November 6, 2025 at 11:01 AM
Anyone here better with the ggsankey package than I am? - I am trying to label the flow with the number of people transitioning through each stage. I have found no examples on the internet of labeling the flow (plenty labeling the nodes).

I have below what I need to try

#rstats #dataviz #ggplot2
November 5, 2025 at 10:58 AM
Looked at the percentage of Oxbridge educated people who have won various British Literary Prizes for #TidyTuesday. I used ggbrick and, as the bricks of look like books, tried to make it look like they were in bookshelves with meh results.

Code here: tinyurl.com/bddsuuc3

#rstats | #dataviz
October 30, 2025 at 9:09 AM
If anybody has any good tips/resources for a statistical programmer who has a technical interview as a data scientist, let me know. I've not done one before

(Background is epidemiology and clinical trials but this would be in health data science looking at massive datasets)

#rstats #datascience
October 23, 2025 at 8:06 PM
As a Welshman, taking the opportunity to ignore the English and focus only on Wales for #TidyTuesday. Looking at the average weather for Wales in 2024 and now I've made myself homesick.

Code here: pgdatavizandstats.netlify.app/data_visuali...

#rstats | #dataviz
October 23, 2025 at 5:56 AM
It's Monday which mean my chaotic self is submitting last weeks #TidyTuesday. I chose to look at water access as most people had looked at food in much nicer ways than me.

Code here: pgdatavizandstats.netlify.app/data_visuali...

#rstats | #dataviz
October 20, 2025 at 5:36 PM
Holiday and job-hunting sucking my soul mean I am a week behind and needed to do a(nother) rush job for #TidyTuesday. I have been playing with Reactable though

Code here: pgdatavizandstats.netlify.app/data_visuali...

#rstats | #dataviz
October 14, 2025 at 10:59 AM
I do like #positron but RStudio allows me to create .tex files from new.

Anyone know how I could get this on #positron, or is not a thing yet?
September 29, 2025 at 2:00 PM
Tried the datapasta package out, which is a form of minor miracle to get the country codes for the players.

Managed to crack a map of grandmasters per country before flying home to the UK for a break

Code here:

pgdatavizandstats.netlify.app/data_visuali...

#dataviz | #rstats | #tidytuesday
September 23, 2025 at 8:51 AM
A LLM trained on UK Biobank data to predict future disease presented uncritically by the media?
September 17, 2025 at 4:22 PM
September 16, 2025 at 10:26 AM
Missed last week's #TidyTuesday / #pydyTuesday due to work/toddler commitments but I'm ahead for this one. Such is life

Tried making a dashboard in python Whoever coded #streamlit is a genius

Dashboard here

tidytuespassportdashboard.streamlit.app

Code here

pgdatavizandstats.netlify.app/datavis
September 8, 2025 at 6:25 PM
For #TidyTuesday / #PydyTuesday I have struggled with the port of patchwork over to python. If anyone has any ideas on how to fix the titles of the graphs, I'm all ears.

Also Plotnine is a minor miracle.

Code Here:
pgdatavizandstats.netlify.app/data_visuali...

#dataviz | #python
August 27, 2025 at 6:58 PM
I logged on, opened RStudio, and accidentally pressed CTRL+SHIFT+P, and what sort of magic is this?
August 25, 2025 at 7:40 AM
(Reposted because I can't spell) Back to the warm blanket that is ggplot2 for #TidyTuesday.

A Sankey diagram of how the classifications of Scottish Munros have changed over time

#rstats | #dataviz | #ggplot2

Code here:

pgdatavizandstats.netlify.app/data_visuali...
August 19, 2025 at 3:49 PM
In a vain attempt to keep learning python I have tried imputing some CDISC data using #Quarto and so on - the report is rough around the edges but at least it worked.

#rstats | #python | #imputation

Code Here: pgdatavizandstats.netlify.app/biostatistic...
August 19, 2025 at 3:48 PM
Another #TidyTuesday / #PyDytuesday another week of thinking that making graphs in Python just isn't as nice as it is in R (Skill issue, I know)

Code here:
pgdatavizandstats.netlify.app/data_visuali...

#dataviz | #python
August 13, 2025 at 4:20 PM
August 7, 2025 at 8:39 AM
I've been cursing out my Propensity Score Matching project all week at work so a very brief #TldyTuesday and #PyDyTuesday. If I have time I'll try and do a better job.

Code here: pgdatavizandstats.netlify.app/data_visuali...

#python | #dataviz | #rstats
August 5, 2025 at 4:53 PM
Anyone got any useful guides on Network Meta-Analysis in R?

Context I have done 1 in my life and despised it and swore I would never do one again. Alas that was not to be.

If I could avoid WINBUGS entirely that would be great

#rstats #episky
July 29, 2025 at 3:33 PM
Created an Interactive map with Folium of all the New York MTA artworks for #TIdyTuesday and #PyDyTuesday

#python | #dataviz | #gis

Code here: pgdatavizandstats.netlify.app/data_visuali...
July 23, 2025 at 8:34 AM
One of things that's stopped me using Quarto full time as opposed to officeR and an R script is the need to use loops to repeat image and table production.

I've now worked out how to run a loop within an R code chunk and images, tables as the loop runs so I get a nice doc

#Quarto #rstats #python
July 18, 2025 at 1:02 PM
Back to ggplot2 because I was in a rush. British Library Funding

Significant downturn in funding post 2010. As prospective employers may see this one day, I don't speculate on the reason.

code here: pgdatavizandstats.netlify.app/data_visuali...

#RStats | #TidyTuesday | #DataViz
July 15, 2025 at 6:08 PM
So I've split out the "Data Science/R Programmer/Python" part of my "main" profile because I found the fact people may be following me for Data content and just seeing relentless posts about Welsh rugby.
July 1, 2025 at 5:22 PM