Lightnews — Scholar-powered news

Gordon Forbes

@gforb.bsky.social

100 followers 100 following 55 posts

I love Netflix for their data science blog and The BBC for their ggplot2 resources.

Posts Replies Media Videos

Gordon Forbes

@gforb.bsky.social

On tabular health data, time and time again, I see linear (or generalised linear models) perform as well or better than machine learning algorithms that avoid linearity assumptions.

I am surprised by this, as the linearity assumption is unlikely to be true.

Does anyone else see this? Why is this?

The optimal machine learning model was linear regression

May 22, 2025 at 9:10 AM

Gordon Forbes

@gforb.bsky.social

On the reproducibility crisis coming for epidemiology from 2008. I think still true today although probably a lot more good examples.

Pre-specified primary endpoints + analysis plans, and corrections for multiple testing are not routinely used in observational studies.

t.co/HPKYqhP1de

December 18, 2024 at 4:09 PM

Gordon Forbes

@gforb.bsky.social

I would love to see that post. I've used the end argument to get better results. The plot on the right is from using

+ scale_colour_viridis_d(end = 0.8)

Two plots, one with a pale yellow for one line which does not stand out against the background and the other with a clearer green.

December 18, 2024 at 12:39 PM

Gordon Forbes

@gforb.bsky.social

*The excellent netflix datascience blog

netflixtechblog.com/experimentat...

December 18, 2024 at 12:23 PM

Gordon Forbes

@gforb.bsky.social

I got this far:

November 7, 2024 at 12:47 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news