Emily Riederer
emilyriederer.bsky.social
Emily Riederer
@emilyriederer.bsky.social
Here for data, data science, analytics engineering, rstats, books
Mostly a review of pretty standard methods, but some fun examples of enabling expression expansion (the magic behind column selectors!), more complex objects in a dataframe (models, vectors), and breaking the paradigm to go back to partitioned dataframes + list comprehensions

(2/2)
November 16, 2025 at 4:15 PM
The way python and R foster inclusion directly contributes to their success: joyful places to exist, a steady flow of new maintainers, and a delightful collection of niche tools empowered by wildly different expertise coming together

Watch the new python documentary for more on PSF’s work here
October 28, 2025 at 12:20 AM
NGL it's not perfect or right for all use cases (e.g. batch only, limited model types) so YMMV. Still some rough edges and bugs, too. I might not rush to deploy in enterprise tomorrow but definitely a project to watch and something I can definitely imagine using in some personal/volunteer work

5/5
August 16, 2025 at 3:38 PM
Exciting because:

- dbt has some of the boilerplate needed for MLOps (tests, logs, orchestration)
- DBs integrated with other systems like CRMs/dashboard so its easy to serve predictions from there

But gotta tweak both orbital and dbt to get the most benefit, with an assist from {sqlglot}

4/
August 16, 2025 at 3:34 PM
A favorite part of my Chicago commute: walking past a building engraved “to safeguard wealth men established banks” that was built in… 1929

I’ve yet to find the complementary building that says “to safeguard banks men established the FDIC”
July 28, 2025 at 5:02 PM
Benefits include staying reproducible, compartmentalizing your thinking/writing to focus on what matters, and writing final docs in Qmd plaintext for easier version control and collaborating with non-technical coauthors who don't have Jupyter spun up

5/n
July 27, 2025 at 1:18 PM
This is useful bcs we can work in a linear order, reacting to previous views in our analysis. But then we can streamline our story to focus on takeaways instead of the analysis process

4/n
July 27, 2025 at 1:17 PM
seaborn.objects out here just utterly roasting me in its error messages #data
July 3, 2025 at 6:19 PM
I have not gotten to kick the tires on @duckdb.org 's DuckLake yet as a technical solution, but deeply enjoying their release blog. It's a lovely piece that covers many of the bigger trends of the last ~5 years with heart, humor, and brevity

duckdb.org/2025/05/27/d...

#datasky
June 4, 2025 at 9:45 AM
So many questions on this incoherent X AI sliding into my DMs

It thinks I’ll DM 1k random people one at a time?

Am I supposed to say “hi data scientist! My vibes say you have no data strategy so lemme tell you how to do your job”

Is it likewise putting my name on random lists?

Bizarre
January 10, 2025 at 6:21 PM
Having to contribute to AI training data (for Captcha) just to fill out a contact form for my Congressman just kinda sums it all up...
November 11, 2024 at 12:35 AM
I can't explain this but your dog is my dog as a baby. Don't worry - he turns out great 💙
November 9, 2024 at 2:57 PM
My most eloquent texts summing up this week
November 9, 2024 at 3:52 AM
Man I love simulation studies.. such a great way to stress test your methods
November 5, 2024 at 1:10 PM
The team explores the relationship between your Bayesian DGP and causal DAG and recommends using "causal queries" to be sure you're looking at implicity assumptions and backdoor paths
November 5, 2024 at 1:07 PM
I always love the rigor at CDSM

The team talks about not just how to do a thing but how to simulate and study their understanding of the thing
November 5, 2024 at 12:28 PM
#ThrowbackThursday to Halloween 22 years ago 🗽

Speaking of liberty, if that is a thing you enjoy, have you voted yet?

Make a plan today: iwillvote.com
October 31, 2024 at 2:24 PM
Making things "pretty" has never been where I find the most joy in data work. So, I'm super excited about @posit.co 's {brand.yml} framework for adding unified styling across Quarto, Shiny, and more at minimal effort 🤩

posit-dev.github.io/brand-yml/

Site also featuring best at-a-glance roadmap
October 31, 2024 at 1:25 PM
October 30, 2024 at 1:12 AM
And it's under the search bar?
October 29, 2024 at 2:03 AM
Another awesome thing about @DataPolars (for #rstats folks and beyond) -- it inspires equally ergonomic open-source addons

Neat project here finally makes calc'ing model metrics in a df as easy as it should be like any aggregation

github.com/abstractqqq/...
February 25, 2024 at 7:09 PM
New Year, New Site -- after many hacks to preserve all my post / talk / RSS links, I'm finally ending 2023 by switching my blog down to Quarto!

Didn't write much in 2023, but I'm hoping Quarto's nice, lightweight framework will lead to a productive 2024
December 31, 2023 at 10:26 PM
Who wore it better?

Union Station DC
vs
Union Station Chicago
December 8, 2023 at 2:55 PM
I'm not 100% clear if this is a typo or a pun, but I'm certain that all API documentation should be titled "Started Getting"
November 27, 2023 at 3:42 AM
posit::conf shirts (2020, 2023) are a bar chart of the ever expanding package ecosystem
September 23, 2023 at 8:03 PM