Erika Pullum (she/her)
erikapullum.bsky.social
Erika Pullum (she/her)
@erikapullum.bsky.social
Engineer, climber, currently working as Head of Data at Hex.

www.erikapullum.com
Reposted by Erika Pullum (she/her)
We're looking for someone to join our lovely team at Data Orchard as an Analytics Engineer! This is a really diverse and exciting role working on a wide range of #data projects for our wonderful nonprofit clients and our own data products. Please do apply if this sounds up your street 👩‍💻 #dataSky
June 4, 2025 at 10:45 AM
Reposted by Erika Pullum (she/her)
I wish there was a way for the dbt Cloud Explore DAG to filter to content that exists *between* two nodes. Anyone know of a fun work-around for this with other tools?

#dataBS
January 15, 2025 at 5:51 PM
What's on my mind this week (as Head of Data at a growth stage startup)

* Hiring 🔜
* Upcoming vendor contract negotiations
* Priorities pulse check with our stakeholders
* Couple lil IC tasks to standardize a metric a bit better

#dataBS
January 13, 2025 at 8:09 PM
Naming is hard.

#dataBS
January 13, 2025 at 4:53 PM
Reposted by Erika Pullum (she/her)
Perpetually flip flopping on self serve analytics

Some days it feels like it could be the holy grail - enabling downstream users to do their own exploration and analysis on governed metrics. Fewer ad hoc requests! Reporting automation!

Other days: So what? Does that actually move the needle?
January 7, 2025 at 6:53 PM
Reposted by Erika Pullum (she/her)
Don’t think of it as whether the data has error or not. Most of the time it will.

Think of it as whether the error in the data will cause you to make a different decision than if the data was perfectly clean. Most of the time it won’t.
January 2, 2025 at 4:49 PM
Reposted by Erika Pullum (she/her)
One of the biggest tips I have for anyone doing data analysis, especially data from people, is to spend some time drilling down to the most granular data and just looking at individual records. You will find the craziest shit you never imagined and your analysis will be better for it #databs
January 2, 2025 at 2:40 PM
My #dataBS new year's resolution is to stop feeling guilty when I tell someone to go self serve their QQ. What's yours?
January 2, 2025 at 8:39 PM
Reposted by Erika Pullum (she/her)
Dear Santa, All I want for Christmas is for any of the abandoned GitHub repos that half-implemented the Hemingway writing app as a VS Code extension to finish the job. Please put your best TypeScript dev elf on this 🙏
December 20, 2024 at 1:18 AM
Watching Hex's internal hack week demos is like Christmas early 😍
December 13, 2024 at 9:09 PM
Reposted by Erika Pullum (she/her)
Has anyone cracked the code on the most efficient way to align on metric definitions with your customers - key word being efficient?
December 13, 2024 at 4:01 PM
Reposted by Erika Pullum (she/her)
a poem by poetry:

poetry run dbt compile
command not found: dbt

poetry show | grep dbt
dbt 1.0.0.38.22

pip freeze | grep dbt
dbt==1.0.0.38.22

which dbt
dbt not found

(look, no one said it was a happy poem)
#dataBS
December 12, 2024 at 2:50 PM
Fascinating! Confirmed this works in DuckDB, but would need some concepts ported to Snowflake to work in our warehouse. Nice work on this! #dataBS
Wait I found a solution in pure SQL using list transformations (works in DuckDB): gist.github.com/aranke/74206...
December 11, 2024 at 6:40 PM
Reposted by Erika Pullum (she/her)
Wait I found a solution in pure SQL using list transformations (works in DuckDB): gist.github.com/aranke/74206...
December 11, 2024 at 4:44 PM
Super #SQL brainteaser for your Tuesday that kept a few folks on our team thinking pretty hard until someone made an elegant solution.

The desired behavior per unit:
* First event qualifies
* Subsequent events qualify only if it's been more than 90 days since the last

Sample dates below #dataBS
December 10, 2024 at 8:12 PM
Continuing my theme of banger PRs -- this one improved documentation for a field confusing to a lot of folks

#dataBS
December 9, 2024 at 10:35 PM
Just had a great chat about being the first data person at a startup.

Pros
* Building at high velocity is fun
* Work with smart people

Challenges
* Always more than you can do
* Hard to know how good to build when

Folks that have done it, what's on your pro/challenge list?

#datasky #dataBS
December 6, 2024 at 4:57 PM
Reposted by Erika Pullum (she/her)
dbt job: "Can't parse a JSON blob, I'm out."

me: *looks for a needle in the 100M-row haystack*

me: "Aha! This looks fine? Guess I'll paste it in a text editor to double check."

text editor: "Uh, boss"

me: "Is that...is that a tiny red 'BS'?"

BS broke my data. I can't make this shit up.
#dataBS
December 5, 2024 at 3:17 PM
Reposted by Erika Pullum (she/her)
This is a great post about a thoughtful data warehouse redesign!

My favorite part is where @afioritto.bsky.social talks about how Hex’s schemas were organized before and their names, and how they were organized afterward and their names!

#dataBS
Life is messy. So is analytics engineering. Sometimes your warehouse looks like the "clothes pile" your partner has been slowly building in the corner of your closet.

The good news is that you CAN make your warehouse a wareHOME. And that I wrote you a how-to guide.

tinyurl.com/5h35ktj2 #dataBS
How we renovated our data warehouse without interruption | Hex
Blue-green deployment is a strategy for updating dbt projects that allows for iterative development without interrupting production.
hex.tech
December 5, 2024 at 2:10 AM
Amanda did absolutely fabulous work on this project! We got a clean and shiny data warehouse, you get an interesting read. Everyone wins.

#dataBS
Life is messy. So is analytics engineering. Sometimes your warehouse looks like the "clothes pile" your partner has been slowly building in the corner of your closet.

The good news is that you CAN make your warehouse a wareHOME. And that I wrote you a how-to guide.

tinyurl.com/5h35ktj2 #dataBS
How we renovated our data warehouse without interruption | Hex
Blue-green deployment is a strategy for updating dbt projects that allows for iterative development without interrupting production.
hex.tech
December 4, 2024 at 10:18 PM
Your Monday reminder:
Lots of people who seem smart, talented, accomplished, and confident on social media also feel insecure, silly, and stupid sometimes.

What matters is that you show up and keep learning, day after day.

#dataBS
December 2, 2024 at 3:09 PM
Them: These numbers don't match
Me: Yes, because they are different
Them: Oh

Tale as old as time, song as old as rhyme, cohorts confuse everyone

#dataBS
November 26, 2024 at 8:10 PM
Reposted by Erika Pullum (she/her)
I am convinced that the YML-based semantic layer was as much invented by a data professional as the stiletto heel was invented by a woman

(the stiletto heel was invented by a man)

#dataBS
November 25, 2024 at 9:35 PM