Carlos
banner
carlos.ag
Carlos
@carlos.ag
Data things, building @hashboard.com
7. The data pipeline ends in the minds of people in your organization.

8. Your data team should be distributed and gain as much context on the domain as possible.

9. People who consume data should get context on business rules and get involved in how data is constructed.
December 3, 2024 at 5:25 PM
5. Data is most powerful when it scales empathy: you can talk to a single user or talk to a single patient. You can't talk to a hundred thousand users. Data can let you talk to 100,000 users.

6. Think about the end-to-end value of your pipeline: from when data is collected to where it is consumed.
December 3, 2024 at 5:25 PM
4. Only start to use data when you have enough of it. When you're early and you only have ten users, you can just ask them what they think.
December 3, 2024 at 5:25 PM
The trick is that archiving is not deleting. You can restore from the trash later. But it doesn’t show up and is not discoverable and there is a red banner across the top. So best of both worlds: doesn’t clutter your project, but it’s there if you need to restore it later.
November 20, 2024 at 6:11 PM
Yup, Hashboard has configurable rules for automatically archiving resources. The default is resources that aren’t verified and aren’t viewed in 30 days are archived. Want to keep skemthi g around? Just verify it. hashboard.com/changelog/au...
Auto archive
Every data team knows that countless resources in their BI tool are stale and unused. While maintaining a clean project containing only active resources is a good practice for data teams, it can still...
hashboard.com
November 20, 2024 at 4:14 AM
Sounds awesome, I'm always happy to chat through it. Maybe not analytics, but happy to point you in the right direction. DMs open
November 7, 2024 at 3:21 PM
Nope, but happy to get a coffee while you’re in town
November 6, 2024 at 10:52 PM
@imightbemary.bsky.social on building a business review process: www.datacouncil.ai/talks/buildi...
Building a Business Review Program from Scratch
www.datacouncil.ai
November 6, 2024 at 10:19 PM
I'll also thread a couple of old talks.
@abhisivasailam.bsky.social on data trees: www.datacouncil.ai/talks/design...
Designing and Building Metric Trees
www.datacouncil.ai
November 6, 2024 at 10:18 PM
Reposted by Carlos
Text-to-sql is just being oversold today. Text-to-sql is being sold as a way for the CEO to replace the data analyst.

When in reality it's a copolit for sql writers (which is awesome and really helpful). All about expectations.
November 6, 2024 at 5:06 PM
I haven't posted anything on this platform yet and got to 400 followers, not sure who all these people are or how they found me, but seem like real data people
November 6, 2024 at 5:08 PM
Text-to-sql is just being oversold today. Text-to-sql is being sold as a way for the CEO to replace the data analyst.

When in reality it's a copolit for sql writers (which is awesome and really helpful). All about expectations.
November 6, 2024 at 5:06 PM
Let data teams work on and incubate products, that's one killer AI usecase I've seen

Two examples:
- Notion's AI product was started by the data team trying to analyze embeddings in customer data without viewing sensitive data.
- Similar story at Replit.
November 6, 2024 at 5:03 PM
A way to approximate this: some sort of propensity score matching. Take all of the accounts that the team took on, take the obvious predictors / correlates of churn and risk. Match each selected customer with a previous similar account for a year ago and track their future performance.
October 31, 2024 at 1:46 AM
Naive approach: what was your selection criteria for these accounts? Back data up to a year ago, apply your selection criteria and see what performance is over the following quarter / year. That is there benchmark for that team.
October 31, 2024 at 12:27 AM