datadungeoneer.bsky.social
@datadungeoneer.bsky.social
I hate the idea of more jargon, but "data obesity" and "data metabolism" are very useful concepts for advancing better data discussions for AI www.dataengineeringweekly.com/p/the-dark-d...
The Dark Data Tax: How Hoarding is Poisoning Your AI
Storage is cheap. Attention is finite. Hallucinations are expensive. It’s time to stop building Data Lakes and start managing Data Metabolism
www.dataengineeringweekly.com
December 26, 2025 at 5:03 PM
Couldn't agree more: one of the reasons I love riding my motorcycle www.ssp.sh/blog/owning-...
Boredom is the New Luxury
Why people are ditching algorithms for old iPods, typewriters, and books. Explore the shift toward single-use devices, boredom, and true ownership.
www.ssp.sh
December 22, 2025 at 5:04 PM
An often overlooked aspect of sharing open source ubc-library-rc.github.io/rdm/content/...
License Research Data
Research Data Management
ubc-library-rc.github.io
December 12, 2025 at 5:03 PM
I've been playing with Opal and I love the implications for app development opal.google/landing/
Opal [Experiment]
opal.google
December 10, 2025 at 6:03 PM
If you need a lot of search data quick, this is a pretty slick tool serpapi.com/search-api
Google Search Engine Results API - SerpApi
Scrape Google search results in JSON format automatically using custom parameters. Search for keywords by location, date and more with SerpApi.
serpapi.com
December 8, 2025 at 5:04 PM
"we expect a growing number of proofs of the form 'AI got there first' without clear evidence of
the form 'this could not have been done without AI'" arxiv.org/abs/2509.18057
Reinforced Generation of Combinatorial Structures: Applications to Complexity Theory
We explore whether techniques from AI can help discover new combinatorial structures that improve on known limits on efficient algorithms. Specifically, we use AlphaEvolve (an LLM coding agent) to…
arxiv.org
December 5, 2025 at 5:07 PM
Always a great review of the data landscape: a must read as we go into 2026 witha focus on practical AI solutions www.mattturck.com/mad2025
Matt Turck
I invest in early-stage AI, ML, and data startups at FirstMark Capital, and I write about the space here
www.mattturck.com
December 1, 2025 at 5:04 PM
Came across this while researching Netflix's use of knowledge graphs and need to try this out! github.com/Netflix/EVCa...
GitHub - Netflix/EVCache: A distributed in-memory data store for the cloud
A distributed in-memory data store for the cloud. Contribute to Netflix/EVCache development by creating an account on GitHub.
github.com
November 28, 2025 at 5:05 PM
Many great points made on getting data AI ready ai.gopubby.com/what-it-mean...
What it means to get your data ready for AI
(or) How AI Agents are changing the job of a Data Engineer
ai.gopubby.com
November 24, 2025 at 5:04 PM
Lengthy, but great read: Circuit Tracing: Revealing Computational Graphs in Language Models transformer-circuits.pub/2025/attribu...
Circuit Tracing: Revealing Computational Graphs in Language Models
We describe an approach to tracing the “step-by-step” computation involved when a model responds to a single prompt.
transformer-circuits.pub
November 17, 2025 at 8:00 PM
A nice little scraping tool github.com/ZA1815/canis...
GitHub - ZA1815/caniscrape
Contribute to ZA1815/caniscrape development by creating an account on GitHub.
github.com
November 14, 2025 at 5:05 PM
They had me at "runs on ESP32" micropythonos.com
MicroPythonOS - The Ultimate MicroPython Operating System
The Only Operating System Built with MicroPython!
micropythonos.com
November 12, 2025 at 5:05 PM
I'm always impressed with the science and data of weather forecasting, so this is a very cool read spectrum.ieee.org/ai-weather-f...
What Makes WindBorne's AI Weather Forecasts so Accurate?
Autonomous weather balloons surf air currents to gather data from remote locations, staying aloft for weeks.
spectrum.ieee.org
November 7, 2025 at 5:05 PM
This is a must read for new data engineers: great points to think about www.dataengineeringweekly.com/p/thinking-l...
Thinking Like a Data Engineer
A Journey Beyond Code — Toward Systems, Curiosity, and Confidence
www.dataengineeringweekly.com
November 5, 2025 at 5:05 PM
Great read on data governance for indigenous data www.gida-global.org/care
CARE Principles — Global Indigenous Data Alliance
CARE Principles of Indigenous Data Governance
www.gida-global.org
November 3, 2025 at 5:06 PM
This is part of discussion I have with many small businesses: sometimes Excel is enough. www.kdnuggets.com/heres-when-y...
Here's When You Would Choose Spreadsheets Over SQL
Spreadsheets might seem obsolete in the world of relational databases. They’re not! Here are situations when spreadsheets easily topple SQL.
www.kdnuggets.com
October 31, 2025 at 4:05 PM