Data Code 101
datacode101.bsky.social
Data Code 101
@datacode101.bsky.social
Data / Software Engineering
Token-Oriented Object Notation (TOON) is a compact, human-readable serialization format designed for passing structured data to Large Language Models with significantly reduced token usage. It's intended for LLM input as a lossless, drop-in representation of JSON data.

#dataengineering #llm
November 6, 2025 at 6:01 AM
RAG Stack

Building with Retrieval-Augmented Generation (RAG) isn't just about choosing the right LLM. It's about assembling an entire stack—one that's modular, scalable, and future-proof.
#ai #rag #dataengineering
October 27, 2025 at 10:36 AM
ETL vs ELT vs EtLT

All three methods begin with Extract (E) and end with Load (L), but the placement of transformation dictates their suitability for different infrastructure, data types, and business needs.

#dataengineering
October 19, 2025 at 4:45 AM
RAG stands for Retrieval Augmented Generation. RAG helps to reduce hallucinations in LLMs by providing them with relevant contexts from external knowledge sources.

Understanding how RAG works from scratch is important for AI/ML Engineers.
#dataengineering #rag #aiengineering #llm
October 17, 2025 at 1:25 PM
Edgar F. Codd. In 1970, working at IBM, Codd published his paper, “A Relational Model of Data for Large Shared Data Banks.” It introduced a new model for managing data, now accepted as the dominant approach for Relational Database Management Systems (RDBMS).
www.seas.upenn.edu/~zives/03f/c...
#sql
www.seas.upenn.edu
October 7, 2025 at 12:18 PM
AI Agent Frameworks

The framework shapes how your agent thinks, acts, and connects to tools and data. LLMs are the brain, frameworks are the wiring connecting different parts.

Image by /in/rakeshgohel01
September 16, 2025 at 3:15 PM
While operational databases are the engines running your day-to-day applications, large-scale analytical systems are designed not for rapid, small transactions, but for complex, large-scale queries and aggregations unlocking insights from vast amounts of historical information.

#DataEngineer #OLAP
September 15, 2025 at 10:46 PM
Undestanding the Types of Databases

Choosing the right database is a critical architectural decision. Each type is a specialized tool designed for a specific job.

Here’s a breakdown of the essentials:
September 15, 2025 at 10:20 PM
Big Data Pipelines Across AWS, Azure & GCP

As data engineers, we often work across different cloud platforms. While the concepts stay the same (ingestion ➝ storage ➝ compute ➝ warehouse ➝ visualization), the tools differ.
September 15, 2025 at 10:07 PM
Here are 10 AI prompts for turning your AI into a personalized language tutor.
September 15, 2025 at 10:03 PM
Free Resource: Data Engineering Design Patterns (O'Reilly)

Data engineers across companies keep solving the same problems over and over. Bartosz Konieczny provides battle-tested design patterns that tackle common challenges from data ingestion to observability.

Buf is offering the book for free.
Data Engineering Design Patterns
This hands-on guide from O’Reilly Media focuses on critical data engineering design patterns, including data ingestion, data quality, and idempotency.
buf.build
September 15, 2025 at 9:28 PM
The async/await pattern is a widespread and popular concept found in many modern programming languages. It's a powerful syntactic sugar for managing asynchronous operations, making code look more synchronous and easier to read and debug.
Wait, What? Async-Await Explained
Matthew Hess demystifies the often misunderstood workings of async-await in C#. While many developers expect async-await to enable parallel execution within individual functions, Hess explains that...
codemag.com
September 3, 2025 at 1:37 AM
Kiro vs Cursor: How Amazon’s AI IDE Is Redefining Developer Productivity
#programming #development #software #engineering #ai #vscode #agentic
July 22, 2025 at 8:41 PM
Bend is a high-level, massively parallel programming language, offering the feel and features of expressive languages like Python and Haskell.

**Made in Brazil** #bolhadev
February 7, 2025 at 4:54 PM
DeepSeek R1 is the game-changer in AI right now. It’s on par with o1, but it’s free, faster, and a massive leap forward that no one saw coming.

No wonder some folks are scrambling to discredit the brilliant Chinese students behind this innovation.
January 28, 2025 at 1:38 PM
One Big Table (OBT) is a data modeling approach that emphasizes the use of a single, wide table to store and manage data for analytics.

#dataengineering #datamodeling
January 16, 2025 at 4:43 PM
5 books for Data Engineers
#dataengineer
January 14, 2025 at 5:33 PM
Brief History of Big Data 🧵
October 23, 2024 at 10:44 PM
Amid the generative AI hype, technology leaders have high expectations about the impact of AI on software development.
www.expresscomputer.in/guest-blogs/...
#development #ai #bolhadev
Gartner: AI won't replace software engineers, but could boost demand for their skills - Express Computer
By Philip Walsh, Senior Principal Analyst, Gartner Amid the generative AI hype, technology leaders have high expectations about the impact of AI on software development. Vendors are making bold claims...
www.expresscomputer.in
October 12, 2024 at 6:02 PM
4 estágios de uso do Linux para desenvolvimento dentro do Windows (segue o 🧵)

#bolhadev
October 11, 2024 at 3:59 PM
Technical Debt

Ward Cunnigham is the inventor of the technical debt metaphor. Ward describes debt as the natural result of writing code about something we don't have a proper understanding of.
#software #engineering #agile
The Definitive Guide to Technical Debt 🦠
How to prevent it, measure it, and eventually pay it back. Backed by deep research and tens of stories from the community.
hybridhacker.email
October 7, 2024 at 10:21 PM
"Football Analytics with Python and R" by Eric Eager and Richard Erickson. Teaches how to use data science to analyze American football using both Python and R.
#datascience #analytics #python
October 4, 2024 at 6:14 PM
SQL Mindmap by @brijpandeyji
DDL, DML, DCL, GROUPING, ORDERING, FUNCTIONS, WINDOW FUNCTIONS
#sql
October 4, 2024 at 5:55 PM