Ricardo Castro
mccricardo.bsky.social
Ricardo Castro
@mccricardo.bsky.social
Senior Principal Engineer, tech speaker & writer, @DevOpsPorto and @DevOpsDaysPT, @CDeliveryFdn Ambassador, martial arts amateur, and metal lover. Opinions are my own.

mccricardo.com
DevOps is not about tools, DevOps "teams," or "engineers".

It's about aligning different people around shared goals rather than working in silos, each with their own goals and ambitions.

Tools help, no doubt. But that's not the point.

DevOps is about the culture you set.
November 13, 2025 at 6:01 PM
"Build Your Kubernetes Monitoring Foundation with kube-prometheus-stack" by Anjali Udasi

last9.io/blog/kube-pr...
Build Your Kubernetes Monitoring Foundation with kube-prometheus-stack | Last9
Set up production-grade Kubernetes monitoring with kube-prometheus-stack using Prometheus, Grafana, and Alertmanager.
last9.io
November 13, 2025 at 5:02 PM
If you know me, you know I've always been in favor of using the best tool for the job.

But that doesn't mean you should be adding tech to your stack *all the time*.

The best tool for the job requires context.

Adding new tech to your stack requires you to think carefully if it's really worth it.
November 12, 2025 at 6:03 PM
"Trixter: A Chaos Proxy for Simulating Network Faults" by Viacheslav Biriukov

biriukov.dev/posts/trixte...
Trixter: A Chaos Proxy for Simulating Network Faults
Posted: Oct 2025 Github: https://github.com/brk0v/trixter Contents Chaos Engineering and Network Fault Injection Introducing Trixter – A Chaos Monkey for TCP Why Trixter vs GNU/Linux tc netem (Kernel…
biriukov.dev
November 12, 2025 at 5:02 PM
"SRE math every engineer should know: a practical guide" by Srivatsa RV

one2n.io/blog/sre-mat...
SRE math every engineer should know: a practical guide | One2N Blog
Curious how top engineers keep systems reliable? This guide breaks down the math behind Site Reliability Engineering into simple, real-life examples whether it’s understanding error budgets, decoding…
one2n.io
November 12, 2025 at 1:01 PM
Companies advertise "DevOps".

And then have a product team building an application and a "DevOps team" running it in production.

Find the mistake.
November 11, 2025 at 6:04 PM
Do Chad1000x, they said.

It will be fun, they said.

It's not.

But it's a very good way to challenge yourself and raise awareness for a very important topic.

chad1000x.com
HOME - CHAD1000X
Sara Wilkinson, Rogue, and CrossFit present the hero workout "CHAD" in honor of Navy SEAL Chad Wilkinson who took his life on October 29, 2018, due to the
chad1000x.com
November 11, 2025 at 5:10 PM
"Contributing the Unroll Processor to the OpenTelemetry Collector Contrib" by Keith Schmitt

opentelemetry.io/blog/2025/co...
Contributing the Unroll Processor to the OpenTelemetry Collector Contrib
The idea for unrolling bundled logs inside the OpenTelemetry Collector didn’t start with a processor. By “unrolling,” I mean taking a single log record that contains multiple logical events—for…
opentelemetry.io
November 11, 2025 at 1:02 PM
"You Should Write An Agent" by Thomas Ptacek

fly.io/blog/everyon...
You Should Write An Agent
They're like riding a bike: easy, and you don't get it until you try.
fly.io
November 10, 2025 at 5:01 PM
"Faster root cause for slow traces with ClickStack Event Deltas" by Dale McDiarmid

clickhouse.com/blog/%20fast...
Faster root cause for slow traces with ClickStack Event Deltas
Read how ClickStack's improved Event Deltas make it effortless to pinpoint the root causes of performance outliers in observability data - turning complex trace analysis into instant, actionable…
clickhouse.com
November 10, 2025 at 1:01 PM
"Announcing Istio 1.28.0"

istio.io/latest/news/...
Announcing Istio 1.28.0
Istio 1.28 Release Announcement.
istio.io
November 8, 2025 at 6:01 PM
"Cloud Native Computing Foundation Announces Graduation of Crossplane"

www.cncf.io/announcement...
Crossplane’s Graduation Announcement
Graduation marks Crossplane’s readiness for widespread use and its evolution from a control plane framework to groundwork for intelligent, secure, and scalable cloud operations and platform…
www.cncf.io
November 7, 2025 at 5:01 PM
TicketOps is perfectly fine for relatively stable stuff.

At scale, it breaks.
November 7, 2025 at 2:51 PM
"SQL expressions in Grafana: Combine and manipulate data from multiple sources" by Sam Jewell and Kyle Brandt

grafana.com/blog/2025/10...
SQL expressions in Grafana: Combine and manipulate data from multiple sources | Grafana Labs
SQL expressions are a versatile and powerful feature that opens up all sorts of creative possibilities by manipulating and combining data from different data sources.
grafana.com
November 7, 2025 at 1:01 PM
In the dawn of a new wave of AI, if you're still thinking about infrastructure as code and not infrastructure as software, you're living in the past.
November 7, 2025 at 12:56 PM
SRE is much more than just incident response.

I thought this needed to be highlighted since many are talking about "AI SRE", which mostly focuses on incident response.
November 6, 2025 at 6:03 PM
"OTel Updates: Consistent Probability Sampling Fixes Fragmented Traces" by Anjali Udasi

last9.io/blog/consist...
OTel Updates: Consistent Probability Sampling Fixes Fragmented Traces | Last9
One sampling decision, propagated everywhere. OpenTelemetry's Consistent Probability Sampling fixes fragmented traces across services.
last9.io
November 6, 2025 at 1:01 PM
Consistency is underrated.

Many people believe in a "big bang" event that propels their career. And while there are certain cases where that's true, consistency is usually a better investment of your time.

Invest in being consistent and you'll reap rewards.
November 5, 2025 at 6:02 PM
"Introducing Agent HQ: Any agent, any way you work" by Kyle Daigle

github.blog/news-insight...
Introducing Agent HQ: Any agent, any way you work
At Universe 2025, GitHub's next evolution introduces a single, unified workflow for developers to be able to orchestrate any agent, any time, anywhere.
github.blog
November 5, 2025 at 5:01 PM
"Effortless Observability - Integrating CloudWatch Application Signals with OpenTelemetry" by Tobias Schmidt

awsfundamentals.com/blog/cloudwa...
How to Use AWS CloudWatch Application Signals with OpenTelemetry on ECS Fargate and Lambda
This guide shows how to connect CloudWatch Application Signals with OpenTelemetry. See simple steps for ECS Fargate and Lambda. Example code included. Get clear metrics and traces fast.
awsfundamentals.com
November 5, 2025 at 1:01 PM
"Go and enhance your calm: demolishing an HTTP/2 interop problem" by Lucas Pardue and Zak Cutner

blog.cloudflare.com/go-and-enhan...
Go and enhance your calm- demolishing an HTTP:2 interop problem
HTTP/2 implementations often respond to suspected attacks by closing the connection with an ENHANCE_YOUR_CALM error code. Learn how a common pattern of using Go's HTTP/2 client can lead to unintended…
blog.cloudflare.com
November 4, 2025 at 5:04 PM