Caju Pereira
cajupereira.bsky.social
Caju Pereira
@cajupereira.bsky.social
Software Engineer working with SRE
jpereira.me
Also make sure to checkout its companion article, an in-depth guide on How to Build and Deploy your Infrastructure as Code (IaC) with #Terraform and #Ansible on #DigitalOcean!

jpereira.me/hands-on-how...
How to Build and Deploy your Infrastructure as Code (IaC)
In our previous post on Infrastructure as Code, we covered the theoretical aspects of IaC, including its benefits, common tooling, and best-practices. Now, let's put that knowledge into practice by bu...
jpereira.me
November 15, 2024 at 2:57 AM
Moral of the story: While monitoring IS essential, effective troubleshooting is key to handling production incidents. It's about systematic investigation and attention to detail. Even the simplest things can trip us up! #SRE #DevOps (5)
November 6, 2024 at 12:08 PM
The issue: The feature flag system expected a string to be exactly equals to "on", and we had mistyped a line-break at the end of this string, which made the feature flag be interpreted as "off"🤦 (4)
November 6, 2024 at 12:08 PM
All of our monitors and SLOs were healthy, the feature flag system was behaving exactly like we expected and hadn't been changed in a long time. Now, take a guess! What do you think was the underlying cause of this incident? (3)
November 6, 2024 at 12:08 PM
The line of code that wasn't working was a single "if" statement. At one point, I was paged and we had 4 engineers in the call, some with masters in computer science, debugging the issue. We got to the point of adding a breakpoint in production! (2)
November 6, 2024 at 12:07 PM
Hey, Chris! I'd love to join the SRE, DevRel'iens and Cloud Native lists!

Been working with SRE for >6 years, and am currently running a blog on reliability and software engineering: jpereira.me :)
November 6, 2024 at 12:05 PM