EN

KUBICAST 175 - DevOps VS SRE - with Luriel Santana

Has DevOps really become a job title, or are we managing to implement SRE even all the way over in HR?

mansplainer

João Brito

In episode 175 of Kubicast, we welcome specialist Luriel Santana for a duel of ideas between DevOps and Site Reliability Engineering (SRE). Over coffee and laughs, we dive into discussions about organizational culture, infrastructure automation, reliability metrics, and field practices ranging from data centers in Angola to modern cloud pipelines.

1. The Landscape: DevOps and SRE in the Market

Since its emergence, the DevOps movement has brought a breath of speed and integration between development and operations teams. Meanwhile SRE, conceptualized by Google, raised the bar by introducing clear metrics (SLIs, SLOs, and SLAs) and error management processes. In this battle, there is no "single winner": DevOps accelerates delivery; SRE ensures it happens without interruptions.

2. Field Lessons in Angola

Luriel shared with us his adventures in physical data centers, running Linux and configuring Cisco routers in one of the most challenging regions of the African continent. The message was clear: without minimal automation, keeping servers operating under extreme conditions becomes a bottleneck. It was there that we learned the importance of Infrastructure as Code and configuration versioning.

3. Culture vs. Tooling

Often, teams fall in love with tools and forget the culture. We discussed how CI/CD pipelines, containers, and Kubernetes orchestration only make sense when there is a mindset of collaboration and shared responsibility. Otherwise, they just become another "box of tricks" without consistent results.

4. Reliability Metrics: SLOs and SLIs in Practice

We explored examples of SLOs for critical applications and saw that defining acceptable error budgets is both an art and a science. We talked about the trade-offs between speed and stability, and how incident routing can be supported by well-configured dashboards — without forgetting about alerts that prevent alert fatigue.

5. Pandemic and Accelerated Adoption

The global crisis pushed many businesses to the cloud and into automation practices. We discussed how remote work reinforced the need for automation and resilient infrastructure, and reflected on use cases of pipelines that were born in a matter of days to support unexpected spikes.



Conclusion and Next Steps

We left this episode with one certainty: DevOps and SRE are not antagonists, but partners in the journey of delivering software with speed and reliability. If you are starting out, start by defining your SLIs. For veterans, the tip is to revisit processes and invest in culture.

Links and Recommendations:

Join our early access program and have a safer environment in moments! https://getup.io/zerocve


🎧 Also listen to Kubicast on Spotify, and share it with the whole DevOps crowd who doesn't have an engineering degree but is an engineer anyway!

Newsletter Getup.

Atualizações sobre Kubernetes e Software Supply Chain Security todos os meses.

Operating Kubernetes in production for more than 13 years. With Quor, this experience extends to software supply chain security as well.