<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Observability on Danilo Falcão da Silva</title><link>https://falcao.org/tags/observability/</link><description>Recent content in Observability on Danilo Falcão da Silva</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Tue, 26 May 2026 18:17:06 -0300</lastBuildDate><atom:link href="https://falcao.org/tags/observability/index.xml" rel="self" type="application/rss+xml"/><item><title>NewRelic vs Datadog in 2026: My Opinionated Choice</title><link>https://falcao.org/posts/newrelic-vs-datadog-in-2026-opinionated-choice/</link><pubDate>Tue, 26 May 2026 18:17:06 -0300</pubDate><guid>https://falcao.org/posts/newrelic-vs-datadog-in-2026-opinionated-choice/</guid><description>&lt;p>I’ll make this simple up front: I love New Relic for APM. I also love what Datadog built as an integrated operational platform.&lt;/p>
&lt;p>If you only want a tribal answer, stop here: for app-centric teams with strong APM focus, New Relic still punches above its weight. For platform-heavy operations across infra, security, delivery, and incident flow, Datadog usually wins.&lt;/p>
&lt;p>But the right answer depends on team shape, telemetry habits, and how much operational burden you are willing to carry.&lt;/p></description></item><item><title>The Kubernetes Operator's Batman Utility Belt: Day-2 Tools That Actually Earn Their Keep</title><link>https://falcao.org/posts/batman-belt-kubernetes-tools/</link><pubDate>Mon, 25 May 2026 14:35:00 -0300</pubDate><guid>https://falcao.org/posts/batman-belt-kubernetes-tools/</guid><description>&lt;p>&lt;code>kubectl&lt;/code> is the Swiss Army knife. Nobody disputes this. But Swiss
Army knives are terrible at most of the individual jobs they claim to
do, and &lt;code>kubectl&lt;/code> is no different: it can tail logs, but only one pod
at a time. It can switch contexts, but with zero guardrails. It can
describe resources, but in a wall of YAML that buries the thing you
actually care about.&lt;/p>
&lt;p>Day-2 operations — the part where the cluster is live, traffic is
flowing, and someone pages you at 2 a.m. — need sharper instruments.
What follows is the utility belt I&amp;rsquo;d recommend to any Kubernetes
operator building their toolkit in 2026. Not everything here is new.
Some of these tools have been around since 2018. The point is that
they&amp;rsquo;re still maintained, still solve real problems, and still faster
than the &lt;code>kubectl&lt;/code> incantation you&amp;rsquo;d otherwise be typing.&lt;/p></description></item><item><title>SLA, SLO, SLI, and Error Budgets: A DevOps Reality Check</title><link>https://falcao.org/posts/sla-slo-sli-error-budget-devops-practical-guide/</link><pubDate>Sun, 24 May 2026 22:50:00 -0300</pubDate><guid>https://falcao.org/posts/sla-slo-sli-error-budget-devops-practical-guide/</guid><description>&lt;p>Most teams get SLAs, SLOs, and SLIs wrong. Not because the concepts are hard, but because they treat them as compliance paperwork instead of operational tools. The result is dashboards nobody trusts, targets nobody chose deliberately, and on-call rotations that burn people out chasing noise.&lt;/p>
&lt;p>This post is a field guide for teams that actually run production systems and want reliability engineering to work as an engineering discipline — not a slide deck exercise.&lt;/p></description></item><item><title>eBPF Is Eating Kubernetes' iptables Plumbing</title><link>https://falcao.org/posts/ebpf-eating-kubernetes-iptables/</link><pubDate>Wed, 20 May 2026 19:30:00 -0300</pubDate><guid>https://falcao.org/posts/ebpf-eating-kubernetes-iptables/</guid><description>&lt;p>For most of Kubernetes&amp;rsquo; life, the cluster data path has been a tower of
&lt;strong>iptables&lt;/strong> rules. Pod-to-service routing, NAT, network policy, even
the way &lt;code>kube-proxy&lt;/code> programs a Service IP — all of it expressed as
netfilter chains evaluated linearly on every packet. It worked. It
also aged badly.&lt;/p>
&lt;p>In 2026, the answer the ecosystem has converged on is &lt;strong>eBPF&lt;/strong>, and the
project doing most of the convergence is &lt;strong>Cilium&lt;/strong>. The shift is
no longer aspirational: kube-proxy itself shipped an &lt;strong>nftables mode&lt;/strong>
that is expected to go &lt;strong>GA in Kubernetes 1.33&lt;/strong>, the old &lt;strong>IPVS
backend is deprecated as of v1.35&lt;/strong>, and the major managed Kubernetes
providers (EKS, GKE, AKS) all offer a Cilium-powered data plane as a
first-class option. Azure CNI Powered by Cilium is GA on K8s 1.33.&lt;/p></description></item></channel></rss>