<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Self-Host on Danilo Falcão da Silva</title><link>https://falcao.org/tags/self-host/</link><description>Recent content in Self-Host on Danilo Falcão da Silva</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Mon, 18 May 2026 20:30:00 -0300</lastBuildDate><atom:link href="https://falcao.org/tags/self-host/index.xml" rel="self" type="application/rss+xml"/><item><title>Hetzner Isn't the Cheap Default Anymore — And the AI Boom Is Why</title><link>https://falcao.org/posts/hetzner-not-cheap-anymore/</link><pubDate>Mon, 18 May 2026 20:30:00 -0300</pubDate><guid>https://falcao.org/posts/hetzner-not-cheap-anymore/</guid><description>&lt;p>I&amp;rsquo;ve been running personal infrastructure on &lt;strong>Hetzner&lt;/strong> for the better
part of a decade. Cloud VMs for side projects. A couple of dedicated
boxes for the things I didn&amp;rsquo;t want to babysit on AWS pricing. The
calculus was always the same: pay a third of what the hyperscalers
charge, manage it yourself, accept the lack of a managed-service safety
net. The deal was very, very good.&lt;/p>
&lt;p>&lt;strong>On April 1, 2026, that deal got noticeably worse.&lt;/strong>&lt;/p></description></item><item><title>Ollama in 2026: The Boring Choice for Open-Source LLMs (And Why That's the Whole Point)</title><link>https://falcao.org/posts/ollama-the-boring-standard/</link><pubDate>Mon, 18 May 2026 17:30:00 -0300</pubDate><guid>https://falcao.org/posts/ollama-the-boring-standard/</guid><description>&lt;p>There&amp;rsquo;s a particular kind of project I love: the one that takes something
new and weird and turns it into something you barely have to think about.
Docker did this for containers. Hugo did it for static sites. In 2026,
&lt;strong>Ollama is doing it for open-source language models.&lt;/strong>&lt;/p>
&lt;p>You install one binary. You type &lt;code>ollama pull &amp;lt;model&amp;gt;&lt;/code>. You type
&lt;code>ollama run &amp;lt;model&amp;gt;&lt;/code>. Your application talks to &lt;code>localhost:11434&lt;/code> over an
OpenAI-compatible HTTP API. That&amp;rsquo;s the whole user manual. The fact that
this is now boring — that nobody has to argue about it on Hacker News
anymore — is the point of this post.&lt;/p></description></item></channel></rss>