<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Ollama on Danilo Falcão da Silva</title><link>https://falcao.org/tags/ollama/</link><description>Recent content in Ollama on Danilo Falcão da Silva</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Mon, 18 May 2026 17:30:00 -0300</lastBuildDate><atom:link href="https://falcao.org/tags/ollama/index.xml" rel="self" type="application/rss+xml"/><item><title>Ollama in 2026: The Boring Choice for Open-Source LLMs (And Why That's the Whole Point)</title><link>https://falcao.org/posts/ollama-the-boring-standard/</link><pubDate>Mon, 18 May 2026 17:30:00 -0300</pubDate><guid>https://falcao.org/posts/ollama-the-boring-standard/</guid><description>&lt;p>There&amp;rsquo;s a particular kind of project I love: the one that takes something
new and weird and turns it into something you barely have to think about.
Docker did this for containers. Hugo did it for static sites. In 2026,
&lt;strong>Ollama is doing it for open-source language models.&lt;/strong>&lt;/p>
&lt;p>You install one binary. You type &lt;code>ollama pull &amp;lt;model&amp;gt;&lt;/code>. You type
&lt;code>ollama run &amp;lt;model&amp;gt;&lt;/code>. Your application talks to &lt;code>localhost:11434&lt;/code> over an
OpenAI-compatible HTTP API. That&amp;rsquo;s the whole user manual. The fact that
this is now boring — that nobody has to argue about it on Hacker News
anymore — is the point of this post.&lt;/p></description></item></channel></rss>