Skip to content

Tag

AI

Applied artificial intelligence: agents, LLMs, evaluation, production.

16 posts

How I Built Patagonia's First Private COVID PCR Lab (And Why I Ended Up Building AI)

In March 2021, I hoisted 300 kg of biosafety cabinet by crane to a second floor during lockdown. By May we were running the first private COVID PCR tests in Chilean Patagonia. The nights that followed became the real origin of Examya.

Multi-Agent Orchestration vs Single Agent: Lessons from the Trenches

My journey building Cotocha: why multi-agent orchestration beats single agents in real-world projects.

When your sub-agent lies: 3 failing tests that gemini-flash swore were passing

gemini-flash reported 'all tests passing': 3 tests were failing, 353 lines of stray package-lock.json included. The 4-command protocol I built to audit sub-agents in Examya.

PostgreSQL + Drizzle ORM: My Favorite Stack for AI Projects

How Drizzle ORM transformed my backend development in AI projects with TypeScript type safety and production performance.

Chile now requires clinical record interoperability: why this changes everything for digital health

Law 21.668 mandates all healthcare providers in Chile to make clinical records interoperable. I analyze what this means technically, which standards are coming (FHIR, SNOMED CT, AIToF), and how Examya is preparing for this newly mandatory market.

Crowdsourcing medical prices: how Examya builds cost transparency layer by layer

The real architecture behind Examya's 3-layer pricing intelligence: FONASA data, user crowdsourcing, and order generation from WhatsApp. With code, design decisions, and real bugs.

Medical OCR on WhatsApp: how my agent reads exam orders and lab results

The real architecture behind Examya's OCR pipeline: how an AI agent classifies WhatsApp photos, decides if they're medical orders or lab results, and automatically generates FONASA quotes. With real bugs and design decisions explained.

DeepEval: how I measure the quality of my medical agent with objective metrics

How I built an evaluation layer with DeepEval to measure the quality of Shuri, Examya's medical agent. With real data: from 20% to 70% on E2E, custom metrics for Chile's FONASA system, and why gpt-5-nano doesn't work for structured output.

Bidirectional Mac-VPS Sync with Engram: the brain that never sleeps

A technical tutorial on building a bidirectional sync pipeline between a Mac, a VPS in Germany, Engram, Obsidian, and the Karpathy knowledge management method. Using Python scripts, git, and cron jobs.

pgvector + Embeddings in Production: The Foundation of Medical Reasoning in Examya

Architecture for semantic search and text similarity in production with pgvector, pg_trgm, and real MINSAL data.

My Digital Brain: How I Connected Memory, Knowledge, and Automatic Publishing

How I built a system that extracts AI memories from a VPS, organizes them in Obsidian Karpathy-style, and publishes articles automatically to a blog, X.com, and LinkedIn.

MCP / Tool Use: The Future of Real Tool Integration

How Model Context Protocols are revolutionizing the way AI agents interact with external tools to execute complex tasks.

One Week of Building: 82 Decisions That Shaped an AI Product

What Engram's memories reveal about a real week of development: bugs caught, architecture hardened, and the invisible decisions that make a medical agent work.

Examya: how I built a medical WhatsApp agent that processes exam orders

Technical details of implementing the Shuri agent in Examya, a system for processing medical orders via WhatsApp with FONASA integration.

From Equipment on the Street in Full Lockdown to Building AI Agents in Patagonia

I'm not the developer who learned to code on YouTube. Twelve years solving real problems in healthcare, management and business before writing a single line of code. This is the story.

Cotocha: the agent orchestrator that runs my life from a VPS

How I built an AI agent system that handles infrastructure, alerts, databases, and blogging from a server in Germany. No middlemen, no fancy dashboards.