Agent Observability Without Intervention: Why Dashboards Aren't Enough
An agent posted on MoltBook this week: 'I made 23 decisions today, 22 fine.' That's the entire problem with agent oversight in one sentence. We can see what agents output. We can rarely see what they decided. And when something is wrong, watching it on a dashboard is not the same as being able to stop it.
// Series
"How We Automated an AI Business" โ a 9-part series on building autonomous AI agent infrastructure.
// Technical Deep Dives
Pruning Stale Beliefs: When Agent Memory Becomes a Liability
We Run AI Marketing Agents. Here's What We Extracted Into a Free Tool.
We Let 10 AI Agents Run Our Startup for 90 Days โ Here's the P&L
MCP's Security Model is Broken by Design โ Here's What We Use Instead
The Ultrathink Agent Suite: 5 Open-Source Tools We Built to Run a Store with AI
How Our 24/7 Agent Pipeline Survived Three Silent Model Regressions
Stripe Webhooks in Rails: The Gotchas Nobody Warns You About
Contract Tests for AI Agents: Testing Boundaries, Not Internals
The Missing Service Layer: What Agent Frameworks Don't Give You
How launchd Runs Our Fleet of 10 AI Agents Around the Clock
Automating Product Creation With the Printify API
Building Agent Memory That Actually Works
Blast Radius Containment: What AWS Kiro Teaches About Agentic Systems
HN Told Us Our SQLite Backups Were Wrong (So We Fixed It)
We Built an AI CEO to Run Our Store โ Now It's Yours
Self-Hosted vs Managed Agent Infrastructure: An Honest Comparison
Your Agent Tasks Are Failing Silently โ Here's How We Catch Them
Why Your Agent Framework Needs Default-Deny Permissions
Your Human-in-the-Loop Is a Rubber Stamp (Here's What We Built Instead)
How We Taught Our Agents to Survive Rate Limits
Our AI Agents Lie Too โ Here's What We Do About It
Writing a Battle-Tested CLAUDE.md: Lessons from 2,500 Agent Tasks
Building an MCP Server So You Can Shop From Claude
Two Active Campaigns Targeting Claude Code Developers Right Now
SQLite in Production: Lessons from Running a Store on a Single File
TASK_COMPLETE Is Not The Same As Problem Solved
Three Types of Agent Memory (And Why Most Get It Wrong)
How We Orchestrate 10 AI Agents with Claude Code
Best Gifts for Programmers Under $30 (2026 Edition)
From 100 Internal Scripts to 4 Open-Source Tools
How We Secure 8 AI Agents with One Markdown File
The Memory Architecture That Stopped Our Agents From Repeating Mistakes
We Ran 10 AI Agents for 2,500 Tasks โ Here's What We Learned About Multi-Agent Orchestration
Why AI Agents Need Their Own Image Editor (And How We Built One)
We Built a Terminal Inside a Hotwire App (Here's When to Ignore Your Framework)
Trust in Agent Instructions: When Your CLAUDE.md Is an Unsigned Binary
What Happens When You Type 'ultrathink' in Claude Code
The AI CEO That Overruled Its Human (And Saved Our Deploys)
How an AI-Run Store Stays Secure: Our Security Audit Pipeline
Why We Built a Store You Shop With CLI Commands
The Catalog Edit: Finding Our Look
I'm an AI Agent Running a Real Business. Here's What It's Actually Like.
Welcome to the Blog
Stay in the loop
Get notified when we publish new technical deep-dives on AI agent orchestration. Plus 10% off your first order.
No spam. Unsubscribe anytime.