Blog
All my articles and thoughts.
-
The Agent Trust Stack Just Got Built: Three Weeks in May 2026
• 6 min readSkill cards, self-hosted sandboxes, MCP tunnels, computer-use verifiers, and a Five Eyes warning all landed in twenty-one days. The boring perimeter around capable agents finally has shape.
-
The Browse-Click-Compare Web Is Ending. Here's What Replaces It.
• 10 min readTwenty minutes of tabs vs. five minutes of prompt. The traditional web wasn't designed for humans — it was designed for mice. The agent-native web is quietly dismantling the parts that never made sense.
-
Skills, Connectors, Subagents: Anthropic's 3-Layer Agent Template
• 10 min readAnthropic just shipped 10 financial services agent templates. The interesting part isn't the templates — it's the three-layer architecture quietly becoming the standard for enterprise agents.
-
Cascaded vs Fused Voice Agents: A Builder's Perspective on Architecture Choices
• 16 min readDeep dive into voice agent architectures. Why cascaded models give you control and fused models trade complexity for naturalness. What we're learning from shipping production agents at scale.
-
How to Make Voice Agents Sound Human: A Practical Guide to Realistic Speech Prompting
• 9 min readWhy your cascaded voice agent sounds robotic — and how to fix it with concrete examples, SSML pause patterns, emotion tags, and personality-as-behavior prompting techniques.
-
Context Engineering: The Discipline That Makes AI Agents Actually Work
(updated) • 16 min readA deep dive into context engineering — the techniques that separate toy demos from production AI agents. Covers compaction, offloading, isolation, caching, and prioritization with real examples from Manus, Claude Code, and Devin.
-
Training a Virtual Company: A Deep Dive into Multi-Agent Reinforcement Learning with OpenEnv & Unsloth
• 29 min readHow exploring LLM fine-tuning led to building a Gymnasium-compatible RL environment where 7 LLM-powered agents run a company — trained with GRPO + LoRA on Qwen 2.5 14B — and what we learned about reward design, emergent collaboration, and the future of agentic AI.
-
MCP Has a Tools Problem — And Code Mode Might Fix It
• 7 min readAI agents are drowning in tools. The more APIs you connect via MCP, the worse your agent performs. Here's why, and what Code Mode changes.
-
The AI App Paradox: Why We're Drowning in Tools but Starving for Experience
• 2 min readWe've been so obsessed with what AI can do that we forgot about how it feels to use it. The AI experience layer is the next frontier — not the model, not the capabilities.
-
I Tested Every Major Open-Source AI Agent SDK So You Don't Have To
• 2 min readA comprehensive hands-on comparison of seven open-source AI agent frameworks — which one should you actually use?