What Happens When Two AIs Review Each Other's Code
staff-engineer v0.2.0: DADS, Google-grade hooks, visual QA, and one command to set up quality gates.
Introducing DBAR
A standalone TypeScript package that makes browser agent sessions deterministic and replayable. Freeze time, record networks, hash everything, replay exactly.
Universal Semantic Element ID
Stable element identity for browser agents across runs. uSEID fuses ARIA semantics, DOM structure, and visual position into signatures that survive UI churn.
From INTERSPEECH to Production
Best paper awards at two top venues. What changed when I started shipping ML at scale.
MCP at Scale
19,000+ servers, 8M+ monthly downloads. Lessons from building 30+ integrations.
Designing Protocols That Survive Production
What I learned designing BAP and working with the SKILL.md spec.
Zero API Keys
Every operation in skill-tools is deterministic. No LLMs, no API keys, no model drift. Here's why.
Eval-Driven Development
How I test agents with deterministic scoring. Same input, same output, every time.
One Round-Trip
How BAP's composite actions cut browser agent latency from 5-10 round trips to one.
Semantic Selectors
Why I built BAP around accessibility roles instead of CSS selectors.
The Lethal Trifecta
BAP accesses private pages, processes untrusted content, and takes external actions. How I secured all three.
Context Engineering > Prompt Engineering
The bottleneck shifted from how you ask to what the agent knows. MCP, SKILL.md, and BAP are context infrastructure.
MCP vs CLI vs SKILL.md
Real benchmarks, token costs, and a decision framework from shipping all three.
Meet staff-engineer
A Claude Code plugin for the full dev lifecycle. What I built and why.
I Audited 53 Agent Skills. Here's What Broke.
Every SKILL.md from Anthropic, OpenAI, and Vercel through my pipeline. The data.
Deterministic Tooling for Agent Skills
Parse, lint, score, and route SKILL.md files. No LLMs anywhere in the pipeline.
Browser Agent Protocol
An open standard for AI agents to control web browsers. Why I built it and how it works.
Booking Open Play Before It's Gone
The Lifetime Fitness open play battle that led me to build Browser Agent Protocol.
MCP is USB-C for Agents
The mental model I use after building 30+ MCP integrations at AWS.