Prompt Engineering

4 articles

Intermediate

intermediate

Retiring Production Agents: The Checklist Nobody Wrote

Launches get conference talks. Retirements get archived repos and live credentials. Five sequential phases — audit, extract, shadow, communicate, shut down — and the security blast radius when you skip any of them.

Mar 27, 20267 min

Advanced

advanced

Platform

Most Skill Files Never Trigger. The Description Field Is Why.

Roughly nine in ten skill files fail one of five basic checks. The body is rarely the problem. The description is — that 100-token blurb is the only thing the agent reads when deciding whether to load you. Engineer it, or stay invisible.

Oct 30, 20257 min

Platform

Your Traces Are Green. Output Quality Is Collapsing.

Latency, error rate, and token cost stay green while LLM output quality degrades for weeks. The infrastructure layer cannot see semantic failure. Sampled evals, prompt hash drift, and distribution alerts are the signals that catch it before users do.

Apr 22, 20265 min

Platform

Prompt Contract Versioning: The Missing Discipline for Multi-Agent Systems

How to apply semantic versioning and consumer-driven contract testing to AI agent system prompts — treating prompts as versioned API contracts with explicit breaking change classification, agent manifests, and CDC-style registration for multi-agent production systems.

Apr 27, 20266 min