Skip to content
Holding·last review24 Apr 2026

The CMU TheAgentCompany 2026 benchmark figure (30.3% task completion for best-in-class frontier model, up from 24% in 2024) is the current capability constraint for enterprise agentic AI. Capability trajectory projects to ~40% by late 2027, which does not cross the 95% production-readiness threshold within the 3-year TCO horizon enterprise business cases operate against. The Stanford DEL 12% durable cohort operates within the 30.3% (narrow scope + human-in-the-loop + GAUGE-dimensional governance discipline), not around it. Capability is not the variable that separates the 12% from the 88%.

Third of three claim-archive signature pieces (after AM-029 Stanford 88% and AM-030 McKinsey 23%). 60-day review cadence. Watches: (1) frontier model crossing 50% on TheAgentCompany without corresponding deployment-pattern change, (2) cross-enterprise analyses showing capability-wait deployments equivalent to governance-discipline deployments, (3) benchmark refresh shifting the easy/medium/hard distribution such that more of the enterprise task space lands in the viable scope envelope.

Published
24 Apr 2026
Last reviewed
24 Apr 2026
Next review
+55d· 23 Jun 2026
Embed this claimiframe + oEmbed
HTML iframe
Paste-the-URL (Substack, Medium, Notion, WordPress)

The card auto-updates when the claim's status, last-reviewed date, or correction log changes. Embedders never need to refresh — the card is rendered live from the canonical record.