Skip to content
Holding·last review12 May 2026

In production agentic systems documented across the publicly observable 2025–2026 deployments, the observable band of internal LLM calls per user-facing request sits between 1:18 and 1:60 across documented deployments, with tail cases regularly exceeding 1:400 — meaning unit-economics, latency budgets, and observability scopes built on a 1:1 mental model under-provision by one to two orders of magnitude.

Claim created at publish; review on 60-day cadence (the model-capability and agentic-framework release tempo means the fan-out ratio band can shift within a single quarter). The 1:18 floor is derived from constrained, well-instrumented production deployments documented in LangSmith multi-step agent case studies and the Anthropic Building Effective Agents post (December 2024); the 1:60 upper band reflects research and orchestration agents documented in the Microsoft Magentic-One paper (arXiv 2411.04468) and Helicone cost-per-task analysis; the 1:400 tail reflects unbounded ReAct agents documented in OpenAI Agents SDK warnings and LangSmith trace data. The band is asserted as the observable range across documented case studies, not as a measured median across a systematic sample — the systematic cross-deployment study does not yet exist in the public literature. Three triggers to revisit before next cadence: (a) a frontier model that internalises multi-step reasoning within a single API call, collapsing the billing-layer fan-out ratio, which would require reframing the cost section while preserving the latency and observability sections; (b) a major observability platform (LangSmith, Helicone, Datadog AI) publishing a cross-deployment fan-out ratio study that either validates or revises the 1:18–1:60 band with systematic data; (c) provider-level task-completion pricing replacing per-call pricing, which changes the unit-economics framing without changing the detection or latency implications. Sister claims: AM-013 (Q1 2026 agentic AI thresholds — the detection-time and observability themes are directly related), AM-148 (GPT-5.5 vs Opus 4.7 cost-per-task framing — the fan-out ratio is a multiplier on the per-call cost comparison). The MTTD-for-Agents tool-use-frequency Z-score tripwire described in this article is the primary detection mechanism; full methodology at /mttd/.

Published
12 May 2026
Last reviewed
12 May 2026
Next review
+59d· 11 Jul 2026
Embed this claimiframe + oEmbed
HTML iframe
Paste-the-URL (Substack, Medium, Notion, WordPress)

The card auto-updates when the claim's status, last-reviewed date, or correction log changes. Embedders never need to refresh — the card is rendered live from the canonical record.

About this register

The Reporting register tracks claims published from articles addressed to senior enterprise IT leaders — CIOs, IT directors, heads of platform. Claims are reviewed on a 30–90 day cadence; each review either reaffirms the claim, marks one substantive part as Partial, or marks it Not holding once the underlying evidence has been overtaken.

Recent corrections in Reporting

  • AM-002 · Not holding · 06 May 2026

    URL state changed. The /the-agentic-ai-revolution-real-world-success-stories-and-strategic-insights-from-2024-2025/ slug now serves a deliberately rewritten retrospective (claimId AM-130, "Agentic AI 2024-2025 retrospective", published 04 May 2026) against audited primary sources. The 28 Apr 2026 redirect to /retractions/ has been lifted to allow that. AM-002 the claim remains Not holding — the original $3.50/dollar + 70% failure-rate framing was withdrawn and is not restored. AM-130 is a separate claim with its own evidence chain. Readers arriving at /holding/AM-002 see the withdrawal here; the article link surfaces the new piece at the URL the original lived at, with this entry as the audit trail.

  • AM-121 · Holding · 2 May 2026

    Klarna walk-back primary-source upgrade — added Siemiatkowski verbatim quotes via Bloomberg-cited-by-Fortune (9 May 2025) and the Uber-style freelance hiring detail via Entrepreneur. Closes the highest-priority evidence gap from the source dossier.

  • AM-115 · Holding · 29 Apr 2026

    Initial publication 29 Apr 2026 — the first Quarterly Claim Review Bulletin. The claim itself is recursive: it asserts that the bulletin will ship quarterly, and the next review (30 Jul 2026) tests whether the Q3 bulletin actually appeared. Status starts as 'up' because the claim is currently true (the Q2 bulletin shipped). The verdict at end of July 2026 will move to Holding, Partial (bulletin shipped but on a delayed cadence), or Not holding (no bulletin shipped). REVIEW: Peter — please verify claim text + cadence wording before removing rewriteInProgress flag.

Reviews coming up in Reporting

  • AM-003 · Holding · next +6d (19 May 2026)

    GPT-5 Pro's tiered-subscription model forces enterprises to classify problems by computational difficulty — $200/month…

  • AM-136 · Holding · next +22d (4 Jun 2026)

    Across the 24-month window May 2024 to April 2026, every major foundation-model provider (Anthropic, OpenAI, Google, AW…

  • AM-020 · Holding · next +36d (18 Jun 2026)

    The 40-60% TCO underestimate on enterprise agentic-AI deployments is not a cost-visibility failure — it is a cross-depa…