Skip to content
Holding·last review12 May 2026

Across the publicly documented 2025–2026 enterprise deployments, single-agent architectures with structured tool-calling outperform multi-agent orchestrations on accuracy, cost, and MTTD for tasks below approximately 12 distinct tool-domains; multi-agent only pays back above that threshold and only when inter-agent state is bounded by a shared structured artifact rather than free-text handoff.

Claim created at publish; review on 60-day cadence. The 'approximately 12' figure is a heuristic derived from the documented deployment record, not a measured constant from a controlled trial; it is expected to refine to an 8–15 band as more enterprise deployment data enters the public record. Anchor sources: Anthropic building-effective-agents guidance (Dec 2024, explicit caution against premature multi-agent complexity); OpenAI Agents SDK production guidance (single-orchestrator pattern documented as default); Microsoft AutoGen Magentic-One paper (arXiv:2411.04468, multi-agent benchmark methodology showing audit complexity); LangGraph production case studies (structured-artifact state pattern in highest-performing deployments). Salesforce Agentforce and ServiceNow Agent Studio are correctly classified as single-orchestrator architectures for the purposes of this claim; they do not provide evidence for or against the multi-agent threshold. Trigger conditions to revisit before next cadence: (a) a peer-reviewed benchmark overturning the ~12 tool-domain threshold using 50+ enterprise deployments with a methodology that separates tool-domain count from task complexity — would move status to Partial or Not holding depending on the direction; (b) a frontier capability shift eliminating delegation-hop overhead through native multi-agent coordination at inference time rather than at the application layer — would move the threshold upward and narrow the single-agent advantage below it; (c) publication of additional enterprise deployment data that refines the single-number heuristic to the expected 8–15 band — would move status to Partial as the band supersedes the single number without invalidating the directional finding. The claim is expected to shift from Holding to Partial within 60 days as the band refinement is the most likely near-term development; this is not a weakness in the claim but the intended review posture.

Published
12 May 2026
Last reviewed
12 May 2026
Next review
+59d· 11 Jul 2026
Embed this claimiframe + oEmbed
HTML iframe
Paste-the-URL (Substack, Medium, Notion, WordPress)

The card auto-updates when the claim's status, last-reviewed date, or correction log changes. Embedders never need to refresh — the card is rendered live from the canonical record.

About this register

The Reporting register tracks claims published from articles addressed to senior enterprise IT leaders — CIOs, IT directors, heads of platform. Claims are reviewed on a 30–90 day cadence; each review either reaffirms the claim, marks one substantive part as Partial, or marks it Not holding once the underlying evidence has been overtaken.

Recent corrections in Reporting

  • AM-002 · Not holding · 06 May 2026

    URL state changed. The /the-agentic-ai-revolution-real-world-success-stories-and-strategic-insights-from-2024-2025/ slug now serves a deliberately rewritten retrospective (claimId AM-130, "Agentic AI 2024-2025 retrospective", published 04 May 2026) against audited primary sources. The 28 Apr 2026 redirect to /retractions/ has been lifted to allow that. AM-002 the claim remains Not holding — the original $3.50/dollar + 70% failure-rate framing was withdrawn and is not restored. AM-130 is a separate claim with its own evidence chain. Readers arriving at /holding/AM-002 see the withdrawal here; the article link surfaces the new piece at the URL the original lived at, with this entry as the audit trail.

  • AM-121 · Holding · 2 May 2026

    Klarna walk-back primary-source upgrade — added Siemiatkowski verbatim quotes via Bloomberg-cited-by-Fortune (9 May 2025) and the Uber-style freelance hiring detail via Entrepreneur. Closes the highest-priority evidence gap from the source dossier.

  • AM-115 · Holding · 29 Apr 2026

    Initial publication 29 Apr 2026 — the first Quarterly Claim Review Bulletin. The claim itself is recursive: it asserts that the bulletin will ship quarterly, and the next review (30 Jul 2026) tests whether the Q3 bulletin actually appeared. Status starts as 'up' because the claim is currently true (the Q2 bulletin shipped). The verdict at end of July 2026 will move to Holding, Partial (bulletin shipped but on a delayed cadence), or Not holding (no bulletin shipped). REVIEW: Peter — please verify claim text + cadence wording before removing rewriteInProgress flag.

Reviews coming up in Reporting

  • AM-003 · Holding · next +6d (19 May 2026)

    GPT-5 Pro's tiered-subscription model forces enterprises to classify problems by computational difficulty — $200/month…

  • AM-136 · Holding · next +22d (4 Jun 2026)

    Across the 24-month window May 2024 to April 2026, every major foundation-model provider (Anthropic, OpenAI, Google, AW…

  • AM-020 · Holding · next +36d (18 Jun 2026)

    The 40-60% TCO underestimate on enterprise agentic-AI deployments is not a cost-visibility failure — it is a cross-depa…