Skip to content
Not holding·last review28 Apr 2026

Agentic AI's compounding economics show up in back-office operations (AP, IT ticket triage, HR onboarding, procurement, close-cycle reconciliation), not in front-office customer-facing workflows. The 12% of deployments that clear 300%+ ROI cluster there for structural reasons: per-action savings × action frequency × task-specification tightness × existing process instrumentation.

Note hygiene 10 Jun 2026 (claim already Not holding since 28 Apr 2026): the Stanford DEL bimodal-distribution anchor this note previously cited failed primary-source verification — the playbook contains no 12/88 ROI distribution (see AM-029 correction, 10 Jun 2026). Verified anchors that remain on the record: Gartner Q1 2026 28% pay-off rate, OneReach 2026 171% average, Futurum 71% operational median vs 40% high-automation. Anthropic AP-processing + Salesforce tier-1 support + Microsoft Copilot-Dynamics as back-office case anchors. The claim text and the 19 Apr 2026 correction entries retain their original 12/88 wording as the immutable historical record. No further reviews scheduled.

Published
19 Jul 2025
Last reviewed
28 Apr 2026
Next review

Correction log

  1. 19 Apr 2026Article predates the Holding-up standard. Retroactive claim assigned on 19 Apr 2026. Initial verdict 'Partial' — spine is defensible, per-claim numeric verification deferred to +60d review. Body not rewritten per AGENTMODE_PHASE2_BRIEF §114.
  2. 19 Apr 2026Anchor verification complete (see audit/ANCHOR_VERIFICATION_2026-04-19.md). 'Sarah Chen' and the 2 AM Munich-hotel scenario are fully fabricated — the article's narrative protagonist does not correspond to any real executive. The underlying framework (back-office cost compounding faster than front-office wins; per-action delta × frequency) IS defensible against McKinsey + Futurum operational-AI-ROI data. Rewrite required before the article can move to Holding.
  3. 19 Apr 2026Body rewritten. Fabricated 'Sarah Chen' narrative frame removed entirely. Claim spine sharpened: original was 'back-office cost compounding faster than front-office'; new version adds the structural explanation (per-action × frequency × task-specification × measurement instrumentation) and specific 2026 benchmark anchors (Stanford DEL 12%/88%, Gartner 28%, Futurum 71% vs 40%). Status moves from Partial to Up. Cross-links to AM-020 (TCO), AM-021 (measurement discipline), AM-022 (bimodal ROI) explicitly drawn in the body. Next review 18 Jun 2026.
  4. 28 Apr 2026Article retracted 28 Apr 2026. Slug structure (fictional protagonist, 'discovered her competitors' secret weapon') is the fabricated narrative frame the body had to remove on 19 Apr 2026. Body rewritten with Stanford DEL / McKinsey / Futurum sourcing (preserved in archived/) but the slug is the structural problem. URL now redirects to /retractions/?retired=the-executive-who-discovered-her-competitors-secret-weapon. Claim withdrawn — status moves to Not holding, no further reviews scheduled.
Embed this claimiframe + oEmbed
HTML iframe
Paste-the-URL (Substack, Medium, Notion, WordPress)

The card auto-updates when the claim's status, last-reviewed date, or correction log changes. Embedders never need to refresh — the card is rendered live from the canonical record.

About this register

The Reporting register tracks claims published from articles addressed to senior enterprise IT leaders — CIOs, IT directors, heads of platform. Claims are reviewed on a 30–90 day cadence; each review either reaffirms the claim, marks one substantive part as Partial, or marks it Not holding once the underlying evidence has been overtaken.

Recent corrections in Reporting

  • AM-132 · Partial · 10 Jun 2026

    One of four legs unanchored on re-review. The claim text attributes '12% of deployments clearing 300%+ ROI with 88% at or below break-even at 12-18 months' to the Stanford DEL 2026 Enterprise AI Playbook. Full-text verification on 10 Jun 2026 found no such figure in that source: the playbook (Pereira, Graylin, Brynjolfsson, Apr 2026) studies 51 successful deployments by design and contains no ROI distribution, no 300%-plus cohort, and no break-even measurement point (full finding at AM-029, correction of 10 Jun 2026). The only verified figure carrying the same 12/88 numerals is IDC research with Lenovo (via CIO.com, Mar 2025): roughly 88% of AI proof-of-concepts never reach production and roughly 12% graduate — a pilot-to-production graduation metric, not an ROI distribution. The Gartner 28%, McKinsey 23%/17%, and MIT NANDA 95% legs verify; they support a small high-performing tail and a large struggling body, but none documents the two-peak bimodal shape the claim asserts. Status Up -> Partial.

  • AM-129 · Partial · 10 Jun 2026

    One of three read-against anchors unanchored on re-review. The claim text cites 'Stanford Digital Economy Lab Enterprise AI Playbook (12/88 bimodal ROI distribution at 12-18 months)' and frames the realistic ROI band around 'the highest-discipline 12% cohort'. Full-text verification on 10 Jun 2026 found the playbook contains no 12/88 distribution, no bimodal ROI shape, and no 12-18-month ROI measurement point (full finding at AM-029, correction of 10 Jun 2026). The claim's core negative finding — no mid-market enterprise has produced a documented +240% ROI in 90 days under audited conditions — is unaffected; the McKinsey State of AI 2025 and MIT NANDA legs verify and continue to support it. The '12% cohort' framing has no verifiable referent. The only verified figure carrying the 12/88 numerals is IDC's pilot-graduation finding (roughly 88% of AI proof-of-concepts never reach production; via CIO.com, Mar 2025), a different metric. Status Up -> Partial.

  • AM-201 · Partial · 10 Jun 2026

    One of four named datasets unanchored on review. The claim text names 'Stanford DEL's 12% clearing 300%+ ROI vs 88% at or below break-even' as one of four independent datasets. Full-text verification on 10 Jun 2026 found the Stanford DEL Enterprise AI Playbook contains no such distribution — it studies 51 successful deployments by design and carries no ROI-realisation failure data (full finding at AM-029, correction of 10 Jun 2026). The McKinsey (23% scaling, 17% EBIT-attribution), Gartner (28% fully paying off), and MIT NANDA (95% no measurable P&L impact) datasets verify; the claim's spine stands on three datasets rather than four. The only verified figure carrying the 12/88 numerals is IDC's pilot-graduation finding (roughly 88% of AI proof-of-concepts never reach production; via CIO.com, Mar 2025), a different metric from an ROI distribution. Status Up -> Partial.

Reviews coming up in Reporting

  • AM-063 · Holding · next +15d (27 Jun 2026)

    AI agents executing financial transactions need a four-control bundle (action-approval gates by blast radius, kill-swit…

  • AM-061 · Holding · next +15d (27 Jun 2026)

    Production agentic-AI costs at scale routinely run multiples of POC projections, and a layered optimisation programme c…

  • AM-003 · Partial · next +15d (27 Jun 2026)

    GPT-5 Pro's tiered-subscription model forces enterprises to classify problems by computational difficulty — $200/month…