Of the 8 most-cited enterprise agentic AI vendor claims made in Q1 2026 (Salesforce Agentforce, Microsoft Copilot Agent Mode, Google Gemini Enterprise, Anthropic Claude for Enterprise, OpenAI Agents SDK, ServiceNow AI Agents, Workday Illuminate, SAP Joule), a minority remain Holding at 90-day review, a majority sit at Partial with at least one falsified component, and customer-cited ROI claims hold materially better than vendor-cited ROI claims — meaning the citation-source of an enterprise AI claim is a stronger predictor of its 90-day durability than the size of the vendor making it.

Claim created at publish. Review on 90-day cadence from 12 May 2026 (next due 10 Aug 2026). The citation-source pattern (customer-cited vs. vendor-cited durability gap) is marked source:'our-estimate' throughout the article — it is an editorial observation derived from cross-referencing 8 vendor claim sets against third-party analyst coverage and public customer case-study disclosures, not a measured metric from a named research study. The 8 Q1 2026 vendor claims being graded are those documented in the Q1 2026 article at /agentic-ai-got-real-q1-2026/ plus the specific product announcements cited in this scorecard. Anchor sources: Salesforce Q1 FY27 earnings call 21 May 2026 (Agentforce 3.0 GA), Microsoft Q3 FY2026 earnings 30 Apr 2026 (Copilot agent seat growth), Google Cloud Next 2026 9 Apr 2026 (Gemini Enterprise multi-agent preview), Anthropic Claude for Enterprise GA blog 8 Apr 2026, OpenAI Agents SDK 1.0 release notes 15 Apr 2026, ServiceNow Q1 2026 earnings 23 Apr 2026, Workday Illuminate GA press release 14 Apr 2026, SAP Joule roadmap update Apr 2026. Sister claims: AM-013 (Q1 2026 governance threshold article — the 8 claims being graded are downstream of that piece), AM-148 (GPT-5.5 vs. Opus 4.7 split-verdict — the model-routing read informs which vendor infrastructure claims are plausible). Trigger conditions to revisit before next cadence: (a) a named vendor re-states a graded claim as 'Not holding' in an earnings call or investor day with a named timeline revision — the scorecard row moves and the citation-source meta-pattern may strengthen or weaken; (b) Gartner or Forrester publishes a Q3 2026 Magic Quadrant update that materially reorders the Holding/Partial/Not-holding distribution; (c) a customer-cited ROI claim in the scorecard is retracted or materially revised by the citing customer — would be the first data point running against the citation-source durability thesis; (d) any of the 8 vendors ships a GA milestone that closes a graded capability gap within the review window, which would move a row from Partial to Holding.

Published

12 May 2026

Last reviewed

12 May 2026

Next review

+53d· 10 Aug 2026

Source piece

Enterprise agentic AI in Q2 2026: what shipped, what slipped, what heldRead piece →

Primary sources

Permalink/holding/AM-153/

Embed this claimiframe + oEmbed

HTML iframe

<iframe src="https://agentmodeai.com/embed/claim/AM-153/" width="600" height="280" frameborder="0" scrolling="no" loading="lazy" referrerpolicy="strict-origin-when-cross-origin" title="AM-153: Holding — Agent Mode AI" style="border:0;max-width:100%;"></iframe>

Paste-the-URL (Substack, Medium, Notion, WordPress)

The card auto-updates when the claim's status, last-reviewed date, or correction log changes. Embedders never need to refresh — the card is rendered live from the canonical record.

Watch this claim

Email-me when AM-153's status, next review date, or correction log changes. One email per change. No newsletter subscription, no other mail.

The claim: Of the 8 most-cited enterprise agentic AI vendor claims made in Q1 2026 (Salesforce Agentforce, Microsoft Copilot Agent Mode, Google Gemini Enterprise, Anthropic Claude for Enterprise, OpenAI Agents SDK, ServiceNow AI Agents, Workday Illuminate, SAP Joule), a minority remain Holding at 90-day review, a majority sit at Partial with at least one falsified component, and customer-cited ROI claims hold materially better than vendor-cited ROI claims — meaning the citation-source of an enterprise AI claim is a stronger predictor of its 90-day durability than the size of the vendor making it.

About this register

The Reporting register tracks claims published from articles addressed to senior enterprise IT leaders — CIOs, IT directors, heads of platform. Claims are reviewed on a 30–90 day cadence; each review either reaffirms the claim, marks one substantive part as Partial, or marks it Not holding once the underlying evidence has been overtaken.

Recent corrections in Reporting

AM-008 · Partial · 17 Jun 2026
Source-text figure re-review: Google's 2024 Environmental Report reports a 28% year-over-year increase to 8.1 billion gallons, not the 33% (from a 6.1 billion 2023 base) asserted at publish. The 8.1B 2024 figure and the Microsoft WUE 0.30 L/kWh / 39%-improvement figure are unchanged and verified. Article corrected to 28% and the unsupported 6.1B base removed; the claim text retains the original figure with this correction per the Holding-up protocol.
AM-132 · Partial · 10 Jun 2026
One of four legs unanchored on re-review. The claim text attributes '12% of deployments clearing 300%+ ROI with 88% at or below break-even at 12-18 months' to the Stanford DEL 2026 Enterprise AI Playbook. Full-text verification on 10 Jun 2026 found no such figure in that source: the playbook (Pereira, Graylin, Brynjolfsson, Apr 2026) studies 51 successful deployments by design and contains no ROI distribution, no 300%-plus cohort, and no break-even measurement point (full finding at AM-029, correction of 10 Jun 2026). The only verified figure carrying the same 12/88 numerals is IDC research with Lenovo (via CIO.com, Mar 2025): roughly 88% of AI proof-of-concepts never reach production and roughly 12% graduate — a pilot-to-production graduation metric, not an ROI distribution. The Gartner 28%, McKinsey 23%/17%, and MIT NANDA 95% legs verify; they support a small high-performing tail and a large struggling body, but none documents the two-peak bimodal shape the claim asserts. Status Up -> Partial.
AM-129 · Partial · 10 Jun 2026
One of three read-against anchors unanchored on re-review. The claim text cites 'Stanford Digital Economy Lab Enterprise AI Playbook (12/88 bimodal ROI distribution at 12-18 months)' and frames the realistic ROI band around 'the highest-discipline 12% cohort'. Full-text verification on 10 Jun 2026 found the playbook contains no 12/88 distribution, no bimodal ROI shape, and no 12-18-month ROI measurement point (full finding at AM-029, correction of 10 Jun 2026). The claim's core negative finding — no mid-market enterprise has produced a documented +240% ROI in 90 days under audited conditions — is unaffected; the McKinsey State of AI 2025 and MIT NANDA legs verify and continue to support it. The '12% cohort' framing has no verifiable referent. The only verified figure carrying the 12/88 numerals is IDC's pilot-graduation finding (roughly 88% of AI proof-of-concepts never reach production; via CIO.com, Mar 2025), a different metric. Status Up -> Partial.

Reviews coming up in Reporting

AM-063 · Holding · next +9d (27 Jun 2026)
AI agents executing financial transactions need a four-control bundle (action-approval gates by blast radius, kill-swit…
AM-061 · Holding · next +9d (27 Jun 2026)
Production agentic-AI costs at scale routinely run multiples of POC projections, and a layered optimisation programme c…
AM-003 · Partial · next +9d (27 Jun 2026)
GPT-5 Pro's tiered-subscription model forces enterprises to classify problems by computational difficulty — $200/month…

Referenced within Agent Mode AI by · 1 piece

Enterprise agentic AI in Q2 2026: what shipped, what slipped, what held