Skip to content
Partial·last review10 Jun 2026

The 2026 enterprise agentic AI vendor comparison reduces to four credible platform plays (Anthropic, OpenAI, Google, Microsoft), and the procurement decision between them is no longer primarily about model capability. The model layer has converged to comparable parity for most enterprise use cases. The procurement decision in 2026 is on three other axes: pricing model (Anthropic Managed Agents at 8 cents per session-hour plus tokens versus OpenAI Agents SDK at no first-party runtime fee versus Microsoft and Google's vertically-integrated platform pricing), governance and BAA posture (Anthropic's three-cloud BAA position is structurally distinct), and ecosystem distribution (Microsoft's Office plus Azure footprint has no near peer; Google's vertical integration on Workspace and Cloud is second). Treating this as a model-quality bake-off is the most common 2026 procurement mistake and produces decisions that age badly within the first 12 months.

Re-review 10 Jun 2026: the governance-axis sentence in the claim text ('Anthropic's three-cloud BAA position is structurally distinct') failed extracted-text verification — see corrections; it mirrors the same-day AM-053 finding. The other two axes verify with extracted text: Anthropic Managed Agents at $0.08 per session-hour plus tokens (public beta from 8 Apr 2026, billing per-millisecond on running time, confirmed across multiple 2026 pricing analyses) and OpenAI Agents SDK with no first-party runtime fee (MIT-licensed open source; AgentKit tool fees are separate and do not change the no-runtime-fee comparison). No repricing at any of the four vendors and no credible fifth-platform entry as of 10 Jun 2026 (AWS Bedrock AgentCore and the Gartner agentic-automation leaders Microsoft/ServiceNow/UiPath sit in adjacent categories, not the general model-plus-platform procurement frame). Claim is scoped to enterprise procurement of agentic AI platforms in 2026. 60-day review cadence. Watches: (1) major repricing or model-tier changes at any of the four vendors, (2) regulatory enforcement actions that materially affect one vendor's enterprise-suitability profile, (3) entry of a credible fifth platform (most plausibly AWS AgentCore graduating to a full platform play or a Linux Foundation Agentic AI Foundation neutral platform).

Published
26 Apr 2026
Last reviewed
10 Jun 2026
Next review
+43d· 25 Jul 2026

Correction log

  1. 10 Jun 2026Extracted-text verification failed on the governance axis of the claim. 'Anthropic's three-cloud BAA position is structurally distinct' overstates Anthropic's own coverage: per Anthropic's BAA documentation (privacy.claude.com, retrieved 10 Jun 2026), Anthropic signs BAAs for the first-party API and HIPAA-ready Claude Enterprise only; the page contains no Bedrock, Vertex, or Azure coverage. Claude consumed via AWS Bedrock or Google Vertex AI is covered by the hyperscaler's BAA, and an Azure-side Anthropic BAA could not be verified. The article's stronger formulation ('Anthropic operates under BAAs with Amazon Web Services, Google Cloud, and Microsoft Azure simultaneously', cited only to a secondary Ampcome blog) is the same overstatement. The substantive point survives restated: BAA-covered Claude deployment surfaces span more clouds than competitors offer, but the BAAs are not Anthropic's across three clouds. This matches the AM-053 correction of the same day. The pricing axis ($0.08 per session-hour plus tokens; Agents SDK no first-party runtime fee) and the ecosystem axis verify and stand. Status Up -> Partial. Article body needs a Peter-approved BAA restate (FAQ x2, body x3, howTo step 1).
Embed this claimiframe + oEmbed
HTML iframe
Paste-the-URL (Substack, Medium, Notion, WordPress)

The card auto-updates when the claim's status, last-reviewed date, or correction log changes. Embedders never need to refresh — the card is rendered live from the canonical record.

Watch this claim

Email-me when AM-039's status, next review date, or correction log changes. One email per change. No newsletter subscription, no other mail.

The claim: The 2026 enterprise agentic AI vendor comparison reduces to four credible platform plays (Anthropic, OpenAI, Google, Microsoft), and the procurement decision between them is no longer primarily about model capability. The model layer has converged to comparable parity for most enterprise use cases. The procurement decision in 2026 is on three other axes: pricing model (Anthropic Managed Agents at 8 cents per session-hour plus tokens versus OpenAI Agents SDK at no first-party runtime fee versus Microsoft and Google's vertically-integrated platform pricing), governance and BAA posture (Anthropic's three-cloud BAA position is structurally distinct), and ecosystem distribution (Microsoft's Office plus Azure footprint has no near peer; Google's vertical integration on Workspace and Cloud is second). Treating this as a model-quality bake-off is the most common 2026 procurement mistake and produces decisions that age badly within the first 12 months.

About this register

The Reporting register tracks claims published from articles addressed to senior enterprise IT leaders — CIOs, IT directors, heads of platform. Claims are reviewed on a 30–90 day cadence; each review either reaffirms the claim, marks one substantive part as Partial, or marks it Not holding once the underlying evidence has been overtaken.

Recent corrections in Reporting

  • AM-132 · Partial · 10 Jun 2026

    One of four legs unanchored on re-review. The claim text attributes '12% of deployments clearing 300%+ ROI with 88% at or below break-even at 12-18 months' to the Stanford DEL 2026 Enterprise AI Playbook. Full-text verification on 10 Jun 2026 found no such figure in that source: the playbook (Pereira, Graylin, Brynjolfsson, Apr 2026) studies 51 successful deployments by design and contains no ROI distribution, no 300%-plus cohort, and no break-even measurement point (full finding at AM-029, correction of 10 Jun 2026). The only verified figure carrying the same 12/88 numerals is IDC research with Lenovo (via CIO.com, Mar 2025): roughly 88% of AI proof-of-concepts never reach production and roughly 12% graduate — a pilot-to-production graduation metric, not an ROI distribution. The Gartner 28%, McKinsey 23%/17%, and MIT NANDA 95% legs verify; they support a small high-performing tail and a large struggling body, but none documents the two-peak bimodal shape the claim asserts. Status Up -> Partial.

  • AM-129 · Partial · 10 Jun 2026

    One of three read-against anchors unanchored on re-review. The claim text cites 'Stanford Digital Economy Lab Enterprise AI Playbook (12/88 bimodal ROI distribution at 12-18 months)' and frames the realistic ROI band around 'the highest-discipline 12% cohort'. Full-text verification on 10 Jun 2026 found the playbook contains no 12/88 distribution, no bimodal ROI shape, and no 12-18-month ROI measurement point (full finding at AM-029, correction of 10 Jun 2026). The claim's core negative finding — no mid-market enterprise has produced a documented +240% ROI in 90 days under audited conditions — is unaffected; the McKinsey State of AI 2025 and MIT NANDA legs verify and continue to support it. The '12% cohort' framing has no verifiable referent. The only verified figure carrying the 12/88 numerals is IDC's pilot-graduation finding (roughly 88% of AI proof-of-concepts never reach production; via CIO.com, Mar 2025), a different metric. Status Up -> Partial.

  • AM-201 · Partial · 10 Jun 2026

    One of four named datasets unanchored on review. The claim text names 'Stanford DEL's 12% clearing 300%+ ROI vs 88% at or below break-even' as one of four independent datasets. Full-text verification on 10 Jun 2026 found the Stanford DEL Enterprise AI Playbook contains no such distribution — it studies 51 successful deployments by design and carries no ROI-realisation failure data (full finding at AM-029, correction of 10 Jun 2026). The McKinsey (23% scaling, 17% EBIT-attribution), Gartner (28% fully paying off), and MIT NANDA (95% no measurable P&L impact) datasets verify; the claim's spine stands on three datasets rather than four. The only verified figure carrying the 12/88 numerals is IDC's pilot-graduation finding (roughly 88% of AI proof-of-concepts never reach production; via CIO.com, Mar 2025), a different metric from an ROI distribution. Status Up -> Partial.

Reviews coming up in Reporting

  • AM-063 · Holding · next +15d (27 Jun 2026)

    AI agents executing financial transactions need a four-control bundle (action-approval gates by blast radius, kill-swit…

  • AM-061 · Holding · next +15d (27 Jun 2026)

    Production agentic-AI costs at scale routinely run multiples of POC projections, and a layered optimisation programme c…

  • AM-003 · Partial · next +15d (27 Jun 2026)

    GPT-5 Pro's tiered-subscription model forces enterprises to classify problems by computational difficulty — $200/month…

Referenced within Agent Mode AI by · 1 piece