Why does the headline 99.9% number undersell the actual buying-committee question?

Because 99.9% is calculated against a denominator and an exclusions list the buying committee rarely reads. The exclusions typically include scheduled maintenance windows (which can be substantial at the hyperscaler tier), force-majeure events, customer-caused outages, third-party-caused outages including upstream dependencies, and a class of 'partial availability' events that the vendor treats as compliant even when the customer's perceived availability is degraded. The buying committee that compares 99.9% against 99.95% on the headline number is comparing two numbers calculated against different denominators; the actual comparison requires reading the SLA's full text against the customer's specific workload pattern. The 2026 pattern that surfaces at year-two renewal is the customer realising the effective availability has been materially below the headline SLA number because the exclusions list has been activated more often than the customer modelled.

What does the AWS Bedrock SLA actually commit to?

The AWS Bedrock Service Level Agreement commits to 99.9% monthly uptime percentage for the Bedrock invocation API, with the credit-tier structure at 10% (uptime between 99.0% and 99.9%), 25% (between 95.0% and 99.0%), and 100% (below 95.0%). The credit is applied against the monthly service charge for the affected Region, not against the entire AWS bill. Exclusions include scheduled maintenance (announced 48+ hours in advance), force majeure, customer-side issues (incorrect IAM configuration, exceeded service quotas), and third-party-source outages. The AgentCore-specific SLA inherits the underlying Bedrock SLA at the GA tier with model-specific commitments varying by region; the buying committee should read the per-region availability commitment separately from the headline.

What are the five comparison dimensions the buying committee should price?

Dimension one, uptime commitment percentage and the denominator it is calculated against (monthly uptime against the customer's actual usage minutes, not against calendar minutes; some SLAs use one definition and some use another). Dimension two, latency commitment (P95 and P99 latency targets the vendor commits to, which is the dimension most underspecified in public SLAs; the buying committee should require it in the customer's MSA). Dimension three, support response tier (the response-time commitment per severity level: P1 acknowledgement under 15 minutes is the enterprise-grade default; the buying committee should evaluate the tier per the customer's actual operational hours). Dimension four, credit calculation (against what does the SLA credit apply: monthly service charge for the affected region, monthly service charge across regions, the entire vendor relationship, or a separate credit pool; the differences materially change the credit's practical value). Dimension five, exclusions list scope (the categorisation of events the vendor treats as SLA-compliant despite customer-perceived degradation: scheduled maintenance, capacity constraints, content-policy actions, third-party-source outages, partial-availability events).

How does this article track its own claim?

Claim AM-179 in the Holding-up ledger, 60-day review on 26 Jul 2026. Trigger conditions: (1) any major vendor publishing a new SLA tier or restructuring the credit mechanism materially shifts the matrix and moves the claim toward Partial; (2) a published industry-wide outage event (a 2026 analog to the December 2024 AWS Bedrock content-filter outage or the January 2025 Azure OpenAI service degradation) provides concrete precedent that changes the exclusions-list framing; (3) NIST AI RMF or sector-specific cybersecurity rules requiring specific SLA commitments for regulated workloads warrants a matrix update; (4) the Anthropic and OpenAI Enterprise SLA documentation reaching the same transparency level as the hyperscaler tier would change the model-vendor comparison materially. Sibling pieces: AM-174 (security platform TCO/ROI) covers the cost-side calculation; AM-167 (NHI procurement clauses) covers the contract-side instruments; /agentic-ai-sla-architecture/ covers the customer-side SLA design that this piece is the supply-side companion to.

Enterprise AI vendor SLA + uptime comparison 2026

At a glance

Claim

The 2026 enterprise AI infrastructure vendor SLA conversation resolves on five dimensions (uptime commitment with the denominator named explicitly, latency commitment at P95 and P99 negotiated into the MSA addendum because public SLAs typically omit it, support response tier per severity level, credit calculation scope and cap, exclusions list scope including scheduled-maintenance window, content-policy actions, capacity constraints, partial-availability events, and third-party-source outages); the publicly disclosed headline numbers (AWS Bedrock 99.9% monthly with 10/25/100% credit tiers, Azure OpenAI Service inheriting Azure platform 99.9% with PTU separate availability, Google Vertex AI 99.5%-99.9% varying per-model and per-region, OpenAI Enterprise 99.9%-99.99% per-customer in MSA, Anthropic Enterprise commitments per-customer with no public uniform tier) understate the year-two operational reality because exclusions list scope and credit calculation scope vary materially across vendors; the buying-committee discipline is to populate the per-vendor matrix at short-list rather than discover the gaps at year-one operational experience or year-two renewal.

Supporting figure

AWS Bedrock posts a 99.9% monthly uptime SLA with credit tiers at 10% (95-99.9% uptime), 25% (90-95%), and 100% (below 90%); Azure OpenAI Service inherits the Azure platform SLA at 99.9% for most regions with credits at 10/25/100% tiers; Google Vertex AI posts varying SLAs per generative AI model (typically 99.5% to 99.9%); OpenAI Enterprise tier discloses 99.9% to 99.99% uptime targets in MSA addenda but the public Status page and the SLA-credit mechanism are less transparent than the hyperscaler tier; Anthropic does not publish a uniform public SLA, with commitments delivered per-customer at the Enterprise tier

Date

27 May 2026

Verdict

Holding(AM-179)

Next review

26 Jul 2026(+38d)

The customer-side question “what does the SLA architecture for an agentic AI workflow look like” is treated at the publication’s SLA architecture piece, which has accumulated 48 Microsoft Copilot grounding citations against agentmodeai content in the three-month window ending 25 May 2026. The supply-side question — what do the major infrastructure vendors actually commit to in their published SLAs — is structurally different, and is the conversation the buying committee that has read the architecture piece arrives at next.

This piece is the supply-side companion. It walks the five major enterprise AI infrastructure vendors (AWS Bedrock, Microsoft Azure OpenAI Service, Google Vertex AI, OpenAI Enterprise, Anthropic Enterprise) against five comparison dimensions (uptime commitment, latency commitment, support response tier, credit calculation, exclusions list scope) and surfaces the structural gap most agentic AI buying committees discover at year-two renewal: the headline 99.9% uptime is calculated against a denominator and an exclusions list that materially shifts the customer’s effective availability.

Why the headline number undersells the buying-committee question

Three structural reasons the 99.9% comparison number is misleading on its own.

The denominator is rarely the customer’s actual usage minutes. Most vendor SLAs are calculated as a percentage of calendar minutes per month, with the exclusions reducing the calendar-minute denominator before the uptime percentage is computed. The customer whose workload pattern uses the service heavily during peak windows and sparsely during off-peak is comparing a headline number that was calculated against a smoother demand curve.

The exclusions list is heterogeneous across vendors. Scheduled maintenance (with 48-hour notice at the hyperscaler tier; typically a smaller window for the model-vendor tier) is the largest exclusion category. Force-majeure events and customer-caused outages are universally excluded. The categories that vary across vendors are content-policy-enforcement actions (Azure OpenAI in particular treats content-filter behaviour as SLA-compliant in a way the buying committee should read explicitly), capacity constraints during peak periods (Azure OpenAI’s Provisioned Throughput Units carry separate commitments), and partial-availability events (where some functions are degraded but the headline service is technically up).

The credit calculation has materially different practical value. Credit applied against the monthly service charge for only the affected region is materially less valuable than credit applied across the vendor relationship; credit capped at a percentage is materially less valuable than credit uncapped or paid into a separate pool. The 2026 vendor pattern at the hyperscaler tier is regional capped credits at 10/25/100% tiers; the 2026 model-vendor pattern at the Enterprise tier is per-customer negotiated.

AWS Bedrock SLA, the publicly disclosed structure

The AWS Bedrock Service Level Agreement commits to 99.9% monthly uptime percentage for the Bedrock invocation API. The credit-tier structure is 10% at uptime between 99.0% and 99.9%, 25% between 95.0% and 99.0%, and 100% below 95.0%. The credit is applied against the monthly service charge for the affected Region, not the entire AWS bill.

The exclusions cover scheduled maintenance (announced 48 hours or more in advance), force majeure, customer-side issues (incorrect IAM configuration, exceeded service quotas, customer-caused throttling), and third-party-source outages. The AgentCore-specific SLA inherits the Bedrock SLA at the GA tier; model-specific commitments vary by region for the third-party models hosted on Bedrock (Anthropic, Meta, AI21, Stability AI, and the others). The buying committee should read the per-region availability commitment separately from the headline; a 99.9% commitment at the platform level can mask a materially lower availability for a specific model in a specific region.

The latency commitment is not in the public SLA. The buying committee that needs latency commitments for the agent’s tool-use round-trips must negotiate those into the MSA separately; the hyperscaler standard at the Enterprise tier is P95 latency targets per model per region.

Azure OpenAI Service SLA

Azure OpenAI Service inherits the Azure platform Service Level Agreement at 99.9% monthly uptime for the API service, with standard Azure credit tiers (10% at uptime below 99.9%, 25% below 99%, 100% below 95%). Provisioned Throughput Units (PTUs) carry a separate availability commitment that depends on the PTU tier and the region.

The exclusions include scheduled maintenance, capacity constraints during peak periods, customer-quota-exceeded events, and content-policy-enforcement actions. The content-policy behaviour is the row most often surprising at year-two renewal; the Azure content filters can block specific calls in ways the buying committee may perceive as service degradation, but which Azure treats as SLA-compliant by exclusion. The buying committee with use cases that span content-policy boundaries (legal-research agents, medical-information agents, regulated-content agents) should price this explicitly.

The dependency graph from the AM-175 platform comparison carries into the SLA conversation. An Azure OpenAI workload’s effective availability is bounded by the underlying Azure platform availability, the Entra identity-layer availability, and (for production workloads) the Azure Monitor observability-layer availability. The customer should aggregate these into a composite availability target rather than treating the OpenAI API SLA in isolation.

Google Vertex AI SLA

Google Vertex AI posts varying SLAs per generative AI model and per region. The typical commitment is 99.5% to 99.9% monthly uptime with the standard Google Cloud SLA credit structure (10/25/50% credit tiers depending on the breach severity).

The model-specific commitment matters because Gemini variants have different per-region availability. Gemini 1.5 Pro, Gemini 1.5 Flash, Gemini 2.0 Flash, Gemini 2.5 Pro, and Imagen have separate availability commitments that depend on the region the customer deploys in. The buying committee should price the model-and-region pair, not the platform headline; a customer running Gemini 2.5 Pro in a region where the SLA is 99.5% is on a different commitment than a customer running Gemini 1.5 Flash in a region where the SLA is 99.9%.

The exclusions follow the Google Cloud standard pattern: scheduled maintenance, force majeure, customer-side issues, third-party causes. The Vertex AI-specific exclusions add the “model is being updated” category (during model version transitions, the SLA can be paused), which the buying committee planning long-running production workloads should price.

OpenAI Enterprise SLA

OpenAI’s Enterprise tier carries a documented 99.9% to 99.99% uptime target in the customer’s MSA addendum. The OpenAI Status page at status.openai.com is the canonical historical record; the public-tier and Pro-tier services do not carry a contractual SLA.

The Enterprise SLA credit mechanism is delivered per-customer rather than via a published tier structure. The buying committee that has the procurement leverage to negotiate the Enterprise-tier MSA can typically get credit calculations comparable to the hyperscaler standard; the buying committee at smaller scale is structurally weaker here because the negotiation leverage that drives the credit terms is concentrated at the Fortune-500-and-above procurement tier.

The dependency graph for an OpenAI Enterprise workload runs through the OpenAI API surface plus the customer’s chosen identity-federation layer (typically Microsoft Entra or Okta) and the customer’s observability tooling. The composite availability the customer experiences is bounded by the weakest of these; the SLA conversation should price the composite, not just the OpenAI commitment.

Anthropic Enterprise SLA

Anthropic does not publish a uniform public SLA. Enterprise-tier customers receive per-customer commitments in the MSA; the Anthropic Status page at status.anthropic.com is the historical record.

The Enterprise commitments observable in public materials and procurement-team interactions in 2025-2026 cluster around 99.9% uptime targets with negotiated credit mechanisms, but the lack of a public uniform tier means the buying committee should treat each procurement as a fresh negotiation rather than as a tier selection. The dependency-graph framing matters here too; an Anthropic Enterprise workload’s composite availability depends on whether the customer runs Claude via the Anthropic API directly, via Amazon Bedrock, via Google Vertex AI, or via Microsoft Azure (Anthropic is available across multiple hyperscalers in 2026). Each deployment topology has a different composite SLA shape.

The five comparison dimensions, walked

The buying-committee output is a per-vendor matrix scored against the five dimensions. The table below is the 2026 reference shape; the customer fills the specific numbers per the vendor’s current public documentation and the customer’s negotiated MSA additions.

Dimension	What to compare
Uptime commitment + denominator	Headline percentage; calendar-minute or usage-minute denominator; per-region vs platform-level scope; per-model commitment if applicable
Latency commitment	P95 and P99 latency targets the vendor commits to in the MSA; public SLAs typically do not include this; require it in the customer’s MSA addendum
Support response tier	Response-time commitment per severity level (P1 acknowledgement under 15 minutes is the enterprise default); the response-content commitment (acknowledgement vs initial diagnosis vs resolution timeline)
Credit calculation	What the credit applies against (affected-region monthly charge, cross-region, entire relationship); the cap on credit amount; whether credit is automatically applied or customer must claim
Exclusions list scope	Categories of events the vendor treats as SLA-compliant: scheduled maintenance window length, content-policy actions, capacity constraints, partial-availability events, third-party-source outages

The 2026 buying-committee discipline is to populate this matrix at vendor short-list, not at contract negotiation. The MSA addendum work is materially easier when the customer arrives at the negotiation with the matrix in hand than when the matrix is discovered through year-one operational experience.

What this means for the Q3 2026 SLA-aware procurement agenda

Three workstreams operationally tractable in the procurement cycle.

The first is the composite-availability calculation. The customer aggregates the vendor SLA against the upstream dependencies (identity, observability, model, region) to produce the composite availability the customer’s workload actually experiences. The customer’s existing observability tooling can populate this from historical data if the customer is already running the workload; the customer at procurement-time uses the vendor’s public availability data plus the upstream dependency SLAs to model the composite.

The second is the latency commitment negotiation. The P95 and P99 latency targets the vendor will commit to in the MSA addendum are materially more useful for an agentic AI workload than the headline uptime number. The buying committee should require P95 and P99 commitments per model and per region; the procurement counsel should write these as binding obligations with a separate credit mechanism.

The third is the exclusion-scope review. The customer’s compliance, legal, and operations functions read the SLA’s exclusions list against the customer’s specific workload pattern and identify the rows where the customer’s perceived availability could be degraded while the vendor remains SLA-compliant. The output is the SLA-addendum redline that closes those rows or prices them explicitly.

The sibling AM-174 security-platform TCO/ROI piece covers the cost-side calculation that the SLA conversation feeds into. The AM-167 NHI procurement clause work covers the contract-side instruments. The customer-side SLA architecture piece covers the design pattern this supply-side matrix is the companion to. Together the four describe the SLA conversation the 2026 buying committee needs to have at procurement, not at year-two renewal. The SLA matrix is one axis of the broader enterprise AI vendor comparison: SLA specificity is one of the three accountability-surface signals that pillar argues now separate converging platforms.

ShareX / Twitter LinkedIn Email

Cite this article

Pick a citation format. Click to copy.

Spotted an error? See corrections policy →

Disagree with this piece?

Reasoned disagreement is a first-class signal here. Every review cycle weighs documented dissent; material dissent becomes part of the article's change history. This is not a corrections form — use /corrections/ for factual errors.

Referenced by · 2 pieces

Part of the pillar

AI agent procurement →

The contracts, SLAs, and evaluation criteria that distinguish agentic-AI procurement from SaaS procurement. 38 other pieces in this pillar.

Enterprise AI infrastructure vendors: the 2026 SLA and uptime comparison matrix

Why the headline number undersells the buying-committee question

AWS Bedrock SLA, the publicly disclosed structure

Azure OpenAI Service SLA

Google Vertex AI SLA

OpenAI Enterprise SLA

Anthropic Enterprise SLA

The five comparison dimensions, walked

What this means for the Q3 2026 SLA-aware procurement agenda

AI agent procurement →

Related reading

Why the headline number undersells the buying-committee question

AWS Bedrock SLA, the publicly disclosed structure

Azure OpenAI Service SLA

Google Vertex AI SLA

OpenAI Enterprise SLA

Anthropic Enterprise SLA

The five comparison dimensions, walked

What this means for the Q3 2026 SLA-aware procurement agenda

The 60-question agentic AI RFP, built as a procurement tool.

AI agent procurement →

Related reading

AWS vs Microsoft vs Google vs OpenAI vs Anthropic: the enterprise agentic AI framework matrix for 2026

Salesforce platform AI vs Microsoft platform AI: the 2026 full-stack comparison for the buying committee

ISO 42001 is becoming the enterprise AI procurement checkpoint

AI-written analysis, signed by a practitioner. One or two pieces a week.

AI-written analysis, signed by a practitioner. One or two pieces a week.