Why do enterprise procurement teams block MCP server integrations?

Enterprise procurement teams assign liability based on documented security posture. The base MCP specification defines how clients talk to servers but says nothing about uptime commitments, audit logging, rate-limit handling, or data retention. Without a dedicated SLA and security page addressing these gaps, procurement treats your MCP integration as unsigned liability and blocks the deal.

What is the difference between Truto and Arcade.dev for MCP server AI agent integrations?

Arcade.dev is an MCP runtime focused on per-user OAuth delegation and authorization-first design, making it strong for user-in-the-loop consumer applications. Truto is a unified API engine that auto-generates MCP tools from integration documentation with zero data retention and transparent rate-limit pass-through, making it a better fit for autonomous B2B agent workflows and strict enterprise compliance requirements. The two platforms solve fundamentally different problems.

What should an enterprise MCP security page include?

At minimum: service availability targets with measurement methodology, P95/P99 latency SLOs for tool discovery and execution, authentication architecture (token hashing, scoping, expiration), rate-limit pass-through behavior, zero data retention policy, compliance certifications (SOC 2, ISO 27001, GDPR, HIPAA), incident response SLAs, service credit calculations, and change management policies. Each section should have a stable HTML anchor for deep-linking into MSA exhibits.

How should managed MCP platforms handle upstream API rate limits?

The most defensible approach for enterprise deployments is transparent pass-through: when an upstream API returns HTTP 429, the platform passes that error directly to the caller with normalized IETF-standard ratelimit-* headers. This lets the AI agent reason about backoff, switch tasks, or inform users. Automatic silent retries destroy agent context and cause unpredictable latency spikes.

What procurement questions should teams ask when evaluating MCP platform vendors?

Key questions include: Is pricing usage-based or flat-rate? Can the data plane be pinned to a specific region? Are audit logs exportable to a SIEM? Who holds OAuth refresh tokens and how are they encrypted? Does the platform absorb or pass through rate-limit errors? Can MCP server access be time-limited with automatic multi-layer cleanup? Which compliance certifications (SOC 2 Type II, ISO 27001, GDPR DPA) are current?

Back

AI & Agents Security Guides

How to Build a Dedicated SLA & Security Page for Managed MCP Offerings

The operational playbook for safely giving AI agents access to third-party SaaS data: least-privilege scoping, audit log schema, human approval UX, incident response, and HIPAA/GDPR checklists.

Yuvraj Muley · May 27, 2026 · 38 min read

Your AI agent feature passed every internal review. Sales is closing six-figure contracts on the back of it. The champion loves your AI-powered product. The economic buyer signed off on the budget. The contract moves to procurement and IT security for final review—and then the deal stalls indefinitely for nine weeks.

The reason is almost always the same: you exposed Model Context Protocol (MCP) tooling to enterprise buyers without a public, versioned dedicated MCP integration reference documenting uptime, security posture, rate-limit behavior, and data retention guarantees.

Enterprise procurement teams will reject your contract before they ever open your pricing page if your AI agent integrations lack documented governance. When you sell to SMBs, a generic status page and a "best effort" support policy are usually enough to get by. Enterprise software procurement requires a completely different standard. IT buyers do not care about your marketing copy. They care about liability, vendor risk, and architectural guarantees.

This guide is the architectural blueprint senior PMs and engineering leaders need to build that dedicated SLA and security page. Whether your team is evaluating managed MCP platforms like Truto or Arcade.dev for AI agent integrations, the infrastructure choices you make directly determine which SLA commitments you can credibly publish. It is designed to be a single public URL your champion can paste into a procurement ticket and attach to a Master Service Agreement (MSA), structured to survive a 30-minute vendor risk assessment and a 90-minute architecture review.

Why Enterprise Procurement Blocks Undocumented MCP Servers

The shift from SMB to enterprise selling is structural. An SMB buyer reads a feature list to feel confident. An enterprise buyer reads a security datasheet to assign liability.

The base Model Context Protocol (MCP) specification was designed for capability, not compliance. It rapidly became the standard for connecting AI models to external data, solving the integration bottleneck by providing a universal JSON-RPC 2.0 interface. However, it defines how a client talks to a server, not how that server commits to 99.99% availability, how it logs every tool invocation for SOC 2 auditors, or what happens when an LLM hammers a tool past the upstream rate limit. Enterprise procurement treats those omissions as unsigned liability.

Financial regulators and enterprise IT teams flag the lack of formal SLAs and offline fallback in raw MCP as a major operational risk. The security record reinforces this skepticism. The number of exposed MCP servers has nearly tripled to 1,467, and these servers are becoming a vector for direct attacks against cloud infrastructure - attackers are now capable of compromising the cloud services that host them. Researchers analyzing more than 7,000 MCP servers found the same SSRF exposure might be latent in around 36.7% of all MCP servers on the Web today.

It gets worse for vendors who built fast and shipped MCP without governance. In June 2025, productivity giant Asana faced a serious MCP-related privacy breach. After launching a new MCP-powered feature in May, they discovered that a bug had caused some customer information to bleed into other customers' MCP instances. For two weeks, Asana pulled the MCP integration offline while security teams raced to patch the underlying vulnerability. Every enterprise CISO has read that incident report. They will not approve another vendor's MCP server without the artifact they wish Asana had published first.

Unmanaged or local MCP servers create massive security vulnerabilities through credential sprawl and lack of centralized audit trails. Over 1,000 exposed MCP servers have been discovered in the wild, prompting a massive push toward managed MCP solutions with unified authentication and identity management.

If your sales team is pitching AI agents that connect to enterprise SaaS platforms, procurement will ask three core questions:

Who holds the credentials and how are they secured?
Where is the data cached during transit?
What happens when the upstream API goes down or rate limits are hit?

If your answer is a disorganized GitHub README or a generic terms-of-service document, the deal dies. You must document your managed MCP security posture explicitly. For a deeper dive into evaluating this infrastructure, see our 2026 MCP Buyer's Checklist.

graph TD
    A[Vendor Risk Assessment] --> B{Does the vendor use MCP?}
    B -- Yes --> C[Check MCP Security Page]
    B -- No --> D[Standard API Review]
    C --> E[Verify Auth & RBAC]
    C --> F[Verify Data Retention]
    C --> G[Verify Rate Limit Handling]
    E --> H{Passes?}
    F --> H
    G --> H
    H -- Yes --> I[Approve Deal]
    H -- No --> J[Block Deal]

The Core Components of a Managed MCP Security Page

A dedicated SLA and security page must act as a standalone artifact that procurement officers can attach to an MSA. It follows the same blueprint as your enterprise SLA & support page, narrowed to MCP-specific risks.

Procurement teams scan SLA pages for specific structural sections. Missing any of these triggers a follow-up questionnaire that adds weeks to the cycle.

The required sections:

Service availability targets: Document your target uptime (e.g., 99.99%) with the exact measurement window and exclusions (planned maintenance, force majeure, upstream provider outages). Distinguish between the uptime of your managed MCP infrastructure and the third-party APIs you connect to.
Latency commitments: AI agent workflows are highly sensitive to latency. Document your P50, P95, and P99 latency targets for both tools/list discovery and tools/call execution.
Authentication architecture: Describe token format, scope, expiration, hashing, and secondary auth requirements.
Rate limit and error pass-through semantics: Detail the exact HTTP status codes and headers returned.
Data retention and processing scope: This is the single most critical section. Explicitly state your zero data retention policy, detailing which payloads are stored (none), for how long, and in what region.
Compliance certifications: List SOC 2 Type II, ISO 27001, GDPR, and HIPAA compliance where applicable, with audit report availability under NDA.
Incident response and notification SLAs: Define breach notification windows.
Service credits and remedies: Quantify exactly how service credits are calculated as a percentage of monthly fees per missed SLA tier.
Change management and deprecation policy: Provide minimum notice periods for breaking changes.

Publish each section as a stable HTML anchor (#availability, #authentication, #rate-limits). Procurement officers deep-link to these in MSA exhibits. If your page rearranges sections every quarter, you break their templates and reintroduce the questionnaire round trip.

Warning

The single fastest way to fail an MCP security review is to leave the rate-limit and retention sections vague. Both are direct attack vectors for the auditor's risk model. State the exact behavior, not aspirations.

Documenting Authentication, RBAC, and Token Management

The authentication section is where most MCP vendor pages collapse under scrutiny. Generic phrases like "enterprise-grade authentication" mean nothing to a security architect. They want strict context governance, role-based access control (RBAC), auditability, and token lifecycle specifics.

Document these mechanics explicitly:

1. Token Format and Hashing

State that raw tokens are never stored at rest. The MCP server URL contains a cryptographic token, but hashing the token (e.g., HMAC with a server-side signing key) before storage in distributed edge storage means a database compromise does not leak working credentials. Tokens should be returned to the caller exactly once at creation time.

2. Token Scoping and Tenant Isolation

Explain that MCP servers are not global endpoints. Each MCP server URL should be dynamically generated and strictly bound to a single tenant or connected account, rather than a global service identity. This matches how multi-tenant MCP servers should be architected and is the structural fix to the cross-tenant bleed scenario observed in major breaches. When a token is bound to a specific tenant connection, it is impossible for an agent to accidentally query data belonging to another organization.

3. Documentation-Driven Tool Filtering

Enterprise security teams hate "wildcard" API access. Document that operators can constrain a server to read-only methods (get, list), specific resource tags, or named methods. Rather than exposing every possible API endpoint, tools should only be generated if they have explicit documentation records. A finance team's MCP server should be structurally incapable of calling delete_invoice if the policy says read-only. For more on this pattern, see our auto-generated MCP tools architecture guide.

4. Secondary Authentication Layers

By default, an MCP server URL acts as a bearer token. Address the procurement question, "What happens if the MCP URL leaks into a log file?" by offering an optional layer requiring a Bearer API token in the Authorization header in addition to URL possession. This ensures only authenticated systems within your VPC can invoke tools.

5. TTL and Expiration Enforcement

Tokens must support a configurable expires_at. Enforcement must happen at multiple layers: the edge lookup store automatically evicting the token, scheduled background tasks permanently purging configuration records, and request-time validation. There can be no stale credentials left floating in the system.

The OWASP MCP Top 10 explicitly flags hard-coded credentials, long-lived tokens, and secrets stored in model memory or protocol logs as exposing sensitive environments to unauthorized access. Your security page should pre-empt that concern with one paragraph per bullet.

Tip

Actionable Advice for PMs: Include a sample JSON payload on your security page showing how MCP servers are provisioned with strict method filtering (e.g., read-only access) and TTL expirations.

{
  "name": "Support-Team-Read-Only-MCP",
  "config": {
    "methods": ["read"],
    "tags": ["tickets", "users"],
    "require_api_token_auth": true
  },
  "expires_at": "2026-12-31T23:59:59Z"
}

Least Privilege and Auditable Access Controls for AI Agents

Enterprise buyers ask two questions about AI agent access that a rate-limit policy cannot answer: "What is the minimum permission set this agent actually needs?" and "If something goes wrong, can we prove exactly what the agent did?" Both come down to how you engineer least privilege and auditability into the MCP layer itself.

The Least Privilege Model for MCP Tools

Least privilege in the MCP context has four axes. Enforce all four at server creation time, not at call time:

Method scope. Restrict the server to read methods (get, list) unless the workflow provably needs writes. A support-summarizing agent should never hold a token that can call delete_ticket.
Resource scope. Filter by resource tags (support, crm, directory). An agent that summarizes tickets does not need access to payroll records on the same connection.
Tenant scope. Bind each token to a single connected account. Never issue a token that spans customers.
Time scope. Set an expires_at on every token. Standing credentials are the single largest source of blast radius during a compromise.

Publish a matrix on your security page mapping each agent workflow to its issued scope. Procurement teams treat this as evidence that least privilege is enforced by configuration, not by developer discipline.

{
  "workflow": "Nightly compliance audit",
  "issued_scope": {
    "methods": ["read"],
    "tags": ["contacts", "deals"],
    "require_api_token_auth": true,
    "expires_at": "2026-04-30T00:00:00Z"
  },
  "denied_by_configuration": [
    "create_a_salesforce_deal",
    "delete_a_hub_spot_contact",
    "update_a_pipedrive_person_by_id"
  ]
}

Audit Log Schema: What to Log, What Not to Log

Auditable access is only as good as the schema behind it. Log too little and you cannot reconstruct an incident. Log too much and you have just recreated the data retention problem you were trying to avoid.

The rule: log metadata about the call, never the content of the call.

Safe to log (metadata):

{
  "event_id": "evt_01HXK3Z9M7P8QW",
  "timestamp": "2026-05-14T02:14:37.421Z",
  "tenant_id": "tenant_9f2a",
  "environment_id": "env_prod",
  "integrated_account_id": "ia_hubspot_44b1",
  "mcp_token_id": "mcp_7c31",
  "tool_name": "list_all_hub_spot_contacts",
  "resource": "contacts",
  "method": "list",
  "http_status": 200,
  "upstream_status": 200,
  "latency_ms": 187,
  "request_id": "req_01HXK3Z9M7P8QW",
  "ratelimit_remaining": 4823,
  "ratelimit_reset_s": 38,
  "caller_ip_hash": "sha256:9c1b...",
  "user_agent": "ClaudeDesktop/0.7.3",
  "session_id": "sess_01HXK3Z9",
  "actor": {
    "type": "agent",
    "framework": "claude_desktop",
    "operator_user_id": "usr_ab12"
  }
}

Never log (payload content):

Request body fields (contact emails, deal amounts, ticket text, PII of any kind)
Response body content (records returned from the upstream API)
Query parameter values that carry PII (email addresses in a filter clause, phone numbers, national IDs)
OAuth access tokens, refresh tokens, or MCP token strings, even hashed forms should not appear in application logs
LLM prompts or completions
File contents from attachment endpoints

If you need to reconstruct "what data was touched" during an incident, use the upstream SaaS provider's audit log (Salesforce Event Monitoring, HubSpot Activity Log, etc.) cross-referenced by the request_id in your metadata log. That preserves the zero-retention guarantee while still giving auditors a complete trail.

Retention and export. Metadata logs should be retained for at least 12 months (SOC 2 baseline) and be exportable as newline-delimited JSON to a customer-controlled SIEM (Splunk, Datadog, Sumo Logic, or a raw object store the customer owns). Sign each batch with a rotating key so the customer can independently verify the log has not been tampered with after export.

Tip

Publish a redacted sample audit log entry directly on your SLA page. When procurement asks "what does your audit trail look like?", the fastest way to close the question is to show them a real event without them having to sign an NDA first.

Defining Rate Limits, Retries, and Error Handling

This section will be read twice: once by the security team evaluating vendor risk, and once by the customer's platform engineering team. Both need the same answer in different framings.

Many integration platforms attempt to mask upstream rate limits by implementing silent, automatic retries with exponential backoff. While this sounds helpful in marketing copy, it is an architectural nightmare for enterprise systems. Hiding HTTP 429s behind exponential backoff feels helpful until you realize the LLM has now generated 40 redundant tool calls in a 60-second window, each one taking 30 seconds to fail, while the agent loop happily "waits." Silent retries tie up connection pools, cause unpredictable latency spikes, and turn one rate-limit event into a cascading outage.

The Pass-Through Rate Limit Philosophy

Your security page should clearly state your rate limit philosophy. The honest engineering position is a radically transparent approach: do not retry, throttle, or apply backoff on rate limit errors.

When an upstream API (like Salesforce or HubSpot) returns an HTTP 429 Too Many Requests error, your platform should pass that error directly to the caller unchanged. This ensures your AI agents have accurate, real-time visibility into the state of the upstream system.

Standardized IETF Rate Limit Headers

To make this actionable for developers, document how you normalize rate limit information. Normalize upstream rate limit data into standardized headers per the IETF draft specification:

HTTP/1.1 429 Too Many Requests
ratelimit-limit: 100
ratelimit-remaining: 0
ratelimit-reset: 47
retry-after: 47

State this contract explicitly on the page:

Rate limits. Upstream 4xx and 5xx responses are returned to the caller without retry or transformation. Rate-limit metadata is exposed via standardized ratelimit-* and retry-after response headers. Clients are responsible for implementing backoff, circuit breakers, and idempotent retry logic appropriate to their workload.

By documenting this explicit contract, you prove to enterprise architects that your platform is designed for deterministic, observable behavior rather than opaque "magic." For more details, read our guide on handling API rate limits across third-party APIs.

Zero Data Retention and Compliance Guarantees

The fastest way to fail an enterprise security review is to admit that you store third-party SaaS data in an intermediate database before passing it to an LLM. Enterprise buyers are terrified of AI data leakage. Your SLA and security page must include a prominent section detailing your data retention architecture.

The answer pattern that wins reviews: MCP tool calls execute through a stateless proxy. Request and response payloads are not persisted. Only operational metadata (request IDs, timestamps, status codes, latency) is retained for observability and billing.

The Proxy Execution Model

This is only credible if the architecture actually supports it. A proxy-based MCP gateway pulls credentials, forwards the request to the upstream integration, streams the response back, and discards the payload. The query and body parameters correspond to the integration's actual API format. There is no intermediate unified model mapping layer caching HubSpot contact records or NetSuite invoice line items. The data passes through the infrastructure in memory, is returned to the MCP client, and is immediately discarded.

Documenting this flow is critical for passing GDPR and SOC 2 audits. It proves that you are acting strictly as a data processor and transport layer, not a data store.

Ephemeral Edge Storage for Tokens

While payloads are not stored, you must be transparent about what is stored. Document that MCP tokens and configuration metadata are stored in distributed edge storage for fast validation. Emphasize that this storage is ephemeral. Truto enforces MCP server expiration at multiple layers. When an expires_at timestamp is reached, the edge storage automatically evicts the token, and scheduled background tasks permanently purge the configuration records from the database. There are no stale credentials left floating in the system.

Document these specifics:

Payload retention: Zero, with an explicit list of fields that are retained (timestamps, integration name, HTTP method, status code, latency).
Credential storage: OAuth refresh tokens encrypted at rest; access tokens refreshed shortly before expiry and never logged.
Region pinning: Data plane region selectable at the tenant level (e.g., EU vs. US hosting) for residency compliance.
Sub-processors: A public list, updated when it changes, with notification SLAs.
Audit logs: Retention period, export format, and SIEM integration capability.

For a comprehensive look at building compliant systems, review our documentation on Zero Data Retention MCP Servers.

sequenceDiagram
    participant Agent as AI Agent<br>(Claude / ChatGPT)
    participant MCP as Managed MCP Gateway
    participant Upstream as Upstream SaaS API
    Agent->>MCP: tools/call (hashed token)
    MCP->>MCP: Validate token, scope, TTL
    MCP->>Upstream: Authenticated request<br>(payload not stored)
    Upstream-->>MCP: Response or 429<br>+ rate-limit headers
    MCP-->>Agent: Pass-through response<br>+ normalized headers
    Note over MCP: Only metadata retained:<br>timestamp, status, latency

Zero Data Retention with LLM Providers: The Other Half of the Equation

A managed MCP gateway can promise zero payload retention for the leg between your infrastructure and the upstream SaaS API. But agents send prompts, tool call arguments, and tool responses to an LLM provider on the other side of that call. If OpenAI or Anthropic stores that traffic under default retention, your MCP-level zero data retention guarantee does not carry end-to-end. Procurement will catch this gap. Address it explicitly on your security page.

What ZDR Means at the LLM Provider Layer

Provider-side ZDR is not a default setting on either major platform. Both OpenAI and Anthropic operate default retention windows for the API and offer ZDR only under a specific contractual arrangement.

OpenAI. By default, abuse monitoring logs may contain customer content, such as prompts and responses, and are retained for up to 30 days unless legally required to retain them longer. Eligible customers may have their customer content excluded from these abuse monitoring logs by getting approved for the Zero Data Retention or Modified Abuse Monitoring controls, subject to prior approval by OpenAI and acceptance of additional requirements. Enabling ZDR also changes endpoint behavior: the store parameter for /v1/responses and v1/chat/completions will always be treated as false, even if the request attempts to set the value to true. Some endpoints (fine-tuning, Assistants, file uploads) may still store application state, even if Zero Data Retention is enabled.

Anthropic. For the Claude API, standard API log retention was reduced from 30 days to just 7 days, and data is never used for training as of September 14, 2025. ZDR is a per-organization arrangement subject to approval, under which customer data is not stored at rest after the API response is returned, except where needed to comply with law or combat misuse. Even under ZDR, Anthropic still retains User Safety classifier results in order to enforce their Usage Policy, and if a session is flagged for a policy violation, Anthropic may retain the associated inputs and outputs for up to 2 years. ZDR covers direct Anthropic API traffic and Claude Code on Enterprise plans; for Claude deployments on Amazon Bedrock, Google Vertex AI, or Microsoft Foundry, refer to those platforms' data retention policies.

Two things procurement will always double-check:

ZDR is not the same as no-training. On OpenAI, excluding your data from model training is a setting available on all enterprise accounts, independent of whether you have ZDR - training exclusion and data retention are two separate controls. A no-training clause does not imply data is not stored.
ZDR does not cover every endpoint. Not every API surface falls under ZDR - image generation, fine-tuning, and file upload endpoints can have different retention rules than chat completions. If your MCP-powered agents use these endpoints, list them explicitly on the security page and describe what is retained where.

Sample Contractual and DPA Language Customers Can Request

Enterprise customers frequently ask what language to put in their MSA or DPA to bind end-to-end ZDR. The clauses below are drafting starting points, not legal advice - route them through counsel before signing. Structure them as a rider to the DPA that names each processor in the chain (your platform, the managed MCP gateway, and the LLM provider).

Clause 1 - Vendor obligation on payload retention:

Vendor shall not persist Customer Content transmitted through the MCP tool-call path, including request payloads, response payloads, and tool arguments. Vendor may retain only operational metadata (timestamps, request identifiers, HTTP status codes, and latency measurements) required for billing, observability, and incident response. All Customer Content shall be processed in memory and discarded upon completion of the tool invocation.

Clause 2 - Subprocessor flow-through:

Where Vendor uses a Subprocessor to provide model inference (including but not limited to OpenAI, Anthropic, Google, or AWS Bedrock), Vendor shall configure such Subprocessor with a Zero Data Retention arrangement covering all endpoints used to process Customer Content. Vendor shall provide, on Customer request, written evidence of ZDR enrollment at each Subprocessor, including account identifiers and covered endpoints.

Clause 3 - Change notification:

Vendor shall notify Customer in writing at least thirty (30) days prior to (a) adding a new Subprocessor to the model inference chain, or (b) modifying the ZDR status of any existing Subprocessor. Customer shall have the right to terminate the affected services without penalty if such change materially increases retention.

Clause 4 - Region and residency:

Vendor shall route all Customer Content through inference endpoints located in Customer's designated processing region (EU, US, or APAC). Fallback to a non-designated region requires Customer's prior written consent, except where required to maintain service availability during a regional outage.

Attach the OpenAI and Anthropic ZDR confirmation records (or their public policy links) as exhibits. Procurement treats this attachment as the "proof" that the SLA page claims are real.

Shared Responsibility Map

End-to-end ZDR is a chain. Break any link and the guarantee fails. Publish this split on the security page so procurement can see exactly which layer your platform controls and which layer requires customer action.

flowchart LR
    A["AI Agent Runtime<br>(Claude Desktop, Cursor, custom app)"] --> B["LLM Provider<br>(OpenAI / Anthropic API)"]
    A --> C["Managed MCP Gateway"]
    C --> D["Upstream SaaS API<br>(Salesforce, HubSpot, ...)"]
    subgraph cust ["Customer configures"]
      A
      B
    end
    subgraph vend ["Vendor enforces"]
      C
    end
    subgraph ext ["Third-party retention policy"]
      D
    end

Layer	Controlled by	Responsibility
AI agent runtime (Claude Desktop, Cursor, custom LangChain app)	Customer	Choose a ZDR-eligible model tier; avoid logging prompts client-side; sanitize inputs before sending.
LLM provider inference (OpenAI API, Anthropic Claude API)	Customer contract with provider	Enable ZDR at the account/org level; enroll in the correct endpoint set; keep proof of enrollment.
Managed MCP gateway (Truto or equivalent)	Vendor	Do not persist request/response payloads; expose only operational metadata; scope tokens per tenant.
Credential storage	Vendor	Encrypt OAuth refresh tokens at rest; hash MCP tokens; refresh access tokens shortly before expiry without logging.
Upstream SaaS API (Salesforce, HubSpot, etc.)	SaaS vendor	Governed by the SaaS vendor's own retention policy - outside the MCP gateway's control.
Audit and observability	Shared	Metadata-only logs on the gateway; agent-side logs sanitized before ingestion into SIEM.

The gateway can enforce its half of the map by architecture (stateless proxy, in-memory forwarding, encrypted token store). The LLM provider half is a contractual configuration that customers must actively enable. Do not let procurement conflate the two.

Customer checklist for end-to-end ZDR:

MCP gateway contract includes zero-retention clause for tool call payloads
LLM provider ZDR enabled at the organization level with written confirmation
Endpoint coverage verified against the provider's ZDR eligibility table (chat completions, responses, embeddings, etc.)
Agent framework not logging prompts locally or to external observability tools
SIEM ingestion filters exclude prompt and response bodies; retain only metadata
Sub-processor list reviewed; each processor's ZDR posture confirmed in writing
Region pinning configured at the MCP gateway and the LLM provider
DPA rider signed covering all links in the chain, with 30-day change notification

How to Verify Provider-Side ZDR

"We have ZDR" is a claim, not evidence. Auditors want artifacts. Point customers to the specific places they can independently confirm ZDR is live at each layer:

OpenAI. Once an organization is approved, a Data Retention tab appears within Settings → Organization → Data controls. Screenshot the tab showing ZDR enabled and attach it to the audit binder. Cross-reference the endpoint eligibility table in OpenAI's platform documentation to confirm the endpoints your agents actually call are covered - endpoints marked as not ZDR-eligible may still store application state even when ZDR is enabled.
Anthropic. Claude Platform users can confirm that ZDR is applied to their account under Settings > Privacy Controls > Data retention period. Zero data retention requests are reviewed and applied on a per-organization basis, so if you have multiple organizations, verify each one separately. Export the DPA and the ZDR arrangement letter for the compliance file.
Managed MCP gateway. Ask the vendor for a metadata-only log sample from a real tool invocation. If the sample includes any request body content, the vendor's "no payload retention" claim is not actually enforced. A gateway operating a genuine proxy model retains only request IDs, timestamps, HTTP methods, status codes, latency, and integration identifiers.
Trust portal artifacts. OpenAI publishes SOC 2 Type 2 attestation and HIPAA BAA availability on its Trust Portal. Anthropic publishes equivalent artifacts on its Trust Center. Pull the latest reports, confirm the audit period covers your contract term, and attach to the MSA.
Pen test the assumption. During a controlled test, send a distinctive, unique canary string through the full agent → MCP → LLM path. Wait 24 hours. Contact the provider's support and ask whether that string exists in any log or storage tied to your organization. A genuine ZDR configuration returns nothing.

Document this verification workflow directly on your SLA page. Enterprise buyers will not run these checks themselves during procurement, but linking to the exact settings pages and eligibility tables signals that you have done the work and expect them to audit.

Every AI agent that can write to an enterprise SaaS system needs an approval boundary. The question is not whether to add human approval, it is where to place it in the workflow so users are neither rubber-stamping every call nor blocked from getting work done.

When to Require Human-in-the-Loop Approval

Map approvals to blast radius, not to tool category. A create_hub_spot_note call is low-risk. A delete_salesforce_opportunity call on a $2M pipeline record is not. The four tiers that survive procurement review:

Risk Tier	Trigger	Approval Pattern
Auto (T0)	Read-only tools; internal, non-PII resources	No approval; log and continue
Notify (T1)	Writes to low-value objects (notes, comments, tags)	No pre-approval; async notification to the operator
Confirm (T2)	Writes to high-value objects (deals, invoices, tickets); bulk operations > N records	Inline approval prompt with structured diff
Escalate (T3)	Deletes; financial writes; PII exports; multi-tenant fan-out	Two-person approval or manager override, plus signed audit record

Configure tier assignment per tool and per integration, and expose the mapping in the MCP server configuration so customers can tighten it further per workflow.

UI Patterns That Actually Work

Three patterns handle the majority of enterprise approval flows. Pick the pattern that matches the tier, not the one that looks most impressive.

1. Inline structured diff (for T2). Before executing a write, the agent renders the exact change as a structured diff the user can approve, edit, or reject.

Claude wants to update Salesforce deal "Acme Q2 Renewal":

  amount:      $180,000  →  $220,000
  stage:       Negotiation  →  Closed Won
  close_date:  2026-06-30  →  2026-05-30

[Approve]  [Edit]  [Reject]  [Approve all similar for 15 min]

The "approve all similar for 15 min" affordance matters. Without it, users disable approvals entirely out of frustration. With it, they get bounded batch approval that expires automatically.

2. Just-in-time OAuth elicitation (for first-time scopes). When an agent first requests a scope it does not yet hold, surface a browser-based consent flow that clearly names the tenant, the scope, and the requesting agent. The URL Elicitation SEP standardizes this pattern in MCP.

3. Out-of-band approval (for T3). For deletes, financial writes, or bulk exports, hand off approval to Slack, email, or a dedicated approvals dashboard. The agent pauses, a human approver receives a signed link, and the approval is bound to a single tool call with a short TTL (5-15 minutes).

Consent copy is a legal artifact. Vague language ("This app wants to access your account") loses in court and in vendor risk reviews. Use language that names the actor, the data classes, the scope, and the retention posture.

End-user consent, initial connection:

{Agent name} is requesting access to your {Integration name} account.

If you approve, {Agent name} will be able to:

Read the following data: {Resource list, e.g., "contacts, deals, and notes"}

Write the following data: {Resource list, or "none"}

Your credentials are stored encrypted by {Your company} and are never shared with {Agent name} directly. {Your company} does not retain the content of any requests or responses. Access will expire on {expiry date} unless renewed.

You can revoke access at any time from {settings URL}. A full audit log of every action taken on your behalf is available at {audit URL}.

[Approve] [Deny]

Per-call write approval:

{Agent name} wants to {method} a {resource} in {Integration name}.

{Structured diff of proposed change}

This action will be logged with your user ID and cannot be undone by {Your company}. To reverse it, use {Integration name}'s native undo or audit tools.

[Approve once] [Approve for {timeframe}] [Reject]

Elevated-risk approval (T3):

{Agent name} is requesting a high-risk action: {delete / bulk export / financial write}.

Approver: {Second approver name and role} Scope: {Exact records affected, count, and estimated value} TTL: This approval expires in {5-15 minutes}

By approving, you accept responsibility for this action under {your organization}'s AI governance policy.

[Approve] [Reject] [Request more information]

Publish these templates as an appendix on your security page. Enterprise legal teams routinely copy this language directly into their own consent flows, which shortens deal cycles and reduces the surface area for future disputes.

Structuring the Page for Vendor Risk Assessments (VRAs)

Procurement officers do not want to read a novel, and they do not read top-to-bottom. They Ctrl-F for the words their internal risk assessment tools use. Structure your dedicated SLA & security page using the following exact hierarchy, mirroring the standard SIG Lite questionnaire flow, so those searches succeed on the first try.

Recommended Page Outline

Section	Anchor	What Procurement Extracts
Executive Summary	`#summary`	A two-sentence statement confirming commitment to enterprise security, zero data retention, and MCP compliance.
Service Availability	`#uptime`	Monthly target uptime % (e.g., 99.99%), measurement methodology, exclusions.
Performance SLOs	`#latency`	P95/P99 latency for tool discovery and execution.
Authentication	`#auth`	Token hashing at rest, scoping, expiration, secondary auth (`require_api_token_auth`).
Authorization & RBAC	`#rbac`	Role mapping, scope/method filtering (read vs. write), audit trail.
Rate Limits & Errors	`#rate-limits`	Explicit pass-through behavior for HTTP 429 errors, standardized `ratelimit-*` headers.
Data Retention	`#retention`	"Zero Data Retention" payload policy, operational log retention, region pinning.
LLM Provider ZDR	`#llm-zdr`	Shared responsibility map, subprocessor ZDR status, verification steps.
Compliance	`#compliance`	SOC 2 Type II, ISO 27001, GDPR DPA link, HIPAA status.
Incident Response	`#incidents`	Notification SLA, status page, postmortem policy.
Service Credits	`#credits`	Service credit calculation matrix per missed SLA tier.
Change Management	`#changes`	Deprecation notice period, breaking change policy.

End with a downloadable PDF version under NDA that includes the SOC 2 Type II report and pen test summary. The PDF is what gets attached to the MSA. The HTML page is what your champion forwards before the procurement call.

Version the page in the footer (e.g., Last updated: 2026-05-15 · v3.2). Procurement teams compare versions during renewal cycles. A page that has not changed in 18 months reads as either stable or abandoned. Frequent updates with a visible changelog read as a vendor under active security investment.

Truto vs Arcade.dev: How Platform Architecture Shapes Your SLA Commitments

When teams evaluate managed MCP server platforms for enterprise AI agent integrations, the platform's architecture directly dictates what you can promise on your SLA page. This is not an abstract comparison - it determines whether your procurement artifact holds up under scrutiny.

Arcade.dev positions itself as an MCP runtime built around per-user OAuth delegation. Its core thesis is that authorization is the hardest problem in agentic AI. Arcade manages OAuth 2.0 flows so agents can act on behalf of specific users, and co-authored the URL Elicitation SEP with Anthropic - a genuine contribution that standardizes how MCP servers trigger browser-based OAuth flows from within an agent conversation. This makes Arcade a strong fit for user-in-the-loop consumer applications and internal productivity tools where the end user is actively present. Arcade lists around 112 first-party OAuth providers and charges usage-based pricing with separate rates for "standard" and "pro" tool executions.

Truto starts from the opposite premise: the hardest problem is data normalization across hundreds of SaaS APIs with zero integration-specific code. MCP tools are auto-generated from integration documentation at runtime, rate limits are passed through transparently with IETF-standard headers, and API payloads never touch persistent storage. Each MCP server is cryptographically scoped to a single tenant connection with configurable method filtering, tag-based grouping, and TTL-based expiration.

The SLA page implications are concrete:

SLA Section	Arcade.dev Implications	Truto Implications
Data Retention	Platform-managed credential and token storage; audit trail specifics vary by tier	Zero data retention for API payloads; only operational metadata retained
Rate Limit Behavior	Runtime provides automatic failover for rate limits and transient errors	Pass-through: upstream 429s returned immediately with standardized `ratelimit-*` headers
Authentication	Per-user OAuth delegation with URL Elicitation	Per-tenant cryptographic token with optional secondary Bearer token auth
Tool Scoping	Per-toolkit granularity	Method filtering (read/write/custom) + tag-based groups + TTL expiration
Pricing Predictability	Usage-based with separate "standard" ($0.01) and "pro" ($0.50) tool call rates, plus per-challenge fees	Contact for current pricing
Integration Coverage	~112 first-party OAuth providers; community servers for broader coverage	100+ integrations with zero integration-specific code; tools auto-generated from documentation

If your agents operate autonomously in B2B SaaS workflows - syncing CRM data, running compliance audits, managing ticketing pipelines - the zero-retention, pass-through architecture lets you write stronger SLA commitments because there is less to defend. If your primary use case is user-present, consumer-facing agent interactions where per-user OAuth delegation is non-negotiable, Arcade's authorization-first model deserves evaluation.

For a full architectural comparison covering tool generation, rate-limit handling, and security models in depth, see our Truto vs Arcade.dev deep dive.

Enterprise Case Study: Multi-CRM Agent Workflow Under Rate-Limit Pressure

Abstract SLA commitments mean nothing without production evidence. Here is a representative scenario illustrating how managed MCP infrastructure behaves under real-world conditions - rate-limit handling, latency, and TTL enforcement.

Scenario: A compliance automation platform deploys AI agents that audit contact records across three CRMs (Salesforce, HubSpot, and Pipedrive) for a financial services client. The agent runs nightly, pulling contact lists to verify data completeness against regulatory requirements. The MCP server is provisioned with read-only access (methods: ["read"]) and a 30-day TTL tied to the audit engagement period.

Rate-limit event at 2:14 AM:

The agent's Salesforce connection hits a 429 after listing 4,800 contacts in rapid succession. With transparent rate-limit pass-through, the agent receives the error immediately - not after a hidden 60-second retry cycle - along with normalized headers:

HTTP/1.1 429 Too Many Requests
ratelimit-limit: 5000
ratelimit-remaining: 0
ratelimit-reset: 38
retry-after: 38

The agent framework reads ratelimit-reset: 38, logs the event, and pivots to auditing HubSpot contacts while the Salesforce rate window resets. Total time lost to the rate-limit event: zero. With an automatic-retry platform, the same event would have frozen the agent's context for 38 seconds with no explanation, potentially triggering a framework-level timeout and aborting the entire audit run.

Latency profile:

For this type of workflow, representative production latency breaks down as:

Metric	`tools/list` (discovery)	`tools/call` (execution)
P50	~45 ms	~180 ms
P95	~95 ms	~420 ms
P99	~150 ms	~850 ms

The managed MCP gateway overhead (token validation, scope checks, header normalization) accounts for low single-digit milliseconds per request. The dominant latency factor is the upstream CRM API response time, which varies by provider and query complexity. A list call returning 100 Salesforce contacts will inherently take longer than a get call for a single HubSpot deal. Your SLA page should commit to the overhead your platform adds, not to end-to-end latency controlled by a third party.

TTL enforcement at day 30:

When the audit engagement ends, the MCP server's expires_at timestamp triggers cleanup at three layers simultaneously: the edge token lookup automatically stops returning the credential, a scheduled background task permanently deletes the configuration record from the database, and any subsequent tool calls fail immediately with an authentication error. The compliance team's auditor can confirm in their report that access was automatically terminated - no manual intervention, no stale credentials lingering in distributed storage.

This is the kind of production behavior your SLA page must describe. Generic uptime percentages are necessary but insufficient. Procurement teams want to know what happens at 2:14 AM when the rate limit hits, and whether expired tokens are cleaned up or forgotten.

Incident Detection Signals and Prioritized Actions

Assume that at some point an agent will be compromised. Either its host machine is breached, a prompt injection payload turns it into a confused deputy, or its token leaks in a Git commit. Your SLA page should document exactly which signals you monitor and what the customer's response playbook looks like.

Detection Signals (Ranked by Precedence)

Prioritize signals by the ratio of true positives to false positives. Chasing every low-signal anomaly burns the on-call team out before real incidents happen.

P1 - Immediate page:

Token used from an unknown network segment. MCP token used from an IP or ASN that has never appeared in this tenant's history within the last 90 days.
Cross-tenant token reuse. The same hashed token appears against two different tenant scopes in any 24-hour window. This should be architecturally impossible; if it fires, treat it as a critical bug or compromise.
Impossible-travel token use. The same token used from geographically distant IPs within a window shorter than plausible travel time.
Anomalous write-to-read ratio. A read-only workflow suddenly generates write attempts. Because the token is scoped to read, the writes fail at the gateway, but the pattern indicates a compromised or manipulated agent.

P2 - Alert within 15 minutes:

Rate of 4xx errors above baseline. A tenfold increase in 401/403/429 responses in a 5-minute window.
Sudden expansion in tool diversity. An agent that historically calls 3 tools starts calling 20 in a session.
Bulk enumeration pattern. Sequential list calls with monotonically incrementing cursors across an entire dataset (common exfiltration signature).
Token used after operator logout. MCP token continues to be exercised more than N minutes after the associated operator's session has ended in your product.

P3 - Batch review daily:

Slow drift in latency or error rate at the tenant level
New user-agent strings appearing against long-lived tokens
Off-hours activity outside the tenant's declared business hours

Prioritized Response Actions

Publish a decision tree that a customer's SOC team can execute without paging you. The first three moves must be executable in under 60 seconds.

Revoke the affected MCP token (see next section). This is the single highest-impact action and it stops the bleeding at the gateway.
Rotate the upstream OAuth refresh token for the affected integrated account. Even if the MCP token is dead, the underlying OAuth grant may still be valid if the attacker exfiltrated it.
Terminate active agent sessions by invalidating the operator's API tokens and session cookies in your product. If the agent runs inside a customer app, kill the app's session server-side.
Freeze the integrated account at the connector level so no new MCP tokens can be minted against it while investigation continues.
Pull the metadata audit log for the incident window into a forensic bucket. Sign and hash it so you can prove log integrity later.
Cross-reference with the upstream SaaS provider's audit log to identify which records were actually touched. Your metadata log tells you which tool was called; the upstream log tells you which rows were read or modified.
Notify the customer per your breach-notification SLA (usually 24-72 hours) with a preliminary scope statement.

Revoking and Rotating MCP Tokens Quickly

The "how fast" is the whole point. If revoking a compromised token takes an on-call engineer 30 minutes and a bespoke script, your incident response promise is not real.

Immediate Revocation Endpoint

Every MCP platform should expose a synchronous revocation API that propagates to every layer where the token is cached. Document the exact call on your security page so customers can wire it into their own SOAR playbooks.

DELETE /integrated-account/{integrated_account_id}/mcp/{mcp_server_id}
Authorization: Bearer <api_token>

Expected behavior on a well-architected platform:

The database record for the MCP token is marked deleted immediately.
The entry in distributed edge storage is evicted so subsequent lookups fail instantly.
Any scheduled expiration task tied to the token is cancelled.
Subsequent tool calls receive HTTP 401 with a clear reason code (token_revoked) rather than a generic error.

The end-to-end propagation should complete in seconds, not minutes. If a vendor's answer is "eventually consistent within 5 minutes," treat that as a red flag during procurement.

Bulk Revocation and Emergency Rotation

For "revoke everything for this tenant right now" scenarios, ship a bulk endpoint:

POST /environment/{environment_id}/mcp/revoke-all
Authorization: Bearer <api_token>
Content-Type: application/json
 
{
  "reason": "suspected_compromise",
  "integrated_account_ids": ["ia_44b1", "ia_9f2a"],
  "notify_operators": true
}

Behavior:

Every MCP token for the named accounts is invalidated in a single transaction.
New token issuance for the affected accounts is blocked until an operator manually clears the freeze.
Operators are notified out-of-band (email, Slack) with the reason and a link to reissue tokens under fresh scopes.

Rotating Upstream OAuth Credentials

Revoking the MCP token does not rotate the underlying OAuth grant. If you suspect the OAuth refresh token itself has leaked, the sequence is:

Revoke all MCP tokens bound to the integrated account.
Trigger a refresh-token rotation on the upstream provider (most providers support this via a /token endpoint with the current refresh token, or via forced re-consent).
Update the encrypted refresh token in your credential store.
Reissue MCP tokens with fresh scopes and shorter TTLs than the originals.
Force operators to reconnect through the OAuth flow if the upstream provider requires a new authorization grant.

Document the expected wall-clock time for each step. A realistic target is under 5 minutes for MCP token revocation and under 30 minutes for full OAuth rotation.

Warning

Test this playbook quarterly with a fire drill. Revoke a real (non-production) token and time each step. Post the drill results on the SLA page's changelog. Enterprise buyers weight demonstrated response time far more heavily than aspirational commitments.

Performance Benchmarks and Best Practices for Production MCP

When documenting performance SLOs on your SLA page, ground your targets in real MCP server behavior rather than aspirational numbers.

What drives MCP tool call latency:

Managed MCP gateway overhead on cache-hit paths (token lookup, RBAC check, scope validation) typically adds only a few milliseconds per request. Against the surrounding LLM inference call (hundreds of milliseconds to seconds) and the upstream API response, gateway overhead is rarely the bottleneck. General-purpose tools/call latency is dominated by three factors:

Upstream API response time - the largest variable, ranging from 50ms to 2s+ depending on provider, payload size, and query complexity
Authentication overhead - token hashing and edge lookup, typically low single-digit milliseconds
Schema generation for tools/list - dependent on the number of tools exposed; auto-generated tool lists that reflect live schemas may be marginally slower than static catalogs, but they eliminate the stale-schema problem

Best practices for SLA page performance commitments:

Separate your infrastructure latency from upstream latency. Commit to the overhead your platform adds, not end-to-end numbers you cannot control. A statement like "Platform overhead P99 < 50ms; end-to-end latency depends on the upstream provider" is honest and defensible.
Instrument tools/list separately from tools/call. Discovery requests and execution requests have very different profiles. Document both.
Publish rate-limit response time, not just rate-limit policy. Enterprise teams care about how fast a 429 is surfaced. If your platform passes rate-limit errors through immediately, quantify "immediately" (sub-100ms including header normalization). If it retries, state the maximum retry window.
Report latency at P95 and P99, not just averages. P50 numbers look good in marketing. P99 numbers predict the worst experience your most active customers will have. Procurement teams increasingly ask for P99 specifically.

Deployment Options, Security, and Procurement Checklist

Before finalizing your SLA page, align it with the deployment model and security posture your enterprise customers actually require. Different managed MCP platforms offer different options, and procurement teams will probe these specifics.

Deployment Models

Model	Description	Compliance Implications
Managed cloud	Vendor hosts the MCP infrastructure; you receive a URL	Simplest to operate; requires vendor SOC 2 + DPA in place
VPC deployment	Vendor infrastructure runs in your cloud account	Data never leaves your network; higher operational burden
On-premises	Self-hosted within your data center	Maximum control; requires internal team to manage, patch, and monitor

Most B2B SaaS companies building AI agent features will start with managed cloud. VPC and on-premises options matter for regulated industries (healthcare, financial services) where data residency is a contractual requirement, not a preference.

Procurement Questions to Ask Any MCP Platform Vendor

Use these questions during vendor evaluation to stress-test the claims on any SLA page - whether you are comparing Truto, Arcade.dev, or another managed MCP platform:

Pricing model: Is pricing usage-based (per tool call), per connection, or flat-rate? Are there separate tiers for different tool types? Unpredictable pricing models are a procurement red flag - some platforms charge different rates for "standard" vs "pro" tool executions, which makes cost forecasting difficult at scale.
Data residency: Can the data plane be pinned to a specific region (EU, US, APAC)? Is this configurable per tenant or only at the account level?
Audit log export: Can logs be streamed to your SIEM? What format (JSON, CEF)? What retention period?
Credential management: Who holds the OAuth refresh tokens? Are they encrypted at rest? How are access tokens refreshed - on a schedule or just-in-time?
Incident response: What is the breach notification window? Is there a public status page? What is the postmortem commitment?
Integration coverage: How many first-party integrations exist vs. community-maintained servers? Community servers have variable quality and maintenance timelines.
Lock-in and portability: If you switch platforms, can you export your integration configurations, or do you rebuild from scratch?
Rate-limit transparency: Does the platform absorb upstream rate-limit errors, or pass them through? If it absorbs them, what is the maximum retry window, and how does that affect agent context and reasoning?
TTL and access revocation: Can MCP server access be time-limited? When access expires, is cleanup immediate and multi-layered (edge storage, database, audit log), or does it depend on a single mechanism?
Compliance certifications: SOC 2 Type II, ISO 27001, GDPR DPA, HIPAA BAA - which are current, and are audit reports available under NDA?
LLM subprocessor ZDR: Does the platform document its LLM subprocessor list, and can it show written evidence of ZDR enrollment at each subprocessor (OpenAI, Anthropic, or equivalent)?

Bring this checklist to the procurement call and cross-reference the vendor's SLA page against their live answers. Discrepancies between the two are the highest-signal risk indicator you will find.

Generic SOC 2 language is table stakes. Regulated industries need domain-specific commitments backed by contractual instruments. This section maps the artifacts you must produce for the two most common high-risk domains.

HIPAA (Healthcare)

If any agent touches Protected Health Information (PHI), the MCP gateway is a Business Associate under HIPAA. That triggers specific obligations.

Checklist:

Business Associate Agreement (BAA) signed before any PHI touches the gateway. A no-cost, standard BAA available on request shortens deals by weeks compared to a bespoke negotiation.
Minimum necessary rule enforced by scope. Method and tag filtering restrict tools to the smallest set required for the treatment/payment/operations purpose.
Access logs retained for 6 years per HIPAA §164.316(b)(2). Metadata-only logs meet this requirement without expanding PHI storage.
PHI never in payload logs. The zero-retention proxy model is the technical control that satisfies §164.312(b) (audit controls) without creating a §164.502 (uses and disclosures) violation.
Breach notification within 60 days of discovery, per §164.410. Set the contractual SLA at 24-72 hours to give the covered entity room to meet their own obligations.
Subprocessor BAAs in place for every downstream processor that could see PHI, including the LLM provider. OpenAI and Anthropic both offer BAAs on qualifying plans; confirm coverage on the exact endpoints your agents use.
Workforce training documentation available on request.
De-identification path documented for any analytics use of log data, per Safe Harbor or Expert Determination.

Sample BAA insertion clause:

Business Associate acknowledges that MCP tool invocations may transmit Protected Health Information between Covered Entity's designated SaaS systems and Covered Entity's authorized AI agents. Business Associate shall process such PHI solely as a conduit, shall not persist PHI in any log, cache, or database, and shall pass all such PHI through in-memory execution only. Any subprocessor engaged by Business Associate for model inference on PHI-containing requests shall be subject to a Business Associate Agreement executed prior to processing.

If any agent processes data of EU residents, GDPR applies regardless of where your infrastructure runs.

Checklist:

Data Processing Agreement (DPA) available on the security page as a downloadable PDF, pre-executed on your side (customer-countersign only).
Article 28 processor commitments enumerated: purpose limitation, subprocessor list, security measures, breach notification, deletion/return on termination, audit rights.
Data residency controls documented: the data plane is region-pinned to EU-hosted infrastructure for EU tenants. Cross-border transfers use Standard Contractual Clauses (SCCs) with a completed Transfer Impact Assessment (TIA).
Subprocessor list published with 30-day advance notice on additions or changes. Include the LLM provider(s) and their ZDR posture.
Lawful basis mapping for each processing activity: legitimate interest, contract performance, or consent. The MCP gateway itself typically operates under Article 6(1)(b) (contract) with the customer, but the underlying agent workflow may rely on Article 6(1)(a) (consent) or (f) (legitimate interest).
Data subject request (DSR) handling. Because the gateway does not retain payloads, most DSRs resolve at the upstream SaaS layer. Document how customers export the metadata log to complete an access request within 30 days.
Breach notification within 72 hours of awareness per Article 33.
DPIA support for high-risk processing. Provide a template DPIA input covering the gateway's technical and organizational measures.
International transfers. SCCs 2021/914 Module 2 (controller-to-processor) attached to the DPA. TIA covering the destination region for LLM inference (US-hosted providers require an assessment).

Sample DPA rider clause for GDPR:

Processor confirms that MCP tool call request and response payloads are processed exclusively in volatile memory within the region designated by Controller under Section {residency}. No copy of Controller's Personal Data is written to any persistent storage operated by Processor. Operational metadata (timestamps, request identifiers, HTTP status codes, and latency measurements) is retained solely to fulfill Processor's obligations under Article 32 (security of processing) and Article 33 (breach notification) and does not constitute Personal Data of Data Subjects other than the Controller's authenticated operators.

Cross-Domain Controls That Apply Everywhere

Some controls apply regardless of regulatory regime. Bundle them into a single "regulated workloads" appendix on your SLA page:

Encryption at rest (AES-256) and in transit (TLS 1.2+ minimum)
Key management with a customer-managed key (CMK) option for regulated tiers
Annual third-party penetration testing with summary reports available under NDA
Quarterly access reviews of internal staff with a documented least-privilege posture
Segregation of duties between engineering, security, and support access
Incident response runbook tested at least annually with results posted to the changelog

The Final Review and Sales Motion

The SLA & security page is not a marketing exercise. It is the second-most-trafficked page in your enterprise sales cycle, after pricing. Treat it as product surface area: owned by a PM, reviewed quarterly by security and legal, and instrumented for which sections cause drop-off in deal cycles.

Before publishing this page, run it past your own engineering and legal teams. Every commitment on this page is a legal liability. Do not promise automated retries if your system does not support them. Do not promise 100% uptime when you rely on third-party APIs that regularly go down. Radical honesty about system boundaries, rate limits, and error handling builds far more trust with enterprise architects than impossible guarantees.

If you are still building MCP servers for AI agents and working toward this, take these three concrete next moves:

Audit your current MCP implementation against the section list above. Anything you cannot answer in one paragraph today is a deal blocker waiting to happen.
Publish the page even if some sections start as "in progress." A versioned commitment to ship by Q3 is worth more than silence.
If your MCP infrastructure cannot back the commitments (per-tenant token scoping, zero retention, pass-through rate limits, signed audit logs), the gap is architectural, not editorial. Fix it before you publish, or pick a managed MCP platform that already encodes those guarantees so you can ship the page next quarter instead of next year.

Stop letting your six-figure AI deals die in procurement. Build the artifact they need, prove your architectural rigor, and close the contract.

FAQ

Why do enterprise procurement teams block MCP server integrations?: Enterprise procurement teams assign liability based on documented security posture. The base MCP specification defines how clients talk to servers but says nothing about uptime commitments, audit logging, rate-limit handling, or data retention. Without a dedicated SLA and security page addressing these gaps, procurement treats your MCP integration as unsigned liability and blocks the deal.
What is the difference between Truto and Arcade.dev for MCP server AI agent integrations?: Arcade.dev is an MCP runtime focused on per-user OAuth delegation and authorization-first design, making it strong for user-in-the-loop consumer applications. Truto is a unified API engine that auto-generates MCP tools from integration documentation with zero data retention and transparent rate-limit pass-through, making it a better fit for autonomous B2B agent workflows and strict enterprise compliance requirements. The two platforms solve fundamentally different problems.
What should an enterprise MCP security page include?: At minimum: service availability targets with measurement methodology, P95/P99 latency SLOs for tool discovery and execution, authentication architecture (token hashing, scoping, expiration), rate-limit pass-through behavior, zero data retention policy, compliance certifications (SOC 2, ISO 27001, GDPR, HIPAA), incident response SLAs, service credit calculations, and change management policies. Each section should have a stable HTML anchor for deep-linking into MSA exhibits.
How should managed MCP platforms handle upstream API rate limits?: The most defensible approach for enterprise deployments is transparent pass-through: when an upstream API returns HTTP 429, the platform passes that error directly to the caller with normalized IETF-standard ratelimit-* headers. This lets the AI agent reason about backoff, switch tasks, or inform users. Automatic silent retries destroy agent context and cause unpredictable latency spikes.
What procurement questions should teams ask when evaluating MCP platform vendors?: Key questions include: Is pricing usage-based or flat-rate? Can the data plane be pinned to a specific region? Are audit logs exportable to a SIEM? Who holds OAuth refresh tokens and how are they encrypted? Does the platform absorb or pass through rate-limit errors? Can MCP server access be time-limited with automatic multi-layer cleanup? Which compliance certifications (SOC 2 Type II, ISO 27001, GDPR DPA) are current?

Updates

Jul 15, 2026 Added operational playbook sections covering least-privilege scoping and audit log schema, human approval UX with consent language templates, incident detection signals, MCP token revocation and OAuth rotation procedures, and a HIPAA/GDPR compliance checklist with sample BAA/DPA clauses.
Jul 3, 2026 Added a new section on end-to-end zero data retention that covers what ZDR means at the OpenAI and Anthropic API layers, sample DPA/contractual clauses customers can request, a shared responsibility diagram and checklist across the agent-LLM-gateway-SaaS chain, and a verification workflow for provider-side ZDR; added a corresponding row to the VRA page outline and a new LLM subprocessor ZDR question in the procurement checklist.
Jun 16, 2026 Added four new sections: Truto vs Arcade.dev managed MCP platform comparison for SLA readiness, enterprise case study showing multi-CRM agent behavior under rate-limit pressure with TTL enforcement, production MCP performance benchmarks and best practices, and a deployment options and procurement checklist with 10 vendor evaluation questions.

FAQ

More from our Blog

Truto vs Arcade.dev: Which MCP Server Platform Is Best for Enterprise AI Agents? (2026)

Best MCP Server Platforms for AI Agents Connecting to Enterprise SaaS in 2026

Zero Data Retention MCP Servers: Building SOC 2 & GDPR Compliant AI Agents

Auto-Generated MCP Tools: Documentation-Driven Tool Creation for AI Agents (2026)

The 2026 MCP Buyer's Checklist and Quick-Start Guide for B2B SaaS