What is API response standardization for LLM consumption?

It is the process of normalizing disparate API responses from multiple SaaS providers (like different ATS platforms) into a consistent envelope schema with predictable field names, pagination structure, and metadata - so LLMs can parse tool results without hallucinating field names or wasting tokens on structural noise.

Which ATS fields should be redacted before sending to an LLM?

It depends on the use case. For pipeline triage, redact emails, phone numbers, resume text, and EEO data. For candidate sourcing, include names and emails but redact phone numbers and EEO data. EEO demographic fields should never appear at the individual candidate level - only in aggregated compliance reports. Free-text fields like recruiter notes should be treated as toxic by default due to embedded PII.

How do you prevent PII leakage in MCP tool responses?

Use a three-layer strategy: schema-driven field stripping (only allow-listed fields get serialized), regex-based pattern scanning on free-text fields for structured PII like emails and SSNs, and NLP-based entity recognition (such as Microsoft Presidio) for contextual PII like names and addresses that do not follow fixed patterns.

Why should MCP tool input schemas be flattened?

LLMs are unreliable at constructing deeply nested JSON objects. Flattening input schemas to top-level properties reduces malformed tool calls. The mapping layer re-nests the flat inputs into the shape the upstream API expects before proxying the request.

How do you handle long-running ATS API calls in MCP servers?

Use an async polling pattern. The initial MCP tool returns immediately with a poll_id and status of accepted. A second tool (like check_ats_export_status) accepts the poll_id and returns either a running status with an estimated completion time, or a complete status with the result payload. The tool description instructs the LLM to poll rather than retry.

Back

AI & Agents Guides Engineering

How to Publish a Dedicated MCP Integration Reference for Enterprises

Enterprise deals stall when CISOs cannot verify your AI agent security. Learn how to publish a dedicated MCP integration reference - with ATS field selection matrices, PII redaction patterns, and LLM-ready envelope schemas - to unblock procurement.

Nachi Raman · May 26, 2026 · 38 min read

If your enterprise deals are stalling in security review because a CISO asked how your AI agent integrations handle token passthrough and you couldn't point to a single public document, this is the playbook to fix it.

When you tell a Chief Information Security Officer (CISO) that your platform is "AI-ready," they do not hear a value proposition. They hear an unquantified security risk. Enterprise buyers will not connect their internal AI agents to your platform based on a marketing page. They require technical proof that your infrastructure can handle autonomous tool calling without exposing their entire database to a prompt injection attack.

The shift happened faster than most product teams realized. A May 2026 CTO survey found that 78% of surveyed enterprises now have the Model Context Protocol (MCP) in production, and 67% of CTOs named MCP their default integration standard for the next 12 months. A separate enterprise deployment study reported that 28% of Fortune 500 companies have implemented MCP for production AI workflows in under 18 months. These companies are building custom LangGraph agents, deploying Claude Desktop across their workforce, and integrating ChatGPT Enterprise into their daily operations. They need these agents to read and write data to your B2B SaaS platform.

The protocol is no longer optional, and neither is the documentation that proves your implementation is safe to plug in. This guide breaks down exactly what enterprise procurement teams are looking for, the architectural security requirements you must document, and how to structure your public-facing MCP reference to bypass security reviews and accelerate your sales cycle.

The Procurement Reality: Why "AI-Ready" Fails Security Reviews

A dedicated MCP integration reference is a public, versioned document that describes exactly how AI agents authenticate to your platform, which tools they can invoke, what data flows through the connection, and how your server behaves under failure conditions. It is not a blog post. It is not a Loom demo. It is structured technical content that a security reviewer can map to their internal checklist.

The reason it has become non-negotiable: the same CTO survey above identified machine identity, gateway security, and token passthrough as the dominant blockers to enterprise MCP adoption. Procurement is not delayed by feature gaps anymore. It is delayed by the absence of credible answers to security questions.

There is also a positioning angle that competitors are exploiting. Kong is loudly positioning itself as the unified AI control plane for enterprise AI traffic, while security vendors like SentinelOne and Sysdig are publishing detailed write-ups on MCP risks. If you do not publish a counter-narrative grounded in your own architecture, the buyer will internalize someone else's framing of what an AI-ready integration looks like.

AI Agent Integration Enterprise Buyer Requirements

Enterprise procurement teams evaluate MCP integrations against a remarkably consistent checklist. Your dedicated MCP integration reference must explicitly address these core requirements without forcing the buyer to ask. Do not bury this information in a general API reference. Create a dedicated "AI Agent Security & Architecture" page that covers the following pillars:

Tenant isolation model. Is each MCP server scoped to a single connected account, or does a single token grant cross-tenant visibility?
Credential handling. Where are upstream OAuth refresh tokens stored, who can read them, and how are they rotated?
Zero Data Retention. Does the MCP server cache responses, log payloads, or persist tool inputs and outputs? The gold standard is a strict pass-through architecture where payloads are proxied directly to the underlying integration and responses return to the client un-persisted.
Audit trail. Can the customer pull an audit log of every tool invocation, with timestamps, tool names, and request IDs?
Authentication layers. Is the MCP URL alone sufficient to call tools, or is a second factor (API token, mTLS, SSO session) required?
Tool scope controls. Autonomous agents hallucinate. Can the customer restrict an MCP server to read-only (get and list), completely removing create, update, or delete tools from the LLM's context window?
Expiration. Can servers be issued with a fixed Time-To-Live (TTL) or expires_at parameter for contractors, agents, or short-lived workflows?
Failure modes. What happens when the upstream API returns 429, 5xx, or a malformed payload?

The last point is where most public MCP documentation falls apart. Vendors describe the happy path and stay silent on rate limits, partial failures, and upstream outages. A buyer's risk model lives in the failure cases, so that is where your reference needs to be most precise. See the 2026 MCP buyer's checklist for the full procurement matrix.

Tip

Treat each requirement as a heading in your reference document. Buyers ctrl-F for these exact terms. If they don't find them, they assume you don't handle them.

Addressing MCP Server Enterprise Security Requirements

Security is the primary reason MCP server platforms face procurement friction. As SentinelOne highlighted, a single breached MCP server without authentication controls can expose an entire organization's integrated databases. Your integration reference must include a dedicated security addendum detailing your architectural safeguards.

Defeating Credential Aggregation and the Blast Radius Problem

Many naive MCP implementations aggregate API keys in a single centralized server configuration. This creates a massive honeypot. If the server is compromised, every connected integration is exposed. Your reference needs to make three things explicit:

Scope per server. Document that each MCP server URL is bound to one tenant's one connected account, not a multi-tenant fleet.
Token storage at rest. State whether raw tokens are stored or only cryptographic hashes. Detail how your architecture stores tokens securely (e.g., hashed in a fast Key-Value store) and how the MCP server URL itself acts as a cryptographic token mapping to a specific, isolated tenant connection.
One-time disclosure. Confirm that the raw URL/token is shown exactly once at creation and never retrievable from logs or admin UIs.

Token Passthrough and Dual Authentication

The most common procurement objection is "the URL is the credential." By default, possessing an MCP server URL grants access to its tools. For enterprise deployments, this is insufficient. A clean answer is to document an optional second authentication layer.

Explain how your platform supports an "API Token Required" flag. When enabled, the MCP client (whether it is Claude Desktop or a custom agent) cannot just connect to the URL - it must also pass a valid API token in the Authorization header.

Show the exact request shape in your reference:

POST /mcp/<server-token> HTTP/1.1
Authorization: Bearer <tenant-api-token>
Content-Type: application/json
 
{"jsonrpc":"2.0","id":1,"method":"tools/call","params":{...}}

With that pattern documented, leaking the URL into a config file or log no longer constitutes a usable credential. You can visualize this dual-auth flow to build immediate technical credibility:

sequenceDiagram
    participant Agent as AI Agent (Claude/Custom)
    participant MCP as MCP Server Router
    participant KV as Token Store
    participant Proxy as Proxy API Layer
    participant SaaS as Target SaaS API

    Agent->>MCP: POST /mcp/:token<br>Header: Authorization Bearer :api_key
    MCP->>KV: Hash URL token & validate expiry
    KV-->>MCP: Validation successful
    MCP->>MCP: Validate API Key (Dual Auth)
    MCP->>Proxy: Execute Tool (e.g., list_contacts)
    Proxy->>SaaS: Proxied Request with OAuth Credentials
    SaaS-->>Proxy: Raw API Response
    Proxy-->>MCP: Normalized Response
    MCP-->>Agent: JSON-RPC Tool Result

Transparent Rate Limit Handling and the 429 Contract

AI agents are notorious for hitting rate limits. They operate at machine speed and will rapidly iterate through pagination cursors until they exhaust a quota. A surprising amount of enterprise security review focuses on rate limits because they are the most common cause of cascading failures during AI agent bursts.

Your documentation must be radically honest about how you handle rate limits. Do not claim to magically absorb or silently retry rate limit errors. If you hide HTTP 429 errors from the LLM, the agent will assume the tool failed for a functional reason and will likely hallucinate a workaround, causing infinite loops.

The honest contract to publish is this: when the upstream provider returns HTTP 429, the MCP server passes that error directly to the calling agent along with normalized rate limit headers, leaving retry decisions to the client. State clearly that you normalize upstream rate limit information into standard IETF headers (ratelimit-limit, ratelimit-remaining, ratelimit-reset).

Document the behavior, and recommend a client-side exponential backoff pattern with jitter to prove to enterprise engineers that they can build predictable workflows:

async function callTool(req: ToolRequest, attempt = 0): Promise<ToolResult> {
  const res = await fetch(mcpUrl, { method: 'POST', body: JSON.stringify(req) });
  if (res.status !== 429) return res.json();
 
  const reset = Number(res.headers.get('ratelimit-reset') ?? 1);
  const backoff = Math.min(reset * 1000, 2 ** attempt * 250) + Math.random() * 100;
  await new Promise(r => setTimeout(r, backoff));
  return callTool(req, attempt + 1);
}

For a deeper treatment, point readers to a dedicated runbook on handling API rate limits and retries across third-party APIs.

Model Context Protocol Documentation Best Practices

A good MCP integration reference reads like a SOC 2 control narrative crossed with an API reference. It is dry, specific, and verifiable. It is an operational manual for AI developers. Here is the structure that has held up across enterprise reviews:

Section	What to include
Overview	One paragraph naming the protocol version (e.g. `2024-11-05`), supported transports, and the JSON-RPC method surface (`initialize`, `tools/list`, `tools/call`, `ping`).
Tenancy & isolation	Diagram showing one MCP server per connected account. State explicitly that no tool call crosses tenant boundaries.
Authentication	The two-layer model: URL token + optional API token. Include the exact `Authorization` header format.
Tool catalog	Auto-generated list of tools with names, descriptions, JSON Schema for inputs, and the upstream endpoint each tool maps to.
Scope controls	How to restrict a server to `read`, `write`, specific resources, or custom methods. Show the create payload.
Rate limits & errors	The normalized rate limit headers, the 429 passthrough contract, and the structured error envelope.
Lifecycle	TTL behavior, rotation, revocation, and what happens on tenant offboarding.

Beyond this structural table, you must include explicit, actionable guides for the engineers actually wiring up the connection:

1. Client Configuration Instructions

Do not assume the buyer knows how to wire up an MCP server. Provide exact, copy-paste instructions for the major clients.

For Claude Desktop: Show the exact JSON block required in the claude_desktop_config.json file.

{
  "mcpServers": {
    "your_saas_integration": {
      "command": "npx",
      "args": [
        "-y",
        "@modelcontextprotocol/server-everything"
      ],
      "url": "https://api.yourdomain.com/mcp/a1b2c3d4e5f6..."
    }
  }
}

For ChatGPT: Provide the step-by-step UI path: Settings -> Apps -> Advanced settings -> Enable Developer mode -> Add Custom Connector.

2. Auto-Generated Tool Schemas

Enterprise developers need to know exactly what tools the LLM will see. Document your naming conventions. Explain that tools are generated with descriptive, snake_case names (e.g., list_all_crm_contacts or update_a_support_ticket_by_id).

Buyers want to see properties, required, and enum values inline, not prose summaries. Highlight how required fields are strictly enforced before the tool executes.

3. Pagination Directives

Document how your tools instruct the LLM to handle pagination. For example, explain that list methods automatically inject a limit and next_cursor property into the query schema.

Show the exact prompt instruction embedded in your tool descriptions: "Always send back exactly the cursor value you received without decoding, modifying, or parsing it." This level of detail proves to engineering buyers that your platform is built specifically for the quirks of LLM behavior.

Tip

Documentation Quality is a Security Feature
In the MCP ecosystem, documentation acts as a quality gate. If an endpoint lacks a human-readable description and a strict JSON schema, it should not be exposed as a tool. Explicitly stating this policy in your reference builds massive trust with enterprise security teams.

Standardizing ATS API Responses for LLM Consumption

The documentation structure above gets you past procurement. But the engineers on the buyer's side still have to wire your platform into their LLM pipelines - and that is where most ATS integrations fall apart. Applicant tracking systems like Greenhouse, Lever, Ashby, and Workday all model the same concepts (jobs, candidates, applications, interviews) using wildly different schemas, field names, and nesting depths. An LLM consuming raw responses from three different ATS providers will waste tokens parsing structural noise, hallucinate field names it saw in a different provider's response, and potentially leak PII that was never intended for the model's context.

Standardizing API responses for LLM consumption is not just a developer experience improvement. It is a token cost problem, a compliance problem, and a reliability problem all at once. The rest of this section is the implementation guide.

Field Selection: When to Include vs Redact

Not every field in an ATS response belongs in an LLM's context window. Sending a full candidate record - with home address, phone number, EEO demographic data, and interviewer notes - to an agent whose job is to move a pipeline stage is wasteful and dangerous. The right approach is to define a minimal safe-field list per use case.

The matrix below covers the four primary ATS agent use cases against the unified data model. "Include" means the field is sent to the LLM. "Redact" means it is stripped before the response leaves your normalization layer. "Hash" means the value is replaced with a deterministic, non-reversible token (useful for join operations without exposing the raw value).

Field	Career Page / Job Board	Candidate Sourcing	Pipeline Triage	Compliance Reporting
`job.title`	Include	Include	Include	Include
`job.description`	Include (truncated)	Include (truncated)	Redact	Redact
`job.department`	Include	Include	Include	Include
`job.office_location`	Include	Include	Redact	Include
`candidate.first_name`	Redact	Include	Include	Hash
`candidate.last_name`	Redact	Include	Include	Hash
`candidate.email`	Redact	Include	Redact	Redact
`candidate.phone`	Redact	Redact	Redact	Redact
`candidate.resume_text`	Redact	Include (truncated)	Redact	Redact
`application.stage`	Redact	Redact	Include	Include
`application.reject_reason`	Redact	Redact	Include	Include
`scorecard.rating`	Redact	Redact	Include	Redact
`scorecard.notes`	Redact	Redact	Include (truncated)	Redact
`interview.scheduled_at`	Redact	Redact	Include	Redact
`eeoc.race`	Redact	Redact	Redact	Include (aggregated only)
`eeoc.gender`	Redact	Redact	Redact	Include (aggregated only)
`eeoc.veteran_status`	Redact	Redact	Redact	Include (aggregated only)
`activity.note_body`	Redact	Redact	Redact	Redact
`user.name` (interviewer)	Redact	Redact	Include	Redact

Two patterns to notice. First, the EEO fields should never appear in an individual candidate context - they are only safe when aggregated for reporting. Second, free-text fields like job.description, candidate.resume_text, and scorecard.notes should be truncated to a token budget (typically 500-1000 tokens) even when included, because a single verbose job description can consume 20% of a smaller model's context window.

Warning

activity.note_body is a consistent source of PII leakage. Recruiters paste phone numbers, salary expectations, and personal circumstances into notes. Treat this field as toxic by default and redact it unless the use case explicitly requires it with a PII scan applied.

The LLM-Ready Envelope Schema

LLMs perform better when every tool response follows a predictable structure. A standard envelope wrapping every ATS response gives the model a consistent shape to parse, reduces hallucinated field access, and gives your telemetry layer a stable contract to hook into.

Here is the envelope pattern that works across list and get operations:

{
  "resource": "candidates",
  "method": "list",
  "integration": "greenhouse",
  "timestamp": "2026-06-15T08:30:00Z",
  "pagination": {
    "next_cursor": "eyJpZCI6IDQ1Nn0=",
    "has_more": true,
    "limit": 25
  },
  "data": [
    {
      "id": "cand_8a3f",
      "first_name": "Jamie",
      "last_name": "Chen",
      "current_stage": "Onsite",
      "applied_job_title": "Senior Backend Engineer"
    }
  ],
  "_meta": {
    "fields_redacted": ["email", "phone", "resume_text"],
    "fields_truncated": [],
    "token_estimate": 187
  }
}

Key design decisions in this envelope:

resource and method are top-level so the LLM can identify what it is looking at without re-reading the tool call.
pagination is always present (even for get operations where has_more is false) so the model never has to guess whether more records exist.
_meta.fields_redacted lists fields that were present in the upstream response but stripped before delivery. This prevents the LLM from assuming the field does not exist in the ATS and hallucinating a workaround to fetch it.
_meta.token_estimate gives the calling agent a budget signal. An orchestration layer can use this to decide whether to fetch the next page or summarize what it has.

This envelope should be the contract your MCP tools return. When docs and runtime share the same envelope, your reference document becomes a testable spec rather than aspirational prose.

Mapping and JSONata Recipes for ATS Normalization

The gap between "Greenhouse calls it current_stage" and "Lever calls it stage.text" and "Workday nests it three levels deep in staffing_event.disposition.status_name" is where unified API normalization earns its keep. A data-driven mapping architecture - where integration behavior is defined entirely in configuration, not code - solves this with declarative JSONata expressions.

JSONata is a lightweight query and transformation language for JSON. It lets you declare the mapping as a pure expression with no procedural code, and because expressions have no side effects, they are safe to evaluate in sandboxed or edge environments.

Here are three practical recipes that cover the most common ATS normalization patterns:

Recipe 1: Flatten a nested stage name

Greenhouse returns current_stage.name as a string. Lever nests it in stage.text. This expression normalizes both to a top-level current_stage field:

{
  "current_stage": $exists(current_stage.name)
    ? current_stage.name
    : $exists(stage.text)
      ? stage.text
      : "unknown"
}

Recipe 2: Coalesce email from multiple shapes

Some ATS providers return email as a string, others as an array of {type, value} objects. This normalizes to a single primary email:

{
  "email": $type(emails) = "array"
    ? emails[type = "primary"].value
    : $type(email) = "string"
      ? email
      : null
}

Recipe 3: Truncate free-text fields to a token budget

For fields like resume_text or job.description, cap the output to approximately 800 tokens (~3200 characters) to avoid blowing out context windows:

{
  "description_truncated": $length(description) > 3200
    ? $substring(description, 0, 3200) & "...[truncated]"
    : description
}

These expressions are stored alongside integration configuration, not embedded in handler code. When a new ATS provider is added, the mapping is a new config record - not a pull request. For a deep dive on the three-level override architecture (platform, environment, account) that makes this scale to hundreds of tenants, see the JSONata mapping guide.

PII Redaction and Hashing Patterns

Field selection handles the known fields. PII redaction handles the unknown ones - the recruiter who pasted a Social Security Number into a candidate note, the phone number buried in a free-text description field, the email address hiding inside a JSON blob your schema didn't anticipate.

A layered strategy works best:

Layer 1: Schema-driven field stripping. Before any response reaches the LLM, strip fields that are categorically unsafe based on the use-case matrix above. This is the fastest and most reliable layer - if the field is not in the allow-list, it does not get serialized. No regex, no ML models, no false positive risk.

Layer 2: Pattern-based scanning on free-text fields. For fields that pass the allow-list but contain free-text content (notes, descriptions, scorecard comments), apply regex-based detection for structured PII patterns: emails, phone numbers, SSNs, credit card numbers. Replace matches with typed placeholders like [EMAIL_REDACTED] or [PHONE_REDACTED] so the LLM knows data was there but cannot access the raw value.

Common regex patterns worth implementing:

const PII_PATTERNS: Record<string, RegExp> = {
  email:    /[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,}/g,
  usPhone:  /(?:\+?1[-\s.]?)?\(?\d{3}\)?[-\s.]?\d{3}[-\s.]?\d{4}/g,
  ssn:      /\b\d{3}-\d{2}-\d{4}\b/g,
  ipv4:     /\b(?:\d{1,3}\.){3}\d{1,3}\b/g,
};
 
function redactFreeText(text: string): string {
  let cleaned = text;
  for (const [type, pattern] of Object.entries(PII_PATTERNS)) {
    cleaned = cleaned.replace(pattern, `[${type.toUpperCase()}_REDACTED]`);
  }
  return cleaned;
}

Layer 3: NLP-based entity recognition for contextual PII. Pattern matching misses names, addresses, and context-dependent identifiers that do not follow fixed formats. For these, Microsoft Presidio is the standard open-source option. It combines NLP models, regex patterns, and rule-based logic to detect entities like names, locations, and medical IDs, then anonymizes them through redaction, hashing, or masking. Presidio runs as a Python service or HTTP sidecar. Wire it into your normalization pipeline as a post-processing step on any free-text field that passes Layers 1 and 2.

Tip

Hashing vs Redaction Trade-Off
Hashing preserves referential integrity - you can still join on a hashed candidate_id across two different tool calls. But hashed values of low-cardinality fields (like names) can be reversed with rainbow tables. Use hashing for IDs and high-cardinality identifiers. Use full redaction (replacement with a placeholder) for names, addresses, and demographic data.

MCP Tool Generation and Flattened Input Patterns

LLMs are bad at constructing deeply nested JSON. Every level of nesting increases the probability of a malformed tool call. When generating MCP tools from ATS resource definitions, flatten the input schema as much as possible.

Consider a create_application tool. The underlying ATS API might expect:

{
  "candidate": { "id": "cand_8a3f" },
  "job": { "id": "job_12ab" },
  "source": { "type": "referral", "referrer": { "name": "Alex" } }
}

The MCP tool schema should flatten this to top-level properties:

{
  "name": "create_an_ats_application",
  "description": "Create a new application linking a candidate to a job.",
  "inputSchema": {
    "type": "object",
    "properties": {
      "candidate_id": { "type": "string", "description": "The candidate ID" },
      "job_id": { "type": "string", "description": "The job ID" },
      "source_type": { "type": "string", "enum": ["referral", "agency", "direct", "internal"] },
      "source_referrer_name": { "type": "string", "description": "Name of the referrer, if source_type is referral" }
    },
    "required": ["candidate_id", "job_id"]
  }
}

The mapping layer re-nests these flat inputs into the shape the upstream API expects before proxying the request. The LLM never sees the nesting. This pattern applies to every write operation across the ATS data model - create_candidate, update_application_stage, submit_scorecard - and is especially important for tools that map to JobFormFields, where the custom fields per role can number in the dozens.

Two additional patterns that improve LLM tool-call accuracy:

Descriptive names over short names. list_all_ats_candidates beats list_candidates because the LLM has less ambiguity when multiple integrations are loaded in the same context.
Enum values in the schema, not the description. Put allowed values in "enum": ["phone_screen", "onsite", "offer"] rather than describing them in prose. The LLM's structured output mode will enforce the constraint.

Downloadable Flattened Input Schema Examples for ATS Tools

A production-quality MCP reference should publish a complete, copy-paste-ready catalog of flattened tool schemas for every write operation. Here are four canonical examples covering the highest-value ATS write paths. Ship these verbatim in your reference so LLM orchestration teams can hardcode them into their agent prompts and test harnesses.

create_an_ats_candidate - Flattens nested contact, address, and social profile objects into a single tier.

{
  "name": "create_an_ats_candidate",
  "description": "Create a new candidate record in the ATS. All contact fields are top-level.",
  "inputSchema": {
    "type": "object",
    "properties": {
      "first_name":       { "type": "string" },
      "last_name":        { "type": "string" },
      "primary_email":    { "type": "string", "format": "email" },
      "primary_phone":    { "type": "string" },
      "linkedin_url":     { "type": "string", "format": "uri" },
      "github_url":       { "type": "string", "format": "uri" },
      "address_city":     { "type": "string" },
      "address_country":  { "type": "string", "description": "ISO 3166-1 alpha-2 code" },
      "source_channel":   { "type": "string", "enum": ["referral", "agency", "direct", "job_board", "linkedin", "internal"] },
      "source_referrer_id": { "type": "string", "description": "Employee user ID if source_channel is referral" }
    },
    "required": ["first_name", "last_name", "primary_email", "source_channel"]
  }
}

update_an_ats_application_stage_by_id - The canonical pipeline triage tool. Note the enum on target_stage and the required application_id.

{
  "name": "update_an_ats_application_stage_by_id",
  "description": "Move an application to a new stage. Fails if target_stage is not valid for the application's job.",
  "inputSchema": {
    "type": "object",
    "properties": {
      "application_id":  { "type": "string", "description": "The application ID. Required." },
      "target_stage":    { "type": "string", "enum": ["application_review", "phone_screen", "technical_screen", "onsite", "reference_check", "offer", "hired", "rejected"] },
      "reject_reason_id": { "type": "string", "description": "Required only when target_stage is 'rejected'" },
      "note_body":       { "type": "string", "description": "Optional context for the stage change. Free text is PII-scanned before persistence." }
    },
    "required": ["application_id", "target_stage"]
  }
}

submit_an_ats_scorecard - Flattens per-attribute ratings that would otherwise be a nested array of {attribute, rating, notes} objects into a fixed set of top-level fields.

{
  "name": "submit_an_ats_scorecard",
  "description": "Submit an interview scorecard. Attribute ratings are 1-5 integers.",
  "inputSchema": {
    "type": "object",
    "properties": {
      "application_id":         { "type": "string" },
      "interview_id":           { "type": "string" },
      "interviewer_user_id":    { "type": "string" },
      "overall_recommendation": { "type": "string", "enum": ["strong_yes", "yes", "neutral", "no", "strong_no"] },
      "rating_technical":       { "type": "integer", "minimum": 1, "maximum": 5 },
      "rating_communication":   { "type": "integer", "minimum": 1, "maximum": 5 },
      "rating_culture":         { "type": "integer", "minimum": 1, "maximum": 5 },
      "notes_technical":        { "type": "string" },
      "notes_communication":    { "type": "string" },
      "notes_culture":          { "type": "string" }
    },
    "required": ["application_id", "interview_id", "interviewer_user_id", "overall_recommendation"]
  }
}

create_an_ats_job - Demonstrates flattening of the custom_fields array. Instead of expecting the LLM to construct [{key, value}, ...], expose known custom fields as named top-level properties driven by per-account schema detection.

{
  "name": "create_an_ats_job",
  "description": "Create a job posting. Custom fields (custom_*) are surfaced per-tenant from the ATS schema.",
  "inputSchema": {
    "type": "object",
    "properties": {
      "title":              { "type": "string" },
      "department_id":      { "type": "string" },
      "office_location_id": { "type": "string" },
      "employment_type":    { "type": "string", "enum": ["full_time", "part_time", "contract", "intern"] },
      "hiring_manager_user_id": { "type": "string" },
      "salary_currency":    { "type": "string", "description": "ISO 4217, e.g. USD" },
      "salary_min":         { "type": "number" },
      "salary_max":         { "type": "number" },
      "custom_cost_center": { "type": "string", "description": "Tenant-specific custom field" },
      "custom_req_type":    { "type": "string", "enum": ["new_headcount", "backfill", "conversion"] }
    },
    "required": ["title", "department_id", "employment_type"]
  }
}

Every one of these schemas is single-tier. There are no oneOf, no nested objects, no arrays of objects the LLM has to construct. That constraint is what keeps tool-call accuracy above 95% on production Claude and GPT-4-class models.

Flattened-to-Nested Rehydration Recipe

The mirror image of flattening is rehydration: when the tool call arrives with flat inputs, the mapping layer must reconstruct the nested shape the upstream ATS API actually expects. This is a mechanical transform, and JSONata handles it cleanly.

Given the LLM sends this flat payload for create_an_ats_candidate:

{
  "first_name": "Jamie",
  "last_name": "Chen",
  "primary_email": "jamie@example.com",
  "primary_phone": "+1 415 555 0142",
  "linkedin_url": "https://linkedin.com/in/jamiechen",
  "address_city": "San Francisco",
  "address_country": "US",
  "source_channel": "referral",
  "source_referrer_id": "user_9f21"
}

A rehydration JSONata expression rebuilds the Greenhouse-shaped payload:

{
  "first_name": first_name,
  "last_name": last_name,
  "email_addresses": [
    { "value": primary_email, "type": "personal" }
  ],
  "phone_numbers": $exists(primary_phone) ? [
    { "value": primary_phone, "type": "mobile" }
  ] : [],
  "social_media_addresses": $exists(linkedin_url) ? [
    { "value": linkedin_url }
  ] : [],
  "addresses": ($exists(address_city) or $exists(address_country)) ? [
    {
      "city": address_city,
      "country_code": address_country,
      "type": "home"
    }
  ] : [],
  "applications": [],
  "source": {
    "type": source_channel,
    "referrer": source_channel = "referral" and $exists(source_referrer_id) ? {
      "type": "user",
      "id": source_referrer_id
    } : null
  }
}

Same flat input, different upstream. The Lever-shaped rehydration is a separate JSONata expression stored against the Lever integration config:

{
  "name": first_name & " " & last_name,
  "emails": [primary_email],
  "phones": $exists(primary_phone) ? [
    { "value": primary_phone, "type": "mobile" }
  ] : [],
  "links": $exists(linkedin_url) ? [linkedin_url] : [],
  "location": $exists(address_city) ? { "name": address_city } : null,
  "origin": source_channel,
  "sources": $exists(source_referrer_id) ? [source_referrer_id] : []
}

The pattern generalizes. Any nested-array-of-objects field on the upstream API becomes an optional array in the rehydration expression, gated on $exists() so absent flat inputs do not produce empty arrays that the upstream might reject. Any oneOf variant (like source.referrer differing between employee referrals and agency referrals) becomes a ternary that inspects a flat discriminator field (source_channel).

Three rules make rehydration safe:

Never introduce fields the LLM did not send. If linkedin_url is missing, omit social_media_addresses entirely rather than emitting an empty array. Some upstream ATS APIs treat empty arrays as "clear all existing entries."
Rehydration is one-way. The response coming back from the upstream is flattened again through a separate JSONata expression before returning to the LLM. There is no round-trip identity requirement.
Validate against the upstream OpenAPI schema in CI. For every flat -> nested mapping, run the rehydrated output through an OpenAPI validator against the upstream's spec. This catches drift when the ATS provider adds a required field to their nested structure.

Sample Generated MCP Tool JSON for LangChain and LangGraph

MCP tools generated on the fly need to work seamlessly with the frameworks enterprises actually use. LangChain and LangGraph consume MCP tools through the @langchain/mcp-adapters package, which converts each MCP tool descriptor into a LangChain StructuredTool at runtime. The tool JSON you emit from tools/list is the input to that adapter, so shape matters.

Here is the full tools/list response as an ATS-focused MCP server would produce it, ready to be picked up by a LangGraph ReAct agent:

{
  "jsonrpc": "2.0",
  "id": 1,
  "result": {
    "tools": [
      {
        "name": "list_all_ats_candidates",
        "description": "List candidates from the ATS. Supports filtering by job_id, stage, and updated_at_gte. Pagination via next_cursor - pass back the cursor value exactly as received.",
        "inputSchema": {
          "type": "object",
          "properties": {
            "job_id":          { "type": "string", "description": "Filter to candidates on a specific job" },
            "stage":           { "type": "string", "enum": ["application_review", "phone_screen", "technical_screen", "onsite", "reference_check", "offer", "hired", "rejected"] },
            "updated_at_gte":  { "type": "string", "format": "date-time" },
            "limit":           { "type": "string", "description": "Number of records to fetch, max 100" },
            "next_cursor":     { "type": "string", "description": "Cursor from previous response. Pass exactly as received." }
          }
        }
      },
      {
        "name": "get_single_ats_candidate_by_id",
        "description": "Retrieve a single candidate by ID.",
        "inputSchema": {
          "type": "object",
          "properties": {
            "id": { "type": "string", "description": "The candidate ID. Required." }
          },
          "required": ["id"]
        }
      },
      {
        "name": "update_an_ats_application_stage_by_id",
        "description": "Move an application to a new stage. Fails if target_stage is not valid for the application's job.",
        "inputSchema": {
          "type": "object",
          "properties": {
            "application_id":   { "type": "string" },
            "target_stage":     { "type": "string", "enum": ["application_review", "phone_screen", "technical_screen", "onsite", "reference_check", "offer", "hired", "rejected"] },
            "reject_reason_id": { "type": "string" },
            "note_body":        { "type": "string" }
          },
          "required": ["application_id", "target_stage"]
        }
      },
      {
        "name": "submit_an_ats_scorecard",
        "description": "Submit an interview scorecard with per-attribute ratings (1-5).",
        "inputSchema": {
          "type": "object",
          "properties": {
            "application_id":         { "type": "string" },
            "interview_id":           { "type": "string" },
            "interviewer_user_id":    { "type": "string" },
            "overall_recommendation": { "type": "string", "enum": ["strong_yes", "yes", "neutral", "no", "strong_no"] },
            "rating_technical":       { "type": "integer", "minimum": 1, "maximum": 5 },
            "rating_communication":   { "type": "integer", "minimum": 1, "maximum": 5 },
            "rating_culture":         { "type": "integer", "minimum": 1, "maximum": 5 },
            "notes_technical":        { "type": "string" },
            "notes_communication":    { "type": "string" },
            "notes_culture":          { "type": "string" }
          },
          "required": ["application_id", "interview_id", "interviewer_user_id", "overall_recommendation"]
        }
      }
    ]
  }
}

Wiring this into a LangGraph agent takes about 20 lines:

import { MultiServerMCPClient } from '@langchain/mcp-adapters';
import { ChatAnthropic } from '@langchain/anthropic';
import { createReactAgent } from '@langchain/langgraph/prebuilt';
 
const mcp = new MultiServerMCPClient({
  mcpServers: {
    ats: {
      url: process.env.ATS_MCP_URL!,
      headers: { Authorization: `Bearer ${process.env.MCP_API_TOKEN}` },
      transport: 'http',
    },
  },
});
 
const tools = await mcp.getTools();
// tools is now a LangChain StructuredTool[] whose schemas match the flat
// inputSchema above one-to-one. Zod schemas are inferred automatically.
 
const agent = createReactAgent({
  llm: new ChatAnthropic({ model: 'claude-sonnet-4-5', temperature: 0 }),
  tools,
});
 
const result = await agent.invoke({
  messages: [{
    role: 'user',
    content: 'Move application app_7a2b to the onsite stage and add a note that the recruiter loop was strong.',
  }],
});

Because each inputSchema is a single-tier JSON Schema with explicit enums and required fields, the LangChain adapter derives a matching Zod schema with no ambiguity. The agent's tool call arrives at your MCP server as { application_id: "app_7a2b", target_stage: "onsite", note_body: "..." } and the rehydration JSONata reconstructs the nested Greenhouse or Lever payload before proxying. This is the pattern LLM compatible API formats for ATS should follow across every provider you support.

Testing Checklist and Telemetry Signals

A normalized ATS response is only trustworthy if you can prove it stays normalized. Ship CI tests alongside your mapping configuration, and instrument telemetry events that catch drift before it reaches production agents.

CI tests for field normalization:

Every ATS integration mapping should have a snapshot test that runs against a fixture of the upstream provider's raw response and asserts the normalized output matches the expected envelope shape.

import { describe, it, expect } from 'vitest';
import { normalize } from './ats-normalizer';
import greenhouseFixture from './fixtures/greenhouse-candidates.json';
import leverFixture from './fixtures/lever-candidates.json';
 
describe('candidate normalization', () => {
  it('greenhouse: produces envelope with required fields', () => {
    const result = normalize('greenhouse', 'candidates', 'list', greenhouseFixture);
    expect(result).toHaveProperty('resource', 'candidates');
    expect(result).toHaveProperty('pagination.has_more');
    expect(result.data[0]).toHaveProperty('id');
    expect(result.data[0]).toHaveProperty('first_name');
    expect(result.data[0]).not.toHaveProperty('email'); // redacted
  });
 
  it('lever: same envelope shape as greenhouse', () => {
    const result = normalize('lever', 'candidates', 'list', leverFixture);
    expect(Object.keys(result.data[0]).sort())
      .toEqual(Object.keys(
        normalize('greenhouse', 'candidates', 'list', greenhouseFixture).data[0]
      ).sort());
  });
 
  it('rejects unknown fields in output', () => {
    const result = normalize('greenhouse', 'candidates', 'list', greenhouseFixture);
    const allowedFields = ['id', 'first_name', 'last_name', 'current_stage', 'applied_job_title'];
    for (const record of result.data) {
      for (const key of Object.keys(record)) {
        expect(allowedFields).toContain(key);
      }
    }
  });
});

The key assertion is the second test: normalized output from different ATS providers should produce the same field set for the same resource and use case. If Greenhouse returns current_stage but Lever returns stage_name, the mapping is broken and the LLM will hallucinate.

Telemetry events to instrument:

Event	What it catches	Alert threshold
`ats.response.field_count`	Unexpected schema changes upstream. If Greenhouse adds 15 new fields overnight, your allow-list is stale.	> 20% deviation from baseline
`ats.response.token_estimate`	Token budget blow-outs. Tracks the `_meta.token_estimate` value per response.	> 2x the p95 for that resource
`ats.response.truncation_applied`	Indicates a free-text field exceeded the token budget and was cut. High rates mean the budget is too tight or job descriptions are getting longer.	> 30% of responses
`ats.response.pii_detected`	A Layer 2 or Layer 3 scan found PII in a free-text field.	Any occurrence
`ats.response.stale_cursor`	The cursor from a previous pagination call is no longer valid upstream. Common after ATS bulk imports.	> 5% of paginated requests
`ats.mapping.fallback_used`	A JSONata expression hit the fallback branch (the `"unknown"` case in Recipe 1). Means a provider returned a shape you have not mapped.	Any occurrence in production

These events feed into a freshness and correctness dashboard. When the pii_detected counter spikes, you know a recruiter started pasting sensitive data into a field you were passing through. When fallback_used fires, you know an ATS provider changed their API response shape.

Operational Playbook for Long-Running ATS Endpoints

Not every ATS API call returns in 200ms. Bulk candidate exports, full pipeline snapshots, and compliance data pulls can take 30 seconds to several minutes. LLM tool calls have timeout expectations - Claude Desktop, for example, assumes a tool responds within a few seconds. If your MCP server blocks for 45 seconds waiting on a Workday bulk export, the agent will assume the tool failed and retry, potentially triggering duplicate operations.

Handle these with an async polling pattern:

The MCP tool returns immediately with a status: "accepted" response and a poll_id.
A second MCP tool - check_ats_export_status - accepts the poll_id and returns either status: "running" with an estimated completion time, or status: "complete" with the result payload.
The tool description instructs the LLM to call check_ats_export_status after a delay instead of retrying the original export.

{
  "resource": "candidates",
  "method": "bulk_export",
  "status": "accepted",
  "poll_id": "exp_7f2a9c",
  "retry_after_seconds": 15,
  "data": null,
  "_meta": {
    "instruction": "Do not retry this tool. Call check_ats_export_status with poll_id exp_7f2a9c after 15 seconds."
  }
}

The _meta.instruction field is read by the LLM as part of the tool response and steers it away from retrying. This pattern also applies to interview scheduling endpoints (which often depend on calendar availability lookups) and scorecard submission endpoints that trigger downstream webhooks.

End-to-End Multi-Tool Async Flow: The Full JSON-RPC Trace

A reference document that only shows one message from the polling flow leaves engineers guessing about the rest. Publish the complete round trip so orchestration teams can implement it without a support ticket. Below is every JSON-RPC message exchanged between an LLM agent and the MCP server for a Workday-backed bulk candidate export.

Step 1: The agent inspects the tool catalog and sees both the async initiator and the status checker.

{
  "jsonrpc": "2.0",
  "id": 1,
  "method": "tools/list",
  "params": {}
}

The server returns (abbreviated to the two async tools):

{
  "jsonrpc": "2.0",
  "id": 1,
  "result": {
    "tools": [
      {
        "name": "start_ats_candidate_bulk_export",
        "description": "Initiate a bulk export of candidates. Returns immediately with a poll_id. This tool does NOT return the export data. To retrieve results, call check_ats_export_status with the poll_id after the retry_after_seconds delay. Do not retry this tool if you receive status='accepted'.",
        "inputSchema": {
          "type": "object",
          "properties": {
            "job_id":          { "type": "string", "description": "Optional: scope export to one job" },
            "updated_at_gte":  { "type": "string", "format": "date-time" },
            "format":          { "type": "string", "enum": ["json", "csv"], "default": "json" }
          }
        }
      },
      {
        "name": "check_ats_export_status",
        "description": "Check the status of a previously initiated export. Returns status='running' (call again later), status='complete' (data is included), or status='failed' (see error field). Call this tool AFTER the retry_after_seconds delay from the previous response.",
        "inputSchema": {
          "type": "object",
          "properties": {
            "poll_id": { "type": "string", "description": "The poll_id returned by start_ats_candidate_bulk_export or a previous check_ats_export_status call." }
          },
          "required": ["poll_id"]
        }
      }
    ]
  }
}

Step 2: The agent initiates the export.

{
  "jsonrpc": "2.0",
  "id": 2,
  "method": "tools/call",
  "params": {
    "name": "start_ats_candidate_bulk_export",
    "arguments": {
      "updated_at_gte": "2026-06-01T00:00:00Z",
      "format": "json"
    }
  }
}

Step 3: The MCP server enqueues the export job upstream and returns immediately with the accepted envelope.

{
  "jsonrpc": "2.0",
  "id": 2,
  "result": {
    "content": [{
      "type": "text",
      "text": "{\"resource\":\"candidates\",\"method\":\"bulk_export\",\"integration\":\"workday\",\"timestamp\":\"2026-06-15T08:30:00Z\",\"status\":\"accepted\",\"poll_id\":\"exp_7f2a9c\",\"retry_after_seconds\":15,\"data\":null,\"_meta\":{\"instruction\":\"Do not retry this tool. Call check_ats_export_status with poll_id exp_7f2a9c after 15 seconds.\",\"estimated_total_seconds\":45}}"
    }]
  }
}

Step 4: The agent waits 15 seconds, then calls the status tool.

{
  "jsonrpc": "2.0",
  "id": 3,
  "method": "tools/call",
  "params": {
    "name": "check_ats_export_status",
    "arguments": { "poll_id": "exp_7f2a9c" }
  }
}

Step 5: The export is still running upstream. The server returns a running envelope with an updated retry delay.

{
  "jsonrpc": "2.0",
  "id": 3,
  "result": {
    "content": [{
      "type": "text",
      "text": "{\"resource\":\"candidates\",\"method\":\"bulk_export\",\"integration\":\"workday\",\"timestamp\":\"2026-06-15T08:30:15Z\",\"status\":\"running\",\"poll_id\":\"exp_7f2a9c\",\"progress_percent\":42,\"retry_after_seconds\":20,\"data\":null,\"_meta\":{\"instruction\":\"Export is 42% complete. Call check_ats_export_status again with the same poll_id after 20 seconds.\",\"records_processed\":8400,\"records_total\":20000}}"
    }]
  }
}

Step 6: The agent waits 20 more seconds and polls again.

{
  "jsonrpc": "2.0",
  "id": 4,
  "method": "tools/call",
  "params": {
    "name": "check_ats_export_status",
    "arguments": { "poll_id": "exp_7f2a9c" }
  }
}

Step 7: The upstream export has completed. The server returns the full result payload using the same standard envelope, with status: "complete" and data populated.

{
  "jsonrpc": "2.0",
  "id": 4,
  "result": {
    "content": [{
      "type": "text",
      "text": "{\"resource\":\"candidates\",\"method\":\"bulk_export\",\"integration\":\"workday\",\"timestamp\":\"2026-06-15T08:30:35Z\",\"status\":\"complete\",\"poll_id\":\"exp_7f2a9c\",\"pagination\":{\"next_cursor\":null,\"has_more\":false,\"limit\":20000},\"data\":[{\"id\":\"cand_8a3f\",\"first_name\":\"Jamie\",\"last_name\":\"Chen\",\"current_stage\":\"Onsite\",\"applied_job_title\":\"Senior Backend Engineer\"}],\"_meta\":{\"fields_redacted\":[\"email\",\"phone\"],\"records_returned\":20000,\"token_estimate\":184000,\"instruction\":\"Export complete. Do not call check_ats_export_status again for this poll_id.\"}}"
    }]
  }
}

Three properties of this flow are worth calling out explicitly in your reference:

The envelope is the same at every stage. status is the only field that changes between accepted, running, and complete. The agent parses one shape, never two.
_meta.instruction is the steering wheel. Every intermediate response tells the LLM exactly what to do next. Without this, agents will call start_ats_candidate_bulk_export again after 30 seconds of silence and duplicate the job upstream.
retry_after_seconds can change mid-flow. The initial response might suggest 15 seconds; the running response might extend it to 20 seconds based on real progress. Agents that treat this as ground truth avoid hammering the status endpoint.

A production implementation should also expose a cancel_ats_export tool that takes a poll_id and terminates the upstream job, so agents can abort work when the user changes their mind. Document its schema alongside the polling tools.

Document these async tools explicitly in your MCP reference, including the expected polling interval and the maximum time before the operation is considered failed. Enterprise buyers running compliance audits or diversity reporting workflows will hit these long-running endpoints first, and their agents need a clear contract for how to wait.

Connecting AI Agents to Oracle NetSuite and SAP: A Practical Blueprint

The two ERPs that block the most enterprise AI deals are Oracle NetSuite and SAP. Both are old, both are deeply customized per tenant, and both punish naive integrations with cryptic errors, aggressive rate limits, and multi-surface APIs (REST, SOAP, OData v4, RFC/BAPI). If your MCP reference does not have concrete NetSuite and SAP guidance, procurement will assume you have never done it before.

This section is the code-first path. A runnable starter repo layout, the exact JSON-RPC an agent sends, a JSONata mapping recipe for per-account NetSuite variation, and a working LangChain demo that produces a read-only invoice summary from either ERP.

MCP Server Starter: Repo Layout and Runbook

Publishing a starter repo alongside your reference gives buyers a working example in under 10 minutes. The layout that has held up across NetSuite and SAP deployments:

mcp-erp-starter/
├── src/
│   ├── server.ts              # JSON-RPC 2.0 handler over HTTP
│   ├── tools/
│   │   ├── registry.ts        # tool discovery + schema generation
│   │   ├── netsuite.ts        # NetSuite tool bindings (SuiteQL + REST)
│   │   └── sap.ts             # SAP tool bindings (OData v4 + BAPI proxy)
│   ├── mappings/
│   │   ├── netsuite/
│   │   │   ├── invoices.jsonata
│   │   │   └── contacts.jsonata
│   │   └── sap/
│   │       ├── invoices.jsonata
│   │       └── business_partners.jsonata
│   ├── auth/
│   │   ├── netsuite-tba.ts    # OAuth 1.0 TBA / HMAC-SHA256
│   │   └── sap-oauth.ts       # OAuth 2.0 client credentials
│   ├── envelope.ts            # LLM-ready response envelope
│   └── ratelimit.ts           # IETF header normalization + backoff
├── fixtures/
│   ├── netsuite-invoices.json
│   └── sap-invoices.json
├── tests/
│   └── normalization.test.ts
├── examples/
│   ├── langchain-invoice-summary.ts
│   └── curl-json-rpc.sh
└── README.md

The runbook is deliberately short:

# 1. Clone and install
git clone https://github.com/your-org/mcp-erp-starter
cd mcp-erp-starter && pnpm install
 
# 2. Configure upstream credentials (both optional; enable only what you need)
cp .env.example .env
# NETSUITE_ACCOUNT_ID=1234567
# NETSUITE_TOKEN_ID=... NETSUITE_TOKEN_SECRET=...
# SAP_HOST=https://my-tenant.s4hana.cloud.sap
# SAP_CLIENT_ID=... SAP_CLIENT_SECRET=...
 
# 3. Run the MCP server locally
pnpm dev
# → JSON-RPC endpoint listening on http://localhost:8787/mcp/local-dev
 
# 4. Register with Claude Desktop or Cursor
# Point the MCP client at http://localhost:8787/mcp/local-dev
 
# 5. Run the demo agent
pnpm demo:invoices

Keep three principles in the README. First, credentials never leave the server process. Second, the server is stateless per request (no session, no response cache). Third, all mapping happens through JSONata files that a non-engineer can edit and hot-reload without a redeploy.

Agent → MCP Flow: A JSON-RPC Example

Enterprise engineers want to see the wire format, not a wrapper SDK call. The full round trip for an agent listing NetSuite invoices looks like this.

Step 1: The agent enumerates available tools.

{
  "jsonrpc": "2.0",
  "id": 1,
  "method": "tools/list",
  "params": {}
}

Step 2: The server returns the tool catalog (abbreviated to two tools):

{
  "jsonrpc": "2.0",
  "id": 1,
  "result": {
    "tools": [
      {
        "name": "list_all_netsuite_invoices",
        "description": "List invoices from NetSuite. Supports filtering by issue_date and status. Use next_cursor for pagination.",
        "inputSchema": {
          "type": "object",
          "properties": {
            "invoice_type": { "type": "string", "enum": ["bill", "invoice"] },
            "issue_date_gte": { "type": "string", "format": "date" },
            "limit": { "type": "string" },
            "next_cursor": { "type": "string" }
          },
          "required": ["invoice_type"]
        }
      },
      {
        "name": "list_all_sap_invoices",
        "description": "List supplier invoices from SAP S/4HANA via OData v4.",
        "inputSchema": {
          "type": "object",
          "properties": {
            "posting_date_gte": { "type": "string", "format": "date" },
            "limit": { "type": "string" },
            "next_cursor": { "type": "string" }
          }
        }
      }
    ]
  }
}

Step 3: The agent invokes a tool.

{
  "jsonrpc": "2.0",
  "id": 2,
  "method": "tools/call",
  "params": {
    "name": "list_all_netsuite_invoices",
    "arguments": {
      "invoice_type": "invoice",
      "issue_date_gte": "2026-01-01",
      "limit": "25"
    }
  }
}

Step 4: The server executes the SuiteQL query, applies JSONata normalization, wraps the result in the standard envelope, and returns:

{
  "jsonrpc": "2.0",
  "id": 2,
  "result": {
    "content": [
      {
        "type": "text",
        "text": "{\"resource\":\"invoices\",\"method\":\"list\",\"integration\":\"netsuite\",\"timestamp\":\"2026-06-15T08:30:00Z\",\"pagination\":{\"next_cursor\":\"b2Zmc2V0PTI1\",\"has_more\":true,\"limit\":25},\"data\":[{\"id\":\"91827\",\"number\":\"INV-2026-0142\",\"status\":\"OPEN\",\"issue_date\":\"2026-01-14\",\"due_date\":\"2026-02-13\",\"total_amount\":48250.00,\"currency\":\"USD\",\"contact_name\":\"Acme Industries\"}],\"_meta\":{\"fields_redacted\":[],\"token_estimate\":142}}"
      }
    ]
  }
}

Two things to notice. The inner payload is a JSON string inside content [0].text per the MCP spec, and it uses the exact envelope defined earlier. Because the envelope is provider-agnostic, the same agent code works when the tool name changes from list_all_netsuite_invoices to list_all_sap_invoices.

Normalized Rate-Limit Headers and Agent Backoff for ERPs

NetSuite and SAP both enforce rate limits, but they surface them differently. NetSuite uses a concurrency governor plus HTTP 429 with an X-NetSuite-Concurrency-Limit-Remaining header on some endpoints and no header on others. SAP S/4HANA Cloud returns 429 with Retry-After and a sap-remaining-request-quota header on OData calls. Neither matches IETF draft ratelimit-* headers.

The MCP server's job is to normalize both into the same IETF-style envelope so the agent's backoff logic does not need to know which ERP it hit:

type NormalizedRateLimit = {
  'ratelimit-limit': string;
  'ratelimit-remaining': string;
  'ratelimit-reset': string; // seconds until reset
};
 
function normalizeNetSuite(res: Response): NormalizedRateLimit {
  return {
    'ratelimit-limit': res.headers.get('x-netsuite-concurrency-limit') ?? '10',
    'ratelimit-remaining': res.headers.get('x-netsuite-concurrency-limit-remaining') ?? '0',
    'ratelimit-reset': res.headers.get('retry-after') ?? '1',
  };
}
 
function normalizeSAP(res: Response): NormalizedRateLimit {
  return {
    'ratelimit-limit': res.headers.get('sap-request-quota') ?? '1000',
    'ratelimit-remaining': res.headers.get('sap-remaining-request-quota') ?? '0',
    'ratelimit-reset': res.headers.get('retry-after') ?? '60',
  };
}

When a 429 occurs, pass the normalized headers through to the agent along with the JSON-RPC error, and let the agent apply exponential backoff with jitter. A production-ready agent-side retry loop:

async function callToolWithBackoff(
  mcpUrl: string,
  body: JsonRpcRequest,
  maxAttempts = 5
): Promise<JsonRpcResponse> {
  for (let attempt = 0; attempt < maxAttempts; attempt++) {
    const res = await fetch(mcpUrl, {
      method: 'POST',
      headers: { 'content-type': 'application/json' },
      body: JSON.stringify(body),
    });
 
    if (res.status !== 429) return res.json();
 
    const reset = Number(res.headers.get('ratelimit-reset') ?? 1);
    const base = Math.max(reset * 1000, 250 * 2 ** attempt);
    const jitter = Math.random() * 250;
    await new Promise(r => setTimeout(r, base + jitter));
  }
  throw new Error('Exceeded max retry attempts');
}

For NetSuite specifically, keep concurrency in mind. A single account allows 5 to 25 concurrent SuiteQL requests depending on tier. If an agent fans out 30 parallel invoice list calls, most will 429 immediately. Serialize the agent's list calls, or use a semaphore in the MCP server to cap upstream concurrency before it reaches NetSuite. For SAP OData v4, prefer $top and $skiptoken cursor pagination over $skip offset pagination - $skip performs poorly on large tables and is more likely to hit the request-quota limit.

Per-Account Schema Overrides with JSONata

NetSuite is the poster child for per-account variation. A OneWorld tenant has subsidiaries. A single-subsidiary tenant does not. A multi-currency tenant has a currency table. A single-currency tenant returns an error on the same query. Your invoice list SuiteQL must adapt to four combinations of these two flags. Hardcoding this into handler code is unmaintainable across dozens of tenants.

The pattern that scales: detect account features at connection time, store them as context.multi_currency and context.multi_subsidiary, then reference them from JSONata mapping expressions that build the SuiteQL query and shape the response.

Feature detection at post-install:

async function detectNetSuiteFeatures(accountId: string) {
  const currencyProbe = await runSuiteQL(accountId, 'SELECT id FROM currency LIMIT 1');
  const subsidiaryProbe = await runSuiteQL(accountId, 'SELECT id FROM subsidiary LIMIT 1');
 
  return {
    multi_currency: !currencyProbe.error?.includes("Record 'currency' was not found"),
    multi_subsidiary: !subsidiaryProbe.error?.includes("Record 'subsidiary' was not found"),
  };
}

Invoice list query mapping (JSONata):

{
  "query": context.multi_currency and context.multi_subsidiary
    ? "SELECT t.id, t.tranid, t.status, t.trandate, t.duedate,
              t.foreigntotal, c.symbol AS currency, s.name AS subsidiary,
              e.entityid AS contact_name
       FROM transaction t
       JOIN entity e ON t.entity = e.id
       LEFT JOIN currency c ON t.currency = c.id
       LEFT JOIN subsidiary s ON t.subsidiary = s.id
       WHERE t.type = 'CustInvc'"
    : context.multi_currency
      ? "SELECT t.id, t.tranid, t.status, t.trandate, t.duedate,
                t.foreigntotal, c.symbol AS currency, e.entityid AS contact_name
         FROM transaction t JOIN entity e ON t.entity = e.id
         LEFT JOIN currency c ON t.currency = c.id
         WHERE t.type = 'CustInvc'"
      : "SELECT t.id, t.tranid, t.status, t.trandate, t.duedate,
                t.foreigntotal, e.entityid AS contact_name
         FROM transaction t JOIN entity e ON t.entity = e.id
         WHERE t.type = 'CustInvc'"
}

Response mapping (JSONata) with status normalization:

{
  "data": items.{
    "id": $string(id),
    "number": tranid,
    "status": status = 'A' ? 'OPEN'
            : status = 'B' ? 'PAID'
            : status = 'C' ? 'CANCELLED'
            : status = 'D' ? 'SUBMITTED'
            : status = 'E' ? 'REJECTED'
            : 'UNKNOWN',
    "issue_date": trandate,
    "due_date": duedate,
    "total_amount": $number(foreigntotal),
    "currency": $exists(currency) ? currency : 'USD',
    "subsidiary": $exists(subsidiary) ? subsidiary : null,
    "contact_name": contact_name
  }
}

Per-account override: A specific NetSuite customer wants their custom custbody_department_code field surfaced on every invoice. Rather than editing the global mapping, layer an account-scoped override on top that merges into the normalized response:

{
  "data": data#$i.$merge([
    $,
    { "department_code": $lookup(%.%.raw.items[$i], "custbody_department_code") }
  ])
}

Overrides are stored per integrated account, applied after the global mapping runs, and evaluated in a sandbox with no I/O. When the tenant's customization changes, an admin edits the override record and the next tool call reflects it. No redeploy, no rebuild.

The same pattern works for SAP. SAP S/4HANA exposes CDS views, custom fields via the extensibility framework, and different SupplierInvoice shapes across on-premise vs cloud. Detect the deployment type at post-install (the $metadata endpoint of the OData service returns different entity sets), store the flavor in context.sap_edition, and branch the JSONata accordingly:

{
  "data": context.sap_edition = 'cloud'
    ? value.{
        "id": SupplierInvoice,
        "number": SupplierInvoiceIDByInvcgParty,
        "status": AccountingDocumentIsPaid ? 'PAID' : 'OPEN',
        "issue_date": PostingDate,
        "total_amount": $number(InvoiceGrossAmount),
        "currency": DocumentCurrency
      }
    : d.results.{
        "id": BelegNr,
        "number": XBLNR,
        "status": AUGBL != '' ? 'PAID' : 'OPEN',
        "issue_date": BLDAT,
        "total_amount": $number(DMBTR),
        "currency": WAERS
      }
}

One mapping file, two SAP deployment models, same envelope out the other end.

Demo: Read-Only Invoice Summary End-to-End

The most common first agent use case for finance teams is a natural-language invoice summary. "Show me all open invoices over $10k from last month." Here is the full round trip using LangChain, targeting the MCP server the starter repo produces.

import { ChatAnthropic } from '@langchain/anthropic';
import { MultiServerMCPClient } from '@langchain/mcp-adapters';
import { createReactAgent } from '@langchain/langgraph/prebuilt';
 
async function main() {
  // 1. Connect to the MCP server(s). Both NetSuite and SAP tools show up
  //    in the same tool list because the envelope is provider-agnostic.
  const mcp = new MultiServerMCPClient({
    mcpServers: {
      erp: {
        url: process.env.MCP_URL!, // e.g. https://api.example.com/mcp/<token>
        headers: { Authorization: `Bearer ${process.env.MCP_API_TOKEN}` },
        transport: 'http',
      },
    },
  });
 
  const tools = await mcp.getTools();
 
  // 2. Build a ReAct agent restricted to read-only ERP tools.
  const readOnlyTools = tools.filter(t =>
    t.name.startsWith('list_all_') || t.name.startsWith('get_single_')
  );
 
  const model = new ChatAnthropic({ model: 'claude-sonnet-4-5', temperature: 0 });
  const agent = createReactAgent({ llm: model, tools: readOnlyTools });
 
  // 3. Run the query.
  const result = await agent.invoke({
    messages: [{
      role: 'user',
      content: 'Summarize all OPEN invoices from January 2026 over $10,000. Group by currency and show total exposure per contact.',
    }],
  });
 
  console.log(result.messages[result.messages.length - 1].content);
  await mcp.close();
}
 
main();

What happens under the hood:

The agent inspects the tool catalog and sees list_all_netsuite_invoices (and optionally list_all_sap_invoices if both are connected).
It calls the tool with { invoice_type: "invoice", issue_date_gte: "2026-01-01", limit: "50" }.
The MCP server executes the account-appropriate SuiteQL variant (chosen by the multi_currency/multi_subsidiary flags), normalizes the response through JSONata, and returns the standard envelope with next_cursor.
The agent reads pagination.has_more, calls the tool again with the returned next_cursor (passed back unchanged, per the tool description), and repeats until has_more is false.
The agent filters the aggregated array to status = "OPEN" and total_amount > 10000, groups by currency, and produces the summary.

The output is deterministic and auditable. Every tool call is a discrete JSON-RPC message you can log and replay. Because the agent is restricted to list_* and get_* tools, no accidental write can reach NetSuite or SAP even if the LLM hallucinates. This is the exact pattern to include in your MCP reference: a working example, a scoped tool set, and a natural-language prompt that produces business value on day one.

For a broader treatment of ERP-specific integration patterns, see the guide on connecting AI agents to NetSuite and SAP with unified accounting APIs.

How to Generate and Publish Your MCP Reference Instantly

The hardest part of maintaining an enterprise-grade MCP reference is not writing it once. It is keeping it accurate across hundreds of resources and dozens of upstream API changes per quarter. If your engineering team has to update a markdown file every time they add a new field to an integration, the documentation will drift. When documentation drifts in an AI context, agents hallucinate missing parameters and break production workflows.

Two patterns prevent drift and work reliably in production.

Pattern 1: Documentation-Driven Tool Generation

Derive every tool definition from a single configuration source. Truto approaches this by treating integration behavior entirely as data. The platform utilizes a "zero integration-specific code" architecture. There are no hardcoded handler functions for individual integrations. Instead, integration capabilities are defined in a standardized JSON configuration (config.resources) and mapped using JSONata expressions.

Because the architecture is purely data-driven, Truto automatically generates MCP tools directly from two existing data sources: the integration's resource definitions (which API endpoints exist) and structured documentation records (descriptions and JSON Schema for each method).

When a customer generates an MCP server URL, the system does not load a static list of tools. On every tools/list request, the server dynamically intersects the integration's available resources with its documentation records. If an endpoint does not have a documentation record, it is silently dropped. The same generator that produces the runtime tool list produces the public reference, so they cannot drift.

flowchart LR
  A[Integration config<br/>+ documentation records] --> B[Tool definitions<br/>name, description, schemas]
  B --> C[Public MCP reference<br/>auto-rendered catalog]
  B --> D[Live MCP server<br/>/mcp/:token endpoint]
  C -. always in sync .- D

When the docs and the runtime share one source, your reference document becomes a contract instead of a screenshot.

Pattern 2: Scoped Server Creation with Predictable URLs

Expose a single API that creates an MCP server scoped to a tenant, with method and tag filters and an optional TTL:

POST /integrated-account/:id/mcp
Content-Type: application/json
 
{
  "name": "Acme Corp - Support agent",
  "config": {
    "methods": ["read"],
    "tags": ["support"],
    "require_api_token_auth": true
  },
  "expires_at": "2026-06-01T00:00:00Z"
}

The response returns a URL like https://api.example.com/mcp/<hashed-token> that the customer pastes into Claude or ChatGPT. This is the single primitive that powers contractor access, time-boxed agent runs, and per-environment isolation. Document the endpoint, document the lifecycle, and your reference covers 80% of the procurement checklist on one page.

Warning

Resist the temptation to add custom retry, caching, or transformation logic inside the MCP server. Every layer of magic between the agent and the upstream API is a layer your security reviewer will demand to audit. Pass-through behavior is easier to defend.

Turn AI Compliance into a Competitive Advantage

Enterprise procurement is a game of risk mitigation. The enterprises buying AI-ready SaaS in 2026 are not rewarding the loudest demos. They are rewarding the vendors whose documentation answers their CISO's questions before the call.

A dedicated, highly technical MCP integration reference transforms your AI capabilities from a perceived security risk into a validated architectural advantage. By transparently documenting your pass-through architecture, your token isolation strategies, and your strict schema enforcement, you eliminate the ambiguity that causes deals to stall.

The build path is clear: lock down the security model (per-tenant scoping, hashed token storage, optional second-factor auth, 429 passthrough), generate the tool catalog from the same config that runs the server, and publish a single URL that maps cleanly to procurement checklists. Stop letting legacy competitors dictate the narrative around AI readiness, and give your enterprise buyers the exact artifact they need to approve the purchase.

FAQ

What is API response standardization for LLM consumption?: It is the process of normalizing disparate API responses from multiple SaaS providers (like different ATS platforms) into a consistent envelope schema with predictable field names, pagination structure, and metadata - so LLMs can parse tool results without hallucinating field names or wasting tokens on structural noise.
Which ATS fields should be redacted before sending to an LLM?: It depends on the use case. For pipeline triage, redact emails, phone numbers, resume text, and EEO data. For candidate sourcing, include names and emails but redact phone numbers and EEO data. EEO demographic fields should never appear at the individual candidate level - only in aggregated compliance reports. Free-text fields like recruiter notes should be treated as toxic by default due to embedded PII.
How do you prevent PII leakage in MCP tool responses?: Use a three-layer strategy: schema-driven field stripping (only allow-listed fields get serialized), regex-based pattern scanning on free-text fields for structured PII like emails and SSNs, and NLP-based entity recognition (such as Microsoft Presidio) for contextual PII like names and addresses that do not follow fixed patterns.
Why should MCP tool input schemas be flattened?: LLMs are unreliable at constructing deeply nested JSON objects. Flattening input schemas to top-level properties reduces malformed tool calls. The mapping layer re-nests the flat inputs into the shape the upstream API expects before proxying the request.
How do you handle long-running ATS API calls in MCP servers?: Use an async polling pattern. The initial MCP tool returns immediately with a poll_id and status of accepted. A second tool (like check_ats_export_status) accepts the poll_id and returns either a running status with an estimated completion time, or a complete status with the result payload. The tool description instructs the LLM to poll rather than retry.

Updates

Jul 16, 2026 Added copy-paste ATS tool schema examples (create_candidate, update_application_stage, submit_scorecard, create_job), a flattened-to-nested JSONata rehydration recipe with Greenhouse and Lever targets, a LangChain/LangGraph-ready tools/list sample, and a complete step-by-step JSON-RPC trace of the async polling multi-tool flow (start -> running -> complete).
Jul 4, 2026 Added a new major section on connecting AI agents to Oracle NetSuite and SAP, including a starter repo layout and runbook, a JSON-RPC agent-to-MCP wire example, IETF-normalized rate-limit headers with agent backoff code, per-account JSONata schema override recipes covering NetSuite OneWorld and SAP S/4HANA variants, and an end-to-end LangChain read-only invoice summary demo.
Jun 15, 2026 Added a major new section 'Standardizing ATS API Responses for LLM Consumption' with seven subsections covering field selection use-case matrices, LLM-ready envelope schemas, JSONata mapping recipes, three-layer PII redaction patterns, flattened MCP tool input schemas, CI testing checklists with telemetry signals, and an operational playbook for async/long-running ATS endpoints.

How to Publish a Dedicated MCP Integration Reference for Enterprises

The Procurement Reality: Why "AI-Ready" Fails Security Reviews

AI Agent Integration Enterprise Buyer Requirements

Addressing MCP Server Enterprise Security Requirements

Defeating Credential Aggregation and the Blast Radius Problem

Token Passthrough and Dual Authentication

Transparent Rate Limit Handling and the 429 Contract

Model Context Protocol Documentation Best Practices

1. Client Configuration Instructions

2. Auto-Generated Tool Schemas

Standardizing ATS API Responses for LLM Consumption

Field Selection: When to Include vs Redact

The LLM-Ready Envelope Schema

Mapping and JSONata Recipes for ATS Normalization

PII Redaction and Hashing Patterns

MCP Tool Generation and Flattened Input Patterns

Downloadable Flattened Input Schema Examples for ATS Tools

Flattened-to-Nested Rehydration Recipe

Sample Generated MCP Tool JSON for LangChain and LangGraph

Testing Checklist and Telemetry Signals

Operational Playbook for Long-Running ATS Endpoints

End-to-End Multi-Tool Async Flow: The Full JSON-RPC Trace

Connecting AI Agents to Oracle NetSuite and SAP: A Practical Blueprint

MCP Server Starter: Repo Layout and Runbook

Agent → MCP Flow: A JSON-RPC Example

Normalized Rate-Limit Headers and Agent Backoff for ERPs

Per-Account Schema Overrides with JSONata

Demo: Read-Only Invoice Summary End-to-End

How to Generate and Publish Your MCP Reference Instantly

Pattern 1: Documentation-Driven Tool Generation

Pattern 2: Scoped Server Creation with Predictable URLs

Turn AI Compliance into a Competitive Advantage

FAQ

More from our Blog

The 2026 MCP Buyer's Checklist and Quick-Start Guide for B2B SaaS

Best Practices for Handling API Rate Limits and Retries Across Multiple Third-Party APIs

How to Generate MCP Servers for Your SaaS Users (2026 Architecture Guide)

Zero Data Retention MCP Servers: Building SOC 2 & GDPR Compliant AI Agents

How to Create a Dedicated MCP-Focused Comparison Guide to Win AI Deals

The Hands-On Guide to Building MCP Servers for AI Agents (2026 Architecture)