Why use JSONata for API transformations instead of writing code?

JSONata is a declarative, Turing-complete transformation language purpose-built for JSON. Because mappings are strings, they can be stored as configuration data, versioned, overridden per customer, and hot-swapped without a code deploy. This eliminates the N² maintenance problem of hardcoded vendor adapters.

How should I handle 429 rate limit errors across multiple SaaS APIs?

Normalize the rate-limit signal but do not hide the error. Translate proprietary headers into the IETF standard headers (ratelimit-limit, ratelimit-remaining, ratelimit-reset), and pass HTTP 429 through to the caller. This ensures retry and backoff policy stays in application code where it belongs.

How do you handle custom fields in unified APIs?

Using a 3-level override hierarchy, custom fields can be mapped at the platform, environment, or individual account level. This allows per-customer customization without altering the core codebase. Preserving the raw payload under a 'remote_data' escape hatch also provides direct access to unmapped fields.

API Schema Normalization Tutorial: End-to-End with JSONata

If you are building B2B SaaS integrations, you eventually hit a wall where writing imperative if (provider === 'salesforce') statements becomes mathematically unscalable. Every new connector starts as a clean adapter and ends as a graveyard of vendor-specific date parsers, fragile pagination loops, and on-call pages at 3 AM because a tenant's HubSpot custom field broke your CRM sync.

Software engineering teams evaluate APIs based on Time to First Call (TTFC). As we discussed in our guide on how to publish end-to-end developer tutorials with API examples, developers will paste your example into a terminal, run it, and decide in under five minutes whether your platform is worth their time. If your integration architecture requires developers to spend hours reverse-engineering undocumented payloads, guessing OAuth scopes, or writing custom retry logic from scratch, your product fails the technical evaluation.

This API schema normalization tutorial provides a code-level blueprint for escaping that trap using declarative JSONata mappings. We will cover how to normalize response payloads, translate unified query parameters into vendor-specific syntax, and standardize error handling and rate limits across any REST API—all without writing hardcoded API adapters.

The target reader is a senior engineer, staff engineer, or product manager at a B2B SaaS company who has felt the maintenance tax of code-first integrations and wants a working architectural pattern. We will treat integrations as data instead of code and discuss the honest engineering trade-offs at the end.

The N² Maintenance Trap: Why Hardcoded API Adapters Fail

Short answer: Every new integration you add as bespoke code multiplies the surface area you have to maintain. The financial drain is not the initial build—it is the next three years of API version bumps, custom-field requests, and pagination quirks.

The initial build phase of an API integration is deceivingly simple. Connecting to a REST endpoint and mapping a few fields takes a competent engineer a few days. The numbers, however, show how quickly this compounds. The typical mid-market company now runs over 110 to 130 different SaaS tools. To capture enterprise deals, your product must integrate with the specific combination of CRMs, HRIS platforms, and ticketing systems your prospect uses. Integrations drive revenue through reduced churn, increased ACVs, and improved win rates, as highlighted by PartnerFleet's State of SaaS Integrations report.

However, building these connections in-house creates a maintenance nightmare. RevTek Capital reports that SaaS engineering teams now spend 20 to 40% of their time maintaining integrations rather than building core features that fuel growth. Integrations often lack transactional integrity and visibility, leading to data inconsistencies, slow error resolution, and degraded customer trust. As startups scale user bases and operations, what began as a lightweight patchwork becomes brittle infrastructure requiring dedicated engineering resources to maintain.

Most teams solve integration fragmentation with brute force. They build a unified facade, but behind that facade, they maintain separate code paths for each provider. They write a hubspot_adapter.ts and a salesforce_adapter.ts. The architecture problem is structural, not motivational:

Provider-specific code paths mean a bug fix in pagination for HubSpot does not help Pipedrive.
Per-vendor schemas force your downstream code to branch on provider names.
Tribal knowledge about which field maps to which gets buried in pull requests from engineers who have since left the company.

This is the N² maintenance trap. Adding a new integration means writing new code and hoping it does not break the integrations already running in production. The fix is to stop treating each integration as a new program and start treating it as configuration that a generic engine executes.

What Is API Schema Normalization?

API schema normalization is the process of mapping disparate third-party API contracts (field names, types, nested shapes, enums, pagination markers, error envelopes) into a single canonical schema your application consumes.

This goes far beyond simple 1:1 key-value mapping like renaming FirstName to first_name. Done properly, true schema normalization covers:

Field mapping including resolving nested objects, arrays, and polymorphic custom objects (see our JSONata mapping examples for concrete patterns).
Type coercion (e.g., Salesforce Id as a string, HubSpot numeric IDs, converting strings to booleans, or Unix epoch vs ISO 8601 dates).
Enum harmonization across vendor-specific picklists.
Query translation between SOQL, OData, GraphQL, and REST filter syntaxes.
Error normalization so retry logic does not need to know the upstream provider.
Pagination unification across cursor, page, offset, and link-header styles.

If you fail to normalize data properly, downstream systems break. Airbyte emphasizes that proper schema normalization is required to prevent AI agents from hallucinating on stale or unstructured data. Group BWT highlights that external data extraction without schema normalization leads to enterprise-grade failures and multi-quarter rebuilds. If your application expects an array of email addresses, but the upstream API returns a comma-separated string, your application will crash. For a deeper look at these architectural challenges, read why schema normalization is the hardest problem in SaaS integrations.

The right tool for this is a declarative transformation language. JSONata is a powerful, industry-standard declarative query and transformation language for JSON. It allows for complex data manipulations, including filtering, mapping, and reducing, all without writing imperative code. It is Turing complete while staying compact enough to store a full mapping in a single database column.

Tutorial Prerequisites: The Sample Repo

This tutorial is framework-agnostic. To follow along locally, you can create a simple Node.js script:

mkdir schema-normalization-demo && cd schema-normalization-demo
npm init -y
npm install jsonata

Create a run.mjs file that we will reuse for evaluating our expressions:

import jsonata from 'jsonata'
 
export async function normalize(expressionStr, input) {
  const expression = jsonata(expressionStr)
  return await expression.evaluate(input)
}

Info

Alternative: You do not need to clone a repository or run Node to follow this guide. The examples below are self-contained JSONata expressions. You can test them directly in the JSONata Exerciser by pasting the sample input JSON and the expression.

We will work with sample contact payloads from HubSpot and Salesforce and normalize both into a single unified Contact schema:

{
  "id": "string",
  "first_name": "string",
  "last_name": "string",
  "name": "string",
  "email": "string",
  "phone": "string",
  "company_name": "string",
  "created_at": "ISO-8601 string",
  "is_active": "boolean",
  "remote_data": { }
}

Step 1: Normalizing the Response Payload with JSONata

Let us map two completely different CRM contact responses into our unified Contact schema.

Here is what HubSpot returns for a contact list (simplified):

{
  "results": [{
    "id": "851",
    "properties": {
      "firstname": "Ada",
      "lastname": "Lovelace",
      "email": "ada@analytical.engine",
      "phone": "+1-555-0100",
      "createdate": "2026-01-12T10:33:00Z",
      "hs_is_contact": "true"
    },
    "associatedCompany": {
      "properties": {
        "name": "HubSpot"
      }
    }
  }],
  "paging": { "next": { "after": "852" } }
}

And here is the equivalent Salesforce response shape—flat, PascalCase, with a totally different timestamp format and boolean representation:

{
  "records": [{
    "Id": "003xx000004TmiQAAS",
    "FirstName": "Ada",
    "LastName": "Lovelace",
    "Email": "ada@analytical.engine",
    "Phone": "+1-555-0100",
    "CreatedDate": "2026-01-12T10:33:00.000+0000",
    "IsDeleted": false,
    "Account": {
      "Name": "Salesforce Inc"
    }
  }],
  "nextRecordsUrl": "/services/data/v60.0/query/01gxx-2000"
}

The HubSpot JSONata Mapping

The HubSpot response mapping is a single JSONata expression that we store in our database:

results.{
  "id": $string(id),
  "first_name": properties.firstname,
  "last_name": properties.lastname,
  "name": properties.firstname & ' ' & properties.lastname,
  "email": properties.email,
  "phone": properties.phone,
  "company_name": associatedCompany.properties.name,
  "created_at": properties.createdate,
  "is_active": properties.hs_is_contact = "true" ? true : false,
  "remote_data": $
}

The Salesforce JSONata Mapping

The Salesforce mapping looks structurally identical, only the field paths and coercion logic differ:

records.{
  "id": $string(Id),
  "first_name": FirstName,
  "last_name": LastName,
  "name": FirstName & ' ' & LastName,
  "email": Email,
  "phone": Phone,
  "company_name": Account.Name,
  "created_at": CreatedDate,
  "is_active": IsDeleted ? false : true,
  "remote_data": $
}

Both expressions produce the exact same unified array. Notice how JSONata handles data coercion effortlessly. The $string() function ensures IDs are always strings. The ternary operators (? :) handle converting the string "true" in HubSpot and the inverted IsDeleted boolean in Salesforce into a standard is_active boolean.

Crucially, the original payload is preserved under remote_data: $ (where $ represents the current context). This means callers who need vendor-specific fields (like custom Salesforce objects or specific HubSpot company associations) can still reach them without requiring an engineering deploy. This pattern—canonical fields plus an escape hatch—is what makes schema normalization survive contact with real enterprise customers.

For a deeper look at syntax patterns and edge cases (nested arrays, recursive lookups, conditional cascades), see our developer tutorial on building JSONata mappings.

Step 2: Translating Query Parameters End-to-End

Schema normalization is bi-directional. Mapping responses is the easy half. The harder half is translating inbound queries. A single unified parameter like ?updated_after=2026-01-01T00:00:00Z&status=active needs to become whatever complex syntax each upstream provider understands.

Salesforce expects a SOQL query. HubSpot expects a JSON payload containing filterGroups sent via a POST request to their search endpoint.

Mapping the Unified Query to HubSpot (Filter Groups)

HubSpot requires a structured JSON body for complex filtering. We use JSONata to map the unified query object into HubSpot's filterGroups array.

(
  $filters := [];
  $filters := query.updated_after ? $append($filters, {
    "propertyName": "lastmodifieddate",
    "operator": "GTE",
    "value": query.updated_after
  }) : $filters;
  
  $filters := query.status = 'active' ? $append($filters, {
    "propertyName": "hs_is_contact",
    "operator": "EQ",
    "value": "true"
  }) : $filters;
 
  {
    "limit": query.limit ? $number(query.limit) : 100,
    "after": query.next_cursor,
    "filterGroups": $count($filters) > 0 ? [
      {
        "filters": $filters
      }
    ] : undefined
  }
)

This expression transforms a simple REST query string into a complex, nested JSON payload required by the vendor's POST search endpoint.

Mapping the Unified Query to Salesforce (SOQL)

Salesforce wants SOQL passed via the q query parameter. We use a JSONata expression to construct the SOQL WHERE clause dynamically based on the presence of unified query parameters.

(
  $conditions := [];
  $conditions := query.updated_after ? $append($conditions, "LastModifiedDate >= " & query.updated_after) : $conditions;
  $conditions := query.status = 'active' ? $append($conditions, "IsDeleted = false") : $conditions;
  
  $whereClause := $count($conditions) > 0 ? " WHERE " & $join($conditions, " AND ") : "";
  
  {
    "q": "SELECT Id, FirstName, LastName, Email, Phone, CreatedDate, LastModifiedDate, IsDeleted, Account.Name FROM Contact" 
         & $whereClause 
         & " ORDER BY LastModifiedDate DESC LIMIT " 
         & (query.limit ? query.limit : "50")
  }
)

Notice what is happening: the unified caller never knows whether the upstream uses SOQL, OData, or a JSON filter object. Two different expressions, both stored as data, both swappable without a deploy. The same pattern handles GraphQL backends like Linear or Monday—the mapping just outputs a GraphQL query string instead of REST query params.

A quick sketch of the full request lifecycle:

flowchart LR
    A[Unified Request<br/>?updated_after=2026-01-01] --> B[Load JSONata Mapping]
    B --> C{Evaluate Query Expression}
    C -->|HubSpot| D[POST filterGroups body]
    C -->|Salesforce| E[GET q=SOQL]
    D --> F[Raw Vendor Response]
    E --> F
    F --> G[Evaluate Response Expression]
    G --> H[Unified Normalized Response]
    
    style A fill:#f9f,stroke:#333,stroke-width:2px
    style H fill:#bbf,stroke:#333,stroke-width:2px

Step 3: Normalizing API Errors

Handling errors across dozens of APIs is a massive engineering headache. Every vendor has its own theory of what an error response should look like. Salesforce returns an array with errorCode and message. HubSpot returns status, message, and a correlationId. Zendesk returns an error object with title and detail. Some return HTTP 200 OK with an error payload inside the body.

Your retry loop should not need to know any of this. We use an error JSONata expression that evaluates the failing response body and returns a normalized shape:

(
  /* Salesforce style: top-level array of errors */
  $exists(response[0].errorCode) ? {
    "code": response[0].errorCode,
    "message": response[0].message,
    "retryable": response[0].errorCode in ["REQUEST_LIMIT_EXCEEDED", "UNABLE_TO_LOCK_ROW"],
    "reauth_required": response[0].errorCode = "INVALID_SESSION_ID"
  } :
  /* HubSpot style: object with category */
  $exists(response.category) ? {
    "code": response.category,
    "message": response.message,
    "retryable": response.category = "RATE_LIMIT",
    "reauth_required": status = 401
  } : 
  /* Fallback */
  {
    "code": response.error.code ? response.error.code : "unknown",
    "message": response.error.message ? response.error.message : $string(response),
    "retryable": status >= 500,
    "reauth_required": status = 401
  }
)

Now your caller catches the exact same { code, message, retryable, reauth_required } shape regardless of the upstream provider. For more on the chaos of vendor error responses, see 404 reasons third-party APIs can't get their errors straight.

Step 4: Normalizing Rate Limits (The Transparent Way)

This is where most unified API tutorials lie. They claim the platform "handles rate limits for you" and then quietly serialize all your requests behind a token bucket, which is the wrong default for high-throughput callers.

Rate limiting behavior is highly variable. Some use X-RateLimit-Remaining, others use RateLimit-Remaining, and some put rate limit data in the response body. A more honest pattern—and the one Truto uses—is to normalize the rate-limit signal but pass the 429 error straight through to the caller.

Truto translates whatever proprietary rate-limit headers the upstream returns into standardized headers per the IETF draft specification:

ratelimit-limit: The total request allowance.
ratelimit-remaining: The number of requests left.
ratelimit-reset: The time window until the limit resets.

When the upstream returns HTTP 429, Truto does not automatically retry, throttle, or apply backoff. It passes that error directly to the caller. Retry, backoff, and circuit-breaking belong in the caller—because only the caller knows whether this request is a critical user action that should retry immediately or a background sync that should defer for an hour. Hiding 429s behind opaque middleware retries is how you end up with mysterious 30-second latency spikes in production.

Here is a minimal client-side retry loop using the normalized headers:

async function callWithBackoff(fn, attempt = 0) {
  const res = await fn()
  if (res.status !== 429 || attempt >= 5) return res
  
  // Read the normalized IETF header
  const reset = Number(res.headers.get('ratelimit-reset') || 1)
  
  // Exponential backoff combined with the upstream reset window
  await new Promise(r => setTimeout(r, reset * 1000 * Math.pow(2, attempt)))
  
  return callWithBackoff(fn, attempt + 1)
}

For a broader treatment of rate-limit strategy across many upstreams, see best practices for handling API rate limits across multiple third-party APIs.

Step 5: Handling API Breaking Changes Without Code Deploys

When you integrate with a single third-party API, a breaking change is an annoyance. When you integrate with fifty, breaking changes become a weekly event. Third-party providers rename fields, restructure response envelopes, change pagination formats, and switch date representations - often on their own schedule with minimal advance notice.

This is the API versioning problem from the consumer's perspective. Standard API versioning advice assumes you control the API. When you are the consumer of dozens of APIs that each version independently, you control nothing. The question is not whether a breaking change will hit your integration layer - it is how fast you can absorb it without disrupting your customers.

With hardcoded adapters, each breaking change requires a code patch, a pull request, CI, and a production deploy - for each affected integration, on each affected customer's timeline. With declarative JSONata mappings stored as configuration data, absorbing a breaking change means updating one expression. The change takes effect immediately. No deploys, no risk to unrelated integrations.

The recipes below cover the four most common breaking-change patterns seen across SaaS APIs. Each recipe includes the provider payload before and after the change, the mapping that breaks, the fixed expression, and test cases you can paste directly into the JSONata Exerciser.

Recipe 1: Field Renames and Casing Changes

The most frequent breaking change in practice. A provider ships a new API version that renames firstname to first_name, or switches from camelCase to snake_case across the board.

Old provider payload (v1):

{
  "id": "c-401",
  "firstname": "Ada",
  "lastname": "Lovelace",
  "emailAddress": "ada@analytical.engine"
}

New provider payload (v2):

{
  "id": "c-401",
  "first_name": "Ada",
  "last_name": "Lovelace",
  "email_address": "ada@analytical.engine"
}

The mapping that breaks - it only handles v1 field names:

{
  "id": response.id,
  "first_name": response.firstname,
  "last_name": response.lastname,
  "email": response.emailAddress,
  "remote_data": response
}

When the provider ships v2, response.firstname resolves to undefined. Your contacts sync silently drops every name field.

The fixed mapping - handles both versions using fallback chains:

{
  "id": response.id,
  "first_name": response.first_name ? response.first_name : response.firstname,
  "last_name": response.last_name ? response.last_name : response.lastname,
  "email": response.email_address ? response.email_address : response.emailAddress,
  "remote_data": response
}

The ternary fallback (new_field ? new_field : old_field) means this single expression works against both v1 and v2 payloads simultaneously. Customers still on the old API version keep working. Customers on the new version also work. Remove the fallback branch later once the old version is fully deprecated.

Tip

Watch out for falsy values. The ternary pattern treats false, 0, "", and null as falsy, which means it would fall through to the old field name even if the new field exists. For boolean or numeric fields, use $exists(response.new_field) ? response.new_field : response.old_field instead.

Test cases:

Input	Expected `first_name`	Expected `email`
`{ "firstname": "Ada", "emailAddress": "ada@a.e" }`	`"Ada"`	`"ada@a.e"`
`{ "first_name": "Ada", "email_address": "ada@a.e" }`	`"Ada"`	`"ada@a.e"`
`{ "first_name": "Ada", "firstname": "STALE" }`	`"Ada"` (new field wins)	`undefined`

The third case validates that when both fields are present during a transition period, the new field takes precedence.

Recipe 2: Nesting and Wrapper Changes

Providers frequently restructure their response envelope. The most common pattern: results that lived at the top level get wrapped inside a data object, or pagination metadata moves to a separate meta block.

Old payload (flat):

{
  "results": [
    { "id": "1", "name": "Acme Corp" }
  ],
  "total": 1,
  "next_page": "abc123"
}

New payload (wrapped in data with separate meta):

{
  "data": {
    "results": [
      { "id": "1", "name": "Acme Corp" }
    ],
    "total": 1
  },
  "meta": {
    "next_page": "abc123"
  }
}

The mapping that breaks:

{
  "items": results.{ "id": id, "name": name },
  "next_cursor": next_page,
  "remote_data": $
}

After the change, results is undefined at the top level because it moved under data.

The fixed mapping - detects the envelope shape dynamically:

(
  $results := data.results ? data.results : results;
  $cursor := meta.next_page ? meta.next_page : next_page;
  {
    "items": $results.{ "id": id, "name": name },
    "next_cursor": $cursor,
    "remote_data": $
  }
)

The expression checks for data.results first (new shape). If it exists, use it. Otherwise, fall back to the top-level results (old shape). The same logic applies to the cursor field that moved under meta. The reverse scenario - unwrapping a payload that became flat - uses the same pattern with flipped fallback order.

Test cases:

Input shape	Expected `items [0].name`	Expected `next_cursor`
Flat (`results` at root, `next_page` at root)	`"Acme Corp"`	`"abc123"`
Wrapped (`data.results`, `meta.next_page`)	`"Acme Corp"`	`"abc123"`
Mixed (`data.results` exists, `next_page` at root)	`"Acme Corp"`	`"abc123"`

Recipe 3: Pagination and Cursor Format Changes

Cursor format changes are particularly dangerous because they silently corrupt pagination. A provider might rename the cursor field, move it to a new location, or switch from an opaque string to a base64-encoded JSON object.

Old cursor format (field at root):

{
  "results": [{ "id": "1" }, { "id": "2" }],
  "next_cursor": "page2token"
}

New cursor format (nested under pagination, renamed to cursor):

{
  "results": [{ "id": "1" }, { "id": "2" }],
  "pagination": {
    "cursor": "eyJvZmZzZXQiOiAyMH0="
  }
}

That base64 value decodes to {"offset": 20}, but your engine should never care. Cursors are opaque tokens - your mapping extracts and forwards them without parsing their contents.

The mapping that breaks:

{
  "items": results,
  "next_cursor": next_cursor
}

After the change, next_cursor is undefined and your sync stops after the first page.

The fixed response mapping:

{
  "items": results,
  "next_cursor": pagination.cursor ? pagination.cursor : next_cursor
}

If the provider also changed what query parameter name they expect the cursor to be sent back as (e.g., from ?next_cursor= to ?page [cursor]=), update the query mapping as well:

Old query mapping:

{ "next_cursor": query.next_cursor }

Fixed query mapping:

{ "page[cursor]": query.next_cursor }

Two config changes. No code deploy. Both old-format and new-format callers can be supported simultaneously if needed.

Test cases:

Cursor field location	Input	Expected `next_cursor`
`next_cursor` at root	`{ "next_cursor": "page2token" }`	`"page2token"`
`pagination.cursor`	`{ "pagination": { "cursor": "eyJvZmZzZXQiOiAyMH0=" } }`	`"eyJvZmZzZXQiOiAyMH0="`
Neither present (last page)	`{}`	`undefined`

Recipe 4: Date and Time Format Conversions

Date format changes are the second most common breaking change. Providers switch between ISO 8601 strings and Unix timestamps, change timezone offset handling, or alter precision (seconds vs. milliseconds).

Old payload (ISO 8601):

{
  "id": "e-100",
  "created_at": "2026-01-12T10:33:00Z",
  "updated_at": "2026-06-15T14:00:00.000+0000"
}

New payload (Unix timestamps in seconds):

{
  "id": "e-100",
  "created_at": 1768213980,
  "updated_at": 1750003200
}

The mapping that breaks - assumes ISO strings:

{
  "id": response.id,
  "created_at": response.created_at,
  "updated_at": response.updated_at,
  "remote_data": response
}

This technically still runs without errors, but your downstream code expects ISO strings. Date comparisons, sorting, and display all break.

The fixed mapping - detects the type and converts:

(
  $toISO := function($v) {
    $type($v) = "number"
      ? $fromMillis($v < 10000000000 ? $v * 1000 : $v)
      : $v
  };
  {
    "id": response.id,
    "created_at": $toISO(response.created_at),
    "updated_at": $toISO(response.updated_at),
    "remote_data": response
  }
)

The $toISO helper checks $type(). If the value is a number, it converts from Unix to ISO using the built-in $fromMillis(). The $v < 10000000000 guard distinguishes seconds (10 digits) from milliseconds (13 digits) - Unix timestamps in seconds are under 10 billion until the year 2286. If the value is already a string, it passes through unchanged.

Test cases:

Input `created_at`	Expected output
`"2026-01-12T10:33:00Z"`	`"2026-01-12T10:33:00Z"` (pass-through)
`1768213980` (seconds)	`"2026-01-12T10:33:00.000Z"`
`1768213980000` (milliseconds)	`"2026-01-12T10:33:00.000Z"`

Testing Your Mapping Recipes

Every recipe above can be validated without deploying anything. JSONata expressions are pure functions - same input always produces the same output. Build a test harness that runs each expression against known fixtures:

import jsonata from 'jsonata'
 
const fixtures = [
  {
    name: 'v1 payload - field rename recipe',
    expression: `{
      "first_name": response.first_name ? response.first_name : response.firstname,
      "remote_data": response
    }`,
    input: { response: { firstname: 'Ada' } },
    expected: { first_name: 'Ada', remote_data: { firstname: 'Ada' } }
  },
  {
    name: 'v2 payload - field rename recipe',
    expression: `{
      "first_name": response.first_name ? response.first_name : response.firstname,
      "remote_data": response
    }`,
    input: { response: { first_name: 'Ada' } },
    expected: { first_name: 'Ada', remote_data: { first_name: 'Ada' } }
  },
  {
    name: 'Unix timestamp - date recipe',
    expression: `(
      $toISO := function($v) {
        $type($v) = "number" ? $fromMillis($v < 10000000000 ? $v * 1000 : $v) : $v
      };
      { "created_at": $toISO(response.created_at) }
    )`,
    input: { response: { created_at: 1768213980 } },
    expected: { created_at: '2026-01-12T10:33:00.000Z' }
  }
]
 
for (const fixture of fixtures) {
  const result = await jsonata(fixture.expression).evaluate(fixture.input)
  const pass = JSON.stringify(result) === JSON.stringify(fixture.expected)
  console.log(`${pass ? '✓' : '✗'} ${fixture.name}`)
}

Three practices that prevent bad mappings from reaching production:

Test both old and new payloads. Every recipe must pass against both the pre-change and post-change response shape. This is your regression safety net during the provider's migration window.
Test the empty case. What happens when a field is null, undefined, or missing entirely? Your mapping should produce null or undefined in the output - never throw.
Test remote_data preservation. Verify that the original payload is always preserved under remote_data, untouched. This is your escape hatch when the mapping does not yet cover a field that moved.

Applying Mapping Patches Safely in Production

With declarative mappings stored as configuration, the deployment workflow for a breaking change looks fundamentally different from code-first approaches.

Code-first workflow:

Discover the break (usually from a customer alert or failed sync)
Identify the affected adapter file
Write a code fix, open a PR, wait for review
Run CI, merge, deploy to production
Hope you did not break other integrations in the same release

Declarative mapping workflow:

Discover the break
Write the fixed JSONata expression
Test it against old and new payloads in the JSONata Exerciser
Update the mapping config (one database row)
The fix is live immediately for all affected accounts

Truto's 3-level override hierarchy makes this even safer. You do not have to update the global mapping right away. Instead, roll out incrementally:

Test on one account first. Apply the fixed mapping as an account-level override on a single affected customer. Verify the fix works against real traffic.
Promote to environment level. Once validated, apply the mapping at the environment level so all accounts using that integration in that environment pick it up.
Promote to platform level. Update the base platform mapping. At this point, you can also clean up the fallback branches for deprecated field names if the provider has fully sunset the old version.

If the fix causes an unexpected issue at any level, roll it back by removing the override. The previous mapping takes effect instantly. No revert commits, no emergency deploys.

This workflow means a breaking change that used to be a multi-day fire drill - discovery, triage, code, review, deploy, monitor - becomes a configuration update with zero deployment risk. For the full details on override mechanics, see our guide on per-customer API mappings.

Why Declarative Mapping Beats Code-First Integration

Treating API integration as a data problem rather than a code problem provides massive architectural advantages. By removing integration-specific code from your repository, you eliminate the deployment bottleneck. The payoff is operational, not aesthetic.

Concern	Code-First Adapter	Declarative Mapping
Add a new vendor	New file, PR, CI, deploy	Insert a row of config
Custom field for one customer	Branch in shared code	Per-account override
Fix a date parsing bug	Affects only one adapter	Fix the engine, all integrations benefit
Non-engineer can edit	No	Yes (solutions engineers, PMs)
Rollback a bad mapping	Revert + redeploy	Update one row

When a customer requests support for a custom field in their specific Salesforce instance, you do not need to alter your core codebase. Truto utilizes a 3-Level Override Hierarchy to handle these edge cases:

Platform Level: The baseline JSONata mapping that applies to all customers.
Environment Level: Mappings customized for a specific staging or production environment.
Account Level: Mappings customized for a single, specific integrated account.

If one enterprise customer needs a custom industry_vertical field mapped from their CRM (a scenario we detail in our guide to mapping custom objects), a product manager or solutions engineer can update that specific account's JSONata mapping via the API or dashboard. The change takes effect immediately. No pull requests. No CI/CD pipelines. No risk of breaking other customers. Read more about this in our guide on per-customer API mappings.

Customizing Unified API Data Models Per Customer Without Code

Short answer: Store your mapping as JSONata configuration in three layered rows - platform, environment, account - and let a generic engine deep-merge them at request time. A customer's custom field, a per-environment sandbox quirk, or a polymorphic endpoint route all become one-line config edits instead of code deploys.

This is the pattern that turns unified APIs from a demo trick into something enterprise customers actually adopt. Every large customer eventually asks for a custom field, a renamed enum value, or a different vendor endpoint. If those requests need engineering time, your integration platform becomes a bottleneck. If they resolve to configuration edits by a solutions engineer or PM, the platform scales.

The three levels apply in order, with each layer deep-merged on top of the previous:

flowchart TB
    A["Platform Base Mapping<br/>default for every customer"] --> B["Environment Override<br/>applies to one environment"]
    B --> C["Account Override<br/>applies to one integrated account"]
    C --> D["Resolved Mapping"]
    D --> E["Runtime Engine Evaluates JSONata"]

Two rules govern the merge:

Objects merge key-by-key. Add a single new response field in the override; every other field in the base mapping is preserved.
Arrays overwrite entirely. If you override a before or related_resources array, you must repeat every entry from the base that you want to keep.

Complete JSONata Example: Base Mapping

Here is a base mapping for a Salesforce Contact resource stored at the platform level. It covers every customer that installs the Salesforce integration on the CRM unified model. The mapping row in the database holds four JSONata expressions plus a resource name:

Resource and method:

{
  "resource": "contacts",
  "method": "list"
}

Query mapping (unified filters into a SOQL string):

(
  $conditions := [];
  $conditions := query.updated_after
    ? $append($conditions, "LastModifiedDate >= " & query.updated_after)
    : $conditions;
  {
    "q": "SELECT Id, FirstName, LastName, Email, Phone, AccountId, CreatedDate FROM Contact"
         & ($count($conditions) > 0 ? " WHERE " & $join($conditions, " AND ") : "")
         & " LIMIT " & (query.limit ? query.limit : "50")
  }
)

Response mapping (Salesforce fields into unified fields):

records.{
  "id": $string(Id),
  "first_name": FirstName,
  "last_name": LastName,
  "name": $join([FirstName, LastName], ' '),
  "email": Email,
  "phone": Phone,
  "account_id": AccountId,
  "created_at": CreatedDate,
  "remote_data": $
}

Given a raw Salesforce response like:

{
  "records": [
    {
      "Id": "003xx000004TmiQAAS",
      "FirstName": "Ada",
      "LastName": "Lovelace",
      "Email": "ada@analytical.engine",
      "Phone": "+1-555-0100",
      "AccountId": "001xx0000001AAA",
      "CreatedDate": "2026-01-12T10:33:00.000+0000"
    }
  ]
}

The engine emits the unified shape:

{
  "result": [
    {
      "id": "003xx000004TmiQAAS",
      "first_name": "Ada",
      "last_name": "Lovelace",
      "name": "Ada Lovelace",
      "email": "ada@analytical.engine",
      "phone": "+1-555-0100",
      "account_id": "001xx0000001AAA",
      "created_at": "2026-01-12T10:33:00.000+0000",
      "remote_data": { "Id": "003xx000004TmiQAAS", "...": "..." }
    }
  ]
}

This is one database row - not a code file. Every customer using the Salesforce integration on the CRM model gets this mapping by default.

Environment Override: Example and Effect

Say your Staging environment points at a Salesforce sandbox that runs an older API version. The sandbox returns a LegacyPhone field instead of Phone, and the SOQL query needs an extra ArchivedDate = null predicate so archived contacts are hidden. You want this behavior for Staging only - Production should keep using the base mapping untouched.

Store an environment-level override with only the two fields that need to change:

Environment override query mapping:

(
  $conditions := ["ArchivedDate = null"];
  $conditions := query.updated_after
    ? $append($conditions, "LastModifiedDate >= " & query.updated_after)
    : $conditions;
  {
    "q": "SELECT Id, FirstName, LastName, Email, LegacyPhone, AccountId, CreatedDate FROM Contact"
         & " WHERE " & $join($conditions, " AND ")
         & " LIMIT " & (query.limit ? query.limit : "50")
  }
)

Environment override response mapping:

records.{
  "id": $string(Id),
  "first_name": FirstName,
  "last_name": LastName,
  "name": $join([FirstName, LastName], ' '),
  "email": Email,
  "phone": LegacyPhone,
  "account_id": AccountId,
  "created_at": CreatedDate,
  "remote_data": $
}

Effect: In Staging, the engine resolves the mapping by merging the environment row on top of the base. query_mapping and response_mapping are strings, so the environment version fully replaces those two expressions; every other field on the mapping row (resource name, method, pagination config, before/after steps) is inherited from the base. In Production, the environment row does not exist, so the base mapping runs unchanged.

Rolling this back is a single database update - remove the override row and Staging falls back to the base. Environment overrides work well for sandbox-vs-production API differences, dedicated regional endpoints (EU vs US instances), and piloting a mapping revision on a staging environment before promoting it globally.

Account Override: Example Resolving Differing Field Names

Account-level overrides handle the messiest case: a single customer's specific instance of a SaaS platform uses custom fields, renamed picklist values, or non-standard treatment of standard fields. The classic scenario: two Salesforce customers, both on the same base mapping, but one uses a custom field Industry_Vertical__c and the other uses Vertical_Name__c for the same concept.

Each account gets its own override stored on the integrated account record.

Customer A account override (uses Industry_Vertical__c):

records.{
  "id": $string(Id),
  "first_name": FirstName,
  "last_name": LastName,
  "name": $join([FirstName, LastName], ' '),
  "email": Email,
  "phone": Phone,
  "account_id": AccountId,
  "created_at": CreatedDate,
  "industry_vertical": Industry_Vertical__c,
  "remote_data": $
}

Customer B account override (uses Vertical_Name__c):

records.{
  "id": $string(Id),
  "first_name": FirstName,
  "last_name": LastName,
  "name": $join([FirstName, LastName], ' '),
  "email": Email,
  "phone": Phone,
  "account_id": AccountId,
  "created_at": CreatedDate,
  "industry_vertical": Vertical_Name__c,
  "remote_data": $
}

Both overrides produce the same unified industry_vertical field. Downstream code never branches on which Salesforce instance produced the record. Neither override affects the other customer's data or the platform base mapping.

Effect: Only Customer A and Customer B receive the enriched response with industry_vertical populated. Every other Salesforce account continues receiving the base mapping's output. Add a third customer with yet another field name (Sector__c) and it is another config row - no code, no deploy, no CI.

A cleaner pattern once you have three or more field-name variants: fold the fallback chain into the base mapping itself and stop shipping account overrides for every new customer.

Base mapping with fallback (preferred at scale):

records.{
  "id": $string(Id),
  "first_name": FirstName,
  "last_name": LastName,
  "email": Email,
  "phone": Phone,
  "industry_vertical": $firstNonEmpty(
    Industry_Vertical__c,
    Vertical_Name__c,
    Sector__c
  ),
  "remote_data": $
}

$firstNonEmpty picks the first non-empty value from the list. Adding a fourth field name for a fourth customer becomes a one-token edit to the base mapping - no override needed.

Tip

When to promote an account override to the base. If three or more accounts have overrides doing similar work, promote the pattern into the base mapping (usually as a $firstNonEmpty or $mapValues call). Account overrides should hold only account-unique logic, not the intersection of common customer variations.

Polymorphic Endpoint Example: NetSuite Contacts

NetSuite is the acid test for polymorphic mapping. A single unified "contact" in NetSuite is not one record type - it can be either a Vendor or a Customer, and the two live in completely different SuiteQL tables with different fields, subsidiary relationships, and currency handling. The unified model has to route to the right endpoint based on a request parameter, then normalize two structurally different responses into a single shape.

This is where the declarative pattern earns its keep. Handling this in code would mean an if (contactType === 'vendor') branch that grows every time NetSuite ships a new record variant.

Instead, the mapping uses dynamic resource resolution - the resource field is an array the engine matches against the request's query parameters:

Resource routing (base mapping):

{
  "resource": [
    { "resource": "vendor", "query_param": "contact_type", "query_param_value": "vendor" },
    { "resource": "customer", "query_param": "contact_type", "query_param_value": "customer" }
  ],
  "method": "list"
}

A call with ?contact_type=vendor hits NetSuite's vendor SuiteQL table; ?contact_type=customer hits the customer table. The engine handles the routing generically - no NetSuite-specific code exists.

Query mapping (base) - context-aware SuiteQL builder:

(
  $entity := query.contact_type = "vendor" ? "vendor" : "customer";
  $subsidiaryJoin := context.multi_subsidiary = "true"
    ? " LEFT JOIN entitysubsidiary es ON es.entity = e.id"
    : "";
  $currencyJoin := context.multi_currency = "true"
    ? " LEFT JOIN currency c ON c.id = e.currency"
    : "";
  $currencySelect := context.multi_currency = "true"
    ? ", c.symbol as currency_code"
    : ", 'USD' as currency_code";
  $where := query.updated_at
    ? " WHERE e.lastmodifieddate >= '" & query.updated_at & "'"
    : "";
  {
    "q": "SELECT e.id, e.entityid, e.email, e.phone, e.altphone" & $currencySelect
         & " FROM " & $entity & " e"
         & $subsidiaryJoin
         & $currencyJoin
         & $where
  }
)

Response mapping (base) - unifies vendor and customer into one shape:

response.{
  "id": $string(id),
  "name": entityid,
  "email": email,
  "phone": $firstNonEmpty(phone, altphone),
  "currency_code": currency_code,
  "contact_type": %.query.contact_type,
  "remote_data": $
}

Two things are happening:

The resource array routes the request. The engine picks the matching entry, so ?contact_type=vendor and ?contact_type=customer produce different SuiteQL queries with zero branching in the engine.
The query and response mappings adapt using context. NetSuite accounts can have multi-subsidiary and multi-currency enabled or disabled. The mapping inspects context.multi_subsidiary and context.multi_currency (set during account install) and conditionally adjusts the SuiteQL JOINs and default values - USD when multi-currency is off, a proper JOIN to currency when it is on.

Now suppose one enterprise customer categorizes vendors with a nonstandard field custentity_supplier_grade in their specific NetSuite instance. Add an account-level override for that single account:

Account override response mapping (Customer C only):

response.{
  "id": $string(id),
  "name": entityid,
  "email": email,
  "phone": $firstNonEmpty(phone, altphone),
  "currency_code": currency_code,
  "contact_type": %.query.contact_type,
  "supplier_grade": custentity_supplier_grade,
  "remote_data": $
}

The base mapping still governs query resolution, subsidiary and currency handling, resource routing, and every other customer. Only Customer C's account gets the supplier_grade field.

How the 3-level hierarchy is applied here:

Concern	Level	Why
Polymorphic `vendor` vs `customer` routing	Platform base	Same NetSuite behavior for every customer
Multi-currency / multi-subsidiary conditionals	Platform base (context-driven)	Account context flips the branch; no per-account edit needed
EU subsidiary requires a `TaxRegistrationCountry` filter	Environment override	Applies to one region's staging/production pair
Custom `custentity_supplier_grade` field	Account override	One customer's schema, no impact on others

A note on the polymorphic case: the same technique extends to APIs with more than two variants. Add a third entry to the resource array ("resource": "lead", "query_param_value": "lead") and the engine picks it up on the next request. Existing customers see no difference; new callers using ?contact_type=lead get routed to the new endpoint.

Common Gotchas and Debugging Tips

The declarative pattern is powerful, but new teams routinely hit the same handful of problems. Here is a practical checklist ordered by how often each one bites in production.

1. Arrays overwrite; objects merge. Deep-merge treats object keys as combinable and arrays as atomic. If your base mapping has before: [step1, step2] and your override has before: [step3], the final before is [step3] - not [step1, step2, step3]. When overriding an array field, include every entry you want to keep. This is the single most common override bug.

2. $ context shifts inside object constructors. In an expression like records.{ "id": Id, "remote_data": $ }, the $ refers to each element of records, not the top-level payload. To reach parent context, use % (parent) or %.% (grandparent). Getting this wrong silently produces undefined fields rather than throwing.

3. Falsy values break naive ternary fallbacks. field ? field : fallback treats false, 0, and "" as missing. Use $exists(field) ? field : fallback for boolean and numeric fields, or $firstNonEmpty(newField, oldField) when you want first-non-empty semantics.

4. null versus missing in create/update bodies. Sending { "phone": null } is not the same as omitting phone. Many providers interpret explicit null as "clear this field" while omission means "leave it unchanged." Guard optional fields with field ? { "phone": field } and let the deep-merge drop absent keys.

5. Overrides are scoped by method. An override on list does not automatically apply to get. If a custom field is exposed by both list and get endpoints, either override both explicitly, or use a mapping pattern where the get mapping references the list mapping by name so a single edit covers both.

6. Test JSONata expressions in isolation first. Every mapping is a pure function of its input. Paste the expression and a sample payload into the JSONata Exerciser to iterate on it. Do not iterate by running end-to-end requests - the feedback loop is too slow, and you cannot easily see the exact input the engine passed to the expression.

7. Preserve remote_data on every override. When you rewrite a response_mapping, it is easy to forget the remote_data: $ line. Losing that field breaks any downstream code relying on the escape hatch for vendor-specific fields you have not modeled yet.

8. Log the resolved mapping, not just the input. When debugging why a customer's response looks wrong, the first question is "which mapping actually ran?" Log the merged mapping (base + env + account) at request time so you can tell whether the override applied at all. A dry-run endpoint that returns the resolved mapping and the intermediate query/response for a given account is worth building early.

9. Type coercion is not automatic. If a provider returns an ID as an integer and you emit it without $string(), downstream code that expects a string will silently misbehave. Coerce IDs, dates, and booleans explicitly at the mapping boundary.

10. Watch context precedence in multi-tenant environments. Account override wins over environment override, which wins over the base. If a customer's override is "not taking effect," check whether an environment override further down the merge chain is overwriting it - especially on array fields where the whole array is replaced (see gotcha #1).

11. Fail loudly on missing required fields. A JSONata expression that returns undefined for a required field will silently produce an incomplete record. Pair every override with a JSON Schema validation step downstream so missing required fields surface as a 4xx to the caller rather than a corrupted database row.

12. Prefer promoting patterns to the base over piling up overrides. If three or more accounts need the same override, that is a signal the base mapping should absorb the pattern (usually via $firstNonEmpty or $mapValues). Account overrides that duplicate common patterns become a maintenance liability the moment the base needs to change.

For deeper debugging workflows - including how to trace a failing JSONata expression back to its evaluation context - see our developer guide to mapping API data with JSONata.

The Honest Trade-Offs

Declarative schema normalization is powerful, but it is not free. The real engineering costs include:

JSONata has a learning curve. Engineers used to writing TypeScript will write ugly expressions for the first week. You must pair it with a small expression library you reuse across integrations.
Debugging is different. You cannot drop a breakpoint in a JSONata expression. You need robust logging at the engine level showing the input payload, the evaluated expression, and the output.
Type safety is weaker. A bad mapping fails at runtime against the JSON Schema, not at compile time. Strong JSON Schema validation at both ends of the pipeline is non-negotiable.
Truly bespoke flows still need code-like escape hatches. Multi-step orchestration (e.g., fetch custom fields, then call the main endpoint, then enrich) needs a before/after step runner. JSONata alone is not enough for complex orchestration.

Unified APIs in general are not a magic bullet either. If you only need one deep, high-fidelity integration to a single platform with custom business logic, a hand-rolled code connector is still the right answer. The architecture in this article wins when you need breadth—five, ten, or fifty connectors across CRM, HRIS, ATS, and accounting—and you do not want your engineering headcount to scale linearly with your integration catalog.

Where to Go From Here

You now have the building blocks: response mapping, query translation, error normalization, and rate limit management, all expressed as declarative JSONata. To productionize this pattern:

Define a canonical schema per category (CRM contacts, HRIS employees, ATS candidates) as JSON Schema, and validate every normalized output against it.
Store mappings as data, not code (for example, by publishing JSONata manifests). Version them, allow per-customer overrides, and audit changes.
Normalize rate-limit headers to the IETF spec and let callers own the retry policy.
Keep remote_data on every record so consumers can reach vendor-specific fields without a code deploy.
Treat your engine as the only code that changes. New connectors become database rows, not pull requests.

Declarative API mapping with JSONata transforms integration maintenance from an open-ended engineering drain into a predictable, scalable configuration task. You stop writing custom adapters and start shipping core product features.

If you want to skip the engine-building phase entirely, Truto runs exactly this architecture—declarative JSONata mappings, three-level overrides, IETF rate-limit headers, and a zero-data-retention proxy—across 200+ connectors today. Bring your own OAuth apps, define mappings as data, and your engineering team can go back to building the actual product.

API Schema Normalization Tutorial: End-to-End with JSONata

The N² Maintenance Trap: Why Hardcoded API Adapters Fail

What Is API Schema Normalization?

Tutorial Prerequisites: The Sample Repo

Step 1: Normalizing the Response Payload with JSONata

The HubSpot JSONata Mapping

The Salesforce JSONata Mapping

Step 2: Translating Query Parameters End-to-End

Mapping the Unified Query to HubSpot (Filter Groups)

Mapping the Unified Query to Salesforce (SOQL)

Step 3: Normalizing API Errors

Step 4: Normalizing Rate Limits (The Transparent Way)

Step 5: Handling API Breaking Changes Without Code Deploys

Recipe 1: Field Renames and Casing Changes

Recipe 2: Nesting and Wrapper Changes

Recipe 4: Date and Time Format Conversions

Testing Your Mapping Recipes

Applying Mapping Patches Safely in Production

Why Declarative Mapping Beats Code-First Integration

Customizing Unified API Data Models Per Customer Without Code

Complete JSONata Example: Base Mapping

Environment Override: Example and Effect

Account Override: Example Resolving Differing Field Names

Polymorphic Endpoint Example: NetSuite Contacts

Common Gotchas and Debugging Tips

The Honest Trade-Offs

Where to Go From Here

FAQ

More from our Blog

Why Schema Normalization is the Hardest Problem in SaaS Integrations

Developer Tutorial: How to Build JSONata Mappings for API Integrations

Per-Customer API Mappings: 3-Level Overrides for Enterprise SaaS

Best Practices for Handling API Rate Limits and Retries Across Multiple Third-Party APIs

How to Publish End-to-End Developer Tutorials with Runnable API Examples

404 Reasons Third-Party APIs Can't Get Their Errors Straight (And How to Fix It)

Developer Guide: JSONata Mapping Examples for API Integration (2026)

Mapping Custom Objects with JSONata: A Step-by-Step Developer Guide

How to Publish JSONata Manifests and Mapping Examples for API Integrations