APIs API · 06 · 05

Query complexity defences: depth, cost, persisted queries

DataLoader fixes database trips. Depth limits, complexity scoring, persisted queries, and alias caps stop attackers from sending query shapes that kill the server before any resolver fires.

API Middle ◷ 14 min

Level

FoundationsJuniorMiddleSenior

DataLoader is live and the page loads in 80 ms. Then a security researcher sends { user { friends { friends { friends { ... } } } } } 12 levels deep. The server fires for 28 seconds and returns a 502. DataLoader did its job. The problem is the query shape itself — and DataLoader cannot help with that.

DataLoader is live, but a single crafted query can still bring the server down before any resolver fires. This lesson covers the defences that sit in front of execution — and the numbers that separate working production configs from defaults that invite abuse.

Depth limiting

Why does query depth matter so much if DataLoader already handles batching? Because at high depth the leaf count grows exponentially — DataLoader cannot batch its way out of 10^22 IDs. A depth limiter walks the query AST and rejects documents whose nesting level exceeds a configured maximum. Typical production limit: 7–10 levels. The check runs during the validation phase — before any resolver fires, before any database is touched.

{ user { friends { friends { friends { ... } } } } }
  depth 1   depth 2   depth 3   depth 4

At depth 12 with a branching factor of 100 (each user has 100 friends), the leaf count is 100^12. Even with DataLoader batching, that is 12 batched queries each on a set of 10^22 IDs. Depth limiting kills this at parse time.

List-depth is stricter than scalar depth. A 10-level query where some levels return scalars is different from a 10-level query where every level returns a list. Some libraries (e.g. graphile/depth-limit) provide separate maxListDepth (typically 3–4).

Complexity scoring

Complexity scoring assigns a cost to each field and sums across the AST. Two styles:

Static weights: each field has a @cost(value: Int!) directive. The analyser walks the AST, sums costs, multiplies list-field costs by their limit arguments. Reject if sum exceeds budget (typical: 1000–10000 units).
Multiplicative (GitHub model): cost(parent) = cost(parent_fields) + sum(child.limit × cost(child_fields)). A query that requests 100 items at each of 5 levels costs 100^5 by this formula — far over budget, rejected at AST parse time.

GitHub caps per-query cost at 1000 and publishes the cost in extensions.cost. Shopify Storefront caps at 1000 cost/query and 1000 cost/second/IP.

Persisted queries (trusted documents)

Persisted queries replace the inline query string with a SHA-256 hash. The client registers known queries at build time; at request time it sends only the hash and variables. The server executes the stored document.

This closes the entire inline-query attack surface: arbitrary client-supplied queries are impossible. Introspection-driven enumeration, complexity attacks, alias bombs, and depth bombs are blocked at the gate.

Tradeoff: every client deploy requires a registration step. Ad-hoc tooling (Postman, browser console) no longer works against production. Public APIs that cannot constrain clients (GitHub, Shopify) keep inline queries open but add complexity scoring instead.

The strategic fork: if you control every client, persisted queries close the inline attack surface outright; if you serve public clients like GitHub or Shopify, you keep inline open and lean on depth and complexity limits instead.

Alias bombs and operation batching

A single document can declare hundreds of top-level aliases for the same resolver:

q1: user(id: 1) { email }
q2: user(id: 2) { email }
...
q1000: user(id: 1000) { email }

This is one valid document, but it executes 1000 resolver calls. DataLoader collapses the database trips, but resolver-execution count is still the attacker’s leverage. Production caps: ≤20 root aliases per document, ≤5–10 operations per batch request.

Defence	What it stops	When it runs
Depth limit	Recursive/deep query bombs	Validation (before resolvers)
Complexity scoring	Cost-budget overruns	Validation
Persisted queries	All arbitrary inline queries	Before parsing
Alias cap	Alias bombs	Validation
Operation batch cap	Batch-request amplification	Before parsing
DataLoader	DB trip amplification	During resolution

GraphQL N+1 and defence: numbers

Typical depth limit: 7–10 levels
List-depth recommendation: 3–4
Typical complexity budget: 1000–10000 units
GitHub per-query cost cap: 1000
Shopify Storefront per-query cap: 1000 cost units
Alias-bomb cap (typical): ≤20 root aliases
Operation-batch cap (typical): ≤5–10 operations

Quiz

Which is the strongest single line of defence for a public GraphQL API against query-complexity attacks?

Order the steps

Order the safety checks a production GraphQL server runs on an incoming query:

1 Hash lookup: is this a known persisted query? If yes, accept and execute the stored document
2 Parse and validate the document against the schema
3 Depth analysis: reject if document depth exceeds the configured maximum
4 Complexity scoring: walk the AST, sum field costs, reject if budget exceeded
5 Authorise: check that the requesting client has the right scopes for the operation
6 Execute resolvers via DataLoader-batched fetchers

Quiz

An API team enables persisted queries but leaves the inline-query endpoint open for debugging. Why is this only marginally safer than no persisted queries?

Depth and complexity are checked at validation time; an over-budget query is rejected before any resolver or DB call runs.

Recall before you leave

01
What does complexity scoring do that depth limiting does not?
02
Persisted queries block complexity attacks. What is their operational tradeoff?

Recap

DataLoader fixes the N+1 problem within resolver execution. It does nothing about query shape. Depth limits (7–10 levels) reject recursive bombs at validation time. Complexity scoring (1000–10000 budget) rejects cost-overrun documents before any resolver fires. Persisted queries close the entire inline-document attack surface by allowing only pre-registered hashes. Alias caps (≤20) and operation-batch caps (≤5–10) stop amplification attacks that defeat naive per-request rate limits. Use all layers together; each one fails closed. Now when you ship a GraphQL API, walk through the table above and check which rows are enabled — missing depth limits and alias caps are the two most common gaps that turn up in production security reviews.

Practice

Start at the top. Tasks go easiest → hardest: recall a fact, apply it to a case, then a senior-level stretch. Open one, attempt it, then reveal.

recallapplystretch0 of 5 done

Connected lessons

builds on

DataLoader mechanics: tick-boundary batchingmiddle

unlocks

Senior GraphQL API: scheduling contract, tenant isolation, observabilitysenior

deepens into

Senior GraphQL API: scheduling contract, tenant isolation, observabilitysenior

appears again in204

Something unclear?

Ask a question about this lesson. Questions are anonymous and go straight to the author to make the lesson better.

Apply this

Put this lesson to work on a real build.

Distributed rate limiterBuild a token-bucket limiter that holds across many app instances by keeping the counter in Redis, not in process memory.