Observability OBS · 02 · 02

The production log schema: fields every line must carry

What a production-grade structured log line contains in 2026 — the OTel Logs Data Model fields, resource attributes, and why the schema is the API your on-call tooling expects.

OBS Middle ◷ 12 min

Level

FoundationsJuniorMiddleSenior

Two services on the same team call the field “status_code” in one and “http_status” in the other. Every dashboard that joins them breaks. The fix is not a regex — it is a shared schema enforced before the code ships.

What every production log line carries

A production-grade structured log line in 2026 has at minimum these fields, in roughly this priority order:

timestamp — ISO-8601 UTC with sub-second precision: 2026-04-12T14:02:11.482Z. The primary sort key for every backend.
level — string enum: TRACE / DEBUG / INFO / WARN / ERROR / FATAL. Controls what gets stored and who gets paged.
service.name — the OTel-canonical key identifying the emitter. This is the join key back to metrics and traces.
trace_id and span_id — when the log is emitted inside a traced request, these are the join keys to the tracing backend. Without them the log-to-trace pivot does not work.
message — a short human-readable summary. The only field humans skim; the rest is for machines.
Event-specific fields — a flat key-value tree using OTel Semantic Conventions: http.route, http.response.status_code, db.system, error.type, etc.
Resource attributes — set once at service start: host.name, cloud.region, service.version, deployment.environment. These describe the emitter, not the event.

Together these groups form a fixed envelope that never changes shape between services — which is exactly what makes cross-service dashboards and run-books possible without per-service translation.

Field group	Example fields	Set by	Frequency
Core	timestamp, level, message	Logger SDK	Every line
Service identity	service.name, service.version	Logger SDK + config	Every line
Trace context	trace_id, span_id, trace_flags	Active span mixin	Inside traced requests
Event-specific	http.route, http.response.status_code, error.type	Application code	Per event type
Resource	host.name, cloud.region, deployment.environment	Platform / config	Once at startup

The OTel Logs Data Model

The OpenTelemetry Logs specification (stable API late 2023, SDKs late 2024) formalises this split. A log record contains:

Timestamp — when the event actually occurred (set by the application).
ObservedTimestamp — when the collector saw the record. Under clock skew or backpressure these two diverge; senior teams alert on the divergence as a pipeline-health metric.
SeverityNumber — a numeric ladder (TRACE=1-4, DEBUG=5-8, INFO=9-12, WARN=13-16, ERROR=17-20, FATAL=21-24) so backends can compare severity across libraries that use different text labels.
Body — the human-readable message.
Attributes — the flat key-value map of event-specific data.
Resource — set once per emitter (service.name, host.name, cloud.region).
TraceId / SpanId / TraceFlags — W3C traceparent context inherited from the active span at emit time.

Adopting this shape — even on a non-OTel backend — buys forward compatibility: you can swap backends without rewriting instrumentation, and every dashboard and on-call run-book can speak the same field names.

Each text level owns a band of four numbers. Mapping them onto one ascending scale is what lets a backend run severity_number >= 9 to mean 'INFO and above' across libraries that disagree on the text label.

timestamp envelope · when

level envelope · severity

service.name envelope · who

trace_id envelope · correlation

span_id envelope · correlation

message envelope · human

context { http.route, status_code, error.type, … } typed fields · event

Every line is a fixed envelope (the same keys on every record) wrapping one typed context object (event-specific fields). The envelope is what queries, joins, and run-books rely on; context carries the per-event payload.

OTel Semantic Conventions: one field name across all services

The Semantic Conventions define the canonical names for common fields: http.route (not route, not path, not url), http.response.status_code (not status, not http_status), db.system (not db_type), error.type. Using these names means:

A dashboard querying http.response.status_code works uniformly across Node and Go services.
A backend alert rule referencing error.type:timeout works without per-service configuration.
A new service that onboards the schema inherits every existing query.

The cost of not following conventions shows up in incident response: when the checkout service calls the field status and the payment service calls it http_status, the join query fails and someone has to know the mapping under pressure at 03:00.

▸Why this works

The schema is not bureaucracy — it is load-bearing. Every query, every dashboard, every alert rule, every run-book step that references a field name assumes that field name is stable and consistent across services. Schema drift breaks them silently: the query returns fewer results, the alert fires on only some services, the run-book step requires manual translation. Centralising the schema in a per-org wrapper logger (one module each service imports) makes drift a build-time error instead of a 03:00 discovery.

Log schema numbers

OTel Logs API stability (GA): Late 2023
OTel Logs SDK stability (most langs): Late 2024
Typical structured log line size: ~0.5–2 KB
SeverityNumber range (TRACE to FATAL): 1–24
OTLP/Logs over gRPC, compressed: ~30–50% smaller than JSON

Quiz

A log line carries Timestamp = 14:00:00 and ObservedTimestamp = 14:05:00. What does this gap indicate?

Quiz

Why does the OTel Logs Data Model define a numeric SeverityNumber (1-24) in addition to the text SeverityText?

Order the steps

Order the OTel Logs Data Model fields from most fundamental to most event-specific:

1 Timestamp — when the event occurred
2 SeverityNumber — numeric severity (1-24)
3 Resource — per-emitter attributes (service.name, host.name)
4 TraceId / SpanId — W3C trace context
5 Body — human-readable message
6 Attributes — event-specific flat key-value (http.route, error.type, ...)

Recall before you leave

01
What are the two timestamps in the OTel Logs Data Model, and why do both matter?
02
Why should you use OTel Semantic Convention field names (http.route, http.response.status_code) instead of your own names?
03
Which fields should be set once at startup versus emitted on every log line?

Recap

A production log schema in 2026 follows the OTel Logs Data Model: Timestamp (event time), ObservedTimestamp (collector time), SeverityNumber (1-24) plus SeverityText, Body, Resource (per-emitter: service.name, host.name, cloud.region), Attributes (event-specific: http.route, error.type), and TraceId/SpanId (W3C trace context). Resource attributes are set once at startup; trace context is pulled from the active span at emit time; event-specific attributes are written in application code. Using OTel Semantic Convention field names uniformly across services is what makes cross-service queries, dashboards, and run-books work without per-service translation. The gap between Timestamp and ObservedTimestamp is a pipeline health signal — alert on p99 over 60 seconds. Now when you see a dashboard showing (ObservedTimestamp - Timestamp) climbing, you know whether to look at the collector or the application clock before assuming the data is wrong.

Practice

Start at the top. Tasks go easiest → hardest: recall a fact, apply it to a case, then a senior-level stretch. Open one, attempt it, then reveal.

recallapplystretch0 of 5 done

Connected lessons

builds on

Why structured logs exist: the diary vs the spreadsheetjunior

unlocks

Log levels and alert routingmiddle

deepens into

appears again in297

Something unclear?

Ask a question about this lesson. Questions are anonymous and go straight to the author to make the lesson better.