Data Engineering DATA · 08 · 08

Data platform: free-recall review

Free-recall prompts spanning the whole data-engineering track. Answer each in your own words first, then reveal the model answer and compare.

DATA Senior ◷ 14 min

Level

FoundationsJuniorMiddleSenior

Retrieval beats re-reading. For each prompt, reconstruct a full answer from memory before you open the model answer — the effort of pulling the whole track together is what fuses the separate stores into one mental model.

Goal

Reconstruct the track’s spine without looking back: why OLTP and OLAP split, where the transform runs, how Parquet prunes, what an MV trades, why the log is the source of truth, and how search and vectors divide relevance — and how the seams between them are designed.

Recall before you leave

01
Why can't one storage layout serve both the OLTP checkout path and OLAP revenue scans, and what is the standard two-store answer?
02
What does choosing ELT over ETL actually buy you, and what is the cost?
03
How does Parquet skip data, and what code mistake silently defeats it?
04
What does a materialized view trade, and how do you keep that tradeoff honest in production?
05
Why is the append-only event log the source of truth in event sourcing, and how does current state relate to it?
06
How do full-text search and vector search divide the relevance problem, and why do mature systems run both?

Recap

If you reconstructed each answer, you hold the track’s spine: OLTP and OLAP split because no layout wins both access patterns; ELT buys replay by retaining raw; Parquet prunes via footer stats unless a function hides the column; an MV trades staleness for read speed and needs a declared freshness SLA; the event log is the source of truth and state is a fold over it; and search plus vectors divide lexical from semantic relevance. Above all, each store is correct for its job — the system stays correct only when you design the contract, delivery guarantee, freshness, and reconciliation at every seam between them. Now when you face a design review or an incident where each team’s store passes its own tests, you’ll reach for the seam first — and you’ll know exactly which contract questions to ask.

Something unclear?

Ask a question about this lesson. Questions are anonymous and go straight to the author to make the lesson better.