awesome-everything RU
↑ Back to the climb

Data Engineering

Vector search: multiple-choice review

Crux Multiple-choice synthesis across the vector-search unit — recall vs latency, HNSW vs IVF-PQ, distance metrics, post-filtering, and the silent-recall failure mode.
Your altitude — climbing toward senior
ZeroJuniorMiddleSenior
You are at senior altitude — in orbit
◷ 13 min

Six questions that cut across the whole unit. Each mirrors a decision you make staring at a RAG pipeline that returns ten ranked rows, fast, with no errors — and the wrong ten.

Goal

Confirm you can connect embeddings, ANN index choice, the recall–latency–memory triangle, distance metrics, and the production traps — the synthesis the lesson built toward.

Quiz

A RAG search returns ten ranked rows in 2ms with zero errors, but support reports the bot 'can't find' docs it clearly has. What is the most likely root cause, and how do you confirm it?

Quiz

A RAG service over ~5M chunks has continuous ingestion, needs high recall, and has RAM to spare. Which index is the right default and why?

Quiz

Raising hnsw.ef_search from 100 to 500 moves recall from ~85% to ~98% but latency from ~1ms to ~5ms. How should a senior engineer read this?

Quiz

A team stores normalized embeddings and debates cosine similarity vs inner (dot) product. What is the correct call?

Quiz

A tenant-scoped search adds WHERE tenant_id = ? on top of an HNSW query, and recall drops sharply for small tenants. Why, and what is the fix?

Quiz

Users search for the exact error string 'ECONNREFUSED' and pure vector search buries the matching doc beneath vaguely-related paragraphs. Best fix?

Recap

The through-line is one triangle — recall, latency, memory — and one habit: recall drops are silent. The candidate pool (ef_search / probes) sets recall vs latency, the index family sets memory and drift (HNSW for changing data, IVFFlat for static, IVF-PQ when it will not fit RAM), the metric must match how embeddings were trained, post-filtering wrecks selective filters (use iterative scan), and exact-token needs demand hybrid BM25 + vector with rank fusion. Above all, measure recall@k against an exact baseline or you are flying blind.

Continue the climb ↑Vector search: free-recall review
shortcuts expand
search
K
prev piece
k
next piece
j
cycle tier
t
this menu
?
sources3
expand
  1. 01
  2. 02
  3. 03

Trademarks belong to their respective owners. Editorial reference only.