awesome-everything RU
↑ Back to the climb

AI / LLM Integration

LLM cost budgets: free-recall review

Crux Free-recall prompts across the LLM cost-budgets unit. Answer each in your own words first, then reveal the model answer and compare.
Your altitude — climbing toward senior
ZeroJuniorMiddleSenior
You are at senior altitude — in orbit
◷ 14 min

Retrieval beats re-reading. For each prompt, say or write a full answer from memory before you open the model answer — the effort of recall is what makes the cost model stick.

Goal

Reconstruct the unit’s spine — token asymmetry, where context accumulates, routing economics, prompt caching, and the in-process kill switch — without looking back at the lesson.

Recall before you leave
  1. 01
    Why is output the expensive half of an LLM bill, and what concrete levers attack it?
  2. 02
    A stateless model re-sends context every turn. Name the three things that inflate the re-sent payload and how each grows.
  3. 03
    When does model routing (cheap-first cascade) actually save money, and when does it backfire?
  4. 04
    Explain prompt caching: what gets discounted, by how much, and how do you structure a prompt to maximise the benefit?
  5. 05
    Why does an uncapped agent loop burn money superlinearly, and why can't a monthly provider cap stop it?
  6. 06
    List the LLM cost controls in priority order, cheapest first-line to last-resort, and say what each one bounds.
Recap

If you could reconstruct each answer from memory, you hold the unit’s spine: output costs ~5x input so cap it; the system prompt, history, and RAG all re-send every turn (fixed, linear, and multiplicative respectively); routing saves only at a low escalation rate; caching the stable prefix drops it to 0.1x and pays off on the first hit; and because a runaway loop is superlinear while a monthly cap is measured in days, the real brake is an in-process budget plus a kill switch on cost velocity.

Continue the climb ↑LLM cost budgets: code and cost arithmetic
shortcuts expand
search
K
prev piece
k
next piece
j
cycle tier
t
this menu
?
sources3
expand
  1. 01
  2. 02
  3. 03

Trademarks belong to their respective owners. Editorial reference only.