ContextEngine — Vault → System Prompt

Load order matters

backend/context/engine.py loads the vault in a fixed sequence: Pippa.md → instructions.md → core/* → index/*. Order matters because identity priority follows order. Files under index/ larger than 10KB get replaced with on-demand pointers — Pippa reads them via the Read tool when needed instead of bloating every system prompt.

RAG injection

The system prompt also includes RAG context (next lesson). RAG runs before the system prompt is finalized — the retrieved chunks become a section of the system prompt, not a user message.

Cache hint

The vault content is huge. The Claude SDK uses prefix caching — same vault content across sessions, same prefix, cached. Tokens billed once, reused on every session start. The cache hint is implicit; the structure is what makes it work.

ContextEngine — Vault → System Prompt

Load order matters

RAG injection

Cache hint

Progress

Comments 0