25.5 Maintenance: updating the index and handling deletions

On this page

Goal: keep the system correct as docs change

RAG systems degrade over time if you don’t maintain the index.

Maintenance is not optional because:

Stale indexes create “confidently wrong with citations”

The system will cite outdated chunks and still look trustworthy. This is why maintenance needs process and monitoring.

Use hashes and versioning to drive incremental updates:

Incremental update flow:

Deletions matter for correctness, privacy, and compliance.

Decisions to make:

Hard delete vs tombstone: do you remove chunk text entirely or keep a minimal record?
Propagation: how quickly must deletions stop influencing answers?
Audit: do you need to prove deletion happened?

Practical safeguards:

delete from vector index and chunk store (or mark inactive),
ensure retrieval filters exclude inactive chunks,
invalidate caches that might re-serve deleted content,
run a “deletion smoke test” that queries for the deleted doc and confirms it is not retrievable.

Embedding model changes are like schema migrations:

Never mix embeddings from different models in the same similarity space unless you know they’re compatible.

Monitor signals that indicate index health:

Daily/weekly: ingest updates and run incremental indexing.
After big doc changes: re-run eval set and spot-check citations.
After embedding/ranking changes: run retrieval metrics + answer faithfulness review.
On deletion request: delete chunks, invalidate caches, run deletion verification queries.
Incident response: if answers are wrong, inspect logs: retrieved chunks, versions, prompts.