Making local LLMs more reliable with a deterministic “context compiler”

Hugging Face Forums [Unofficial] March 18, 2026

Source

I’ve been experimenting with running LLMs locally, and kept running into a common issue:

constraints and corrections drift out of the prompt over time

Example:

This gets worse with smaller models or limited context windows.

So I built a small deterministic tool called a context compiler.

Instead of relying only on the transcript, it extracts structured state like:

Then that state is injected into the prompt every turn, so important constraints don’t get lost.

Key idea:

I added a set of demos comparing:

The interesting part is that better prompting improves things, but the compiled state is what actually guarantees invariants.

Repo + demos:

github.com

Deterministic state engine for managing conversation state and constraints in LLM applications.