Apr 8, 2026
Reading and Writing Rule-Compliance Representations in LLMs
We found an internal rule-mismatch signal in Qwen3-30B-A3B that forms at layer 24, becomes cleanly readable by layer 40, and is causally load-bearing for policy compliance decisions.