ast-guard: Deterministic Reward Hacking Detection - Paper & Control Experiments | Manifund