Probehalter, an evaluation harness for tool-abuse risks in LLM agents | Manifund