Rogue tool calls
The agent invents a customer ID, a SKU, an email address, or a refund amount and calls a real tool with it. The model is confident, the arguments parse, the tool fires.
Validate every tool argument against a schema and a live data check before execution. Block calls referencing entities that don't exist. Require approval for destructive verbs (delete, refund, deploy).
