docs: update model guidance

2026-01-06 23:48:25 +01:00
parent c920ee1166
commit edfc71a47e
2 changed files with 5 additions and 0 deletions
--- a/docs/security.md
+++ b/docs/security.md
@@ -75,6 +75,7 @@ Even with strong system prompts, **prompt injection is not solved**. What helps
 - Prefer mention gating in groups; avoid “always-on” bots in public rooms.
 - Treat links and pasted instructions as hostile by default.
 - Run sensitive tool execution in a sandbox; keep secrets out of the agent’s reachable filesystem.
+- **Model choice matters:** we recommend Anthropic Opus 4.5 because it’s quite good at recognizing prompt injections (see [“A step forward on safety”](https://www.anthropic.com/news/claude-opus-4-5)). Using weaker models increases risk.

 ## Lessons Learned (The Hard Way)