Quoting Model Card Addendum: Claude 3.5 Haiku and Upgraded Sonnet

from blog Simon Willison's Weblog, | ↗ original
We enhanced the ability of the upgraded Claude 3.5 Sonnet and Claude 3.5 Haiku to recognize and resist prompt injection attempts. Prompt injection is an attack where a malicious user feeds instructions to a model that attempt to change its originally intended behavior. Both models are now better able to recognize adversarial prompts from a user...