#prompt-vulnerability

[ follow ]
Artificial intelligence
fromComputerworld
1 day ago

Get poetic in prompts and AI will break its guardrails

25 frontier proprietary and open-weight models yielded high attack-success rates when prompted in verse, showing AI can break guardrails and reveal harmful instructions.
[ Load more ]