23 November 2025

LLM's get fooled by poets

Italian boffins discovered that trying to sidestep the guards that limit what an AI system will do for you (hacking...) works an order of magnitude better if you phrase your request as a poem.

The approach works accorss all popular models, though some are more vulnerable then others.

This suggest that prompting systems using non standard language will give you a better success rate when trying to abuse AI systems.

more... 

No comments:

Post a Comment