Jailbreaking Techniques

A refusal is a learned behaviour, not an enforced rule — which is exactly why it can be steered around. Personas, fictional framing, encoding tricks, low-resource languages, and the multi-turn crescendo that quietly walks a model past its own guardrails. How each one works, why it works, and how to test for it.

Related Articles