Jailbreak Taxonomy


We classify the methods based on the following two criteria:

Jailbreak Taxonomy Jailbreak Method Require White-Box Access? Modify the Original Question?
Human-Based AIM
Human-Based Devmoderanti
Human-Based Devmodev2
Obfuscation-Based Base64
Obfuscation-Based Combination
Obfuscation-Based Zulu
Optimization-Based AutoDAN
Optimization-Based GCG
Optimization-Based COLD
Optimization-Based GPTfuzz
Optimization-Based PAIR
Optimization-Based TAP
Optimization-Based Masterkey
Parameter-Based Generation Exploitation