rogue-security/prompt-injection-jailbreak-sentinel-v2 Text Classification • 0.6B • Updated Mar 11 • 17.7k • 30
Necent/distilbert-base-uncased-detected-jailbreak Text Classification • 67M • Updated May 29, 2025 • 70
GuardrailsAI/prompt-saturation-attack-detector Text Classification • 4.39M • Updated Nov 14, 2024 • 45.7k • • 2
gincioks/cerberus-proventra-mdeberta-v3-base-v1.0-onnx Text Classification • Updated Jun 15, 2025 • 2
intelliway/deberta-v3-base-prompt-injection-v2-mapa Text Classification • 0.2B • Updated Jul 3, 2025 • 4
llm-semantic-router/mmbert-jailbreak-detector-merged Text Classification • 0.3B • Updated Jan 21 • 179
llm-semantic-router/mlcommons-safety-classifier-level1-binary Text Classification • Updated Jan 22 • 15
llm-semantic-router/mmbert32k-jailbreak-detector-merged Text Classification • 0.3B • Updated Mar 6 • 4.72k
satyamg1620/mmbert32k-jailbreak-detector-healthcare-merged Text Classification • 0.3B • Updated Feb 15 • 3