qualifire/prompt-injection-jailbreak-sentinel-v2 Text Classification • 0.6B • Updated Sep 28, 2025 • 18.1k • 26
nvidia/Aegis-AI-Content-Safety-LlamaGuard-Defensive-1.0 Text Classification • Updated Sep 22, 2025 • 1.81k • 25
nvidia/Aegis-AI-Content-Safety-LlamaGuard-Permissive-1.0 Text Classification • Updated Sep 22, 2025 • 75 • 18
ShieldGemma Collection ShieldGemma is a family of models for text and image content moderation. • 4 items • Updated Jul 10, 2025 • 11