Safety at Scale: A Comprehensive Survey of Large Model Safety Paper • 2502.05206 • Published Feb 2 • 3
SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law Paper • 2507.18576 • Published Jul 24 • 8
Simulated Ensemble Attack: Transferring Jailbreaks Across Fine-tuned Vision-Language Models Paper • 2508.01741 • Published Aug 3 • 1
Argus Inspection: Do Multimodal Large Language Models Possess the Eye of Panoptes? Paper • 2506.14805 • Published Jun 3 • 3
Evolve the Method, Not the Prompts: Evolutionary Synthesis of Jailbreak Attacks on LLMs Paper • 2511.12710 • Published Nov 16 • 37