OmniSafeBench-MM: A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack-Defense Evaluation
IntermediateXiaojun Jia, Jie Liao et al.Dec 6arXiv
OmniSafeBench-MM is a one-stop, open-source test bench that fairly compares how multimodal AI models get tricked (jailbroken) and how well different defenses stop that.
#multimodal large language models#jailbreak attacks#safety alignment