MUSE: A Run-Centric Platform for Multimodal Unified Safety Evaluation of Large Language Models
BeginnerZhongxi Wang, Yueqian Lin et al.Mar 3arXiv
MUSE is a new open-source platform that tests how safely AI models behave when you talk to them with text, sound, pictures, and video, not just text.
#MUSE#multimodal safety evaluation#red-teaming