DeepSight: An All-in-One LM Safety Toolkit
IntermediateBo Zhang, Jiaxuan Guo et al.Feb 12arXiv
DeepSight is a free, all-in-one safety toolkit that both tests how models behave (DeepSafe) and peeks inside how they think (DeepScan).
#LLM safety evaluation#multimodal safety#frontier AI risks