COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
IntermediateDasol Choi, DongGeon Lee et al.Jan 5arXiv
COMPASS is a new framework that turns a companyβs rules into thousands of smart test questions to check if chatbots follow those rules.
#policy alignment#allowlist denylist#enterprise AI safety