FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments
IntermediateZhi Yang, Runguo Li et al.Jan 9arXiv
FinVault is a new test that checks if AI helpers for finance stay safe while actually doing real jobs, not just chatting.
#financial AI agents#execution-grounded benchmarking#sandboxed environments