BizFinBench.v2: A Unified Dual-Mode Bilingual Benchmark for Expert-Level Financial Capability Alignment
IntermediateXin Guo, Rongjunchen Zhang et al.Jan 10arXiv
This paper builds BizFinBench.v2, a big bilingual (Chinese–English) test that checks how well AI models really handle finance using real business data from China and the U.S.
#BizFinBench.v2#financial benchmark#bilingual evaluation