MMFCTUB: Multi-Modal Financial Credit Table Understanding Benchmark
By: Cui Yakun , Yanting Zhang , Zhu Lei and more
Potential Business Impact:
Helps computers understand financial tables for loans.
The advent of multi-modal language models (MLLMs) has spurred research into their application across various table understanding tasks. However, their performance in credit table understanding (CTU) for financial credit review remains largely unexplored due to the following barriers: low data consistency, high annotation costs stemming from domain-specific knowledge and complex calculations, and evaluation paradigm gaps between benchmark and real-world scenarios. To address these challenges, we introduce MMFCTUB (Multi-Modal Financial Credit Table Understanding Benchmark), a practical benchmark, encompassing more than 7,600 high quality CTU samples across 5 table types. MMFCTUB employ a minimally supervised pipeline that adheres to inter-table constraints and maintains data distributions consistency. The benchmark leverages capacity-driven questions and mask-and-recovery strategy to evaluate models' cross-table structure perception, domain knowledge utilization, and numerical calculation capabilities. Utilizing MMFCTUB, we conduct comprehensive evaluations of both proprietary and open-source MLLMs, revealing their strengths and limitations in CTU tasks. MMFCTUB serves as a valuable resource for the research community, facilitating rigorous evaluation of MLLMs in the domain of CTU.
Similar Papers
CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model
Computation and Language
Helps computers understand money charts and numbers.
FCMBench: A Comprehensive Financial Credit Multimodal Benchmark for Real-world Applications
CV and Pattern Recognition
Helps AI understand loan papers for better decisions.
FCMBench: A Comprehensive Financial Credit Multimodal Benchmark for Real-world Applications
CV and Pattern Recognition
Tests AI for loan applications.