Bank Statement Parsing Benchmark

Parsing Challenges

36 real-world formatting problems seen across 77+ real banks and 15 benchmark statements.

Last updated: March 17, 2026
36 challenges
Layout & StructureBank statements only

Separate Credit & Debit Columns

Statements use separate Credit and Debit columns in some formats, while others use a single Amount column with signs. Treating one layout like the other leads to wrong amounts or flipped signs. We automatically detect which layout is used and normalize the output.

How it looks

date
description
debit
credit
balance
15 Jan 2025
ONLINE PURCHASE - AMAZON.COM Order #112-7834
45.99
4,954.01
15 Jan 2025
SALARY DEPOSIT - ACME CORP January payroll
3,500.00
8,454.01
17 Jan 2025
UTILITY PAYMENT - ELECTRIC CO Account #98765
127.50
8,326.51
18 Jan 2025
ATM WITHDRAWAL
200.00
8,126.51
19 Jan 2025
TRANSFER FROM SAVINGS Monthly transfer
500.00
8,626.51
↑ Scroll ↓

In 10 synthetic statements

🇸🇬 bsb-001🇭🇰 bsb-004🇨🇦 bsb-005🇲🇽 bsb-006🇦🇺 bsb-008🇬🇧 bsb-009🇮🇳 bsb-010🇭🇰 bsb-011🇰🇿 bsb-013🇹🇭 bsb-014

Often appears with

Test your parser against these challenges

Download the benchmark dataset and see how your parser handles real-world formatting problems.

See how commercial converters score on our benchmark: Accuracy Benchmarks