Benchmark / Challenges

Parsing Challenges

34 real-world formatting problems across 15 benchmark statements.

34 challenges
Layout & StructureBank statements only

Separate Credit & Debit Columns

Statements use separate Credit and Debit columns in some formats, while others use a single Amount column with signs. Treating one layout like the other leads to wrong amounts or flipped signs. We automatically detect which layout is used and normalize the output.

How it looks

15 Jan 2025
ONLINE PURCHASE - AMAZON.COM
45.99
4,954.01
15 Jan 2025
SALARY DEPOSIT - ACME CORP
3,500.00
8,454.01
17 Jan 2025
UTILITY PAYMENT - ELECTRIC CO
127.50
8,326.51
18 Jan 2025
ATM WITHDRAWAL
200.00
8,126.51
19 Jan 2025
TRANSFER FROM SAVINGS
500.00
8,626.51

In 10 statements

🇸🇬 bsb-001🇭🇰 bsb-004🇨🇦 bsb-005🇲🇽 bsb-006🇦🇺 bsb-008🇬🇧 bsb-009🇮🇳 bsb-010🇭🇰 bsb-011🇰🇿 bsb-013🇹🇭 bsb-014

Often appears with

Running Balance Cross-CheckInconsistent Date FormatsMulti-Line Transaction DescriptionsMultiple Dates per TransactionMissing Year in DatesPayment Method/Rails Information

Test your parser against these challenges

Download the benchmark dataset and see how your parser handles real-world formatting problems.