Improving Zero-Shot Text Matching for Financial Auditing with Large Language Models
Lars Hillebrand, Armin Berger, Tobias Deu{\ss}er, Tim Dilmaghani,, Mohamed Khaled, Bernd Kliem, R\"udiger Loitz, Maren Pielka, David Leonhard,, Christian Bauckhage, Rafet Sifa

TL;DR
This paper introduces ZeroShotALI, a zero-shot text matching system for financial auditing that combines a BERT-based retrieval step with an LLM filtering step, reducing the need for fine-tuning and annotated data.
Contribution
It presents a novel two-step approach using a domain-optimized transformer and a large language model for improved zero-shot document matching in financial audits.
Findings
Significant performance improvements over existing methods
Effective zero-shot document retrieval without extensive fine-tuning
Combines BERT-based retrieval with LLM filtering for accuracy
Abstract
Auditing financial documents is a very tedious and time-consuming process. As of today, it can already be simplified by employing AI-based solutions to recommend relevant text passages from a report for each legal requirement of rigorous accounting standards. However, these methods need to be fine-tuned regularly, and they require abundant annotated data, which is often lacking in industrial environments. Hence, we present ZeroShotALI, a novel recommender system that leverages a state-of-the-art large language model (LLM) in conjunction with a domain-specifically optimized transformer-based text-matching solution. We find that a two-step approach of first retrieving a number of best matching document sections per legal requirement with a custom BERT-based model and second filtering these selections using an LLM yields significant performance improvements over existing approaches.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Topic Modeling · Advanced Text Analysis Techniques
