BR-TaxQA-R: A Dataset for Question Answering with References for Brazilian Personal Income Tax Law, including case law
Juvenal Domingos J\'unior, Augusto Faria, E. Seiti de Oliveira, Erick de Brito, Matheus Teotonio, Andre Assump\c{c}\~ao, Diedre Carmo, Roberto Lotufo, Jayr Pereira

TL;DR
This paper introduces BR-TaxQA-R, a new dataset for question answering on Brazilian tax law, and evaluates a retrieval-augmented generation system that outperforms commercial tools in relevance but highlights the importance of human review.
Contribution
The paper presents a novel dataset for legal question answering in Brazilian tax law and benchmarks a RAG system, comparing it with commercial AI models.
Findings
Our RAG pipeline outperforms commercial tools in Response Relevancy.
Commercial models score higher in Factual Correctness and fluency.
Human expert evaluation is crucial for legal validity in AI-generated answers.
Abstract
This paper presents BR-TaxQA-R, a novel dataset designed to support question answering with references in the context of Brazilian personal income tax law. The dataset contains 715 questions from the 2024 official Q\&A document published by Brazil's Internal Revenue Service, enriched with statutory norms and administrative rulings from the Conselho Administrativo de Recursos Fiscais (CARF). We implement a Retrieval-Augmented Generation (RAG) pipeline using OpenAI embeddings for searching and GPT-4o-mini for answer generation. We compare different text segmentation strategies and benchmark our system against commercial tools such as ChatGPT and Perplexity.ai using RAGAS-based metrics. Results show that our custom RAG pipeline outperforms commercial systems in Response Relevancy, indicating stronger alignment with user queries, while commercial models achieve higher scores in Factual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Law · Text Readability and Simplification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Attention Dropout · Softmax · WordPiece · Weight Decay · Dropout · Adam · Linear Layer
