KemenkeuGPT: Leveraging a Large Language Model on Indonesia's Government Financial Data and Regulations to Enhance Decision Making
Gilang Fajar Febrian, Grazziela Figueredo

TL;DR
This paper presents KemenkeuGPT, a large language model tailored for Indonesia's government financial data, demonstrating improved accuracy and potential to support policymaking through retrieval-augmented generation and fine-tuning.
Contribution
The study develops and evaluates KemenkeuGPT, the first LLM specifically designed for Indonesia's financial data, integrating retrieval-augmented generation and human feedback for enhanced decision support.
Findings
Model accuracy improved from 35% to 61%.
KemenkeuGPT achieved 44% correctness and 73% faithfulness in RAGAS evaluation.
Expert feedback indicates potential for decision-making support.
Abstract
Data is crucial for evidence-based policymaking and enhancing public services, including those at the Ministry of Finance of the Republic of Indonesia. However, the complexity and dynamic nature of governmental financial data and regulations can hinder decision-making. This study investigates the potential of Large Language Models (LLMs) to address these challenges, focusing on Indonesia's financial data and regulations. While LLMs are effective in the financial sector, their use in the public sector in Indonesia is unexplored. This study undertakes an iterative process to develop KemenkeuGPT using the LangChain with Retrieval-Augmented Generation (RAG), prompt engineering and fine-tuning. The dataset from 2003 to 2023 was collected from the Ministry of Finance, Statistics Indonesia and the International Monetary Fund (IMF). Surveys and interviews with Ministry officials informed,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining and Machine Learning Applications · Economic Growth and Fiscal Policies · Financial Reporting and XBRL
MethodsBalanced Selection
