LAPIS: Language Model-Augmented Police Investigation System
Heedou Kim, Dain Kim, Jiwoo Lee, Chanwoong Yoon, Donghee Choi, Mogan, Gim, Jaewoo Kang

TL;DR
LAPIS is an AI system that assists police in legal reasoning during investigations by combining a fine-tuned Korean language model with a specialized knowledgebase, outperforming GPT-4 in providing legal guidance.
Contribution
The paper introduces a novel AI-assisted investigation system that integrates a fine-tuned Korean language model with a domain-specific knowledgebase for legal reasoning.
Findings
LAPIS provides reliable legal guidance surpassing GPT-4.
The system demonstrates strong reasoning ability with legally correct conclusions.
Manual curation improves dataset quality and system performance.
Abstract
Crime situations are race against time. An AI-assisted criminal investigation system, providing prompt but precise legal counsel is in need for police officers. We introduce LAPIS (Language Model Augmented Police Investigation System), an automated system that assists police officers to perform rational and legal investigative actions. We constructed a finetuning dataset and retrieval knowledgebase specialized in crime investigation legal reasoning task. We extended the dataset's quality by incorporating manual curation efforts done by a group of domain experts. We then finetuned the pretrained weights of a smaller Korean language model to the newly constructed dataset and integrated it with the crime investigation knowledgebase retrieval approach. Experimental results show LAPIS' potential in providing reliable legal guidance for police officers, even better than the proprietary GPT-4…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Digital and Cyber Forensics
MethodsAttention Is All You Need · Linear Layer · Residual Connection · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Softmax · Absolute Position Encodings · Dense Connections
