NOWJ1@ALQAC 2023: Enhancing Legal Task Performance with Classic   Statistical Models and Pre-trained Language Models

Tan-Minh Nguyen; Xuan-Hoa Nguyen; Ngoc-Duy Mai; Minh-Quan Hoang,; Van-Huan Nguyen; Hoang-Viet Nguyen; Ha-Thanh Nguyen; Thi-Hai-Yen Vuong

arXiv:2309.09070·cs.CL·September 19, 2023·2 cites

NOWJ1@ALQAC 2023: Enhancing Legal Task Performance with Classic Statistical Models and Pre-trained Language Models

Tan-Minh Nguyen, Xuan-Hoa Nguyen, Ngoc-Duy Mai, Minh-Quan Hoang,, Van-Huan Nguyen, Hoang-Viet Nguyen, Ha-Thanh Nguyen, Thi-Hai-Yen Vuong

PDF

Open Access

TL;DR

This paper presents the NOWJ1 team's approach for ALQAC 2023, combining classical statistical models and pre-trained language models to improve legal document retrieval and question answering tasks, showing promising results.

Contribution

The paper introduces an integrated method using classical statistical models and PLMs for legal tasks, with specific techniques for document retrieval and question answering.

Findings

01

Effective pre-processing for input limitations

02

Successful application of learning-to-rank methods

03

Promising experimental results in legal QA

Abstract

This paper describes the NOWJ1 Team's approach for the Automated Legal Question Answering Competition (ALQAC) 2023, which focuses on enhancing legal task performance by integrating classical statistical models and Pre-trained Language Models (PLMs). For the document retrieval task, we implement a pre-processing step to overcome input limitations and apply learning-to-rank methods to consolidate features from various models. The question-answering task is split into two sub-tasks: sentence classification and answer extraction. We incorporate state-of-the-art models to develop distinct systems for each sub-task, utilizing both classic statistical models and pre-trained Language Models. Experimental results demonstrate the promising potential of our proposed methodology in the competition.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Topic Modeling · Natural Language Processing Techniques