Towards Full-Fledged Argument Search: A Framework for Extracting and Clustering Arguments from Unstructured Text
Michael F\"arber, Anna Steyer

TL;DR
This paper presents a comprehensive framework for argument search that integrates argument-query matching, multi-sentence argument identification, and topic-aware clustering, improving retrieval and organization of arguments from unstructured texts.
Contribution
It introduces a novel combination of keyword search with topic clustering, a sentence-level sequence-labeling method for argument identification, and an argument clustering approach for better argument organization.
Findings
Density-based clustering algorithms are effective for argument-query matching.
The BiLSTM-based sequence-labeling approach achieves a macro F1 score of 0.71.
Fine-grained argument clustering by subtopics remains challenging but promising.
Abstract
Argument search aims at identifying arguments in natural language texts. In the past, this task has been addressed by a combination of keyword search and argument identification on the sentence- or document-level. However, existing frameworks often address only specific components of argument search and do not address the following aspects: (1) argument-query matching: identifying arguments that frame the topic slightly differently than the actual search query; (2) argument identification: identifying arguments that consist of multiple sentences; (3) argument clustering: selecting retrieved arguments by topical aspects. In this paper, we propose a framework for addressing these shortcomings. We suggest (1) to combine the keyword search with precomputed topic clusters for argument-query matching, (2) to apply a novel approach based on sentence-level sequence-labeling for argument…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research
