A Study of Factuality, Objectivity and Relevance: Three Desiderata in Large-Scale Information Retrieval?
Christina Lioma, Birger Larsen, Wei Lu, Yong Huang

TL;DR
This paper investigates how factuality and objectivity detection, enabled by NLP, can enhance large-scale information retrieval, showing that factuality significantly improves retrieval precision especially for uncurated web data.
Contribution
It introduces the first study of factuality and objectivity in IR, demonstrating their impact on retrieval effectiveness and analyzing their relation to document relevance.
Findings
Factuality correlates positively with document relevance.
Factuality improves retrieval precision by over 10%.
Objectivity has mixed effects on retrieval performance.
Abstract
Much of the information processed by Information Retrieval (IR) systems is unreliable, biased, and generally untrustworthy [1], [2], [3]. Yet, factuality & objectivity detection is not a standard component of IR systems, even though it has been possible in Natural Language Processing (NLP) in the last decade. Motivated by this, we ask if and how factuality & objectivity detection may benefit IR. We answer this in two parts. First, we use state-of-the-art NLP to compute the probability of document factuality & objectivity in two TREC collections, and analyse its relation to document relevance. We find that factuality is strongly and positively correlated to document relevance, but objectivity is not. Second, we study the impact of factuality & objectivity to retrieval effectiveness by treating them as query independent features that we combine with a competitive language modelling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Text and Document Classification Technologies
