Online Forum Thread Retrieval using Pseudo Cluster Selection and Voting   Techniques

Ameer Tawfik Albaham; Naomie Salim

arXiv:1212.5590·cs.IR·December 27, 2012

Online Forum Thread Retrieval using Pseudo Cluster Selection and Voting Techniques

Ameer Tawfik Albaham, Naomie Salim

PDF

TL;DR

This paper introduces a combined model for online forum thread retrieval that integrates pseudo cluster selection and voting techniques, improving retrieval accuracy by focusing on message scoring and aggregation methods.

Contribution

It presents a novel combination of existing thread retrieval approaches, enhancing effectiveness through joint focus on input scoring and aggregation strategies.

Findings

01

Some combined models outperform baseline methods statistically.

02

The integrated approach improves thread retrieval accuracy.

03

Focus on input and aggregation enhances retrieval performance.

Abstract

Online forums facilitate knowledge seeking and sharing on the Web. However, the shared knowledge is not fully utilized due to information overload. Thread retrieval is one method to overcome information overload. In this paper, we propose a model that combines two existing approaches: the Pseudo Cluster Selection and the Voting Techniques. In both, a retrieval system first scores a list of messages and then ranks threads by aggregating their scored messages. They differ on what and how to aggregate. The pseudo cluster selection focuses on input, while voting techniques focus on the aggregation method. Our combined models focus on the input and the aggregation methods. The result shows that some combined models are statistically superior to baseline methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.