Learning from Labeled Features for Document Filtering
Lanbo Zhang, Yi Zhang, Qianli Xing

TL;DR
This paper introduces a new method for document filtering that leverages user prior knowledge about important features, improving filtering accuracy by incorporating feedback on faceted features in semi-structured documents.
Contribution
The paper proposes a novel user profile learning algorithm that effectively integrates user feedback on features, enhancing document filtering performance over existing methods.
Findings
User feedback on faceted features improves filtering accuracy.
The proposed algorithm outperforms several existing methods.
User studies confirm the usefulness of feature-based feedback.
Abstract
Existing document filtering systems learn user profiles based on user relevance feedback on documents. In some cases, users may have prior knowledge about what features are important. For example, a Spanish speaker may only want news written in Spanish, and thus a relevant document should contain the feature "Language: Spanish"; a researcher focusing on HIV knows an article with the medical subject "Subject: AIDS" is very likely to be relevant to him/her. Semi-structured documents with rich metadata are increasingly prevalent on the Internet. Motivated by the well-adopted faceted search interface in e-commerce, we study the exploitation of user prior knowledge on faceted features for semi-structured document filtering. We envision two faceted feedback mechanisms, and propose a novel user profile learning algorithm that can incorporate user feedback on features. To evaluate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Text and Document Classification Technologies · Web Data Mining and Analysis
