Science Concierge: A fast content-based recommendation system for   scientific publications

Titipat Achakulvisut; Daniel E. Acuna; Tulakan Ruangrong; Konrad; Kording

arXiv:1604.01070·cs.IR·September 28, 2016

Science Concierge: A fast content-based recommendation system for scientific publications

Titipat Achakulvisut, Daniel E. Acuna, Tulakan Ruangrong, Konrad, Kording

PDF

2 Repos

TL;DR

This paper introduces a fast, content-based recommendation system for scientific publications, utilizing an algorithm and open-source Python library that outperforms keyword-based suggestions, aiding researchers in navigating large scholarly datasets.

Contribution

The authors developed a novel content-based recommendation algorithm and an open-source Python library tailored for scientific publications, demonstrating improved accuracy over keyword methods.

Findings

01

The system significantly outperforms keyword-based suggestions.

02

It achieves high correlation with human judgments.

03

The library is adaptable and suitable for real-time recommendations.

Abstract

Finding relevant publications is important for scientists who have to cope with exponentially increasing numbers of scholarly material. Algorithms can help with this task as they help for music, movie, and product recommendations. However, we know little about the performance of these algorithms with scholarly material. Here, we develop an algorithm, and an accompanying Python library, that implements a recommendation system based on the content of articles. Design principles are to adapt to new content, provide near-real time suggestions, and be open source. We tested the library on 15K posters from the Society of Neuroscience Conference 2015. Human curated topics are used to cross validate parameters in the algorithm and produce a similarity metric that maximally correlates with human judgments. We show that our algorithm significantly outperformed suggestions based on keywords. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.