Towards Creating a Standardized Collection of Simple and Targeted Experiments to Analyze Core Aspects of the Recommender Systems Problem
Andrea Barraza-Urbina

TL;DR
This paper advocates for a standardized set of simple, targeted experiments in Recommender Systems to better analyze algorithm strengths and weaknesses across different scenarios, complementing traditional complex evaluations.
Contribution
It proposes creating a collection of simple, targeted experiments for RS, enabling detailed performance analysis and better understanding of algorithm capabilities.
Findings
Standardized experiments can reveal specific strengths and weaknesses.
Targeted tests improve understanding of algorithm performance across scenarios.
Proposed framework for defining and sharing simple experiments.
Abstract
Imagine you are a teacher attempting to assess a student's level in a particular subject. If you design a test with only hard questions, and the student fails, this mostly proves that the student does not understand the more advanced material. A more insightful exam would include different types of questions varying in difficulty to truly understand the student's weaknesses and strengths from different perspectives. In the field of Recommender Systems (RS), more often than not, we design evaluations to measure an algorithm's ability to optimize goals in complex scenarios, representative of the real-world challenges the system would most probably face. Nevertheless, this paper posits that testing an algorithm's ability to address both simple and complex tasks/problems would offer a more detailed view of performance to help identify, at a more granular level, the weaknesses and strengths…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Machine Learning and Data Classification
MethodsTest
