Function-based Labels for Complementary Recommendation: Definition, Annotation, and LLM-as-a-Judge
Chihiro Yamasaki, Kai Sugahara, Yuma Nagi, Kazushi Okamoto

TL;DR
This paper introduces Function-Based Labels (FBLs), a new human-annotated framework for defining and inferring complementary relationships between items, improving accuracy and consistency over traditional methods and leveraging LLMs as effective judges.
Contribution
It proposes FBLs as a novel, human-annotated definition of complementary relationships, and demonstrates their effectiveness for machine learning inference and LLM-based annotation.
Findings
ML models achieve macro-F1 scores around 0.82 using FBLs.
LLMs like gpt-4o-mini show high consistency (0.989) and accuracy (0.849) in labeling.
FBLs enable more accurate and automated complementary recommendation labeling.
Abstract
Complementary recommendations enhance the user experience by suggesting items that are frequently purchased together while serving different functions from the query item. Inferring or evaluating whether two items have a complementary relationship requires complementary relationship labels; however, defining these labels is challenging because of the inherent ambiguity of such relationships. Complementary labels based on user historical behavior logs attempt to capture these relationships, but often produce inconsistent and unreliable results. Recent efforts have introduced large language models (LLMs) to infer these relationships. However, these approaches provide a binary classification without a nuanced understanding of complementary relationships. In this study, we address these challenges by introducing Function-Based Labels (FBLs), a novel definition of complementary relationships…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
