Using Interventions to Improve Out-of-Distribution Generalization of Text-Matching Recommendation Systems
Parikshit Bansal, Yashoteja Prabhu, Emre Kiciman, Amit Sharma

TL;DR
This paper investigates the limitations of fine-tuning language models for text-matching recommendation systems in out-of-distribution scenarios and proposes an intervention-based regularizer to improve their generalization capabilities.
Contribution
It introduces an intervention-based regularizer that constrains token importance, enhancing OOD generalization in text-matching recommendation systems beyond standard fine-tuning methods.
Findings
Regularizer improves OOD and in-distribution accuracy
Fine-tuning can reduce model robustness to data shifts
Intervention-based approach mitigates spurious correlations
Abstract
Given a user's input text, text-matching recommender systems output relevant items by comparing the input text to available items' description, such as product-to-product recommendation on e-commerce platforms. As users' interests and item inventory are expected to change, it is important for a text-matching system to generalize to data shifts, a task known as out-of-distribution (OOD) generalization. However, we find that the popular approach of fine-tuning a large, base language model on paired item relevance data (e.g., user clicks) can be counter-productive for OOD generalization. For a product recommendation task, fine-tuning obtains worse accuracy than the base model when recommending items in a new category or for a future time period. To explain this generalization failure, we consider an intervention-based importance metric, which shows that a fine-tuned model captures spurious…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Advanced Graph Neural Networks
MethodsBalanced Selection
