On the Scaling of Robustness and Effectiveness in Dense Retrieval
Yu-An Liu, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng

TL;DR
This paper investigates how robustness and effectiveness in dense retrieval models scale with resources, revealing distinct patterns and proposing Pareto-efficient optimization strategies to balance both aspects without extra costs.
Contribution
It demonstrates that robustness follows scaling laws similar to effectiveness and introduces Pareto training to optimize joint robustness and effectiveness efficiently.
Findings
Robustness follows a scaling law similar to effectiveness.
Robustness and effectiveness have different scaling patterns, leading to resource trade-offs.
Pareto training achieves balanced robustness and effectiveness without additional resources.
Abstract
Robustness and Effectiveness are critical aspects of developing dense retrieval models for real-world applications. It is known that there is a trade-off between the two. Recent work has addressed scaling laws of effectiveness in dense retrieval, revealing a power-law relationship between effectiveness and the size of models and data. Does robustness follow scaling laws too? If so, can scaling improve both robustness and effectiveness together, or do they remain locked in a trade-off? To answer these questions, we conduct a comprehensive experimental study. We find that:(i) Robustness, including out-of-distribution and adversarial robustness, also follows a scaling law.(ii) Robustness and effectiveness exhibit different scaling patterns, leading to significant resource costs when jointly improving both. Given these findings, we shift to the third factor that affects model performance,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
