Quality analysis and evaluation prediction of RAG retrieval based on machine learning algorithms
Ruoxin Zhang, Zhizhao Wen, Chao Wang, Chenchen Tang, Puyang Xu, Yifan Jiang

TL;DR
This paper proposes a machine learning-based approach to evaluate and predict the quality of retrieval modules in RAG systems, aiming to enhance retrieval relevance and overall generation accuracy.
Contribution
It introduces an XGBoost regression model optimized with particle swarm optimization for assessing retrieval quality, addressing limitations in processing tabular features.
Findings
Document relevance significantly impacts answer quality (correlation 0.66).
Tradeoff identified between semantic similarity, redundancy, and diversity.
VMD PSO BiLSTM model outperforms other models with lower error metrics.
Abstract
With the rapid evolution of large language models, retrieval enhanced generation technology has been widely used due to its ability to integrate external knowledge to improve output accuracy. However, the performance of the system is highly dependent on the quality of the retrieval module. If the retrieval results have low relevance to user needs or contain noisy information, it will directly lead to distortion of the generated content. In response to the performance bottleneck of existing models in processing tabular features, this paper proposes an XGBoost machine learning regression model based on feature engineering and particle swarm optimization. Correlation analysis shows that answer_quality is positively correlated with doc_delevance by 0.66, indicating that document relevance has a significant positive effect on answer quality, and improving document relevance may enhance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Expert finding and Q&A systems · Advanced Technologies in Various Fields
