Retrieval-Augmented Generation Systems for Intellectual Property via Synthetic Multi-Angle Fine-tuning
Runtao Ren, Jian Ma, Jianxi Luo

TL;DR
This paper introduces MQG-RFM, a lightweight, retrieval-augmented generation framework that improves patent-related question-answering accuracy by simulating diverse user queries and fine-tuning retrieval models, demonstrating significant empirical gains.
Contribution
The paper presents a novel, scalable method combining prompt-engineered query generation and hard negative mining to enhance retrieval robustness without complex architecture changes.
Findings
185.62% improvement in retrieval accuracy on Patent Consultation dataset
262.26% improvement on Novel Patent Technology Report dataset
14.22% and 53.58% improvements in generation quality
Abstract
Retrieval-Augmented Generation (RAG) systems in the Intellectual Property (IP) field often struggle with diverse user queries, including colloquial expressions, spelling errors, and ambiguous terminology, leading to inaccurate retrieval and suboptimal responses. To address this challenge, we propose Multi-Angle Question Generation and Retrieval Fine-Tuning Method (MQG-RFM), a novel framework that leverages large language models (LLMs) to simulate varied user inquiries and fine-tunes retrieval models to align semantically equivalent but linguistically diverse questions. Unlike complex architectural modifications, MQG-RFM adopts a lightweight Data-to-Tune paradigm, combining prompt-engineered query generation with hard negative mining to enhance retrieval robustness without costly infrastructure changes. Experimental results on a Taiwan patent Q&A dataset show 185.62% improvement in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvancements in Photolithography Techniques
