TL;DR
PatRe introduces a comprehensive benchmark for modeling the full patent examination process, including Office Action generation and rebuttal, emphasizing its interactive and multi-turn nature.
Contribution
This work presents the first benchmark capturing the entire patent examination lifecycle, supporting real-world cases and dynamic evaluation settings.
Findings
LLMs show varying performance between proprietary and open-source models.
Model performance differs significantly between examiner analysis and applicant rebuttal.
Current models have limitations in complex legal reasoning and technical novelty judgment.
Abstract
Patent examination is a complex, multi-stage process requiring both technical expertise and legal reasoning, increasingly challenged by rising application volumes. Prior benchmarks predominantly view patent examination as discriminative classification or static extraction, failing to capture its inherently interactive and iterative nature, similar to the peer review and rebuttal process in academic publishing. In this paper, we introduce PatRe, the first benchmark that models the full patent examination lifecycle, including Office Action generation and applicant rebuttal. PatRe comprises 480 real-world cases and supports both oracle and retrieval-simulated evaluation settings. Our benchmark reframes patent examination as a dynamic, multi-turn process of justification and response. Extensive experiments across various LLMs reveal critical insights into model performance, including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
