TL;DR
PANORAMA introduces a comprehensive dataset of patent examination records with decision trails, enabling step-by-step evaluation of NLP models' capabilities in understanding patent review processes and rationales.
Contribution
This work provides the first dataset capturing full decision trails in patent examination, facilitating detailed benchmarking of NLP models at each review step.
Findings
LLMs effectively retrieve relevant prior art.
LLMs struggle to assess patent claim novelty.
The dataset enables nuanced evaluation of NLP in patent review.
Abstract
Patent examination remains an ongoing challenge in the NLP literature even after the advent of large language models (LLMs), as it requires an extensive yet nuanced human judgment on whether a submitted claim meets the statutory standards of novelty and non-obviousness against previously granted claims -- prior art -- in expert domains. Previous NLP studies have approached this challenge as a prediction task (e.g., forecasting grant outcomes) with high-level proxies such as similarity metrics or classifiers trained on historical labels. However, this approach often overlooks the step-by-step evaluations that examiners must make with profound information, including rationales for the decisions provided in office actions documents, which also makes it harder to measure the current state of techniques in patent review processes. To fill this gap, we construct PANORAMA, a dataset of 8,143…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
