Improving Patent Mining and Relevance Classification using Transformers

Th\'eo Ding; Walter Vermeiren; Sylvie Ranwez; Binbin Xu

arXiv:2105.03979·cs.CL·July 5, 2021

Improving Patent Mining and Relevance Classification using Transformers

Th\'eo Ding, Walter Vermeiren, Sylvie Ranwez, Binbin Xu

PDF

Open Access

TL;DR

This paper demonstrates how fine-tuning pre-trained transformer models can effectively improve patent relevance classification, reducing workload while maintaining high accuracy in patent analysis.

Contribution

It introduces a novel approach combining state-of-the-art NLP techniques to enhance patent filtering and classification efficiency.

Findings

01

Improved classification accuracy with transformer models

02

Reduced manual review workload

03

Maintained high recall and precision metrics

Abstract

Patent analysis and mining are time-consuming and costly processes for companies, but nevertheless essential if they are willing to remain competitive. To face the overload induced by numerous patents, the idea is to automatically filter them, bringing only few to read to experts. This paper reports a successful application of fine-tuning and retraining on pre-trained deep Natural Language Processing models on patent classification. The solution that we propose combines several state-of-the-art treatments to achieve our goal - decrease the workload while preserving recall and precision metrics.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntellectual Property and Patents · Machine Learning in Materials Science · Topic Modeling