PatentBERT: Patent Classification with Fine-Tuning a pre-trained BERT Model
Jieh-Sheng Lee, Jieh Hsiang

TL;DR
This paper introduces PatentBERT, a fine-tuned pre-trained BERT model that achieves state-of-the-art performance in patent classification, especially using patent claims alone, on a large USPTO dataset.
Contribution
It presents a novel BERT-based fine-tuning approach for patent classification, a large patent dataset, and demonstrates claims-only sufficiency for classification.
Findings
BERT-based model outperforms CNN with word embeddings
Patent claims alone are sufficient for accurate classification
Provides a large USPTO-3M dataset for future research
Abstract
In this work we focus on fine-tuning a pre-trained BERT model and applying it to patent classification. When applied to large datasets of over two millions patents, our approach outperforms the state of the art by an approach using CNN with word embeddings. In addition, we focus on patent claims without other parts in patent documents. Our contributions include: (1) a new state-of-the-art method based on pre-trained BERT model and fine-tuning for patent classification, (2) a large dataset USPTO-3M at the CPC subclass level with SQL statements that can be used by future researchers, (3) showing that patent claims alone are sufficient for classification task, in contrast to conventional wisdom.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Intellectual Property and Patents · Topic Modeling
MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax
