PATopics: An automatic framework to extract useful information from pharmaceutical patents documents
Pablo Cecilio, Ant\^onio Perreira, Juliana Santos Rosa Viegas,, Washington Cunha, Felipe Viegas, Elisa Tuler, Fabiana Testa Moura de Carvalho, Vicentini, Leonardo Rocha

TL;DR
PATopics is an automated framework that extracts, summarizes, and groups pharmaceutical patent information, significantly reducing manual effort and aiding various stakeholders in understanding complex patent documents.
Contribution
It introduces a novel automated system for extracting and summarizing pharmaceutical patent data, enhancing patent management and analysis capabilities.
Findings
Effective in analyzing 4,832 patents across 809 molecules
Facilitates patent grouping based on similarity
Proves useful for researchers, chemists, and companies
Abstract
Pharmaceutical patents play an important role by protecting the innovation from copies but also drive researchers to innovate, create new products, and promote disruptive innovations focusing on collective health. The study of patent management usually refers to an exhaustive manual search. This happens, because patent documents are complex with a lot of details regarding the claims and methodology/results explanation of the invention. To mitigate the manual search, we proposed PATopics, a framework specially designed to extract relevant information for Pharmaceutical patents. PATopics is composed of four building blocks that extract textual information from the patents, build relevant topics that are capable of summarizing the patents, correlate these topics with useful patent characteristics and then, summarize the information in a friendly web interface to final users. The general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies
