Data augmentation on graphs for table type classification

Davide del Bimbo; Andrea Gemelli; Simone Marinai

arXiv:2208.11210·cs.CV·February 21, 2023·1 cites

Data augmentation on graphs for table type classification

Davide del Bimbo, Andrea Gemelli, Simone Marinai

PDF

Open Access 1 Repo

TL;DR

This paper introduces a graph neural network approach for classifying table types in scientific documents, utilizing data augmentation techniques to improve performance on limited annotated datasets.

Contribution

It presents a novel graph-based method for table classification and proposes data augmentation techniques tailored for graph-structured table data.

Findings

01

Promising preliminary results with the proposed method.

02

Effective data augmentation improves classification performance.

03

Utilizes table structure for message passing in GNNs.

Abstract

Tables are widely used in documents because of their compact and structured representation of information. In particular, in scientific papers, tables can sum up novel discoveries and summarize experimental results, making the research comparable and easily understandable by scholars. Since the layout of tables is highly variable, it would be useful to interpret their content and classify them into categories. This could be helpful to directly extract information from scientific papers, for instance comparing performance of some models given their paper result tables. In this work, we address the classification of tables using a Graph Neural Network, exploiting the table structure for the message passing algorithm in use. We evaluate our model on a subset of the Tab2Know dataset. Since it contains few examples manually annotated, we propose data augmentation techniques directly on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ailab-unifi/da-graphtab
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques · Text and Document Classification Technologies · Machine Learning and Data Classification

MethodsGraph Neural Network