CLEVR Parser: A Graph Parser Library for Geometric Learning on Language   Grounded Image Scenes

Raeid Saqur; Ameet Deshpande

arXiv:2009.09154·cs.CL·October 5, 2020

CLEVR Parser: A Graph Parser Library for Geometric Learning on Language Grounded Image Scenes

Raeid Saqur, Ameet Deshpande

PDF

1 Repo

TL;DR

The paper introduces a graph parser library for the CLEVR dataset that extracts object attributes and relationships, enabling geometric learning and improving tasks like language grounding and interpretability in vision-language models.

Contribution

It provides an extensible, easy-to-integrate graph parser library for CLEVR that facilitates structural representations for geometric learning and downstream applications.

Findings

01

Enables structural graph representations for CLEVR scenes

02

Supports seamless integration with GNN libraries

03

Accelerates research in language-grounded visual reasoning

Abstract

The CLEVR dataset has been used extensively in language grounded visual reasoning in Machine Learning (ML) and Natural Language Processing (NLP) domains. We present a graph parser library for CLEVR, that provides functionalities for object-centric attributes and relationships extraction, and construction of structural graph representations for dual modalities. Structural order-invariant representations enable geometric learning and can aid in downstream tasks like language grounding to vision, robotics, compositionality, interpretability, and computational grammar construction. We provide three extensible main components - parser, embedder, and visualizer that can be tailored to suit specific learning setups. We also provide out-of-the-box functionality for seamless integration with popular deep graph neural network (GNN) libraries. Additionally, we discuss downstream usage and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

raeidsaqur/clevr-parser
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGraph Neural Network