Characterizing Datasets for Social Visual Question Answering, and the   New TinySocial Dataset

Zhanwen Chen; Shiyao Li; Roxanne Rashedi; Xiaoman Zi; Morgan; Elrod-Erickson; Bryan Hollis; Angela Maliakal; Xinyu Shen; Simeng Zhao,; Maithilee Kunda

arXiv:2010.11997·cs.HC·October 26, 2020

Characterizing Datasets for Social Visual Question Answering, and the New TinySocial Dataset

Zhanwen Chen, Shiyao Li, Roxanne Rashedi, Xiaoman Zi, Morgan, Elrod-Erickson, Bryan Hollis, Angela Maliakal, Xinyu Shen, Simeng Zhao,, Maithilee Kunda

PDF

Open Access

TL;DR

This paper introduces methods for creating and characterizing social VQA datasets, presents two new datasets (TinySocial-Crowd and TinySocial-InHouse), and discusses their potential to improve AI explainability and social reasoning assessments.

Contribution

It proposes novel rubrics for dataset characterization and introduces two new social VQA datasets, advancing research in social reasoning for AI and humans.

Findings

01

Comparison of crowdsourced and in-house dataset creation methods

02

Development of rubrics for video and question content characterization

03

Introduction of TinySocial datasets for social VQA research

Abstract

Modern social intelligence includes the ability to watch videos and answer questions about social and theory-of-mind-related content, e.g., for a scene in Harry Potter, "Is the father really upset about the boys flying the car?" Social visual question answering (social VQA) is emerging as a valuable methodology for studying social reasoning in both humans (e.g., children with autism) and AI agents. However, this problem space spans enormous variations in both videos and questions. We discuss methods for creating and characterizing social VQA datasets, including 1) crowdsourcing versus in-house authoring, including sample comparisons of two new datasets that we created (TinySocial-Crowd and TinySocial-InHouse) and the previously existing Social-IQ dataset; 2) a new rubric for characterizing the difficulty and content of a given video; and 3) a new rubric for characterizing question…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition