Play and Learn: Using Video Games to Train Computer Vision Models

Alireza Shafaei; James J. Little; Mark Schmidt

arXiv:1608.01745·cs.CV·August 16, 2016·40 cites

Play and Learn: Using Video Games to Train Computer Vision Models

Alireza Shafaei, James J. Little, Mark Schmidt

PDF

Open Access

TL;DR

This paper investigates whether synthetic RGB images from video games can effectively train computer vision models for tasks like segmentation and depth estimation, showing promising results comparable to real-world data.

Contribution

It demonstrates that synthetic data from video games can be used to train deep neural networks effectively, with simple domain adaptation improving real-world performance.

Findings

01

Synthetic images can improve model performance on real data.

02

Models trained on synthetic data achieve similar accuracy to those trained on real data.

03

Simple domain adaptation enhances the effectiveness of synthetic data.

Abstract

Video games are a compelling source of annotated data as they can readily provide fine-grained groundtruth for diverse tasks. However, it is not clear whether the synthetically generated data has enough resemblance to the real-world images to improve the performance of computer vision models in practice. We present experiments assessing the effectiveness on real-world data of systems trained on synthetic RGB images that are extracted from a video game. We collected over 60000 synthetic samples from a modern video game with similar conditions to the real-world CamVid and Cityscapes datasets. We provide several experiments to demonstrate that the synthetically generated RGB images can be used to improve the performance of deep neural networks on both image segmentation and depth estimation. These results show that a convolutional network trained on synthetic data achieves a similar test…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning