Play and Learn: Using Video Games to Train Computer Vision Models
Alireza Shafaei, James J. Little, Mark Schmidt

TL;DR
This paper investigates whether synthetic RGB images from video games can effectively train computer vision models for tasks like segmentation and depth estimation, showing promising results comparable to real-world data.
Contribution
It demonstrates that synthetic data from video games can be used to train deep neural networks effectively, with simple domain adaptation improving real-world performance.
Findings
Synthetic images can improve model performance on real data.
Models trained on synthetic data achieve similar accuracy to those trained on real data.
Simple domain adaptation enhances the effectiveness of synthetic data.
Abstract
Video games are a compelling source of annotated data as they can readily provide fine-grained groundtruth for diverse tasks. However, it is not clear whether the synthetically generated data has enough resemblance to the real-world images to improve the performance of computer vision models in practice. We present experiments assessing the effectiveness on real-world data of systems trained on synthetic RGB images that are extracted from a video game. We collected over 60000 synthetic samples from a modern video game with similar conditions to the real-world CamVid and Cityscapes datasets. We provide several experiments to demonstrate that the synthetically generated RGB images can be used to improve the performance of deep neural networks on both image segmentation and depth estimation. These results show that a convolutional network trained on synthetic data achieves a similar test…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
