TL;DR
The paper introduces the YouTube AV 50K dataset, comprising over 50,000 comments on autonomous vehicle videos, to facilitate opinion mining and sentiment analysis related to self-driving cars.
Contribution
It provides a new, publicly available dataset of YouTube comments on autonomous vehicles, including creation details, data format, and potential applications, along with a case study on public reaction to a self-driving car fatality.
Findings
The dataset enables analysis of public attitudes toward autonomous vehicles.
It helps understand reactions to specific incidents like the self-driving car fatality.
The dataset supports future research in opinion mining and sentiment analysis.
Abstract
With one billion monthly viewers, and millions of users discussing and sharing opinions, comments below YouTube videos are rich sources of data for opinion mining and sentiment analysis. We introduce the YouTube AV 50K dataset, a freely-available collections of more than 50,000 YouTube comments and metadata below autonomous vehicle (AV)-related videos. We describe its creation process, its content and data format, and discuss its possible usages. Especially, we do a case study of the first self-driving car fatality to evaluate the dataset, and show how we can use this dataset to better understand public attitudes toward self-driving cars and public reactions to the accident. Future developments of the dataset are also discussed.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
