Analysing the Public Discourse around OpenAI's Text-To-Video Model 'Sora' using Topic Modeling
Vatsal Vinay Parikh

TL;DR
This study analyzes Reddit comments to uncover public perceptions and dominant themes surrounding OpenAI's new text-to-video model Sora, revealing insights into societal impacts, concerns, and creative uses of the technology.
Contribution
It applies topic modeling to online discourse to systematically identify key narratives and public sentiment about Sora, providing a framework for analyzing perceptions of emerging AI models.
Findings
Four main discussion topics identified: AI impact, public opinion, artistic use, media applications.
Public discourse highlights concerns about ethics, employment, and societal influence.
Creative and media applications of Sora are prominent in online discussions.
Abstract
The recent introduction of OpenAI's text-to-video model Sora has sparked widespread public discourse across online communities. This study aims to uncover the dominant themes and narratives surrounding Sora by conducting topic modeling analysis on a corpus of 1,827 Reddit comments from five relevant subreddits (r/OpenAI, r/technology, r/singularity, r/vfx, and r/ChatGPT). The comments were collected over a two-month period following Sora's announcement in February 2024. After preprocessing the data, Latent Dirichlet Allocation (LDA) was employed to extract four key topics: 1) AI Impact and Trends in Sora Discussions, 2) Public Opinion and Concerns about Sora, 3) Artistic Expression and Video Creation with Sora, and 4) Sora's Applications in Media and Entertainment. Visualizations including word clouds, bar charts, and t-SNE clustering provided insights into the importance of topic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDiverse Approaches in Healthcare and Education Studies · Computational and Text Analysis Methods · Technology and Data Analysis
