The Data Science of Hollywood: Using Emotional Arcs of Movies to Drive Business Model Innovation in Entertainment Industries
Marco Del Vecchio, Alexander Kharlamov, Glenn Parry, Ganna Pogrebna

TL;DR
This paper uses natural language processing to analyze emotional arcs in movies, revealing how specific emotional trajectories correlate with commercial success and audience engagement, informing business strategies in entertainment.
Contribution
It introduces a novel methodology to map and cluster emotional arcs in movies, linking these patterns to success metrics and offering insights for content creation and business innovation.
Findings
Man in a Hole shape correlates with highest box office revenue.
Emotional arcs influence viewer engagement more than genre or budget.
Successful movies often generate extensive discussions regardless of their ratings.
Abstract
Much of business literature addresses the issues of consumer-centric design: how can businesses design customized services and products which accurately reflect consumer preferences? This paper uses data science natural language processing methodology to explore whether and to what extent emotions shape consumer preferences for media and entertainment content. Using a unique filtered dataset of 6,174 movie scripts, we generate a mapping of screen content to capture the emotional trajectory of each motion picture. We then combine the obtained mappings into clusters which represent groupings of consumer emotional journeys. These clusters are used to predict overall success parameters of the movies including box office revenues, viewer satisfaction levels (captured by IMDb ratings), awards, as well as the number of viewers' and critics' reviews. We find that like books all movie stories…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
