News-Based Group Modeling and Forecasting
Wenbin Zhang, Steven Skiena

TL;DR
This paper analyzes news group dynamics using NLP-derived data, revealing fame distribution patterns and developing models to forecast future entity fame and popularity within various news groups.
Contribution
It introduces a large-scale NLP-based framework for modeling and forecasting news group fame distributions, including a practical HMM-based news generation model.
Findings
Fame distributions follow log-normal and power-law patterns.
Future fame distribution exhibits a power-law tail.
Forecasting methods can predict future entity fame and probability of becoming very famous.
Abstract
In this paper, we study news group modeling and forecasting methods using quantitative data generated by our large-scale natural language processing (NLP) text analysis system. A news group is a set of news entities, like top U.S. cities, governors, senators, golfers, or movie actors. Our fame distribution analysis of news groups shows that log-normal and power-law distributions generally could describe news groups in many aspects. We use several real news groups including cities, politicians, and CS professors, to evaluate our news group models in terms of time series data distribution analysis, group-fame probability analysis, and fame-changing analysis over long time. We also build a practical news generation model using a HMM (Hidden Markov Model) based approach. Most importantly, our analysis shows the future entity fame distribution has a power-law tail. That is, only a small…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Text Analysis Techniques · Opinion Dynamics and Social Influence
