# Large-Scale Joint Topic, Sentiment & User Preference Analysis for Online   Reviews

**Authors:** Xinli Yu, Zheng Chen, Wei-Shih Yang, Xiaohua Hu, Erjia Yan

arXiv: 1901.04993 · 2019-01-17

## TL;DR

This paper introduces scalable variational inference methods for a joint topic, sentiment, and user preference review model, enabling analysis of millions of reviews with improved speed and accuracy.

## Contribution

It reconstructs the TSPRA model using variational inference, develops scalable algorithms, and introduces an online version to monitor sentiment and preference dynamics over time.

## Key findings

- svTSPRA converges faster than TSPRA.
- The algorithms process millions of reviews efficiently.
- User preferences are shown to be independent of sentiment.

## Abstract

This paper presents a non-trivial reconstruction of a previous joint topic-sentiment-preference review model TSPRA with stick-breaking representation under the framework of variational inference (VI) and stochastic variational inference (SVI). TSPRA is a Gibbs Sampling based model that solves topics, word sentiments and user preferences altogether and has been shown to achieve good performance, but for large data set it can only learn from a relatively small sample. We develop the variational models vTSPRA and svTSPRA to improve the time use, and our new approach is capable of processing millions of reviews. We rebuild the generative process, improve the rating regression, solve and present the coordinate-ascent updates of variational parameters, and show the time complexity of each iteration is theoretically linear to the corpus size, and the experiments on Amazon data sets show it converges faster than TSPRA and attains better results given the same amount of time. In addition, we tune svTSPRA into an online algorithm ovTSPRA that can monitor oscillations of sentiment and preference overtime. Some interesting fluctuations are captured and possible explanations are provided. The results give strong visual evidence that user preference is better treated as an independent factor from sentiment.

---
Source: https://tomesphere.com/paper/1901.04993