Multi-GPU Distributed Parallel Bayesian Differential Topic Modelling
Aaron Q Li

TL;DR
This paper introduces a multi-GPU distributed framework for Bayesian differential topic modeling, significantly accelerating the sampling process while maintaining accuracy, enabling real-time analysis of complex models on standard hardware.
Contribution
The paper presents the first scalable multi-GPU parallel implementation of differential topic models like SPDP, achieving substantial speed-ups over traditional methods.
Findings
Speed increased by about 50 times on a single GPU
Speed scales sublinearly with multiple GPUs
Accuracy remains nearly unchanged despite acceleration
Abstract
There is an explosion of data, documents, and other content, and people require tools to analyze and interpret these, tools to turn the content into information and knowledge. Topic modeling have been developed to solve these problems. Topic models such as LDA [Blei et. al. 2003] allow salient patterns in data to be extracted automatically. When analyzing texts, these patterns are called topics. Among numerous extensions of LDA, few of them can reliably analyze multiple groups of documents and extract topic similarities. Recently, the introduction of differential topic modeling (SPDP) [Chen et. al. 2012] performs uniformly better than many topic models in a discriminative setting. There is also a need to improve the sampling speed for topic models. While some effort has been made for distributed algorithms, there is no work currently done using graphical processing units (GPU). Note…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Text Analysis Techniques · Topic Modeling
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Linear Discriminant Analysis
