From Symbols to Embeddings: A Tale of Two Representations in Computational Social Science
Huimin Chen, Cheng Yang, Xuanming Zhang, Zhiyuan Liu, Maosong Sun,, Jianbin Jin

TL;DR
This paper reviews data representations in Computational Social Science, comparing symbol-based and embedding-based schemes, highlighting the rising prominence of embeddings, and discussing future challenges in effectively representing social data.
Contribution
It provides a comprehensive review of data representations in CSS, categorizing methods into symbol-based and embedding-based, and analyzes trends and challenges in their application.
Findings
Embedding-based representations are increasingly popular over the last decade.
Symbol-based representations are still used but are being complemented or replaced by embeddings.
The review covers over 400 research articles from top CSS venues.
Abstract
Computational Social Science (CSS), aiming at utilizing computational methods to address social science problems, is a recent emerging and fast-developing field. The study of CSS is data-driven and significantly benefits from the availability of online user-generated contents and social networks, which contain rich text and network data for investigation. However, these large-scale and multi-modal data also present researchers with a great challenge: how to represent data effectively to mine the meanings we want in CSS? To explore the answer, we give a thorough review of data representations in CSS for both text and network. Specifically, we summarize existing representations into two schemes, namely symbol-based and embedding-based representations, and introduce a series of typical methods for each scheme. Afterwards, we present the applications of the above representations based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Opinion Dynamics and Social Influence · Topic Modeling
