# Contributive Social Capital Extraction From Different Types of Online   Data Sources

**Authors:** Sebastian Schams, Georg Groh

arXiv: 1902.07636 · 2019-02-21

## TL;DR

This paper explores methods to infer contributive social capital from various online data sources, focusing on properties like expertise and trustworthiness, and compares different algorithms and data types for effective extraction.

## Contribution

It introduces a comprehensive analysis of how to extract contributive social capital from diverse online platforms using various features and machine learning techniques.

## Key findings

- Algorithms based on individual features like followers and centrality measures are effective.
- Different data sources require tailored algorithms for accurate social capital inference.
- The study provides a framework for evaluating and comparing social capital extraction methods.

## Abstract

It is a recurring problem of online communication that the properties of unknown people are hard to assess. This may lead to various issues such as the spread of `fake news' from untrustworthy sources. In sociology the sum of (social) resources available to a person through their social network is often described as social capital. In this article, we look at social capital from a different angle. Instead of evaluating the advantage that people have because of their membership in a certain group, we investigate various ways to infer the social capital a person adds or may add to the network, their contributive social capital (CSC). As there is no consensus in the literature on what the social capital of a person exactly consists of, we look at various related properties: expertise, reputation, trustworthiness, and influence. The analysis of these features is investigated for five different sources of online data: microblogging (e.g., Twitter), social networking platforms (e.g., Facebook), direct communication (e.g., email), scientometrics, and threaded discussion boards (e.g., Reddit). In each field we discuss recent publications and put a focus on the data sources used, the algorithms implemented, and the performance evaluation. The findings are compared and set in context to contributive social capital extraction. The analysis algorithms are based on individual features (e.g., followers on Twitter), ratios thereof, or a person's centrality measures (e.g., PageRank). The machine learning approaches, such as straightforward classifiers (e.g., support vector machines) use ground truths that are connected to social capital. The discussion of these methods is intended to facilitate research on the topic by identifying relevant data sources and the best suited algorithms, and by providing tested methods for the evaluation of findings.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.07636/full.md

## References

61 references — full list in the complete paper: https://tomesphere.com/paper/1902.07636/full.md

---
Source: https://tomesphere.com/paper/1902.07636