# Memetic search for overlapping topics based on a local evaluation of   link communities

**Authors:** Frank Havemann, Jochen Gl\"aser, and Michael Heinz

arXiv: 1702.02808 · 2017-04-26

## TL;DR

This paper introduces a memetic algorithm for detecting overlapping thematic structures in citation networks by clustering links using a local evaluation method, addressing the challenge of pervasive overlaps in scientific literature.

## Contribution

It proposes a novel link clustering approach with a memetic algorithm to identify overlapping communities based on local minima in a cost landscape, validated on large citation networks.

## Key findings

- Effective detection of overlapping topics in large citation networks
- Demonstrated the method on networks with up to 100,000 papers
- Identified communities with Pervasive overlaps in scientific literature

## Abstract

In spite of recent advances in field delineation methods, bibliometricians still don't know the extent to which their topic detection algorithms reconstruct `ground truths', i.e. thematic structures in the scientific literature. In this paper, we demonstrate a new approach to the delineation of thematic structures that attempts to match the algorithm to theoretically derived and empirically observed properties all thematic structures have in common. We cluster citation links rather than publication nodes, use predominantly local information and search for communities of links starting from seed subgraphs in order to allow for pervasive overlaps of topics. We evaluate sets of links with a new cost function and assume that local minima in the cost landscape correspond to link communities. Because this cost landscape has many local minima we define a valid community as the community with the lowest minimum within a certain range. Since finding all valid communities is impossible for large networks, we designed a memetic algorithm that combines probabilistic evolutionary strategies with deterministic local searches. We apply our approach to a network of about 15,000 Astronomy & Astrophysics papers published 2010 and their cited sources, and to a network of about 100,000 Astronomy & Astrophysics papers (published 2003--2010) which are linked through direct citations.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1702.02808/full.md

## Figures

20 figures with captions in the complete paper: https://tomesphere.com/paper/1702.02808/full.md

## References

48 references — full list in the complete paper: https://tomesphere.com/paper/1702.02808/full.md

---
Source: https://tomesphere.com/paper/1702.02808