TopicENA: Enabling Epistemic Network Analysis at Scale through Automated Topic-Based Coding
Owen H.T. Lu, Tiffany T.Y. Hsu

TL;DR
TopicENA combines automated topic modeling with epistemic network analysis to enable scalable, interpretable analysis of large text corpora by replacing manual coding with machine-generated topics.
Contribution
This paper introduces TopicENA, integrating BERTopic with ENA to automate concept coding and improve scalability for large-scale text analysis.
Findings
Coarse topics are better for large datasets.
Topic inclusion thresholds should be adjusted based on quality.
TopicENA successfully scales to larger datasets.
Abstract
Epistemic Network Analysis (ENA) is a method for investigating the relational structure of concepts in text by representing co-occurring concepts as networks. Traditional ENA, however, relies heavily on manual expert coding, which limits its scalability and real-world applicability to large text corpora. Topic modeling provides an automated approach to extracting concept-level representations from text and can serve as an alternative to manual coding. To tackle this limitation, the present study merges BERTopic with ENA and introduces TopicENA, a topic-based epistemic network analysis framework. TopicENA substitutes manual concept coding with automatically generated topics while maintaining ENA's capacity for modeling structural associations among concepts. To explain the impact of modeling choices on TopicENA outcomes, three analysis cases are presented. The first case assesses the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Mental Health Research Topics · Advanced Graph Neural Networks
