A Computational Framework to Identify Self-Aspects in Text

Jaya Caporusso; Matthew Purver; Senja Pollak

arXiv:2507.13115·cs.CL·July 18, 2025

A Computational Framework to Identify Self-Aspects in Text

Jaya Caporusso, Matthew Purver, Senja Pollak

PDF

Open Access 1 Video

TL;DR

This paper proposes a computational framework to identify Self-aspects in text, aiming to advance NLP analysis of the multifaceted Self concept across disciplines like psychology and phenomenology.

Contribution

It introduces an ontology and annotated dataset for Self-aspects, and evaluates various models for their effectiveness in identifying these aspects in text.

Findings

01

Development of an ontology and annotated dataset for Self-aspects

02

Evaluation of discriminative, generative, and embedding models for accuracy and interpretability

03

Application of top models in mental health and phenomenology case studies

Abstract

This Ph.D. proposal introduces a plan to develop a computational framework to identify Self-aspects in text. The Self is a multifaceted construct and it is reflected in language. While it is described across disciplines like cognitive science and phenomenology, it remains underexplored in natural language processing (NLP). Many of the aspects of the Self align with psychological and other well-researched phenomena (e.g., those related to mental health), highlighting the need for systematic NLP-based analysis. In line with this, we plan to introduce an ontology of Self-aspects and a gold-standard annotated dataset. Using this foundation, we will develop and evaluate conventional discriminative models, generative large language models, and embedding-based retrieval approaches against four main criteria: interpretability, ground-truth adherence, accuracy, and computational efficiency.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A Computational Framework to Identify Self-Aspects in Text· underline

Taxonomy

TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques