Interpretable Recognition of Cognitive Distortions in Natural Language Texts

Anton Kolonin; Anna Arinicheva

arXiv:2511.05969·cs.CL·November 11, 2025

Interpretable Recognition of Cognitive Distortions in Natural Language Texts

Anton Kolonin, Anna Arinicheva

PDF

Open Access

TL;DR

This paper introduces an interpretable AI method for detecting cognitive distortions in texts, improving accuracy over existing approaches and providing transparent, robust models for psychological applications.

Contribution

It presents a novel multi-factor classification approach using weighted structured patterns that considers heterarchical relationships, enhancing detection of cognitive distortions.

Findings

01

Significant F1 score improvements on two datasets

02

Models and code made publicly available

03

Enhanced interpretability and robustness of detection algorithms

Abstract

We propose a new approach to multi-factor classification of natural language texts based on weighted structured patterns such as N-grams, taking into account the heterarchical relationships between them, applied to solve such a socially impactful problem as the automation of detection of specific cognitive distortions in psychological care, relying on an interpretable, robust and transparent artificial intelligence model. The proposed recognition and learning algorithms improve the current state of the art in this field. The improvement is tested on two publicly available datasets, with significant improvements over literature-known F1 scores for the task, with optimal hyper-parameters determined, having code and models available for future use by the community.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Authorship Attribution and Profiling · Text Readability and Simplification