# Exploratory structural equation modeling and the curse of dimensionality

**Authors:** Tra T. Le, Jeroen K. Vermunt, Nicola Ballhausen, Katrijn Van Deun

PMC · DOI: 10.3758/s13428-026-02960-y · Behavior Research Methods · 2026-03-11

## TL;DR

This paper introduces a new statistical method to handle complex models with many variables and small sample sizes in behavioral science research.

## Contribution

A novel two-stage regularized approach for exploratory structural equation modeling is proposed to address high-dimensional data challenges.

## Key findings

- The proposed method outperforms existing approaches in recovering measurement model structures in both low and high-dimensional settings.
- The method successfully estimates factor scores and addresses measurement model indeterminacy through regularization techniques.
- The approach is demonstrated on empirical datasets and is available as an R package.

## Abstract

The next-generation approach to research in the behavioral sciences is based on intensive collections of data and complex models characterized by many parameters for a limited sample size. This introduces new challenges for traditional latent-variable methods, as they are found to fail or yield unstable solutions when the number of variables is large relative to the sample size. To tackle this issue, we propose a two-stage regularized approach for exploratory structural equation modeling. In the first stage, we introduce a novel (exploratory) approximate factor analysis technique that not only estimates the measurement model but also the factor scores; indeterminacy of the measurement model is addressed by imposing simple structure through regularizing techniques (LASSO penalty and cardinality constraint). The factor scores can then be used to estimate the structural model in the second stage. An extensive simulation shows that the proposed method outperforms other approaches in recovering the underlying simple structure of the measurement model in both low-dimension high-sample-size and high-dimension low-sample-size settings. The use of the method is demonstrated on two empirical datasets. An implementation of the proposed method in the R software is publicly available: https://github.com/trale97/regularizedESEM.

## Full-text entities

- **Genes:** FMR1 (fragile X messenger ribonucleoprotein 1) [NCBI Gene 2332] {aka FMRP, FRAXA, POF, POF1}
- **Diseases:** obesity (MESH:D009765), HDLSS (MESH:D009800), Autism (MESH:D001321), fragile X (MESH:D005600)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12979312/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12979312/full.md

## References

16 references — full list in the complete paper: https://tomesphere.com/paper/PMC12979312/full.md

---
Source: https://tomesphere.com/paper/PMC12979312