# Early warning strategies for corporate operational risk: A study by an improved random forest algorithm using FCM clustering

**Authors:** Xini Fang

PMC · DOI: 10.1371/journal.pone.0318491 · PLOS One · 2025-03-11

## TL;DR

This study improves corporate risk early warning by combining FCM clustering with a Random Forest model, achieving better accuracy and performance.

## Contribution

The novel integration of FCM clustering with an optimized Random Forest model enhances risk prediction accuracy and stability.

## Key findings

- The improved model achieved an F1 score of 87.26%, outperforming traditional RF by 6.45%.
- The model's AUC reached 91.20%, showing strong classification performance.
- The model's accuracy, precision, and recall improved by 4.45%, 4.81%, and 3.83%, respectively.

## Abstract

To enhance the accuracy and response speed of the risk early warning system, this study develops a novel early warning system that combines the Fuzzy C-Means (FCM) clustering algorithm and the Random Forest (RF) model. Firstly, based on operational risk theory, market risk, research and development risk, financial risk, and human resource risk are selected as the primary indicators for enterprise risk assessment. Secondly, the Criteria Importance Through Intercriteria Correlation (CRITIC) weight method is employed to determine the importance of these risk indicators, thereby enhancing the model’s prediction ability and stability. Following this, the FCM clustering algorithm is utilized for pre-processing sample data to improve the efficiency and accuracy of data classification. Finally, an improved RF model is constructed by optimizing the parameters of the RF algorithm. The data selected is mainly from RESSET/DB, covering the issuance, trading, and rating data of fixed-income products such as bonds, government bonds, and corporate bonds, and provides basic information, net value, position, and performance data of funds. The experimental results show that the model achieves an F1 score of 87.26%, an accuracy of 87.95%, an Area under the Curve (AUC) of 91.20%, a precision of 89.29%, and a recall of 87.48%. They are respectively 6.45%, 4.45%, 5.09%, 4.81%, and 3.83% higher than the traditional RF model. In this study, an improved RF model based on FCM clustering is successfully constructed, and the accuracy of risk early warning models and their ability to handle complex data are significantly improved.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11896059/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11896059/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/PMC11896059/full.md

---
Source: https://tomesphere.com/paper/PMC11896059