A Feature Importance Analysis for Soft-Sensing-Based Predictions in a Chemical Sulphonation Process
Enrique Garcia-Ceja, {\AA}smund Hugo, Brice Morin, Per-Olav Hansen,, Espen Martinsen, An Ngoc Lam, {\O}ystein Haugen

TL;DR
This study analyzes feature importance in predicting product quality in a chemical sulphonation process using soft-sensing, revealing that only the top three variables are needed for effective predictions, with random forests performing best overall.
Contribution
It introduces a feature importance analysis for soft-sensing in chemical processes, identifying key variables for accurate product quality prediction.
Findings
Top 3 variables suffice for satisfactory predictions.
Random forest outperforms other models with all variables.
Feature importance ranking guides efficient model design.
Abstract
In this paper we present the results of a feature importance analysis of a chemical sulphonation process. The task consists of predicting the neutralization number (NT), which is a metric that characterizes the product quality of active detergents. The prediction is based on a dataset of environmental measurements, sampled from an industrial chemical process. We used a soft-sensing approach, that is, predicting a variable of interest based on other process variables, instead of directly sensing the variable of interest. Reasons for doing so range from expensive sensory hardware to harsh environments, e.g., inside a chemical reactor. The aim of this study was to explore and detect which variables are the most relevant for predicting product quality, and to what degree of precision. We trained regression models based on linear regression, regression tree and random forest. A random forest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
