Sparse deep neural networks for modeling aluminum electrolysis dynamics
Erlend Torje Berg Lundby, Adil Rasheed, Ivar Johan Halvorsen, Jan, Tommy Gravdahl

TL;DR
This paper shows that applying sparse regularization to deep neural networks modeling aluminum electrolysis improves interpretability, stability, and generalization, especially with limited data, by reducing model complexity.
Contribution
It demonstrates the effectiveness of sparse regularization in creating more interpretable and stable neural network models for nonlinear industrial processes.
Findings
Sparse models are more interpretable.
Sparse models generalize better with small data.
Training ensembles yields consistent model structures.
Abstract
Deep neural networks have become very popular in modeling complex nonlinear processes due to their extraordinary ability to fit arbitrary nonlinear functions from data with minimal expert intervention. However, they are almost always overparameterized and challenging to interpret due to their internal complexity. Furthermore, the optimization process to find the learned model parameters can be unstable due to the process getting stuck in local minima. In this work, we demonstrate the value of sparse regularization techniques to significantly reduce the model complexity. We demonstrate this for the case of an aluminium extraction process, which is highly nonlinear system with many interrelated subprocesses. We trained a densely connected deep neural network to model the process and then compared the effects of sparsity promoting l1 regularization on generalizability, interpretability,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMolten salt chemistry and electrochemical processes
MethodsL1 Regularization
