Online learning of neural networks based on a model-free control algorithm
Lo\"ic Michel

TL;DR
This paper proposes a novel model-free control law for online training of neural networks, framing weight tuning as a feedback control problem, demonstrated through promising numerical results and classification examples.
Contribution
It introduces a new model-free control approach for online neural network training, offering an alternative to traditional methods.
Findings
Effective online weight adjustment demonstrated
Numerical results show promising learning dynamics
Classifier example confirms approach viability
Abstract
We explore the possibilities of using a model-free-based control law in order to train artificial neural networks. In the supervised learning context, we consider the problem of tuning the synaptic weights as a feedback control tracking problem where the control algorithm adjusts the weights online according to the input-output training data set of the neural network. Numerical results illustrate the dynamical learning process and an example of classifier that show very promising properties of our proposed approach.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Control Systems and Identification · Extremum Seeking Control Systems
11institutetext: École centrale de Nantes-LS2N, UMR 6004 CNRS, Nantes, France 11email: [email protected]
Online learning of neural networks based on a model-free control algorithm
Loïc MICHEL
Abstract
We explore the possibilities of using a model-free-based control law in order to train artificial neural networks. In the supervised learning context, we consider the problem of tuning the synaptic weights as a feedback control tracking problem where the control algorithm adjusts the weights online according to the input-output training data set of the neural network. Numerical results illustrate the dynamical learning process and an example of classifier that show very promising properties of our proposed approach.
Keywords:
advanced optimization techniques advances in machine learning model-free control
1 Introduction
Training a neural network consists in tuning its internal weights in order to learn a mapping function from inputs to outputs and eventually examine what the model predicts [1]. Besides classical tuning techniques (see e.g. [2] and a survey in [3] that presents tuning methods to model complex manufacturing processes), some connections between adaptive control and optimization methods have been pointed out recently in [4, 5] that highlight a certain equivalence between using tools from the adaptive control field and solving problems in the machine learning field. In this line of thinking, the motivation of this work is to propose a strategy to tune neural networks using the so-called model-free control algorithm in the context of supervised learning.
The model-free control methodology, originally proposed by [6], has been designed to control a priori any ”unknown” dynamical system in a ”robust” manner, and can be considered as an alternative to standard PI and PID control [7] as it does not need any prior knowledge of the plant to control. Its usefulness has been demonstrated through successful applications111See e.g. the references in [6, 8, 9] and the references therein for an overview of the applications., and in particular, an application dedicated to the supply chain management [9] has been recently proposed. A derivative-free-based version of this control algorithm has been proposed by the author in [10], for which some interesting capabilities of online optimization have been highlighted.
At the intersection between control, optimization and machine learning, in this work, we consider the training of a neural network as a tracking control problem, where the proposed ”para-model” control technique [10] is experimented as a derivative-free learning algorithm to tune the weights of the network in order to fit online the training data.
The paper is organized as follow. Section 2 reviews the para-model approach. In Section 3, a preliminary example illustrates how a model-free-based distributed control could be implemented in order to control multiple systems. Section 4 presents the application of the para-model control to train a simple neural network and numerical results are presented in Section 5 to illustrate the dynamical evolution of the learning process as well as an example of classifier. Section 6 gives some concluding remarks.
2 Principle of the para-model control
Consider a nonlinear SISO dynamical system to control
[TABLE]
where is the function describing the behavior of a nonlinear system and is the state vector; the para-model control is an application whose purpose is to control the output of (1) following an output reference . In simulation, the system (1) is controlled in its ”original formulation” without any modification or linearization.
For any discrete moment , one defines the discrete controller as an integrator associated to a numerical series such as symbolically
[TABLE]
with the recursive term
[TABLE]
where is the output (or tracking) reference trajectory; and are real positive tuning gains; is the tracking error; is an initialization function where and are real positive constants; practically, the integral part is discretized using e.g. Riemann sums.
Define the set of the -parameters of the controller as the set of the tuning coefficients 222An interesting property that has been observed with para-model control throughout the overall applications is the relative flexibility of the -parameters to obtain good tracking performances while ”prototyping” a new process to control. In particular, we highlight the case of the experimental validation [11] for which no mathematical representative model of the nonlinear process was available and the control has been tested under several working conditions using indeed the -parameters adjusted for the corresponding simplified simulation.. The implementation of the control scheme is depicted in Fig. 1 where is the proposed para-model controller.
In the next section, an example is presented to illustrate how model-free-based distributed control can be implemented in order to introduce the methodology to train neural networks by controlling the corresponding neural weights.
3 Example of distributed model-free-based control : an amazing way to solve
To illustrate the properties of the proposed para-model algorithm, consider the following linear system to solve
[TABLE]
where we denote the solution of (3). Considering the controlled sub-system derived from (3)
[TABLE]
the goal is to solve the system (3) as a tracking problem in such manner that in the sub-system (4), the controlled tracks . Hence, if is kept ”close” to , then the controlled is ”close” to the solution .
Each variable of (4) is driven by an autonomous controller, with respect to the tracking reference such as ideally in a finite time. The associated control law , that is associated to each variable , reads
[TABLE]
where the set of parameters is associated to the th controller.
Figure 2 illustrates the evolution of the controlled versus the iterations that converges to the solution .
4 Application to the training of neural networks
4.1 Problem statement
In the context of supervised learning, let us consider a neural network described as a ”black-box” model
[TABLE]
that is composed of inputs ; an output ; synaptic weights and a sigmoid activation function of the form that defines the output of each neuron (node).
Given training data and associated respectively to the inputs and to the output of , we assume that the algorithm (2) updates each synaptic weight such as
[TABLE]
and therefore, allows ”configuring” the neural network (updates of the for all ) in such manner that asymptotically, the output remains ”as close as possible” to . Since the neural network does not include any internal dynamic, a filter is associated to each in order to include a dynamic regarding the proper use of the controllers (Fig. 1).
Remark 1:
Depending on the expected closed loop transient dynamic, a possible choice of the -parameters is to consider e.g. a decrease of the control amplification gains according to the th node i.e. in order to obtain a good dynamic response regarding possible changes of the model and the rejection of external disturbances, like changes in the training data set.
4.2 Simple example of training
To illustrate our proposed training strategy, consider a three-node network333Such small network is still mathematically interesting to investigate [12]., depicted in Fig. 3 including two inputs and and an output .
The strategy (7) is applied to calculate online the weights given the training values and (the latter corresponds to the output reference). A first order filter (with a small time constant) is added to include a dynamic to each controller.
5 Numerical results
To present some preliminary properties, the following test bench have been performed considering the initial set of training data , and . The -parameters have not been optimized regarding the transient responses and the are bounded such as for all . All are initialized to zero.
Evolution of online modifications of the network topology and the training data
In formula (2), set , , and including a first order filter with a time constant of s; the simulation time-step is s. Figure 4 shows respectively the evolution of the weights and the controlled output , when the network is subjected to an arbitrary change of its topology (the weight is for example forced to zero at an arbitrary time) as well as arbitrary changes of the training data.
As a result, a great tracking of the output has been observed despite the different changes of the training data as well as the topology of the network, which is referred to as the ”Dropout” concept in e.g. [13, 14].
A classifier example
Consider training the three-node network as a classifier with the following data training set
[TABLE]
where is the boolean test of .
The following table illustrates a simple classification test and the resulting average of all output values defines the output partition of the classifier (i.e. classify the particular input values that produce a ”0” in output and vice versa).
[TABLE]
The set of data is properly classified according to the boolean comparison with . Remark that since the proposed control-based training algorithm deals with dynamical systems and sweeps the training data through low pass filtering, the partition of the classifier via corresponds indeed to the ’filtered’ averaged value of the output training data.
6 Conclusion and perspectives
This paper presented an application of the model-free-based control methodology in the field of artificial neural networks. Encouraging results show promising tracking performances taking into account online modifications of the training data set as well as modifications of the topology of the studied network. Further works will include the formalization of our proposed approach (based e.g. on the implicit framework proposed in [15]), as well as as investigations regarding the application of our proposed algorithm to large scale neural networks including specific networks used e.g. in decision support systems [16].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] G. Montavon, W. Samek, and K.-R. Müller. Methods for interpreting and understanding deep neural networks. Digital Signal Processing , 73:1–15, 2018.
- 2[2] C. C. Aggarwal. Neural Networks and Deep Learning . Springer, 2018.
- 3[3] W. Sukthomya and J. Tannock. The training of neural networks to model manufacturing processes. Journal of Intelligent Manufacturing , 16:39–51, 02 2005.
- 4[4] J. E. Gaudio, T. E. Gibson, A. M. Annaswamy, M. A. Bolender, and E. Lavretsky. Connections between adaptive control and optimization in machine learning. In 2019 IEEE 58th Conference on Decision and Control (CDC) , pages 4563–4568, 2019.
- 5[5] N. Matni, A. Proutiere, A. Rantzer, and S. Tu. From self-tuning regulators to reinforcement learning and back again. In 2019 IEEE 58th Conference on Decision and Control (CDC) , pages 3724–3740, 2019.
- 6[6] M. Fliess and C. Join. Model-free control. International Journal of Control , 86(12):2228–2252, 2013.
- 7[7] M. Fliess and C. Join. An alternative to proportional-integral and proportional-integral-derivative regulators: Intelligent proportional-derivative regulators. Int J Robust Nonlinear Control , pages 1–13, 2021.
- 8[8] O. Bara, M. Fliess, C. Join, J. Day, and S. M. Djouadi. Toward a model-free feedback control synthesis for treating acute inflammation. Journal of Theoretical Biology , 448:26 – 37, 2018.
