Optimal and Sub-optimal Feedback Controls for Biogas Production

Antoine Haddon (DIM; MISTEA); Hector Ramirez (DIM); Alain Rapaport; (MISTEA)

arXiv:1906.02945·math.OC·June 10, 2019·J. Optim. Theory Appl.

Optimal and Sub-optimal Feedback Controls for Biogas Production

Antoine Haddon (DIM, MISTEA), Hector Ramirez (DIM), Alain Rapaport, (MISTEA)

PDF

Open Access

TL;DR

This paper investigates optimal and sub-optimal feedback control strategies to maximize biogas production in continuous bio-processes, providing explicit solutions and bounds for different horizon scenarios and illustrating results with specific growth functions.

Contribution

It introduces explicit feedback controls for biogas production optimization over infinite and finite horizons, including bounds on sub-optimality and applications to specific growth models.

Findings

01

Identified optimal controls for infinite horizon problems with averaged and discounted rewards.

02

Developed explicit, time-independent feedback controls for finite horizon problems.

03

Provided bounds on the sub-optimality of proposed controllers.

Abstract

We revisit the optimal control problem of maximizing biogas production in continuous bio-processes in two directions: 1. over an infinite horizon, 2. with sub-optimal controllers independent of the time horizon. For the first point, we identify a set of optimal controls for the problems with an averaged reward and with a discounted reward when the discount factor goes to 0 and we show that the value functions of both problems are equal. For the finite horizon problem, our approach relies on a framing of the value function by considering a different reward for which the optimal solution has an explicit optimal feedback that is time-independent. In particular, we show that this technique allows us to provide explicit bounds on the sub-optimality of the proposed controllers. The various strategies are finally illustrated on Haldane and Contois growth functions.

Figures9

Click any figure to enlarge with its caption.

Equations247

\overset{s}{˙}

\overset{s}{˙}

\overset{x}{˙}

\int_{t_{0}}^{T} μ (s (t), x (t)) x (t) d t

\int_{t_{0}}^{T} μ (s (t), x (t)) x (t) d t

μ (0, x) = 0 \mbox an d μ (s, x) > 0 \mbox f or s > 0.

μ (0, x) = 0 \mbox an d μ (s, x) > 0 \mbox f or s > 0.

{\cal U}(t_{0},T)=\Big{\{}u(\cdot)\in L^{\infty}(t_{0},T;\mathbb{R}):u(t)\in[0,u_{\max}]\mbox{ for }t\in[t_{0},T]\Big{\}}

{\cal U}(t_{0},T)=\Big{\{}u(\cdot)\in L^{\infty}(t_{0},T;\mathbb{R}):u(t)\in[0,u_{\max}]\mbox{ for }t\in[t_{0},T]\Big{\}}

D := [0, s_{in}) \times (0, \infty)

D := [0, s_{in}) \times (0, \infty)

ζ = (s, z) \mbox w i t h z = \frac{x}{s _{in} - s},

ζ = (s, z) \mbox w i t h z = \frac{x}{s _{in} - s},

\displaystyle\dot{\zeta}=\left[\begin{array}[]{c}\dot{s}\\ \dot{z}\end{array}\right]=f(\zeta,u):=\left[\begin{array}[]{c}\Big{(}u-\mu\big{(}s,(s_{in}-s)z\big{)}z\Big{)}(s_{in}-s)\\ \mu\big{(}s,(s_{in}-s)z\big{)}(1-z)z\end{array}\right].

\displaystyle\dot{\zeta}=\left[\begin{array}[]{c}\dot{s}\\ \dot{z}\end{array}\right]=f(\zeta,u):=\left[\begin{array}[]{c}\Big{(}u-\mu\big{(}s,(s_{in}-s)z\big{)}z\Big{)}(s_{in}-s)\\ \mu\big{(}s,(s_{in}-s)z\big{)}(1-z)z\end{array}\right].

\int_{t_{0}}^{T}\phi\big{(}s_{t_{0},\xi,u}(t),z_{t_{0},\xi,u}(t)\big{)}z_{t_{0},\xi,u}(t)\,dt

\int_{t_{0}}^{T}\phi\big{(}s_{t_{0},\xi,u}(t),z_{t_{0},\xi,u}(t)\big{)}z_{t_{0},\xi,u}(t)\,dt

\phi(s,z)=\mu\big{(}s,(s_{in}-s)z\big{)}(s_{in}-s)

\phi(s,z)=\mu\big{(}s,(s_{in}-s)z\big{)}(s_{in}-s)

\overline{ϕ} (z) = s \in (0, s_{in}) max ϕ (s, z) .

\overline{ϕ} (z) = s \in (0, s_{in}) max ϕ (s, z) .

L (ξ) = [0, s_{in}] \times [min (z_{0}, 1), max (z_{0}, 1)] .

L (ξ) = [0, s_{in}] \times [min (z_{0}, 1), max (z_{0}, 1)] .

min (z_{0}, 1) ⩽ z_{t_{0}, ξ, u} (t) ⩽ max (z_{0}, 1)

min (z_{0}, 1) ⩽ z_{t_{0}, ξ, u} (t) ⩽ max (z_{0}, 1)

\max_{(s,z)\in{\cal L}(\xi)}\mu\big{(}s,(s_{in}-s)z\big{)}z<u_{max}.

\max_{(s,z)\in{\cal L}(\xi)}\mu\big{(}s,(s_{in}-s)z\big{)}z<u_{max}.

\psi_{s^{*}}(s,z)=\left|\begin{array}[]{ll}0&\mbox{if }s>s^{*},\\ \mu(s^{*},(s_{in}-s^{*})z)\,z&\mbox{if }s=s^{*},\\ u_{\max}&\mbox{if }s<s^{*}.\end{array}\right.

\psi_{s^{*}}(s,z)=\left|\begin{array}[]{ll}0&\mbox{if }s>s^{*},\\ \mu(s^{*},(s_{in}-s^{*})z)\,z&\mbox{if }s=s^{*},\\ u_{\max}&\mbox{if }s<s^{*}.\end{array}\right.

\displaystyle\dot{s}_{t_{0},\xi,\psi_{s^{*}}}(t)=-\mu\big{(}s,(s_{in}-s)z\big{)}z(s_{in}-s)\leqslant k_{-}<0,\quad\forall\,t\in I

\displaystyle\dot{s}_{t_{0},\xi,\psi_{s^{*}}}(t)=-\mu\big{(}s,(s_{in}-s)z\big{)}z(s_{in}-s)\leqslant k_{-}<0,\quad\forall\,t\in I

\displaystyle\dot{s}_{t_{0},\xi,\psi_{s^{*}}}(t)=\left[u_{\max}-\mu\big{(}s,(s_{in}-s)z\big{)}z\right](s_{in}-s)\geqslant k_{+}>0,\quad\forall\,t\in I

\displaystyle\dot{s}_{t_{0},\xi,\psi_{s^{*}}}(t)=\left[u_{\max}-\mu\big{(}s,(s_{in}-s)z\big{)}z\right](s_{in}-s)\geqslant k_{+}>0,\quad\forall\,t\in I

\int_{t_{0}}^{T} u (t) d t T [\to \infty] ⟶ \infty.

\int_{t_{0}}^{T} u (t) d t T [\to \infty] ⟶ \infty.

t \to \infty lim z_{0, ξ, u} (t) = 1

t \to \infty lim z_{0, ξ, u} (t) = 1

δ \to 0 lim \int_{0}^{\infty} δ e^{- δ t} z_{0, ξ, u} (t) d t = T \to \infty lim \frac{1}{T} \int_{0}^{T} z_{0, ξ, u} (t) d t = 1.

δ \to 0 lim \int_{0}^{\infty} δ e^{- δ t} z_{0, ξ, u} (t) d t = T \to \infty lim \frac{1}{T} \int_{0}^{T} z_{0, ξ, u} (t) d t = 1.

t \to + \infty lim s_{0, ξ, u} (t) = 0.

t \to + \infty lim s_{0, ξ, u} (t) = 0.

z(t)=\frac{z_{0}+e^{\int_{t_{0}}^{t}\mu(s(\tau),x(\tau))\,d\tau}}{1+z_{0}\Big{(}e^{\int_{t_{0}}^{t}\mu(s(\tau),x(\tau))\,d\tau}-1\Big{)}}

z(t)=\frac{z_{0}+e^{\int_{t_{0}}^{t}\mu(s(\tau),x(\tau))\,d\tau}}{1+z_{0}\Big{(}e^{\int_{t_{0}}^{t}\mu(s(\tau),x(\tau))\,d\tau}-1\Big{)}}

x(t)=x(t_{0})e^{\int_{t_{0}}^{t}\big{(}\mu(s(\tau),x(\tau))-u(\tau)\big{)}\,d\tau}\;.

x(t)=x(t_{0})e^{\int_{t_{0}}^{t}\big{(}\mu(s(\tau),x(\tau))-u(\tau)\big{)}\,d\tau}\;.

t \mapsto \int_{t_{0}}^{t} μ (s (τ), x (τ)) d τ, t \geq t_{0}

t \mapsto \int_{t_{0}}^{t} μ (s (τ), x (τ)) d τ, t \geq t_{0}

\frac{d}{dt}\big{(}s(t)+x(t)\big{)}=u(t)\big{(}s_{in}-s(t)+x(t)\big{)}\\

\frac{d}{dt}\big{(}s(t)+x(t)\big{)}=u(t)\big{(}s_{in}-s(t)+x(t)\big{)}\\

s(t)+x(t)=s_{in}+(s(t_{0})+x(t_{0})-s_{in}\big{)}e^{-\int_{t_{0}}^{t}u(\tau)\,d\tau}

s(t)+x(t)=s_{in}+(s(t_{0})+x(t_{0})-s_{in}\big{)}e^{-\int_{t_{0}}^{t}u(\tau)\,d\tau}

μ (s (t), x (t))) > μ (s_{in}, 0) /2 > 0

μ (s (t), x (t))) > μ (s_{in}, 0) /2 > 0

δ \to 0 lim \int_{0}^{\infty} δ e^{- δ t} z_{0, ξ, u} (t) d t = T \to \infty lim \frac{1}{T} \int_{0}^{T} z_{0, ξ, u} (t) d t

δ \to 0 lim \int_{0}^{\infty} δ e^{- δ t} z_{0, ξ, u} (t) d t = T \to \infty lim \frac{1}{T} \int_{0}^{T} z_{0, ξ, u} (t) d t

∣ z_{0, ξ, u} (t) - 1∣ < \tilde{ε} .

∣ z_{0, ξ, u} (t) - 1∣ < \tilde{ε} .

\frac{1}{T} \int_{0}^{T} z_{0, ξ, u} (t) d t - 1

\frac{1}{T} \int_{0}^{T} z_{0, ξ, u} (t) d t - 1

< \frac{t _{\tilde{ε}}}{T} ∣ z_{0} - 1∣ + (1 - \frac{t _{\tilde{ε}}}{T}) \tilde{ε}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Climate Change Policy and Economics · Anaerobic Digestion and Biogas Production

Full text

Optimal and Sub-optimal Feedback Controls

for Biogas Production

Antoine Haddon1,2

Héctor Ramírez1

Alain Rapaport2

(1 Mathematical Engineering Department and Center for Mathematical Modelling (CNRS UMI 2807), Universidad de Chile, Beauchef 851, Santiago, Chile ([email protected])

2 MISTEA, Université Montpellier, INRA, Montpellier SupAgro, 2 pl. Viala, 34060 Montpellier, France )

Abstract

We revisit the optimal control problem of maximizing biogas production in continuous bio-processes in two directions:

over an infinite horizon,
with sub-optimal controllers independent of the time horizon. For the first point, we identify a set of optimal controls for the problems with an averaged reward and with a discounted reward when the discount factor goes to 0 and we show that the value functions of both problems are equal. For the finite horizon problem, our approach relies on a framing of the value function by considering a different reward for which the optimal solution has an explicit optimal feedback that is time-independent. In particular, we show that this technique allows us to provide explicit bounds on the sub-optimality of the proposed controllers. The various strategies are finally illustrated on Haldane and Contois growth functions.

Keywords : Optimal control, Chemostat model, Singular arc, Sub-optimality, Infinite horizon.

1 Introduction

Anaerobic digestion is a biological process in which organic matter is transformed by microbial species into biogas (such as methane and carbon dioxide). Such transformations have been used for a long time in waste water-treatment plants to purify water [1]. Valorizing biogas production while treating wastewater has received recently great attention, as a way for producing valuable energy and limiting the carbon footprint of the process [2]. As a final product of the biological reaction, the total production of biogas measures the performances of the biological transformation. Therefore, there is a strong interest in determining control strategies maximizing biogas production.

With continuous-stirred bioreactors, two kinds of anaerobic models are usually considered for control purposes in the literature: the one-step model, which corresponds to the classical chemostat model [3], and the two-step model that has been proposed by Bernard et al. [4].

Although these models only have few dynamic variables, it has been shown that they are capable of reproducing the qualitative behavior of the anaerobic digestion process [5]. Furthermore, in the two-step model, the second reaction is the most limiting due to inhibition by the substrate and we can then consider that a one-step model can be used to focus on the second reaction. In particular, a common assumption is to consider that the first step is fast and then the two reactions can be reduced to a single one with a slow-fast approximation and in this case, the one-step model provides a good representation of the biogas production.

The control variable is typically the input flow rate (or equivalently the dilution rate, since the volume of the reactor is constant in continuous operating mode). Several works have already considered the static optimization problem of maximizing the output flow rate of biogas at steady state, and various control strategies have been proposed to stabilize the processes at these nominal states (see for instance [6, 7, 8, 9, 10, 11, 12]).

There has been comparatively much less work considering the dynamic optimization problem over the transients, while bio-processes are often not initialized at their optimal nominal state. Although the optimal control problem, which consists in maximizing biogas production over a given time interval, has been posed a long time ago [13], it is still unsolved today (even for the one-step model). Let us mention two attempts to solve approximately or partially this problem. Sbarciog et al. [14] have considered the two-step anaerobic model and proposed a strategy for maximizing biogas production as an optimal control to drive the system in finite time in a neighborhood of the optimal steady state, with additive penalty terms in the criterion. In [15], Ghouali et al. give a complete solution of the original optimal control problem for the one-step model, but for a particular subset of initial conditions which belong to an invariant manifold of the system (see also [16]). The dynamics can be then reduced to a scalar one and the authors show that the optimal solution exhibits a singular arc with a “most rapid approach path” optimal strategy. Let us underline that optimal control problems over a fixed time horizon possess generally a time-dependent optimal synthesis, while the duration of process operation is often poorly known. However, the scalar reduced problem exhibits the remarkable feature of having an optimal synthesis independent of the terminal time, which makes it quite attractive from an application view point.

The purpose of the present article is to propose new control strategies for the one-step model, as time-independent feedbacks for general initial conditions

either considering an infinite horizon,

-

either considering sub-optimal controllers for the finite horizon.

For the infinite horizon (see for instance the book [17]), we consider the limit of the discounted criterion (when the discount factor tends to zero) and the average cost. We study optimal strategies and compare their related optimal costs. This study extends the preliminary results presented in the conference paper [18] and considers a large class of growth functions, that can be in particular density-dependent (such as the Contois law) or not (such as the Monod or Haldane law). Our work for the finite horizon exploits and extends an approximation technique presented in [19]. This consists, for a given initial condition, in framing the optimal solution by considering a different reward for which the optimal solution can be determined exactly and that possess the property of having a time-independent optimal synthesis (i.e. whatever is the time horizon, finite or infinite). This technique has moreover the advantage of providing bounds on the sub-optimality of the controllers. The results are again obtained for a large class of growth functions and we show that density dependent growth functions lead to more sophisticated feedback laws.

The paper is organized as follows. Section 2 specifies dynamics, control, criterion and hypotheses, and gives some preliminary results about controllability and asymptotic behavior of solutions. Sections 3 and 4 study the optimal solutions, respectively for the infinite and finite time horizons. Finally, Section 5 illustrates our results on various growth functions.

2 Preliminaries

In this work, we consider the classical chemostat model [3]. This represents a well-mixed continuously fed bioreactor in which a substrate of concentration $s$ is treated (and then transformed into biogas) by a population of microorganisms of concentration $x$

[TABLE]

We denote $s_{in}>0$ the inflow concentration of substrate, $Y$ the yield coefficient, $\mu(\cdot,\cdot)$ the specific growth rate and $u$ the dilution rate, which is the control.

The biogas flowrate is assumed proportional to the growth rate so that the biogas produced during a time interval $[t_{0},T]$ is proportional to

[TABLE]

and, without loss of generality, we will suppose that the proportionality coefficient as well as the yield coefficient are equal to 1.

We will consider the following class of growth functions :

Assumption 1.

We suppose that $\mu:\mathbb{R}_{+}\times\mathbb{R}_{+}\rightarrow\mathbb{R}_{+}$ is a Lipschitz continuous function that satisfies, for all $x>0$

[TABLE]

We suppose as well that $x\mapsto\mu(s,x)$ is non increasing, which models crowding effects, and $x\mapsto\mu(s,x)x$ is non decreasing, which models the fact that having more biomass provides at least the same growth.

A typical instance of this class is the Contois growth function, defined later in (38), but note that this class of functions also contains growth functions that depend only on the substrate concentration, such as the Monod (36) and the Haldane (37) functions.

We will study the problem of maximizing the accumulated biogas for controls in the following set of admissible controls

[TABLE]

with $t_{0}\in\mathbb{R}$ and $T\in\mathbb{R}\cup\{+\infty\}$ , and where $u_{max}>0$ is a given parameter that represents the maximal dilution rate. We will consider initial conditions taken in the invariant set

[TABLE]

which corresponds to the most common operating conditions. Notice that for initial conditions in ${\cal D}$ , any solution of (1)-(2) cannot reach $s=s_{in}$ in finite time and stays non negative. Therefore the set ${\cal D}$ is (forward) invariant.

2.1 Properties of the Dynamics

On the invariant domain ${\cal D}$ , we introduce the change of variables

[TABLE]

under which the dynamics become

[TABLE]

We will denote $s_{t_{0},\xi,u}(\cdot)$ and $z_{t_{0},\xi,u}(\cdot)$ the solution of (7), with initial condition $\xi=(s_{0},z_{0})=(s(t_{0}),z(t_{0}))\in{\cal D}$ and control $u(\cdot)\in{\cal U}(t_{0},T)$ . The cumulated biogas production becomes

[TABLE]

with

[TABLE]

and we will denote

[TABLE]

We can now establish an important property of the controlled dynamics.

Lemma 1.

The trajectories of the system (7) for a given initial condition $\xi=(s_{0},z_{0})\in\cal D$ , for all admissible controls, remain in the set

[TABLE]

Proof.

From Assumption 1 we have that $\mu(\cdot,\cdot)\geqslant 0$ and since the solutions $z(\cdot)$ satisfy (7), we then have the following

[TABLE]

for all $t\geqslant 0$ , for any admissible control $u(\cdot)$ . ∎

In the following, we consider initial conditions that guarantee the controllability of the $s$ variable.

Assumption 2.

We suppose that the initial condition $\xi\in\cal D$ is such that

[TABLE]

In practice, for a given initial condition it possible to choose $u_{\max}$ such that the previous inequality is satisfied.

We now define a class of feedbacks, that will play an important role, and that are based on the notion of most rapid approach path, a well known concept in the theory of optimal control, see for example [20, 21].

Definition 1.

For $(s,z)\in{\cal L}(\xi)$ , we define the most rapid approach feedback to a given substrate level $s^{*}\in[0,s_{in})$ , as

[TABLE]

Clearly, with Assumption 2 this feedback is well defined, so that, associated with this control, for every initial condition $\xi\in\cal D$ , there exists a unique absolutely continuous solution for the dynamics (7).

Lemma 2.

For any $\xi\in\cal D$ satisfying Assumption 2, a given substrate level $s^{*}\in(0,s_{in})$ is reachable in finite time with the feedback $\psi_{s^{*}}$ .

Proof.

First, using the monotonicity properties of $\mu(\cdot,\cdot)$ of Assumption 1, it is clear that $\psi_{s^{*}}$ is admissible provided Assumption 2 is satisfied.

To show that $s^{*}$ is reachable in finite time, it is enough to note that when $s_{t_{0},\xi,\psi_{s^{*}}}(t)>s^{*}$ , for $t$ in a given open interval $I$ , we have

[TABLE]

with $k_{-}=-\min_{s\in(s^{*},s_{in})}\mu\big{(}s,(s_{in}-s)\min(z_{0},1)\big{)}\min(z_{0},1)(s_{in}-s^{*})$ . This insures that $s^{*}$ is always reachable in finite time from $s_{0}>s^{*}$ .

Analogously, if $s_{t_{0},\xi,\psi_{s^{*}}}(t)<s^{*}$ , for $t\in I$ , we have from Assumption 2

[TABLE]

with $k_{+}=\left[u_{\max}-\max_{s\in(0,s^{*})}\mu\big{(}s,(s_{in}-s)\max(z_{0},1)\big{)}\max(z_{0},1)\right](s_{in}-s^{*})$ . Then $s^{*}$ is reachable from $s_{0}<s^{*}$ , again in finite time.

∎

Remark 1.

It should be pointed out that there is a similarity with the turnpike property [22, 23] when using the controller (12). The turnpike property has received great attention in the literature (see for instance [24, 20, 21, 25]), and recent results give sufficient optimality conditions [26, 27]. However, we shall show in the next sections that the value $s^{*}$ , which determines the turnpike, has to depend on the initial condition (excepted for the very particular case when the initial condition belongs to the invariant set $\{z=1\}$ that has been solved in [15]). So, we are not in the usual framework of a single turnpike [26, 27] or isolated turnpikes [28], and the results of the literature do not apply.

For the problem on an infinite horizon, we will consider persistently exciting controls, which are defined as satisfying

[TABLE]

As the next Lemma shows, the trajectories associated with these controls are such that $z_{t_{0},\xi,u}(t)$ converges to 1, which is essential in our approach. Furthermore, for non persistently exciting controls, $s_{t_{0},\xi,u}(t)$ converges to 0 and thus the biogas production also converges to 0. As a consequence, the controls that maximize biogas production are necessarily persistently exciting controls.

Lemma 3.

For all initial conditions $\xi\in\cal D$ and for all persistently exciting controls $u(\cdot)\in{\cal U}(0,\infty)$ , we have

[TABLE]

and

[TABLE]

Moreover, for non persistently exciting controls, we have

[TABLE]

Proof.

From equation (7), the solution $z(\cdot)=z_{0,\xi,u}(\cdot)$ can be written as follows

[TABLE]

where $s(\cdot)=s_{0,\xi,u}(\cdot)$ , $x(\cdot)=x_{0,\xi,u}(\cdot)$ . From equation (2), the solution $x(\cdot)$ is such that

[TABLE]

Therefore, if the integral function

[TABLE]

is bounded, then $x(t)$ must converge asymptotically to [math] when $t$ goes to $+\infty$ and $u(\cdot)$ is a persistently exciting control. Moreover, from equations (1), (2) we have

[TABLE]

so that

[TABLE]

and then $s(t)$ must converge to $s_{in}$ when $t$ goes to $+\infty$ . Consequently, by continuity of the function $\mu$ , there exists $T>t_{0}$ such that

[TABLE]

for any $t>T$ , which implies that the integral defined in (14) goes to $+\infty$ when $t$ goes to $+\infty$ , which is a contradiction. We deduce that this integral cannot be bounded and from equation (13) that $z(t)$ converges to $1$ when $t$ goes to $+\infty$ .

A proof of the equality of limits of the integrals

[TABLE]

can be found in [29, Lemma 3.5]. For the value of the limits we use the fact that $z_{0,\xi,u}(t)$ converges to 1 : for all $\tilde{\varepsilon}>0$ , there exits a time $t_{\tilde{\varepsilon}}$ such that, for all $t\geqslant t_{\tilde{\varepsilon}}$ ,

[TABLE]

Then, for all $T\geqslant\max(t_{\tilde{\varepsilon}},t_{\tilde{\varepsilon}}/\tilde{\varepsilon})$

[TABLE]

With this, for all $\varepsilon>0$ , we can take $\tilde{\varepsilon}=\varepsilon/(|z_{0}-1|+1)$ and then we have, for $T\geqslant\max(t_{\tilde{\varepsilon}},t_{\tilde{\varepsilon}}/\tilde{\varepsilon})$

[TABLE]

Finally, we prove that for non persistently exciting controls, $s_{0,\xi,u}(t)$ converges to 0. Therefore, suppose that $u(\cdot)$ is an admissible control with a finite integral and we define, for all $t\geqslant 0$ ,

[TABLE]

and

[TABLE]

Then

[TABLE]

and since $\varphi(t)$ is bounded, we can deduce that $\varphi(t)$ converges as $t$ goes to infinity. Note as well that $\varphi^{\prime}$ is absolutely continuous and thus uniformly continuous. We can therefore use Barbalat’s Lemma [30, Lemma 4.2] to get that $\varphi^{\prime}(t)$ converges to 0. Then, as $z_{0,\xi,u}(t)$ cannot reach 0 (Lemma 1), we have that $\phi\big{(}s_{0,\xi,u}(t),z_{0,\xi,u}(t)\big{)}$ must converge to 0 and by continuity we conclude that $s_{0,\xi,u}(t)$ converges to 0.

∎

3 Infinite Horizon and Average Reward

In this section, we study the problem of maximizing biogas production over an infinite horizon. Since the dynamics (7) are autonomous, without loss of generality, we can assume here that $t_{0}=0$ and we will then denote $s_{\xi,u}(\cdot)$ and $z_{\xi,u}(\cdot)$ solutions of (7).

We start by defining the average biogas production during a time interval $[0,T]$ as

[TABLE]

and we consider the inferior and superior limits as $T$ goes to infinity

[TABLE]

The optimal control problems in consideration here consist in maximizing these functionals with respect to the dilution rate $u(\cdot)\in{\cal U}(0,\infty)$ , for any initial condition $\xi\in{\cal D}$ . More precisely, the value functions of these optimal control problems are

[TABLE]

We need to consider the inferior and superior limits here as there exists controls for which the rewards (16) and (17) may differ. Indeed, this is the case for certain oscillating controls as can be seen in the example in the Appendix. Nevertheless, we will show that the value functions (18) and (19) are in fact equal. Moreover, we will connect these problems to the problem with a discounted reward when the discount factor goes to 0, as in [31], and we will identify a set of controls that are optimal for all three problems.

To this end, we now define the following discounted reward, for a discount rate $\delta>0$

[TABLE]

This type of cost function is often used in problems related to economics for which the term $e^{-\delta t}$ represents a discount rate or a preference for the present [17]. In our setting, the use of this discounted reward can be seen as a preference for earlier rather than later production. Here, the integral is rescaled with the discount factor $\delta$ in order to guarantee that, when we take the limit as $\delta$ goes to 0, the reward remains finite.

The value function of the optimal control problem for a given $\delta$ is then

[TABLE]

Note that both average rewards (16) and (17), as well as the discounted reward (20), are well defined as the following Lemma shows.

Lemma 4.

For all $\xi\in\cal D$ , for all admissible controls $u(\cdot)\in{\cal U}(0,\infty)$ and for all $\delta>0$ , the rewards $\underline{J}^{\infty}(\xi,u(\cdot))$ , $\overline{J}^{\infty}(\xi,u(\cdot))$ and $J_{\delta}(\xi,u(\cdot))$ are uniformly bounded.

Proof.

From the monotonicity properties of Assumption 1, we have that the function $z\mapsto\phi(s,z)$ is non increasing. for all $s>0$ . Thus, for all $t\geqslant 0$

[TABLE]

The uniform boundedness of the rewards then follows from Lemma 1.

∎

3.1 Relation Between Average and Discounted Biogas Production Problems

We now show how the average and discounted biogas production problems are related when the discount factor $\delta$ goes to 0.

In the following, we will consider the discounted reward (20) as a function of the trajectory $\zeta(\cdot)=\big{(}s_{\xi,u}(\cdot),z_{\xi,u}(\cdot)\big{)}$ instead of the control and with a slight abuse of notation, we will denote it as $J_{\delta}(\zeta(\cdot))$ . Define the set valued map

[TABLE]

and consider the set of all forward trajectories of (7) with initial condition $\xi$

[TABLE]

where ${\cal AC}([0,\infty),{\cal L}(\xi))$ denotes the set of absolutely continuous functions from $[0,\infty)$ to ${\cal L}(\xi)$ . We recall from the Filippov Selection Theorem (see for instance [32]) that the optimal control problem (21) is equivalent to the optimization problem on $\mathcal{S}(\xi)$ ,

[TABLE]

We now specify the topology that we will use to study the limit of the discounted biogas production problem when the discount factor $\delta$ goes to 0.

Definition 2.

For $b>0$ , we denote by $L^{1}\big{(}0,\infty;\mathbb{R}^{2},e^{-bt}dt\big{)}$ the weighted Lebesgue space of measurable functions $y(\cdot)$ from $[0,\infty)$ to $\mathbb{R}^{2}$ such that

[TABLE]

and we denote $W^{1,1}\big{(}0,\infty;\mathbb{R}^{2},e^{-bt}dt\big{)}$ the weighted Sobolev space of measurable functions $y(\cdot)$ satisfying

[TABLE]

We consider the topology on $W^{1,1}\big{(}0,\infty;\mathbb{R}^{2},e^{-bt}dt\big{)}$ for which a sequence $y_{n}(\cdot)$ converges to $y(\cdot)$ if and only if

$y_{n}(\cdot)$ * converges uniformly to $y(\cdot)$ on compact intervals,*

-

$\dot{y}_{n}(\cdot)$ * converges weakly to $\dot{y}(\cdot)$ in $L^{1}\big{(}0,\infty;\mathbb{R}^{2},e^{-bt}dt\big{)}$ .*

Now, we define the notion of $\Gamma-$ limit in our context (see [33] for further details).

Definition 3.

For a given initial condition $\xi\in\cal D$ and trajectory $\zeta(\cdot)\in{\cal S}(\xi)$ , the $\Gamma-$ lower limit and $\Gamma-$ upper limit of $J_{\delta}(\cdot)$ are

[TABLE]

Here, we denote ${\cal N}(\zeta(\cdot))$ the set of all open neighborhoods of $\zeta(\cdot)$ of the topology on $W^{1,1}\big{(}0,\infty;\mathbb{R}^{2},e^{-bt}dt\big{)}$ given in Definition 2. If both of these limits coincide, then the $\Gamma-$ limit of $J_{\delta}(\cdot)$ is

[TABLE]

We now show that this $\Gamma-$ limit is well defined, as well as the associated optimal control problem.

Proposition 1.

For all $\xi\in\cal D$ and for all trajectories $\zeta(\cdot)\in{\cal S}(\xi)$ , the $\Gamma-$ limit of $J_{\delta}(\cdot)$ exists and we denote it as

[TABLE]

In addition, for all $\delta>0$ , the suprema are attained

[TABLE]

and these maxima converge as $\delta$ goes to 0, pointwise in $\xi$ ,

[TABLE]

Finally, if $\zeta_{\delta}(\cdot)$ is an optimal trajectory for (21), i.e. if $V_{\delta}(\xi)=J_{\delta}(\zeta_{\delta}(\cdot))$ , and if $\zeta_{\delta}(\cdot)$ converges to $\zeta_{0}(\cdot)$ in ${\cal S}(\xi)$ , then $\zeta_{0}(\cdot)$ is an optimal control for (22) and

[TABLE]

Proof.

First, we show that for all trajectories $\zeta(\cdot)=\big{(}s(t),z(t)\big{)}\in{\cal S}(\xi)$ and for $\delta$ small enough, $\delta\mapsto J_{\delta}(\zeta(\cdot))$ is increasing. We can write this function as

[TABLE]

with $g(t):=\phi\big{(}s(t),z(t)\big{)}z(t)$ , which is bounded and positive (Lemma 1),

[TABLE]

Then, we have

[TABLE]

so that for $T>0$ ,

[TABLE]

Now, since $e^{-\delta T}=1-\delta T+o(\delta)$ , there exists $\bar{\delta}>0$ such that, for all $\delta<\bar{\delta}$ , $\frac{m}{\delta}\big{(}1-e^{-\delta T})>\frac{mT}{2}$ . Then, taking $T>\frac{2M}{m\delta}$ , we conclude that $\delta\mapsto J_{\delta}(\zeta(\cdot))$ is increasing for $\delta<\bar{\delta}$ .

Next, recall that for all initial conditions and all trajectories, $\delta\mapsto J_{\delta}(\zeta(\cdot))$ is uniformly bounded (Lemma 4). Finally, since $J_{\delta}(\cdot)$ is continuous with respect to $\zeta(\cdot)$ , we can use [33, Proposition 5.7] to get the $\Gamma-$ convergence as $\delta$ goes to 0.

To show that the suprema are attained and that they converge, it is sufficient to show that there exists a countably compact set on which the suprema are attained for all $\delta$ [33, Theorem 7.4]. The set $\mathcal{S}(\xi)$ is clearly independent of $\delta$ so that we now need to show that $\mathcal{S}(\xi)$ is countably compact for the topology on $W^{1,1}\big{(}0,\infty;\mathbb{R}^{2},e^{-bt}dt\big{)}$ given in Definition 2. However, since $W^{1,1}\big{(}0,\infty;\mathbb{R}^{2},e^{-bt}dt\big{)}$ is a metric space, any compact set is countably compact, so we only need to prove the compactness of $\mathcal{S}(\xi)$ .

For each $\xi\in\cal D$ we set

[TABLE]

where $P_{\cal L(\xi)}$ is the projection on the convex set $\cal L(\xi)$ . Then $F_{\xi}$ has linear growth, so that we can define

[TABLE]

where $||F_{\xi}(\zeta)||:=\sup_{\eta\in F_{\xi}(\zeta)}||\eta||$ . Note that $F$ is upper semi-continuous and has compact non-empty convex images (such a map is known as a Marchaud map [34]). With this, the set $\mathcal{S}(\xi)$ is the set of absolutely continuous solutions of the differential inclusion

[TABLE]

We can therefore use [34, Theorem 3.5.2] to establish that $\mathcal{S}(\xi)$ is compact in $W^{1,1}\big{(}0,\infty;\mathbb{R}^{2},e^{-bt}dt\big{)}$ for $b>c$ .

∎

We now relate the average and discounted biogas production problems.

Proposition 2.

For all $\xi\in\cal D$ we have

[TABLE]

Proof.

We adapt here results of [31] given for minimization problems to maximization problems by changing the sign of the reward. We give here the main steps for the first inequality, the second is obtained similarly. First, [31, Lemma 3.3] gives

[TABLE]

and then [31, Corollary 3.5] states that for all $T>0$ , all $\varepsilon>0$ we have

[TABLE]

for all small $\delta$ . Taking the limit as $T\rightarrow\infty$ and $\delta\rightarrow 0$ gives the result.

∎

3.2 Solution of Optimal Control Problems

We now solve the optimal control problems (18) and (19) and show that their value functions are equal to the limit (22) of the discounted problem. We start by determining an upper bound for the value functions and then we will exhibit controls that attain this bound.

Proposition 3.

For all initial conditions $\xi\in\cal D$

[TABLE]

Proof.

With the monotonicity properties of $\mu(\cdot,\cdot)$ of Assumption 1, we have that $z\mapsto\phi(s,z)$ is non increasing and $z\mapsto\phi(s,z)z$ is non decreasing. This implies that

[TABLE]

and

[TABLE]

First, we consider the case when $z_{0}\leqslant 1$ . For any control $u(\cdot)$ , we have

[TABLE]

Taking the upper limit as $T$ goes to infinity and the supremum with respect to $u(\cdot)$ we get the result.

Next, for $z_{0}\geqslant 1$ , we have

[TABLE]

Using Lemma 3 we get that $\overline{J}^{\infty}(\xi,u(\cdot))\leqslant\overline{\phi}(1)$ and we conclude taking the supremum with respect to $u(\cdot)$ .

∎

Note that the existence of a maximum of $s\mapsto\phi(s,1)=\mu(s,s_{in}-s)(s_{in}-s)$ on $(0,s_{in})$ follows from Assumption 1. We will denote a substrate level at which such a maximum is attained as

[TABLE]

Proposition 4.

For any initial condition $\xi\in\cal D$ , any control $\overline{u}(\cdot)\in{\cal U}(0,\infty)$ that drives the system asymptotically to the state $(\bar{s},1)$ is optimal for problems (18), (19) and (22). We then have

[TABLE]

Proof.

The continuity of $\phi$ implies that for all $\varepsilon>0$ , there exists a time $t_{\varepsilon}\geqslant 0$ such that, for all $t\geqslant t_{\varepsilon}$ ,

[TABLE]

Since $s_{\xi,\bar{u}}(\cdot)$ and $z_{\xi,\bar{u}}(\cdot)$ take values in the compact set $\cal L(\xi)$ (11), there is a constant $M_{\xi}>0$ such that, for all $t\geqslant 0$ ,

[TABLE]

Then, for all $T\geqslant t_{\varepsilon}$ , from (26) and (27)

[TABLE]

and we have

[TABLE]

Using Propositions 2 and 3, we get the equality of value functions (25) and deduce the optimality of $\overline{u}(\cdot)$ for both average biogas production problems (18) and (19). We proceed similarly to get

[TABLE]

and we have

[TABLE]

Then, Proposition 2 implies that $\overline{u}(\cdot)$ is also optimal for problem (22).

∎

With Lemma 3, we know that all persistently exciting admissible controls make $z(\cdot)$ converge to 1, and from Lemma 2, we know that the feedback $\psi_{s^{*}}$ defined in (12) with $s^{*}=\bar{s}$ guarantees that $s(\cdot)$ reaches $\bar{s}$ . Then, from the previous Proposition we have the following result.

Proposition 5.

For any initial condition $\xi\in\cal D$ satisfying Assumption 2, the most rapid approach feedback to $\bar{s}$ , defined in (12) and denoted $\psi_{\bar{s}}$ , is optimal for both average production problems (18) and (19) and for the limit (22) of the discounted production problem.

Clearly, there is not a unique optimal control for the infinite horizon problems that we have considered. For example, in the case of a growth function that depends only on the substrate and that is monotone (such as the Monod growth function), the constant control $u=\mu(\bar{s})$ can also drive the system to the state $(\bar{s},1)$ . Nonetheless, for the control $\psi_{\bar{s}}$ , we are able to state in the next section an estimation of the sub-optimality for the finite horizon problem.

4 Finite Horizon and Sub-optimal Controls

We now examine the problem of maximizing biogas production over a finite horizon for a time interval $[t_{0},T]$ where $T$ is fixed. For this we consider the following reward

[TABLE]

where we recall that $\big{(}s_{t_{0},\xi,u}(\cdot),z_{t_{0},\xi,u}(\cdot)\big{)}$ is the solution of (7) with control $u(\cdot)\in{\cal U}(t_{0},T)$ and initial condition $\xi\in{\cal D}$ . The optimal control problem consists in maximizing this functional with respect to the dilution rate, so that the associated value function is

[TABLE]

We also consider auxiliary optimal control problems, which consist in maximizing the cost, for a given $z_{1}\in[\min(z_{0},1),\max(z_{0},1)]$ ,

[TABLE]

for the same dynamics (7). The value functions of these auxiliary problems are then defined as

[TABLE]

The resolution of these auxiliary problems will be presented in Section 4.1.

We now show that the value functions of the original problem (29) and the auxiliary problems (31) are related.

Proposition 6.

For all $\xi\in\cal D$ , $t_{0}<T$ and any $z_{1}\in[\min(z_{0},1),\max(z_{0},1)]$ , we have the following frame for the value function $V$ of the original problem

[TABLE]

Proof.

We start with the case $z_{0}\leqslant 1$ . For a given control $u(\cdot)\in{\cal U}(t_{0},T)$ , we define the following time

[TABLE]

which it is well defined since $z_{t_{0},\xi,u}(\cdot)$ is monotonous. Then, for $t_{0}\leqslant t\leqslant t_{1}$ we have $z_{0}\leqslant z_{t_{0},\xi,u}(t)\leqslant z_{1}\leqslant 1$ and with the monotonicity properties of $\mu(\cdot,\cdot)$ of Assumption 1 we have

[TABLE]

Next, for $t_{1}\leqslant t\leqslant T$ we have $z_{0}\leqslant z_{1}\leqslant z_{t_{0},\xi,u}(t)\leqslant 1$ and

[TABLE]

Combining these inequalities we get

[TABLE]

Now, since $z_{0}\leqslant z_{1}\leqslant 1$ we have

[TABLE]

For the case $z_{0}\geqslant 1$ , we proceed in a similar way to get

[TABLE]

We conclude by taking the supremum over all admissible controls.

∎

The interest of the previous frames on the value functions is that it allows to find controls for which we have an estimation of sub-optimality for the original problem.

Proposition 7.

For all $\xi\in\cal D$ and all $t_{0}<T$ , any optimal control $u^{\star}_{z_{1}}(\cdot)$ for the reward $J_{z_{1}}(t_{0},\xi,\cdot)$ guarantees a (sub-optimal) value for the original criterion $J(t_{0},\xi,\cdot)$ that satisfies

[TABLE]

and we have the following estimation of the value function $V$

[TABLE]

Proof.

From the proof of Proposition 6, for any control $u(\cdot)\in{\cal U}(t_{0},T)$ , we have

[TABLE]

Evaluating this for any optimal control $u^{\star}_{z_{1}}(\cdot)$ for the reward $J_{z_{1}}(t_{0},\xi,\cdot)$ gives the sub-optimality frame (33). The sub-optimality estimation (34) then follows from (32) and (33).

∎

4.1 Resolution of Auxiliary Problems

In order to obtain sub-optimal controls for problem (29) we now need to solve the auxiliary problem (31) for a given $z_{1}\in[\min(z_{0},1),\max(z_{0},1)]$ . The optimal control of this auxiliary problem is an autonomous feedback, even though the horizon is fixed and finite. It is similar to the optimal feedback for the infinite horizon problem $\psi_{\bar{s}}$ , defined in (12), and it drives the system towards a maximizer of $s\mapsto\phi(s,z_{1})$ but now, this maximizing substrate level depends on $z_{1}$ .

We first need an assumption on the uniqueness of a maximum of $\phi(\cdot,z_{1})$ .

Assumption 3.

For each $z_{1}\geqslant 0$ , the function $s\mapsto\phi(s,z_{1})$ admits a unique maximum on $(0,s_{in})$ , and we denote the substrate level at which this maximum is attained as

[TABLE]

Note that implies that $s\mapsto\phi(s,z_{1})$ is increasing on $(0,\bar{s}(z_{1})]$ and decreasing on $[\bar{s}(z_{1}),s_{in})$ .

Proposition 8.

For all $\xi\in\cal D$ satisfying Assumption 2 and all $t_{0}<T$ , the most rapid approach feedback to $\bar{s}(z_{1})$ , defined in (12) and denoted $\psi_{\bar{s}(z_{1})}$ , is optimal for the auxiliary problem (31).

Proof.

We start with the case $s_{0}\geqslant\bar{s}(z_{1})$ . With the control $u=0$ , the solution of (7) is such that $s_{t_{0},\xi,0}(\cdot)$ is monotonic and non increasing. Therefore there exists a time $t_{min}$ , possibly larger than $T$ , such that $s_{t_{0},\xi,0}(t_{min})=\bar{s}(z_{1})$ and then the solution with the feedback (12) is, with $t_{*}=\min(t_{min},T)$

[TABLE]

Next, for all $u\in[0,u_{max}]$ and for all $(s,z)\in{\cal L}(\xi)$ ,

[TABLE]

By the theorem of comparison of solutions of scalar differential equations, this implies that $s_{t_{0},\xi,0}(t)\leqslant s_{t_{0},\xi,u}(t)$ , up to time $t_{*}$ , for all controls $u(\cdot)\in{\cal U}(t_{0},T)$ . Since $s\mapsto\phi(s,z_{1})$ is decreasing on $[\bar{s}(z_{1}),s_{in})$ , we have

[TABLE]

Finally, as $s\mapsto\phi(s,z_{1})$ reaches its maximum at $\bar{s}(z_{1})$ we get

[TABLE]

We now consider $s_{0}<\bar{s}$ . From Assumption 2, the feedback is admissible and we have

[TABLE]

Thus, with the control $u=u_{\max}$ , the solution of (7) is such that $s_{t_{0},\xi,u_{\max}}(\cdot)$ is monotone and non decreasing. Therefore, there exists a time $t_{\max}$ , possibly larger than $T$ , such that $s_{t_{0},\xi,u_{\max}}(t_{\max})=\bar{s}(z_{1})$ and then the solution with the feedback (12) is, with $t_{*}=\min(t_{\max},T)$

[TABLE]

Next, for all $u\in[0,u_{max}]$ and for all $(s,z)\in{\cal L}(\xi)$

[TABLE]

and this implies that $s_{t_{0},\xi,u_{\max}}(t)\geqslant s_{t_{0},\xi,u}(t)$ , up to time $t_{*}$ , for all controls $u(\cdot)\in{\cal U}(t_{0},T)$ . Since $s\mapsto\phi(s,z_{1})$ is increasing on $(0,\bar{s}(z_{1})]$ , we have

[TABLE]

Finally, since $s\mapsto\phi(s,z_{1})$ reaches its maximum at $\bar{s}(z_{1})$ , we get

[TABLE]

∎

5 Application to Particular Growth Functions

The controls that we have considered up to now are all most rapid approach feedbacks to $\bar{s}(z_{1})$ , with $z_{1}\in[\min(z_{0},1),\max(z_{0},1)]$ , and this leads to the question of which is best in terms of biogas production. It turns out that it depends on the initial conditions and the horizon considered.

Indeed, we know that for an infinite horizon, the feedback $\psi_{\bar{s}(z_{1})}$ with $z_{1}=1$ is optimal and we can then expect that when the horizon is large, the best of the considered feedbacks would be for $z_{1}$ close to 1. On the other hand, when the horizon is small, the feedback $\psi_{\bar{s}(z_{0})}$ would seem to be the best option since this strategy consists in remaining close to the maximum of the biogas flow rate corresponding to the initial condition, whereas another feedback could drive the system away, towards another maximizing state but that can not be reached in time.

In this section, we apply our main results to the most common growth functions and explore with numerical simulations the question of determining the best feedback $\psi_{\bar{s}(z_{1})}$ for a given initial condition and final time. In particular, we will work with the Monod function

[TABLE]

the Haldane function

[TABLE]

and the Contois function

[TABLE]

where $\mu_{max}$ , $\bar{\mu}$ , $K_{s}$ and $K_{i}$ are positive numbers. We shall see later that these functions satisfy our assumptions (Lemma 5).

First, note that the Monod and Haldane functions only depend on the substrate, so that in this case, the maximizers $\bar{s}(z_{1})$ , defined in (35), are all equal to $\bar{s}(1)=\bar{s}$ , for all $z_{1}\in[\min(z_{0},1),\max(z_{0},1)]$ . We illustrate the associated feedback $\psi_{\bar{s}}$ for a Haldane function with a graph of the state space trajectories in Figure 1. The case of a Monod function leads to a similar dynamical behavior and the only major difference is the value of $\bar{s}$ .

From now on we will only consider the Contois growth function, for which we plot the trajectories in state space obtained with the feedback $\psi_{\bar{s}(z_{0})}$ in Figure 2.

To determine which of the feedbacks $\psi_{\bar{s}(z_{1})}$ is the best, we now compute the associated reward for a range of values of $z_{1}\in[\min(z_{0},1),\max(z_{0},1)]$ and of final times for a given initial condition. In order to easily identify the maximum of $J(\xi,\psi_{\bar{s}(z_{1})}(\cdot))$ with respect to $z_{1}$ , we normalize the average reward (15) by computing

[TABLE]

where the minimum and maximum are taken for $y\in[\min(z_{0},1),\max(z_{0},1)]$ . Hence, for each final time $T$ , the maximum reward is achieved for $z_{1}$ such that $J_{N}(T,z_{1})=1$ and the minimum when $J_{N}(T,z_{1})=0$ .

Figure 3 shows a case when $z_{0}<1$ and Figure 4 is an example of $z_{0}>1$ . We can see clearly that for small final times, the maximum is attained for a value of $z_{1}$ close to $z_{0}$ and that for $z_{1}=1$ the reward is the smallest. However, as the final time increases, the value of $z_{1}$ for which the reward is maximum approaches 1, and with the feedback $\psi_{\bar{s}(z_{0})}$ the reward is the smallest. In particular, we can see that the best of the feedbacks $\psi_{\bar{s}(z_{1})}$ depends on the final time.

This leads us to consider a new feedback that keeps the system in the set of maximizers

[TABLE]

We therefore introduce the following most rapid approach feedback to $\overline{\cal S}$

[TABLE]

where $\bar{u}(s,z)$ is the feedback that keeps the system in the set $\overline{\cal S}$ , that we compute by differentiating with respect to time the equation $s(t)=\bar{s}(z(t))$ .

We first illustrate this feedback in Figure 5 where we show the states as functions of time and the open loop realizations of the feedbacks $\psi_{\bar{s}(z_{0})}$ , $\psi_{\bar{s}(1)}$ and $\psi_{\overline{\cal S}}$ . Next, in Figure 6 we compare the reward of the feedback $\psi_{\overline{\cal S}}$ to the others and we can notice that the reward associated with the feedback $\psi_{\overline{\cal S}}$ is always one of the best, although for any given final time it is possible to do better with a feedback $\psi_{\bar{s}(z_{1})}$ for the right $z_{1}$ .

Note also that the feedback $\psi_{\overline{\cal S}}$ will drive the system asymptotically towards the state $(s,z)=(\bar{s},1)$ so that it is also optimal for the infinite horizon problems (18), (19) and (22).

In Figure 8, we show the difference between the rewards of the feedbacks $\psi_{\bar{s}(1)}$ and $\psi_{\bar{s}(z_{0})}$ as a function of the initial condition for various final times.

From this, we see that the feedback that is best changes, depending on the initial condition and the horizon considered.

The sub-optimality estimation (33) is affected similarly, as this bound depends on the initial condition and in particular, the distance to the set $\{z=1\}$ has a major impact on the sub-optimality of the considered feedbacks. In addition, the growth function has an influence on our estimation, through $W_{z_{1}}(\cdot)$ , and we illustrate this in Figure 9 by plotting this value function for the Haldane and the Contois growth function. Observe that, for the Contois growth function, $W_{z_{1}}(\cdot)$ varies significantly with the initial biomass and thus the sub-optimality bound as well. This can be attributed to the dependence of the Contois growth function on biomass concentration and this effect is not seen with the Haldane growth function, which depends only on the substrate.

We finish this section with a Lemma that shows that the considered growth functions satisfy our assumptions.

Lemma 5.

For all positive $\mu_{max}$ , $\bar{\mu}$ , $K_{s}$ and $K_{i}$ the Monod, Haldane and Contois growth functions satisfy Assumptions 1 and 3.

Proof.

Notice that the function $\phi$ with the Monod or Haldane function does not depend on $z$ . Let us show that the function $\mu_{M}$ is increasing and strictly concave

[TABLE]

Now, since the function $\phi(\cdot,1)$ is non-negative on $[0,s_{in}]$ and vanishes at 0 and $s_{in}$ it admits a maximum on $(0,s_{in})$ . One has

[TABLE]

The function $\phi(\cdot,1)$ is thus strictly concave on $(0,s_{in})$ , which provides the uniqueness of its maximum.

For the Haldane function, we have

[TABLE]

such that $\frac{d}{ds}\phi(0,1)>0$ and $\frac{d}{ds}\phi(s_{in},1)<0$ and since $\frac{d}{ds}\phi(\cdot,1)$ is continuous it must have an odd number of zeroes in the interval $(0,s_{in})$ . But notice that the equation $\frac{d}{ds}\phi(s,1)=0$ admits at most 2 solutions and $\phi(0,1)=\phi(s_{in},1)=0$ and therefore $\phi(\cdot,1)$ has a unique maximum.

For the Contois function, notice that $\mu_{C}(s,x)=\mu_{M}(s/x)$ so that, for $z_{1}\in[\min(z_{0},1),\max(z_{0},1)]$

[TABLE]

and since $s\mapsto\frac{s}{(s_{in}-s)z_{1}}$ is an increasing function, $\phi(\cdot,z_{1})$ is also strictly concave.

∎

6 Conclusions

In this work, we have proposed a novel approach to obtain autonomous sub-optimal feedbacks for the open problem of maximizing biogas production in the chemostat model out of equilibrium. These controllers generalize the “most-rapid approach path” feedback control that is known to be optimal when the initial condition belongs to a certain manifold. Indeed, we obtain a family of feedback controls of similar structure, for which we are able to give bounds on the sub-optimality. This last point merits to be underlined as it usually difficult to evaluate a priori the performances of sub-optimality without having to determine or compute the optimal solution. This choice gives also flexibility for the practitioners to choose a controller depending on the time horizon or simply to pick one when the finite horizon is poorly known (as each controller guarantees a sub-optimality bound), or to adjust it when the horizon is changed. For infinite horizon we show that each controller guarantees the same optimal averaged cost.

This methodology, based on a framing of the dynamics, could be investigated for a larger class of dynamics, such as the two-step model, and be the matter of future work.

Acknowledgments

The first and second authors were supported by FONDECYT grant 1160567, and by Basal Program CMM-AFB 170001 from CONICYT-Chile. The first author was supported by a doctoral fellowship CONICYT-PFCHA/Doctorado Nacional/2017-21170249. The third author was supported by the LabEx NUMEV incorporated into the I-Site MUSE.

Appendix: A Particular Example

We construct here a control $u(\cdot)$ for which the average rewards (16) and (17) do not coincide. For this, let us consider an initial condition $\xi=(s_{0},z_{0})=(\varepsilon,1)$ , with $\varepsilon\in(0,s_{in})$ fixed. The set $\{(s,1)\in\mathbb{R}^{2}_{+}:s\in[0,s_{in}]\}$ is clearly invariant for the dynamics (7) and therefore the chosen initial condition ensures that trajectories $(s_{\xi,u}(\cdot),z_{\xi,u}(\cdot))$ remains in this set.

Now consider the 2 following paths :

(A)

Starting at $\xi:=(\varepsilon,1)$ , use the control $u=u_{\max}$ to reach a prescribed level of substrate $s^{*}\in(\varepsilon,s_{in})$ in finite time. Then, apply the control $u=0$ to return to $\xi$ in finite time, which is possible by Assumption 2. Denote this control by $u_{*}$ , and let $t_{*}$ be the (finite) time necessary to follow this path and $I_{*}$ be the biogas produced by this path. 2. (B)

Starting at $\xi:=(\varepsilon,1)$ , use $u=\mu(\varepsilon,s_{in}-\varepsilon)$ to stay at $(s=\varepsilon,z=1)$ for any time interval.

Then, define control $u(\cdot)$ as follows:

•

For $t\in[0,t_{*}]$ , set $u(t)=\mu(\varepsilon,s_{in}-\varepsilon)$ so that the biogas production for this period is $I_{\varepsilon}:=t_{*}\phi(\varepsilon,1)$ .

•

For $t\in(2^{2k}t_{*},2^{2k+1}t_{*}]$ , with $k\in\mathbb{N}$ , set $u=u_{*}$ in order to follow the path (A) repeatedly $2^{2k}$ times. For each of these intervals the biogas production is $2^{2k}I_{*}$ .

•

For $t\in(2^{2k+1}t_{*},2^{2k+2}t_{*}]$ , with $k\in\mathbb{N}$ , set $u=\mu(\varepsilon,s_{in}-\varepsilon)$ . For each of these intervals the biogas production is $2^{2k+1}I_{\varepsilon}$ .

Thus, when we apply control $u(\cdot)$ up to a time $2^{2N}t_{*}$ , for a given $N\geqslant 1$ , the average biogas production is computed as follows

[TABLE]

which yields

[TABLE]

We have used here the fact that the sum $s_{N}=\sum_{j=1}^{N}2^{-2j}$ converges to $1/3$ . Indeed, this follows from the identity

[TABLE]

However, for the same control $u(\cdot)$ , the average biogas production is, up to time $2^{2N+1}t_{*}$ , computed as follows

[TABLE]

which yields

[TABLE]

Since $s_{*}>\varepsilon$ , it follows that $I_{*}>I_{\varepsilon}$ , and consequently, $L_{\infty}>K_{\infty}$ . We thus obtain that

[TABLE]

Bibliography34

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Russell, D.: Practical wastewater treatment. Wiley (2006)
2[2] Rehl, T., Muller, J.: CO 2 abatement costs of greenhouse gas (GHG) mitigation by different biogas conversion pathways. Journal of Environmental Management 114 (15), 13–25 (2013)
3[3] Harmand, J., Lobry, C., Rapaport, A., Sari, T.: The Chemostat: Mathematical Theory of Microorganisms Cultures. Wiley, Chemical Engineering Series, Chemostat and Bioprocesses Set 1 (2017)
4[4] Bernard, O., Hadj-Sadok, Z., Dochain, D., Genovesi, A., , Steyer, J.P.: Dynamical model development and parameter identification for an anaerobic wastewater treatment process. Biotechnology and Bioengineering 75 , 424–438 (2001)
5[5] Bernard, O., Chachuat, B., Hélias, A., Rodriguez, J.: Can we assess the model complexity for a bioprocess: theory and example of the anaerobic digestion process. Water science and technology 52 (1), 85–92 (2006)
6[6] Steyer, J.P., Buffière, P., Rolland, D., Moletta, R.: Advanced control of anaerobic digestion processes through disturbances modeling. Water Research 33 (9), 2059–2068 (1999)
7[7] Rodríguez, J., Ruiz, G., Molina, F., Roca, E., Lema, J.: A hydrogen-based variable-gain controller for anaerobic digestion processes. Water Science and Technology 54 (2), 57–62 (2006)
8[8] Dimitrova, N., Krastanov, M.: Nonlinear stabilizing control of an uncertain bioprocess. Int. J. Appl. Math. Comput. Sci 19 (3), 441–454 (2009)

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Optimal and Sub-optimal Feedback Controls

Abstract

1 Introduction

2 Preliminaries

Assumption 1**.**

2.1 Properties of the Dynamics

Lemma 1**.**

Proof.

Assumption 2**.**

Definition 1**.**

Lemma 2**.**

Proof.

Remark 1**.**

Lemma 3**.**

Proof.

3 Infinite Horizon and Average Reward

Lemma 4**.**

Proof.

3.1 Relation Between Average and Discounted Biogas Production Problems

Definition 2**.**

Definition 3**.**

Proposition 1**.**

Proof.

Proposition 2**.**

Proof.

3.2 Solution of Optimal Control Problems

Proposition 3**.**

Proof.

Proposition 4**.**

Proof.

Proposition 5**.**

4 Finite Horizon and Sub-optimal Controls

Proposition 6**.**

Proof.

Proposition 7**.**

Proof.

4.1 Resolution of Auxiliary Problems

Assumption 3**.**

Proposition 8**.**

Proof.

5 Application to Particular Growth Functions

Lemma 5**.**

Proof.

6 Conclusions

Acknowledgments

Appendix: A Particular Example

Assumption 1.

Lemma 1.

Assumption 2.

Definition 1.

Lemma 2.

Remark 1.

Lemma 3.

Lemma 4.

Definition 2.

Definition 3.

Proposition 1.

Proposition 2.

Proposition 3.

Proposition 4.

Proposition 5.

Proposition 6.

Proposition 7.

Assumption 3.

Proposition 8.

Lemma 5.