Dynamic Programming Method for Best Piecewise Linear Approximation for Vector Field of Nonlinear Boundary Value Problems on the Interval [0, 1]
Duggirala Meher Krishna, Duggirala Ravi

TL;DR
This paper introduces a dynamic programming approach for optimally approximating vector fields in nonlinear boundary value problems, ensuring boundary condition adherence and numerical stability, with convergence guarantees under finer discretization.
Contribution
It presents a novel dynamic programming method for piecewise linear approximation of vector fields in boundary value problems, emphasizing stability and convergence.
Findings
Method guarantees convergence to true solution with finer discretization
Ensures boundary conditions are maintained during approximation
Provides a stable computational framework for nonlinear BVPs
Abstract
An important problem that arises in many engineering applications is the boundary value problem for ordinary differential equations. There have been many computational methods proposed for dealing with this problem. The convergence of the iterative schemes to a true solution, when one such exists, and their numerical stability are the central issues discussed in the literature. In this paper, we discuss a method for approximating the vector field, maintaining the boundary conditions and numerical stability. If a true solution exists, a subsequence of solutions convergent to one such can be produced, by finer discretization of the solution space.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Methods in Computational Mathematics · Advanced Optimization Algorithms Research · Numerical methods for differential equations
Dynamic Programming Method for Best Piecewise Linear Approximation for Vector Field of Nonlinear Boundary Value Problems on the Interval
**Duggirala Meher Krishna
Gayatri Vidya Parishad College of Engineering (Autonomous)
Madhurawada, VISAKHAPATNAM – 530 048, Andhra Pradesh, India
E-mail : [email protected]
and
Duggirala Ravi
Gayatri Vidya Parishad College of Engineering (Autonomous)
Madhurawada, VISAKHAPATNAM – 530 048, Andhra Pradesh, India
E-mail : [email protected]; [email protected];
[email protected]; [email protected] **
Abstract
An important problem that arises in many engineering applications is the boundary value problem for ordinary differential equations. There have been many computational methods proposed for dealing with this problem. The convergence of the iterative schemes to a true solution, when one such exists, and their numerical stability are the central issues discussed in the literature. In this paper, we discuss a method for approximating the vector field, maintaining the boundary conditions and numerical stability. If a true solution exists, finer discretization of the solution space converges to one such.
1 Introduction
During the last few decades there has been a remarkable growth of interest in problems associated with solving linear and nonlinear ordinary differential equations satisfying boundary conditions. For many of the nonlinear boundary value problems that occur in engineering and applied sciences, it is difficult to obtain a solution analytically. For a nonlinear boundary value problem, the difficulty lies in establishing the existence of a solution mathematically, though in some cases multiple solutions exist. Approximation of the solution space of a given differential equation has gained importance as it speeds up or helps in solving the problem efficiently. These approximation methods can be put into two classes: (i) those in which a solution is approximated numerically at a number of discrete points of the domain, and (ii) those in which a solution is approximated by a finite number of terms of a sequence of functions. The approach in (ii) is called a weighted residual method. In most of the numerical methods described in the literature, we may have to add a sufficient number of some undetermined variables with implicitly assumed conditions at one end of the domain, and adjust the additional variables until the required conditions are satisfied at the other end to obtain the solution of the boundary value problem [[2] – [4], [7], [10] – [13]], and further approximate the derivatives of the dependent variables with forward, backward or central difference operators defined at the grid points [9]. In the first part of a numerical scheme as just mentioned, the convergence may be very slow, and in the second part, convergence and stability of the particular difference scheme may depend on the selection of the approximations used for the derivatives involved in the differential equation and the boundary conditions. It may also be the case that the chosen difference method is not numerically stable, resulting in chaos phenomenon creeping into the iterative schemes, at places where matrix inversions are utilized, without the solver being explicitly aware of its entry. In some cases, suitable regularization and relaxation conditions, involving more variables, may have to be added to the constraints formulated in the previous steps. In the weighted residual methods, difference equations are generated using approximation methods with piecewise polynomial solutions [9].
Among the most popular and successful techniques for solving boundary value problems with nonlinearities is Galerkin procedure. In this approach, the solution of the ordinary differential equation is expressed as a linear combination of certain basis functions, and the coefficients of the basis functions are determined by requiring that the residual be orthogonal to each of the basis functions. The difficulty lies in the selection of basis functions to obtain the desired solution, that can take care of the boundary conditions simultaneously. In recent times, the concept of piecewise linear approximation of the differential equation gained momentum [[5], [7], [9] – [10]] The two point boundary value problems are approximated by piecewise linear ones which have analytical solutions and reduced to finding the slope of the solution at the left boundary, so that the boundary conditions at the right end of the interval are satisfied. This approach results in a complex system of non-linear algebraic equations. Some more recent and highly efficient algorithms [6], for solving these complex systems of differential and algebraic equations (DAE) can be used for computational purpose.
The motivation for the present work is the necessity of a new method that is applicable for most or all general continuous vector fields and general boundary conditions. The objective of our study is to find efficiently a solution, if exists, by an algorithm, such that the algorithm is able to detect and report when there is no solution, if an error term does not fall below a threshold, despite using various approximation schemes with several basis functions. We propose a new method based on dynamic programming for solving boundary value problems in one variable. The dynamic programming based formulation is adapted for obtaining an optimal approximation for the vector field for genera of two-point boundary value problem, which is usually formulated as an optimal control problem in the literature [1]. For improving an initial approximation, repeated application of the dynamic programming algorithm with refined discretization of the parameter space can be used. A modified Newton-Raphson method for improving an approximate solution along with updating the initial value is also discussed. These aspects are newly introduced in this work. Possible extensions to formulation of solution methods for optimal control problem and for boundary value problems of partial differential equations defined on compact simply connected convex domains are briefly described.
2 Best Piecewise Linear Approximations for Vector Fields of Two-point Boundary Value Problems
In this section, we consider the boundary value problem
[TABLE]
where , , and . The vectors are written as columns of appropriate dimensions. The objective of obtaining a piecewise linear approximation is stated as follows:
Let, for some fixed integer , be fixed node points of the time interval . 2. 2.
Then, we obtain optimal values for the parameters of form , , where and are and matrices consisting of undetermined linear combination of some fixed basis functions defined on , and is column vector of undetermined constants, such that the curve satisfying
[TABLE]
subject to the boundary and continuity conditions
[TABLE]
minimizes the error defined by
[TABLE]
where the integrand can be chosen to be any nonnegative continuous function defined on and equal to 0 at the origin.
We first consider a special case of (2.1), in which the boundary conditions are variables seperable (i.e. of the form and , where ). Later, we shall indicate the modifications needed to extend the method to the general case of boundary conditions described in (2.1).
2.1 Piecewise Linear Approximation for the Special Case
In this section, we consider the boundary value problem
[TABLE]
where , and , where is the space of continuous functions from into and is the set of real numbers. The objective being described in the sequel is to find optimal values for the parameters , , for which the error functional in (2.3) is minimized by the solution satisfying (2) subject to the boundary and continuity conditions
[TABLE]
The solution for (2) in the interval , , is
[TABLE]
which is expressible explicitly in terms of the parameter values. Substituting the expression for in (2.3), the value of can be found for any prescribed values of the parameters. If the parameter space is discretized, the values of can be tabulated for various values of the parameters. However, using the additive property of the error functional , it is possible to formulate a dynamic programming method for obtaining the tables quickly and efficiently. For this purpose, we define a -step error functional , for , by the relation and for ,
[TABLE]
Thus in (2.8) is actually in (2.3). The functional depends only on the parameters . To make this dependency explicit, we sometimes write for ; similarly, is sometimes written as . Now let for , be the set of parameters satisfying
[TABLE]
where if , the parameter is interpreted as , and in the interval is given by (2.6) with . Let
[TABLE]
Now we define a finite sequence of functions on the parameter space as follows: let , and for ,
[TABLE]
Then, observing that for each fixed ,
[TABLE]
we can conclude that
[TABLE]
The value of gives the minimum value of over the parameter space constrained by the boundary and continuity conditions (2.5).
Now we describe a tabulation procedure for computation of optimal parameters as follows:
Initially compute , and set
[TABLE] 2. 2.
Now for , compute , and set
[TABLE]
At the end of the procedure, we obtain a table for for various values of (which is ). The optimal parameters are found by backtracking as follows:
First find . 2. 2.
Then for , choose as an element of .
The tuple thus obtained are optimal parameters with respect to the error functional (2.3) subject to the boundary and continuity conditions in (2.5).
2.2 Piecewise Linear Approximation for the General Case
Now we describe a generalization of the method of the previous section to the general case of boundary conditions in (2.2). Let for , be the set of parameters satisfying
[TABLE]
Let be as in (2.10), but with as in (2.13). Now the parameter space constrained by (2.2) is given by , where . However, is now defined on :
[TABLE]
so that
[TABLE]
for . Now
[TABLE]
The computation of the forward tables is as follows:
Initially compute for various values of . 2. 2.
Now for , compute from (2.15), and set
[TABLE]
From the table thus computed, the optimal values of the parameters are extracted by backtracking as follows:
First find from
[TABLE] 2. 2.
Then for , choose as an element of .
The tuple thus obtained minimizes the error functional (2.3) subject to the boundary and continuity conditions in (2.2).
3 Improving the Initial Approximation
In this section, we describe a method similar to Newton-Raphson method for improving an initial approximation for a BVP or in particular, for an initial value problem. The correction of the approximation consists of two parts: in the first part, a correction term with 0 initial value for improving the solution in the interior is obtained, and the second part finds optimal correction term in the initial value so that the updated solution satisfies the boundary conditions more accurately. Alternatively, it is also possible to achieve the same by either Picard’s successive approximation (with suitable correction term in the initial value) or repeated application of the dynamic programming method described in the last section with finer discretization of the parameter space restricted to a tube-like set around the initial approximate solution.
3.1 Improving an Approximate Solution with Intial Value Fixed
We assume that the vector field is differentiable with continuous derivatives of upto as high an order (upto two) as necessary. Let be an initial approximate solution satisfying
[TABLE]
The objective is to formulate an efficient method for improving the approximation in (3.1) by successive iterations. Let for ,
[TABLE]
At step , the correction term must be found such that
[TABLE]
Subtracting (3.2) from (3.3), we find that must satisfy
[TABLE]
Now using the Taylor series approximation with respect to the first variable (i.e. ) upto the first order for we find
[TABLE]
Letting and using (3.1) in (3.4), an approximate correction term is found by solving
[TABLE]
The solution of (3.6) is given by
[TABLE]
The iteration converges fast (at almost quadratic rate) provided the initial approximate solution (3.1) is sufficiently close to the exact solution. If the matrix is uniformly boundedly invertible in the interval , then can also be taken as
[TABLE]
which is the well-known method for solving for from for each . This method also modifies the initial condition. If the initial value is required to be updated independently, then we have to find from (3.6), with solution given by (3.7). It may be observed when either of (3.7) and (3.8) converges, , almost everywhere (a.e.), for , which implies, if convergent, either iteration leads to the final solution satisfying a.e., for .
3.2 Improving the Initial Value
Suppose that the initial value of approximate correction term is (to be determined). Then the correction term is given by
[TABLE]
where is as in (3.1) and in the following line. Then the parameter is found based on an optimality criterion. The objective can be formulated as follows:
[TABLE]
If the initial approximation is sufficiently accurate at , then must be small. Thus we can expect that the minimum in (3.10) is attained for . The update value can be chosen to be proportional to the gradient of at 0. Specifically, we can choose , for a small positive number , resulting in an easy updation of the initial value. The multiplier can be found by binary search method over an interval of the form , for some sufficiently large positive constant . Alternately it is also possible to find by solving , which gives the iterative formula and for ,
[TABLE]
where is the Hessian matrix of . The method is fast and does not require a seperate search for the multiplier constant as in the case of gradient descent method.
3.3 Combining Both: Improving an Initial Approximation by Successive Iterations
In this section we briefly describe a gradient descent method for imporving an initial solution of the boundary value problem (2.1). For this purpose, we assume that is continuously differentiable having continuous derivatives upto Hessian. The update in approximation of the -th iteration is given by (3.9) with initial condition , and satisfying (3.4). The initial value of is determined by solving the boundary condition. Specifically let and . The initial value is , and the final value is . The updated values must be found such that
[TABLE]
Now
[TABLE]
Substituting the values of and from (3.13) and (3.14) into (3.12), we find
[TABLE]
As (3.15) is to be solved for the vector from only one equation, we propose first a gradient descent method for minimization of . Further, if is small, we can evaluate the gradient of in (3.15) with respect to for . Thus is chosen such that
[TABLE]
where and its partial derivatives and are evaluated for and \theta_{N}=\theta_{N}^{(k)}+\int_{0}^{1}\textit{\Large e}^{{\bigg{[}}\int_{s}^{1}A_{k}(v)\,\,\mathrm{d}v{\bigg{]}}}\cdot\,{\mathbf{b}}_{k}(s)\,\,\mathrm{d}s, and is a small multiplier that can be found, for example, by binary search method in the interval for some constant . However, it is important to constrain to be close to [math], since must be restricted such that never leaves a tube-like set that can be determined for convergence of the Newton-Raphson method. Alternately, it is also possible to find such that , where is as in (3.15). In this case, the update in is found by the following iteration: and for ,
[TABLE]
where is the Hessian matrix of with as in (3.15), and . The method is fast and converges to the true boundary values provided the initial approximation is sufficiently accurate.
4 Summary of Boundary Value Problem and its Extension to Optimal Control Problem and Multidimensional Cases
In this section, we summarize the dynamic programming method, with an error function using uniform metric. The dynamic programming method can be extended to a vary large class of metrics. We bring out the essential characteristic that is needed for formulation of a dynamic programming based discrete optimization method. A special formulation for the optimal control problem is given.
Towards the end of the section, we briefly indicate how to extend the dynamic programming method for the partial differential equations defined in the interior of a simply connected compact, preferably convex, domain, with a regular boundary together with prescribed conditions on it that a solution must satisfy. It is assumed that the boundary is prescribed by a covering as in an atlas. Applications of this method for solving partial differential equations can be found in remote sensing, spectroscopy and tomography, in which regions of physical matter of different permeating, penetrating, reflexivity or resistivity properties, that can affect a flow field, are estimated by the modeling parameters, using measurements taken at the surface and some interior points, where the measurements at these interior points may be assumed or default values, and comparing the reconstruction with another model, which is assumed to be free from anomalies.
4.1 Summary of Dynamic Programming Method for the Solution to a Boundary Value Problem
The method described in Section 2 can be applied with any nonnegative error functional for which the -step error functional can be evaluated based on and . In particular, if for some function defined on , the following holds
[TABLE]
and is specified, then the dynamic programming method can be still applied with the function in stead of .
One of the important error functionals is with respect to uniform metric, which takes the form
[TABLE]
The -step error funcitonal for (4.2) is given by
[TABLE]
where . The error functional (4.2) is best suited for obtaining an initial approximation, followed by the Newton-Raphson method of iterative improvement. A bound on the error in the initial approximation for convergence of the Newton-Raphson mothod for computing the solution of algebraic or analytic equations can be explicitly found. Now as the formulation for the differential equation that the error correction term satisfies in Sections 3.1 and 3.2 is obtained by treating the variable fixed, the same bound works for convergence of (or ).
Further, with the error functional given by (4.2) the dynamic programming method can be applied repeatedly until the desired precision is achived using finer and finer discretizations and restricting the search to only the tube-like set around the previously obtained optimal parameters. Decisions concerning which parameters (not necessarily optimal with respect to ) to retain in subsequent iterations can be made based on quantitative measures such as stiffness at the parameter value for identifying the tube set locally. A measure similar to stiffness is the spectrum of the matrix defined in (3.1), which indicates as the iteration progresses which components move inward and which outward of the tube. The spectrum of indicates the torsion and oscillatory properties of the solution. These measures can be evaluated at the parameter value from the vector field without requiring complete or even part of the actual solution. Thus besides the tables for , auxiliary tables containing information regarding stiffness or other measures can be used for choosing the parameter values in the dynamic programming algorithm. The usefulness of the auxiliary tables is especially significant when using larger discretization step-sizes. The auxiliary information can expedite the search method by eliminating unwanted parameter values and retaining only those parameter values that could actually produce the true solution, so that in the subsequent iterations, closer approximations (i.e., with smaller error) are produced. The basic abstract model for the error functional in (4.1) can be recast in such a way that allowances for parameter dependences in the accumulating function and dependence of the -step cost function on a future state to reach, which is described by the model parameter , for , are explicated, as follows:
[TABLE]
and is specified. The model parameter sequence in the arguments of the -function in (4.4) allows the designer to take into consideration costs incurred due to lag or drag involved in the course from an initial or past state up to until the current state, described by the model parameter , to reach the next state described by the model parameter , for .
4.2 Extension to the Solution of Optimal Control Problem
In this section, we describe a method to solve the optimal control problem
[TABLE]
where , , and is a cost functional that can be written as sum of cost functionals over the intervals , . Taking , on , , the dynamic programming method and improvements of approximations described in the previous section can be utilized to find an approximate optimal solution for the control function . For other applications, for instance, in [14], the optimal control based formulation is used for estimation of sets reachable from an initial state.
4.3 Extension for Boundary Value Problems for Partial Differential Equations Defined over a Simply Connected Compact Domain of the Euclidean Space
In this subsection, we describe a method to solve a boundary value problem of a partial differential equation, defined over a simply connected compact domain, which is a subset of , for some positive integer . For simplicity of description, the domain is further assumed to be convex, although this assumption is not always necessary. The boundary is assumed to be sufficiently regular and specified by smooth surfaces, parameterized as in an atlas, such as in the case of a sphere. The boundary is then propagated inward by, for example, computing the Euclidean distance from a point inside the domain. This approach is called boundary propagation or front propagation [8]. A change of coordinates, that is consistent with the description of the domain as consisting of concentric surfaces diffeomorphic to the initial boundary of the domain, is performed in the given partial differential equation. These surfaces are parametrically described in such a way that some geometric attribute remains constant for each surface, and hence they are called level sets of the propagating boundary. If the initial boundary is convex, with its unit normal pointing outward, then the boundary can be propagated inward by collecting points obtained by subtracting vectors normal to the boundary at the initial points on the boundary with small multipliers as absolute value from those points on the initial boundary, as in the gradient descent method, with varying multipliers. But this propagation may not eventually end in a single point, and, depending on the absolute values chosen as multipliers of the surface normal vectors, various shapes can be realized. In order to construct a set of concentric surfaces ending in a point, the Euclidean distance is computed from a point, which is termed as the center of the domain, to various points on the boundary, whose normal always points outwards, by assumption. A criterion for an interior point to be the center is stated as follows: let the distance of an interior point to the boundary be defined to be the maximum distance from it to any point on the boundary, and let the center be chosen to be the interior point that results in a minimum distance to the boundary among all interior points. This method of choosing the center is appropriate with compact convex regular boundaries with simply connected interior domains. Then, from the center, the distance to a point on the boundary is multiplied by a scale, as in the case of projective coordinate system, but the scale parameter is chosen to be a real number between 0 and 1. The level sets are the surfaces corresponding to the same scale. Now, appropriate dynamic programming tables are constructed that approximate a solution and its partial derivatives with respect to the changed coordinate system. In this case, the objective is to find a solution that agrees at the center of the domain, when approached from various directions, with minimum error.
5 Conclusions
In this paper, we have presented a dynamic programming based formulation for obtaining a best piecewise linear approximation for continuous vector fields with the solution constrained by arbitrary boundary conditions. The method attempts obtain a fast but reasonably accurate approximate solution. The method also assumes a discrete space. The parameters to be determined are of the form used to approximate the vector field in the interval , for . The components and contain undetermined linear combinations of some fixed basis functions (such as polynomials) defined on the interval . The objective function for minimization of error can be chosen to be the standard -norm or it can also be chosen from among a large class of error functionas including the uniform metric. The particular aspect of the error functional that allows a formulation of the dynamic programming method is usually a recurrence relation that an associated -step functional satisfies. In the proposed method, the -step functionals satisfy a one-step recurrence relation. It is possible to formulate dynamic programming based methods also for error functionals satisfying more general recurrence relations, involving difficult parameter dependencies. A method for improving an initial approximation by successive iterations is also presented. The proposed method also updates the boundary values. The correction function in the interior of the interval is found by Newton-Raphson method. The instant when to switch from the dynamic programming method to successive iterations method for improvement of the current solution can be determined based on the width of the convergence set for the Newton-Raphson method. The instant when to switch from the dynamic programming method to successive iterations method for improvement of the current solution can be determined based on the width of the convergence set for the Newton-Raphson method, when it is preferred instead of gradient descent method, for this purpose. It is also possible to consider taking a convex combination of formulated updations, for choosing the actual update. If any one of the updation formula always produces a more accurate solution than the other, then the convex combination degenerates into binary exclusive combination.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Bellman, Richard, and Robert E. Kalaba, Dynamic programming and modern control theory , Academic Press, New York, 1965, pp. 195–209
- 2[2] J. W. Daniel, A road map of methods for approximating solutions of two point boundary value problems , in “Codes for boundary value problems in ordinary differential equations”, Bart Childs, Melvin R. Scott, James W. Daniel, Eugene D. Denman, Paul Nelson (Eds.), Proceedings of a Working Conference May 14-17, 1978, LNCS Vol. 76, Springer Verlag, 1979
- 3[3] Bruce A. Finlayson, The method of weighted residual and variational principals , Academic Press, 1972
- 4[4] Leslie Fox, The numerical solution of two-point boundary problems in ordinary differential equations . Dover Publications, 1990
- 5[5] C. M. Garcia-López, and J. I. Ramos, A piecewise-linearized method for ordinary differential equations: two-point boundary-value problems , International Journal of Numerical Methods in Fluids, vol. 22, Issue 11, pp.1089-1102
- 6[6] Alan.C. Hindmarsh, Peter N. Brown, Keith E. Grant, Steven L. Lee, Radu Serban, Dan E. Shumaker and Carol S. Woodward, SUNDIALS: Suite of nonlinear and differential/algebraic equation solvers , ACM Transactions on Mathematical Software, Special issue on Advanced Compu Tational Software (ACTS), Vol. 31, Issue 3, September 2005
- 7[7] Herbert B. Keller, Numerical solution of two point boundary value problems Vol. 24. Society for Industrial Mathematics, 1987
- 8[8] R. Malladi, J. A. Sethian and B. C, Vemuri, Shape modeling with front propagation: a level set approach , IEEE Trans. on PAMI, vol 17(2), 1995, pp. 158-175
