An (R, S) Based Heuristic Model for the Stochastic Joint Replenishment Problem
Mengyuan Xiang, Roberto Rossi, S. Armagan Tarim

TL;DR
This paper develops a heuristic model based on an (R, S) policy for the stochastic joint replenishment problem, improving decision-making under demand uncertainty with a static-dynamic policy approach.
Contribution
It extends a MILP model to approximate the optimal (σ, S) control policy for the JRP, demonstrating its effectiveness through computational experiments.
Findings
The proposed approach outperforms existing methods in computational tests.
The model effectively balances replenishment timing and order quantities under uncertainty.
The heuristic provides a practical solution for complex inventory coordination problems.
Abstract
This paper considers the periodic-review stochastic joint replenishment problem (JRP) under Bookbinder and Tan's static-dynamic uncertainty control policy. According to a static-dynamic uncertainty control rule, the decision maker fixes timing of replenishments once and for all at the beginning of the planning horizon, the inventory position is then raised to a predefined order-up-to-position at the beginning of each replenishment period. In this policy, freezing the replenishment times ameliorates the inherent difficulties pertinent to replenishment coordination of multiple products, whereas dynamic order quantities facilitate dealing with uncertain demands. We adapt and extend an earlier mixed integer linear programming (MILP) model for computing static-dynamic uncertainty policy parameters, and demonstrate that the same can be used to approximate the optimal control rule for the JRP,…
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 40 | 40 | 40 | 40 | 40 | 40 | 40 | 40 | 40 | 40 |
| 2 | 5 | 64 | 29 | 54 | 70 | 50 | 54 | 45 | 13 | 50 |
| 3 | 40 | 55 | 72 | 86 | 78 | 51 | 42 | 38 | 30 | 26 |
| 4 | 41 | 58 | 75 | 63 | 40 | 35 | 33 | 18 | 29 | 39 |
| 5 | 45 | 40 | 22 | 31 | 38 | 46 | 59 | 62 | 46 | 40 |
| items | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10 | 10 | 20 | 20 | 40 | 20 | 40 | 40 | 60 | 60 | 80 | 80 | |
| 40 | 35 | 40 | 40 | 40 | 20 | 20 | 20 | 28 | 20 | 20 | 20 | |
| 0.2 | 0.5 | 0.2 | 0.1 | 0.2 | 1.5 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 |
| Average cost improvement | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 50 | 30 | 2 | 936.94 | -0.91 | -0.84 | -0.33 | 4.38 | 0.68 | 0.79 | 2.14 |
| 100 | 30 | 2 | 990.50 | -0.05 | -0.45 | 0.75 | 2.57 | 1.77 | 4.39 | 6.81 |
| 150 | 30 | 2 | 1046.56 | -0.24 | -1.01 | -0.35 | 0.52 | 0.65 | 5.68 | 8.36 |
| 200 | 30 | 2 | 1072.97 | 1.32 | 0.47 | 1.11 | 1.34 | 2.12 | 8.34 | 12.31 |
| 100 | 30 | 6 | 1639.75 | -0.23 | -1.52 | -1.02 | 2.15 | 0.00 | 1.24 | 3.31 |
| 150 | 30 | 6 | 1707.05 | 0.64 | -0.60 | -0.07 | 1.46 | 0.95 | 2.34 | 6.68 |
| 200 | 30 | 6 | 1766.38 | 1.16 | 0.08 | 0.65 | 1.17 | 1.67 | 3.08 | 9.04 |
| 150 | 30 | 20 | 2718.47 | 0.77 | 4.32 | -1.26 | 1.27 | -0.21 | -0.59 | 6.20 |
| 200 | 30 | 20 | 2812.52 | -3.23 | 0.14 | -0.72 | 0.77 | 0.34 | 0.25 | 8.34 |
| Average cost improvement | -0.09 | 0.07 | -0.14 | 1.74 | 0.89 | 2.84 | 7.02 | |||
| Average cost improvement | |||||||||
| 20 | 10 | 2 | 772.25 | -0.03 | 0.48 | 0.76 | 8.30 | 1.79 | 1.80 |
| 50 | 10 | 2 | 813.94 | -0.48 | 0.12 | 0.62 | 0.47 | 1.64 | 1.74 |
| 100 | 10 | 2 | 861.05 | 0.23 | 0.70 | 1.17 | 3.68 | 2.20 | 2.38 |
| 200 | 10 | 2 | 932.86 | 1.62 | 1.83 | 2.38 | 2.88 | 3.42 | 3.73 |
| 500 | 10 | 2 | 1131.42 | 0.14 | 0.14 | 0.59 | 0.18 | 1.60 | 2.12 |
| 20 | 10 | 6 | 1166.06 | 0.85 | 2.84 | 0.01 | 7.99 | 1.08 | 1.04 |
| 50 | 10 | 6 | 1222.82 | -0.15 | 1.83 | 0.62 | 5.53 | 1.68 | 1.73 |
| 100 | 10 | 6 | 1283.92 | 1.33 | 2.50 | 1.26 | 4.49 | 2.34 | 2.46 |
| 200 | 10 | 6 | 1413.72 | 0.30 | 1.23 | 1.02 | 1.82 | 2.10 | 2.33 |
| 500 | 10 | 6 | 1658.48 | 2.26 | 2.20 | 2.52 | 2.30 | 3.59 | 4.03 |
| 50 | 10 | 10 | 1420.63 | 1.57 | 5.30 | -0.03 | 5.88 | 1.07 | 1.07 |
| 100 | 10 | 10 | 1497.96 | 1.67 | 4.28 | 0.75 | 4.37 | 1.87 | 1.93 |
| 200 | 10 | 10 | 1637.27 | 0.66 | 2.18 | 1.15 | 2.16 | 2.28 | 2.44 |
| 500 | 10 | 10 | 1935.07 | 1.60 | 1.60 | 1.79 | 1.60 | 2.90 | 3.27 |
| 100 | 50 | 2 | 1043.31 | -1.95 | -0.79 | -0.23 | 1.98 | 0.78 | 0.92 |
| 200 | 50 | 2 | 1132.61 | -1.29 | -0.48 | 0.30 | 0.50 | 1.31 | 1.97 |
| 500 | 50 | 2 | 1327.95 | 0.08 | 0.08 | 0.82 | 0.13 | 1.83 | 2.30 |
| 100 | 50 | 6 | 1794.60 | -1.37 | -2.65 | -2.09 | 0.94 | -1.09 | -0.97 |
| 200 | 50 | 6 | 1938.25 | -0.27 | -1.56 | -0.89 | -0.05 | 0.13 | 0.34 |
| 500 | 50 | 6 | 2244.01 | -0.27 | -0.27 | 0.43 | -0.26 | 1.44 | 1.87 |
| 200 | 50 | 10 | 2448.79 | -3.83 | -2.11 | -1.55 | -0.75 | -0.53 | -0.34 |
| 500 | 50 | 10 | 2796.29 | 0.35 | 0.35 | 0.97 | 0.35 | 2.00 | 2.40 |
| 200 | 100 | 2 | 1200.38 | -1.61 | -0.94 | -0.11 | -0.01 | 0.90 | 1.13 |
| 500 | 100 | 2 | 1406.67 | -0.76 | -0.83 | 0.16 | 0.16 | 1.17 | 1.60 |
| 200 | 100 | 6 | 2106.78 | 0.44 | -1.23 | -0.48 | 0.94 | 0.54 | 0.73 |
| 500 | 100 | 6 | 2449.51 | -0.88 | -0.88 | -0.07 | -0.07 | 0.94 | 1.33 |
| 200 | 100 | 10 | 2728.08 | -3.41 | -1.90 | -1.29 | -0.49 | -0.27 | -0.10 |
| 500 | 100 | 10 | 3108.05 | 0.22 | 0.22 | 0.94 | 0.94 | 1.96 | 2.33 |
| 500 | 200 | 2 | 1470.29 | -0.90 | -0.90 | 0.05 | 0.05 | 1.05 | 1.45 |
| 500 | 200 | 6 | 2620.77 | -0.91 | -0.91 | 0.08 | 0.08 | 1.09 | 1.45 |
| 500 | 200 | 10 | 3421.28 | -0.94 | -0.94 | -0.04 | -0.04 | 0.97 | 1.30 |
| Average cost improvement | -0.19 | 0.37 | 0.37 | 1.81 | 1.41 | 1.67 | |||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSupply Chain and Inventory Management · Scheduling and Optimization Algorithms · Sustainable Supply Chain Management
An Based Heuristic Model for the Stochastic Joint Replenishment Problem
Mengyuan Xiang
Roberto Rossi
S. Armagan Tarim
Business School, University of Edinburgh, Edinburgh, United Kingdom
Cork University Business School, University College Cork, Cork, Ireland
Abstract
This paper considers the periodic-review stochastic joint replenishment problem (JRP) under Bookbinder and Tan’s static-dynamic uncertainty control policy. According to a static-dynamic uncertainty control rule, the decision maker fixes timing of replenishments once and for all at the beginning of the planning horizon, the inventory position is then raised to a predefined order-up-to-position at the beginning of each replenishment period. In this policy, freezing the replenishment times ameliorates the inherent difficulties pertinent to replenishment coordination of multiple products, whereas dynamic order quantities facilitate dealing with uncertain demands. We adapt and extend an earlier mixed integer linear programming (MILP) model for computing static-dynamic uncertainty policy parameters, and demonstrate that the same can be used to approximate the optimal control rule for the JRP, also known as policy. An extensive computational study illustrates the effectiveness of our approach when compared to alternative approaches in the literature.
keywords:
inventory , stochastic joint replenishment , policy , mixed integer linear programming
1 Introduction
The Joint Replenishment Problem (JRP) occurs when several items are ordered from the same supplier, or several products have the same means of transportation, or several products are processed on the same piece of equipment [Salameh et al., 2014]. Every time an order is placed, the group fixed ordering cost is incurred regardless the number of items replenished; in addition there are also item-specific fixed and variable ordering costs that are charged whenever an item is included in a replenishment order. The goal of the JRP is to determine the optimal inventory replenishment plan that minimises the cost of replenishing multiple items.
The problem of controlling inventory of a multi-item system under joint replenishment has been receiving considerable attention for the past several decades. Literature on JRP can be roughly categorised into deterministic and stochastic based on the nature of demand. In the deterministic joint replenishment inventory system, demand for each individual item is assumed to be constant over an infinite time horizon and replenishments are made at equally spaced time intervals; the problem is to determine the length of replenishment cycles and the frequency of replenishing individual items, e.g., [Goyal & Belton, 1979, Kaspi & Rosenblatt, 1991, Viswanathan, 1996, Wildeman et al., 1997, Hariga, 1994, Goyal & Deshmukh, 1993, Boctor et al., 2004, Nilsson et al., 2007]. In the stochastic joint replenishment inventory system, the demand for each individual item is unknown but follows certain types of distributions; the problem is to decide the optimal parameters of a given inventory policy, e.g., [Balintfy, 1964, Atkins & Iyogun, 1988, Renberg & Planche, 1967, Kalpakam & Arivarignan, 1993, Viswanathan, 1997, Nielsen & Larsen, 2005, Özkaya et al., 2006]. Most literature still presents applications to constant and dynamic deterministic demands; however, the study regarding stochastic demand has received increasing attention due to its practical relevance [Bastos et al., 2017]. This work belongs to the growing literature on the stochastic joint replenishment.
This paper applies the static-dynamic strategy, proposed by Bookbinder & Tan [1988] for tackling single-item lot-sizing problems, in the context of a JRP system. The static-dynamic strategy, known as , features two control parameters: , timing of replenishment, and , order-up-to-position. At each review period, the decision maker places an order so as to increase the inventory position (i.e. net inventory level + outstanding orders) to a given order-up-to-position. In the context of the JRP system, a periodic-review policy is adopted for each item. The policy is an appealing strategy since it eases the coordination between supply chain players [Kilic & Tarim, 2011], and facilitates managing joint replenishment [Silver et al., 1998]. Additionally, the policy comes with the advantage of being able to tackle nonstationary demand which has not been addressed yet in the literature.
Our goal is to tackle the periodic-review nonstationary JRP under policy. We first present a mixed-integer linear programming (MILP) model for computing policy parameters that minimise the expected total cost comprising group fixed ordering costs, item-specific fixed ordering costs, holding costs, and penalty costs over the planning horizon. Our model generalises the discussion in [Rossi et al., 2015], which presented an MILP model for approximating optimal policy parameters for single-item lot-sizing problems. We further show that our MILP model can be used to approximate the policy, which is known to be optimal for the JRP [Liu & Esogbue, 2012]. Under this policy, decision makers order up to if opening inventory positions fall in (, represents the number of items) at the beginning of each time period. Numerical experiments illustrate the effectiveness of our policy and the corresponding MILP model.
We contribute to the literature on the stochastic JPR as follows.
We present, for the first time in the literature, a mathematical programming (MP) model for tackling the nonstationary stochastic JRP.
- 2.
We reformulate this MP model as an MILP model that can be solved using off-the-shelf solvers.
- 3.
We demonstrate that our MILP model can be used to approximate the policy, which is optimal for the JRP.
- 4.
In an extensive computational study based on existing test beds, drawn from the literature, we demonstrate the effectiveness of our models when compared to alternative approaches in the literature.
This rest of this paper is organised as follows. Section 2 surveys relevant literature. Section 3 describes problem settings. Section 4 presents an MILP model for computing policy parameters. Section 5 extends the MILP model for approximating the optimal policy parameters. An extensive computational study is conducted in Section 6. We draw conclusions in Section 7.
2 Literature review
The problem of controlling the inventory of a multi-item system under joint replenishment has received increasing attention over the past several decades. For a thorough review of literature readers may refer to [Silver & Peterson, 1985, Goyal & Satir, 1989, Van Eijs et al., 1992, Khouja & Goyal, 2008, Bastos et al., 2017]. In this section, we focus our attention on existing policies for tackling stochastic JRPs. In particular, we survey control policies that have been considered in the literature.
** policy**. The landmark study of Scarf [1960] proved the optimality of (,) policies for the single-item inventory problem; since then, there have been attempts to generalise this result to multi-item inventory systems. Johnson [1967] characterised the optimal policy for the stationary case and introduced the policy, in which and ; in this policy one orders up to if inventory levels and , otherwise one does not order. Kalin [1980] showed that, when and , there exists such that the optimal policy is to order up to , this policy is named policy. Ohno et al. [1994] proposed an algorithm for computing an optimal ordering policy for a periodic-view multi-item inventory system. Ohno & Ishigaki [2001] further proposed a policy iteration method to compute an exact optimal policy by leaving properties of the optimal policy for continuous-time inventory problems with compound Poisson demands. Gallego & Sethi [2005] gave the general definition of -convexity in , which encompasses both the joint ordering and individual ordering case; it derived an optimal policy for the two-item deterministic inventory problem with a joint ordering cost. However, the computation of the optimal policies is still a difficult task.
** policy.** Several works on stochastic JRPs have focused on computing policies, introduced by Balintfy [1964]. This policy features three control parameters: , reorder point; , can-order level; , order-up-to-position. Under this policy, When the inventory position of item crosses , a replenishment order is triggered to raise its inventory position to ; meanwhile, any other item with an inventory position at or below its can-order point, , is also included in the replenishment, raising its inventory position to . Under the assumption of Poisson-distributed demands, Ignall [1969] proved that the policy is not optimal even for two-item problems. Silver [1974] proposed the decomposition method to approximate policy parameters, where the multi-item problem is decomposed into several single-item problems. This approximation technique was followed by [Melchiors, 2002, Johansen & Melchiors, 2003]. Kayiş et al. [2008] modelled the two-item JRP problem as a semi-Markov decision model, and proposed an enumerative approach to approximate policies. In addition, [Schaack & Silver, 1972, Thompstone & Silver, 1975, Silver, 1981, Federgruen et al., 1984] studied JRPs with compound Poisson-distributed demands.
** policy.** Atkins & Iyogun [1988] proposed two periodic-review -type policies, namely periodic policy and modified periodic policy , which differ only in the way the ordering periods are determined. Under this policy, every periods, the inventory position of item is raised to . Numerical experiments demonstrate that the policy performs consistently better than the policy, and that the policy generally outperforms the the policy, except for problems involving small values of group fixed ordering cost.
** policy.** This policy was first proposed by Renberg & Planche [1967]. Under this policy, whenever the total inventory position drops to the group reorder point, an order is placed to raise inventory position of each item to item-specific order-up-to-position . The combined order quantity is , and the group reorder point is reached when the combined usage reaches . Pantumsinchai [1992] evaluated the computational performance of the policy by comparing it against the policy, policy and policy on the basis of long-run total average costs. Computational experiments showed that the policy consistently outperforms the policy on the test instances, and both and policy perform better as the group ordering cost increases. The study showed that the policy is appropriate for items for which the stock-out costs are low and the major set-up cost is high relative to the minor set-up cost.
** policy.** This policy was proposed by Viswanathan [1997] for periodic-review inventory systems, in which inventory position of each item is reviewed at every fixed and constant time interval. At each review time, the policy is applied to each item, so that any item with inventory position at or below is order up to . For a fixed review period, the algorithm of Zheng & Federgruen [1991] is adopted to compute the optimal policy parameters. Computational studies indicated that although the proposed policy requires more computational effort, it generally dominates the policy, and dominates policy, and policy for most test instances.
** policy.** Nielsen & Larsen [2005] combined features of policy and policy, and proposed the policy. By operating under this policy, the total inventory position is continuously reviewed while the item-specific inventory positions are reviewed only when the total consumption since the last order reaches . Then every item with inventory position less than or equal to its respective reorder point is order to . An analytical solution is derived by using the Markov decision theory in Nielsen & Larsen [2005]. Computational study demonstrated that the policy outperforms ) policy, and dominates policy in of test instances on the data set of Atkins & Iyogun [1988].
** policy.** This continuous-review policy was proposed by Özkaya et al. [2006]. Decision makers raise the inventory position of each item to its order-up-to-position whenever a total of demands accumulated or time units have elapsed, whichever occurs first. This policy is a hybrid of the continuous review policy, proposed by Renberg & Planche [1967], and the periodic review policy, proposed by Atkins & Iyogun [1988]. Thus, it features benefits of two separate policies. The comprehensive numerical study indicates that the proposed policy dominates the policy, policy, policy, and policy in of instances.
** policy** was proposed by Bookbinder & Tan [1988] for controlling single-item inventory systems. This policy requires decision makers to place an order at each replenishment period to increase the inventory position to the order-up-to-position . It is an appealing policy since it eases the coordination between supply chain players [Kilic & Tarim, 2011], and facilitates managing joint replenishment [Silver et al., 1998]. [Tarim & Kingsman, 2004, 2006] formulated a mixed integer programming (MIP) model for computing optimal policy parameters. Tarim et al. [2011] relaxed the MIP model, and solved it as a shortest path problem which does not require the use of a mathematical programming solver. In addition, Özen et al. [2012] introduced a DP-based algorithm for solving small-size problems, and an approximation heuristic and a relaxation heuristic for tackling larger-size problems. Rossi et al. [2015] generalised the discussions above and developed a unified MILP model for approximating the policy by adopting the piecewise linear approximation technique in Rossi et al. [2014]. Recently, Tunc et al. [2018] presented an extended MIP model that blends heuristic methods originally introduced by Tunc et al. [2014] and Rossi et al. [2015]. As a result, this formulation features the computational efficiency of Tunc et al. [2014] and the modelling variety of Rossi et al. [2015]. Although various efficient modelling methods for computing policy parameters have been proposed, all existing works focus on a single-item inventory system.
The stochastic JRP is an open research area for the development of more efficient computational methods and control policies. The main purpose of this work is to apply the policy to a multi-item inventory system. In the context of the JRP, we apply the periodic-review policy, originally proposed by Bookbinder & Tan [1988] for tackling single-item lot sizing problems, to JRPs under stochastic demand and fixed lead time; a periodic-review policy is adopted for each item. Note that when the stochastic demand is stationary, the policy is the same as the policy proposed by Atkins & Iyogun [1988], in which every periods one raises the inventory position of item to the order-up-to-position . However, the policy can also deal with non-stationary stochastic demands; a setting that was not addressed in Atkins & Iyogun [1988]. In what follows, we introduce a novel MILP approach for approximating policies under non-stationary stochastic demands for multi-item inventory systems. Nonlinear costs are approximated by leveraging the technique originally introduced in [Rossi et al., 2014].
3 Problem description
Consider a periodic-review -item inventory management system over a -period planning horizon. We assume that demand of item , , in period , is a random variable with a known probability density function; all are assumed to be mutually independent.
We further assume that replenishments are issued at the beginning of each time period. There is a group fixed ordering cost , which is incurred whenever a replenishment is issued regardless the number of items replenished. Moreover, there is an item-specific fixed ordering cost , which is incurred whenever item is replenished regardless the quantity of the replenishment.
We define as the quantity of item ordered in period , which is placed and received immediately. Then, the ordering cost of item in period with ordering quantity can be written as,
[TABLE]
Let denote the ordering cost of period with ordering quantity vector . has the following structure
[TABLE]
A penalty cost is incurred for each unit of item of backorder demand per period, and a holding cost is charged for each unit of item carried from one period to the next. The immediate penalty and holding cost of period can be expressed as
[TABLE]
where vector is the inventory level immediately after replenishments are received at the beginning of period , and “E” denotes the expectation operator.
Let denote the net inventory level of item at the end of period , which is also the opening inventory level of period , and denote the expected total cost of the optimal plan over period , given opening inventory level at the beginning of period . Then, can be written as,
[TABLE]
where , and
[TABLE]
represents the boundary condition.
Example. We consider an instance with two items in which the group fixed ordering cost is , and for each item, the item-specific ordering cost is 0, the holding cost is , and the stock-out penalty cost is . We control the inventory for two items over a planning horizon of periods. We assume that the demand of item in period follows a Poisson distribution with rate ; where . The expected total cost, i.e. , of an optimal policy, given initial inventory level , can be obtained via stochastic dynamic programming (SDP) and is equal to . In Fig. 1 we plot for and .
4 An MILP model for approximating non-stationary stochastic policies
In this section, we formulate the stochastic JRP problem under the policy as an MILP model. Under the policy, the replenishment periods and associated order-up-to-positions are fixed at the beginning of the planning horizon, while actual order quantities are decided at the beginning of each replenishment period. Note that in the context of the JRP, a periodic-review policy is adopted for each item. We next introduce a stochastic programming formulation (Section 4.1) and then approximate it by an MILP model (Section 4.2).
4.1 A stochastic program
Consider the periodic-review -item -period JRP described in Section 3. We introduce binary variables and , , and ; takes value if a group order is made in period , otherwise [math]; is set to if item is replenished in period .
Under the policy, we reformulate the stochastic dynamic programming model in Section 3 as the stochastic program in Fig. 2.
The objective is to find the optimal replenishment plan so as to minimise the expected ordering costs, penalty costs, and holding costs of items over the -period planning horizon. Constraints (7) imply that if at least an item is ordered, then a group replenishment is issued. Constraints (8) are inventory conservation constraints: the inventory level at the end of period is equal to the initial inventory level, plus all orders received before the end of period , minus demands raised up to period . Constraints (9)- (12) state domains of , , , and .
4.2 MILP model for approximating policies
By leveraging the piecewise approximation approach in [Rossi et al., 2014], the stochastic programming formulation in Fig. 2 can be approximated by an MILP model.
We introduce the first order loss function
[TABLE]
and its complementary function
[TABLE]
where is a random variable with a known probability density function, and is a scalar variable.
Consider a single replenishment cycle of item over periods , where the only replenishment is placed at the beginning of period with order-up-to-level , and the initial inventory level is . Thus, , , must equal to the order-up-to-level , minus the demand convolution over periods , i.e.: . We rewrite the expected excess back-orders and on-hand stocks as and , by means of the first order loss function and its complementary function.
We introduce a binary variable , , , , which is set to one if the most recent replenishment of item up to period was issued in period , where — if no replenishment occurs before or at period , then we let , this allows us to properly account for demand variance from the beginning of the planning horizon. We observe that if , the closing inventory level of period must equal to the order-up-to-level of period , minus the demand convolution over periods , i.e. . Then, the expected excess back-orders and on-hand stocks of period can be written by means of the first order loss function and its complementary function, , and . Additionally, since period must be the only most recent order received up to period , the following constraints must be satisfied.
[TABLE]
In what follows, let “” denote the expectation operator. We introduce decision variables and to represent expected excess back-orders and on-hand stocks. The stochastic program in Fig 2 can be approximated by the MINLP model in Fig. 3.
The objective function (15) minimizes the expected group fixed ordering costs, item-specific fixed ordering costs, holding costs, and penalty costs of -item over the -period planning horizon. Constraints (16) imply an individual item can only be included in a group replenishment if that replenishment is made. Constraints (17) - (18) are inventory balance constraints. Constraints (19) - (20) ensure the most recent order before period was issued in period . Nonlinear constraints (21) - (22) represent the expected on-hand stocks and back-orders of item over the planning horizon. Note that the order-up-to-position of item in period can be expressed by the expected closing inventory level and expected demand convolution, i.e., . Constraints (23) - (25) indicate domains of , , and .
By solving the model in Fig. 3, the optimal replenishment plan including group replenishment periods , and item-specific replenishment periods , and the item-specific order-up-to-positions are obtained, for , and . The MINLP model in Fig. 3 can be readily approximated by an MILP model by using the approach discussed in [Rossi et al., 2014, 2015] to piecewise linearise loss functions in constraints (21) and (22). Note that the MINLP model in Fig. 3 can be extended to explore the fixed lead time settings. Details are discussed in A
Example. We demonstrate the modelling strategy underpinning the MILP model on a -item -period example. It is assumed that demands follow a Poisson distribution with rates presented in Table 1. The initial inventory level is taken as zero, and item-specific lead time . Other parameters are: , , , and . We employ eleven segments in the piecewise-linear approximations of and (for , and ).
The resulting expected total cost is . Replenishment plans of each item are presented in Fig. 4. Items , and are replenished in periods , , , and ; while item and are replenished only in periods , , and , since orders in period could not be received by the end of the planning horizon. Additionally, since item is expected to be ordered every two periods with the same order-up-to-position by the nature of stationary demand, while it is ordered up to a higher position in period to cover demands in the next periods in order to coordinate with other items.
5 MILP model for approximating the optimal () policies
Since the landmark study of Scarf [1960] which proved the optimality for the single-item inventory system, there have been several attempts to prove the optimality for multi-item inventory systems, e.g.: [Johnson, 1967, Kalin, 1980, Ohno & Ishigaki, 2001, Gallego & Sethi, 2005]. In this section we demonstrate that the MILP model proposed in Section 4.2 can be used to approximate the optimal replenishment plan under policy for the JRP.
Definition 5.1**.**
Function is -convex if
[TABLE]
where , , and is defined as follows,
[TABLE]
where , , and for all .
Gallego & Sethi [2005] showed the optimal policy for the joint setup cost case by studying the function
[TABLE]
Consider a continuous -convex function , then it has global minimum at . Define set , and set . Lemma 5.1 shows that the optimal replenishment plan is to order up to if opening inventory levels and ; and not to order, otherwise.
Lemma 5.1** (Gallego & Sethi [2005]).**
If is continuous -convex, continuous and coercive, then
,
- 2.
.
We next show that the MILP model in Fig. 3 can be adjusted to approximate set and .
Due to the complexity of , it is impractical to derive a closed form expression for it. To address this difficulty, we propose a strategy to determine whether a given initial inventory level vector belongs to . By solving our modified MILP model over the planning horizon , we observe the minimised expected total cost , order-up-to-levels , and the first period replenishment decision . If , then ; otherwise, . Therefore, our MILP model can be used to determine whether given initial inventory levels . Moreover, by repeating this procedure, one can approximate the optimal replenishment strategy for every period .
Example. We illustrate the concept introduced on the -item -period example presented in Section 3. Assuming initial inventory levels , we plot the expected total cost contours, obtained via the modified MILP, in Fig. 5(a). Note that there are two similar minima, which is expected, since the ordering cost is relatively small and the demand variance is large. We plot set and obtained via the modified MILP model, and compare them with that obtained via the stochastic dynamic programming in Fig. 5(b). The optimal replenishment plan is to place an order whenever inventory levels fall in set , and not to place an order if fall in . We observe that set and obtained via the modified MILP model neatly approximate those obtained via the stochastic dynamic programming.
6 Computational Experiments
In this section we assess the cost performance of the policy by comparing its cost performance against policy [Özkaya et al., 2006], policy [Nielsen & Larsen, 2005], policy [Viswanathan, 1997], policy [Pantumsinchai, 1992], policy [Atkins & Iyogun, 1988], policy [Melchiors, 2002], and policy [Federgruen et al., 1984], on data sets of Atkins & Iyogun [1988] and Viswanathan [1997]. These data sets consider stationary demand over an infinite horizon. Unfortunately, computing policy parameters for infinite horizon JRPs via our MILP model is computationally expensive; however, since the demand is stationary, it is possible to derive an efficient shortest path reformulation, which we present in B and we use in our computational study.
Computational experiments are conducted by using IBM ILOG CPLEX Optimization Studio 12.7 and Matlab R2016a on a 3.20 GHz Intel Core i5-6500 CPU with 16.0 GB RAM, 64 bit machine.
Since the shortest path reformulation operates over a finite horizon, in order to compare the cost performance of the policy with the continuous-review , , and policies, we discretize each time period into small periods. We consider a planning horizon length of periods for a total of small periods. For each test instance, we first obtain the optimal replenishment plan by solving the shortest path reformulation presented in B. The computational time is limited to minutes, if a timeout occurs, the best solution available is adopted. Next, we simulate the expected average cost of each test instance via Monte Carlo Simulation (100,000 replications). Finally, we compare the average cost per small period against the average cost under existing policies.
The data set of Atkins & Iyogun [1988] assumes that the demand of each item follows a stationary Poisson distribution with rate , . The item-specific fixed ordering cost , expected demand , and lead time are displayed in Table 2. Items share the same penalty cost , holding cost , and group fixed ordering cost .
The data set of Atkins & Iyogun [1988] contains some unusual lot sizing instances; more specifically, instances for which the group as well as item fixed ordering costs become negligible in comparison to holding costs. In the lot-sizing literature the fixed ordering cost is commonly assumed to be greater than the holding cost [see Axsäter, 2010, p. 62, Property 2]; moreover, the penalty cost should not be smaller than the holding cost. Additionally, the fixed ordering cost should be greater than the penalty cost, otherwise, the inventory system tends to place orders in every period instead of penalising backorders. To focus on meaningful lot sizing instances — instances in which a trade off between fixed ordering and holding/penalty cost is sought — we filter test instances of the data set of Atkins & Iyogun [1988] by using the following conditions: . We also check the order frequency in each period and we discard instances in which orders are issued too frequently — i.e. instance in which a replenishment is issued more than twice per time period, as it turns out that for these instances order coordination is straightforward due to negligible item fixed ordering costs: if a group order is placed, all items are ordered. We present the filtered computational results in Table 3.
Let denote the percentage gap between the expected average cost of existing policies and that of the proposed policy, over the expected average cost of the policy. By definition, a positive represents the policy outperforms existing policies. Note that expected average costs under , , , , and policies are obtained from Özkaya et al. [2006], that of policy is obtained from Melchiors [2002], and that of policy is obtained from Viswanathan [1997].
We observe that the policy fully dominates all policies in of test instances; is the best policy in instances; is the best policy in instances; is the best policy in 1 instance. Moreover, the policy outperforms and policies, and no policy is dominant on all test instances. The average cost improvement increases with the increase of group fixed ordering cost, and decreases with the increase of holding cost compared with and policies. That means an increase in group fixed ordering cost or a decrease in holding cost improves the cost performance of policy. If we compare the policy with , , , , and policies, there is no obvious trend with respect to the group fixed ordering cost and holding cost. The policy performs better than , , , , and policies with average improvements of , , , , and , respectively; however, and policies perform slightly better than the policy with average improvements of and , respectively.
Viswanathan [1997] adopts the same experimental setup as Atkins & Iyogun [1988], except , , and .
We filter the computational results by using the same conditions previously adopted. We present computational results of the policy on the data set of Viswanathan [1997] in Table 4. We observe that the policy dominates of test instances; is the best policy in instances; is the best policy in instances; is the best policy in instances. There is once more no dominant policy on all test instances. Regarding the comparison with other policies, the average cost improvement decreases as the penalty cost increases; while there is no obvious trend with respect to the group fixed ordering cost, and penalty cost. On average, the policy performs better than , , , , and policies with average cost improvements of , , , , and ; while the policy performs slightly better than the policy with an average cost improvement of .
Even though the policy does not fully dominate alternative policies, it presents a key advantage: in contrast to all other policies in the literature, it is able to tackle stationary as well as nonstationary demand.
7 Conclusion
In this paper, we presented a mathematical programming approach for controlling the multi-item inventory system with joint replenishment under the policy. We first presented an MILP-based model for approximating optimal policies, which is built upon the piecewise-linear approximation technique proposed by [Rossi et al., 2014]. We further demonstrated that the MILP model can be used to approximate the policy.
We conducted an extensive computational study comprising instances. We first evaluated our approach on the data set of Atkins & Iyogun [1988]. This evaluation demonstrates that the policy fully dominates other competing policies in the literature in 2 out of test instances considered. The policy performs better than , , , , and policies with average improvements of , , , , and , respectively; however, and policies perform slightly better than the policy with average improvements of and . Computational experiments on the data set of Viswanathan [1997] indicate that is the best policy in out of test instances. The policy performs better than , , , , and policies with average cost improvements of , , , and ; while the policy performs slightly better than the policy with an average cost improvement . Most importantly, the policy comes with the additional advantage of being able to tackle stationary and nonstationary demand. Future research may focus on investigating the cost performance of policy in a rolling horizon setting.
Appendix A MILP model for approximating policies with fixed lead time
This section demonstrates that the MINLP model in Fig. 3 can be extended to compute near-optimal policy parameters for nonstationary JRPs with fixed lead time. Let denote the lead time of item , . We next separate our discussions into two parts.
The first part involves periods , , where no order is received. We assume that there is no outstanding order at the beginning of the planning horizon, and the system is forced to issue an order in period , then the inventory level must equal to the initial inventory level of item at the beginning of the planning horizon, minus the demand convolution over periods , i.e., , where is the demand convolution of item over periods , i.e., . We rewrite the expected back-orders and excess on-hand stocks using the first order loss function and its complementary function, and .
Additionally, since no order of item is received before , the expected inventory level of item at the end of period is equal to the expected inventory level at the end of period , minus expected demand in period , ,
[TABLE]
The second part involves periods , . Consider a single cycle of item over periods , in which a single order is received at the beginning of period , and the next order will be received at the beginning of period . Since the lead time of item is , the order that arrives in period must be issued in period with order-up-to-position . Thus, , , must equal to the order-up-to-position , minus the demand convolution over periods , i.e. .
We introduce a binary variable which is set to one if the most recent order received before period arrived in period , where , , , and ; and we introduce the following constraints, , ,
[TABLE]
Constraints (28) indicate that the most recent order received before period t arrived in period . Constraints (29) identify uniquely the period in which the most recent order received before period t has been received. Therefore, the inventory level , where , and represents the order-up-to-position of item in period . We write the back-orders and excess inventory as the first order loss function and its complementary, and .
In addition, constraints (17)-(18) in Fig. 3 can be reformulated as follows,
[TABLE]
We now present the MILP model for approximating policies with fixed lead time in Fig. 6.
The objective function (32) minimises the expected group fixed ordering costs, item-specific fixed ordering costs, penalty costs, and holding costs of -item over the -period planning horizon. Constraints (33) imply an individual item can only be included in a group replenishment if that replenishment is made. Constraints (34) - (35) assume that the first order is issued at the beginning of period , and there is no outstanding replenishment at the beginning of the planning horizon. Constraints (38) - (41) represent the expected back-orders and on-hand stocks of item over periods . Constraints (42) state all orders are received by the end of the planning horizon. Constraints (43) - (44) are inventory balance constraints. Constraints (45) - (48) ensure the most recent replenishment that has arrived before period was received in period . Constraints (51) - (54) represent the expected back-orders and on-hand stocks of item over periods . Constraints (55) - (59) indicate domains of binary variables , , and .
Appendix B Shortest path reformulation for approximating stationary stochastic policies
In this section we present an efficient shortest path reformulation for computing stationary policies.
Consider a network with nodes representing time periods, and arc between each pair of representing a possible decision to issue an order in period to satisfy demands in periods . Assigning a cost to this arc, solving the optimisation problem in Fig. 3 is equivalent to finding the shortest path between nodes and in the network . In the rest of this section, we first present how to compute the cost of each arc, and then present the shortest path reformulation.
Consider a replenishment cycle , where the only order is issued in period with order-up-to-position , and the next order is issued in period , for , , . We assume follows a Poisson distribution with rate . Then, can be calculated by Askin [1981],
[TABLE]
Note that the order-up-to-position actually accounts for demand variances over periods , which is reflected on the cumulative distribution function on the left-hand-side of Eq. (60).
Since the demand of item follows a Poisson distribution with rate , we could approximate the cost of the replenishment cycle by that of the cycle as shown in Fig. 7. As a result, the cycle cost can be calculated as follows,
[TABLE]
At the beginning of the planning horizon, the initial inventory level is . We check the cost of not issuing an order in period , , and update with if , for .
[TABLE]
Additionally, we introduce an auxiliary binary variable , which is equal to if an order is placed in period to satisfy demands in cycle , otherwise 0.
We now present the shortest path reformulation in Fig. 8. Let binary variable equal to if an order is issued in period to cover demands in periods , otherwise 0. The objective is to find the optimal replenishment plan that minimising the expected group fixed order costs, item-specific fixed order costs, holding costs and penalty costs over periods for items .
Recall that represents the item-specific first period replenishment decision, which is set to if an order is issued in period , otherwise [math]. Therefore, Constraints (64) guarantee the group fixed order cost in period is properly counted. Constraints (65) ensure that the group fixed order cost is encountered whenever any item is replenished in period . Constraints (66) ensure that there is no more than one outgoing arc from period . Constraints (67) are flow balance equations. Constraints (68) guarantee that period is included in a replenishment cycle. By solving the shortest path reformulation in Fig. 8, the group order decision and item-specific order decision are obtained111This can be obtained by adding constraints and , , to Fig. 8., for , .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Askin [1981] Askin, R. G. (1981). A procedure for production lot sizing with probabilistic dynamic demand. AIIE Transactions , 13 , 132–137.
- 2Atkins & Iyogun [1988] Atkins, D. R., & Iyogun, P. O. (1988). Periodic versus “can-order” policies for coordinated multi-item inventory systems. Management Science , 34 , 791–796.
- 3Axsäter [2010] Axsäter, S. (2010). Inventory Control (International Series in Operations Research & Management Science) . (2nd ed.). Springer.
- 4Balintfy [1964] Balintfy, J. L. (1964). On a basic class of multi-item inventory problems. Management Science , 10 , 287–297.
- 5Bastos et al. [2017] Bastos, L. d. S. L., Mendes, M. L., Nunes, D. R. d. L., Melo, A. C. S., & Carneiro, M. P. (2017). A systematic literature review on the joint replenishment problem solutions: 2006-2015. Production , 27 .
- 6Boctor et al. [2004] Boctor, F. F., Laporte, G., & Renaud, J. (2004). Models and algorithms for the dynamic-demand joint replenishment problem. International Journal of Production Research , 42 , 2667–2678.
- 7Bookbinder & Tan [1988] Bookbinder, J. H., & Tan, J. Y. (1988). Strategies for the probabilistic lot-sizing problem with service-level constraints. Management Science , 34 , 1096–1108.
- 8Federgruen et al. [1984] Federgruen, A., Groenevelt, H., & Tijms, H. C. (1984). Coordinated replenishments in a multi-item inventory system with compound poisson demands. Management Science , 30 , 344–357.
