Performance Evaluation of Optimal Radio Access Technology Selection Algorithms for LTE-WiFi Network
Arghyadip Roy, Prasanna Chaporkar, Abhay Karandikar

TL;DR
This paper proposes optimal algorithms for radio access technology selection in LTE-WiFi HetNets to maximize throughput and minimize voice user blocking, validated through simulations.
Contribution
It introduces threshold-based optimal policies and algorithms for user association and offloading in LTE-WiFi HetNets, enhancing system throughput and reducing blocking.
Findings
Algorithms improve total system throughput
Reduced voice user blocking probability
Outperforms benchmark algorithms in simulations
Abstract
A Heterogeneous Network (HetNet) comprises of multiple Radio Access Technologies (RATs) allowing a user to associate with a specific RAT and steer to other RATs in a seamless manner. To cope up with the unprecedented growth of data traffic, mobile data can be offloaded to Wireless Fidelity (WiFi) in a Long Term Evolution (LTE) based HetNet. In this paper, an optimal RAT selection problem is considered to maximize the total system throughput in an LTE-WiFi system with offload capability. Another formulation is also developed where maximizing the total system throughput is subject to a constraint on the voice user blocking probability. It is proved that the optimal policies for the association and offloading of voice/data users contain threshold structures. Based on the threshold structures, we propose algorithms for the association and offloading of users in LTE-WiFi HetNet. Simulation…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6| Parameter | Value |
|---|---|
| Maximum voice capacity | users |
| Maximum data capacity | users |
| Voice bit rate of a single user | kbps |
| Data bit rate of a single user | Mbps |
| Voice packet payload | bits |
| Data packet payload | bits |
| Tx power for BS and MS | dBm and dBm |
| Noise figure for BS and MS | dB and dB |
| Antenna height for BS and MS | m and m |
| Antenna parameter for BS and MS | Isotropic antenna |
| Path loss | , in kms |
| Parameter | Value |
|---|---|
| Channel bit rate | Mbps |
| UDP header | bits |
| Packet payload | bytes |
| Slot duration | s |
| Short inter-frame space (SIFS) | s |
| Distributed Coordination Function IFS (DIFS) | s |
| Minimum acceptable per-user throughput | Mbps |
| Tx power for AP | dBm |
| Noise figure for AP | dB |
| Antenna height for AP | m |
| Antenna parameter | Isotropic antenna |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced MIMO Systems Optimization · Advanced Wireless Network Optimization · Wireless Networks and Protocols
Optimal Radio Access Technology Selection Algorithm for LTE-WiFi Network
Arghyadip Roy, Prasanna Chaporkar and Abhay Karandikar
Department of Electrical Engineering
Indian Institute of Technology Bombay, Mumbai, India, 400076
e-mail: arghyadip, chaporkar, karandi@ee.iitb.ac.in
Abstract
A Heterogeneous Network (HetNet) comprises of multiple Radio Access Technologies (RATs) allowing a user to associate with a specific RAT and steer to other RATs in a seamless manner. To cope up with the unprecedented growth of data traffic, mobile data can be offloaded to Wireless Fidelity (WiFi) in a Long Term Evolution (LTE) based HetNet. In this paper, an optimal RAT selection problem is considered to maximize the total system throughput in an LTE-WiFi system with offload capability. Another formulation is also developed where maximizing the total system throughput is subject to a constraint on the voice user blocking probability. It is proved that the optimal policies for the association and offloading of voice/data users contain threshold structures. Based on the threshold structures, we propose algorithms for the association and offloading of users in LTE-WiFi HetNet. Simulation results are presented to demonstrate the voice user blocking probability and the total system throughput performance of the proposed algorithms in comparison to another benchmark algorithm.
Index Terms:
User association, LTE-WiFi offloading, CMDP, Threshold policy.
I Introduction
To meet the ever-increasing Quality of Service (QoS) requirements of users, various Radio Access Technologies (RATs) have been standardized [1]. Each RAT has different characteristics regarding associated parameters like coverage and capacity. It has been predicted that by monthly global mobile data traffic will exceed exabytes [2]. This unprecedented growth in data traffic has become one of the serious challenges for cellular network operators. To address this issue, both from users’ and network providers’ point of view, it has become necessary that different RATs interwork with each other. A wireless network where different RATs are present, and users can be associated and moved seamlessly from one RAT to another, is called a Heterogeneous Network (HetNet). In this paper, our aim is to determine the optimal RAT selection policy in a HetNet 111The terminologies “RAT selection” and “association” has been used interchangeably throughout the paper..
Due to the complementary characteristics of Third Generation Partnership Project (3GPP) Long Term Evolution (LTE) Base Stations (BSs) providing ubiquitous coverage and IEEE 802.11 [3] based Wireless Local Area Network (WLAN) (also known as Wireless Fidelity (WiFi)) Access Points (APs) providing high bit rate capability in hot-spot areas, interworking between them [4] offers an interesting solution. In areas where both LTE and WiFi coverage are present, a user can be associated with either of them. Moreover, data users can be steered from one RAT to another to achieve load balancing. This proposal, known as mobile data offloading, has been introduced in GPP Release specifications [4]. Since WiFi operates in unlicensed spectrum and most of the commercially available user equipments already have a dedicated WLAN interface, this proposal has become popular both with network operators and handset manufacturers.
For efficient utilization of both LTE and WiFi networks, it is necessary to take appropriate association and offloading decisions. RAT selection and offloading decisions can be made either at the user side or the network side. In user-initiated RAT selection schemes, there is no cooperation between LTE and WiFi networks, and users decide which RAT should be selected based on certain criteria. Since users individually take selfish RAT selection decisions to maximize individual utility functions, this may not provide a globally optimum solution [5]-[8]. To address this issue, a network-initiated RAT selection algorithm, which optimizes different network parameters, becomes necessary.
In this paper, we investigate an optimal association policy for an LTE-WiFi HetNet, as illustrated in Fig.1. Network-initiated RAT selection and offloading decisions are taken by a centralized controller possessing an overall view of the network. We consider two types of users, viz., voice and data users, to be present inside the LTE-WiFi HetNet. We consider that voice users are always associated with LTE since unlike LTE, WiFi may not provide the required QoS for a voice user. However, data users can be associated with either LTE or WiFi. Offloading of data users from one RAT to another is considered at the time of association of voice users or departure of existing voice/data users. From a network operator’s perspective, total system throughput is an important system metric since the generated revenue may largely depend on the number of bytes transported by the operator. Moreover, data users experiencing high throughput are more likely to adhere to a network operator, thus facilitating the improvement of the customer base of the operator. Therefore, we aim to maximize the total system throughput and formulate this as a continuous-time Markov Decision Process (MDP) problem.
In the case of data users, although in low load condition, WiFi usually provides higher throughput than that of LTE, as the WiFi load increases, average per-user throughput in WiFi decreases rapidly [9]. Therefore, under high WiFi load, LTE may offer more throughput than WiFi to data users and thus may be preferable to data users for the association. However, voice and data users are allocated resources in LTE from a common resource block pool. The throughput requirement of LTE data users is usually more than that of the voice users. Therefore, maximization of the total system throughput may result in excessive blocking of voice users. The system may attempt to save LTE resources which can be allocated later to data users having greater contributions to the system throughput than that of voice users. It results in an inherent trade-off between the total system throughput and the blocking probability of voice users. We consider this problem within the formalism of Constrained Markov Decision Process (CMDP), which maximizes the total system throughput subject to a constraint on the voice user blocking probability.
It is proved that the associated optimal policies contain a threshold structure, where after a certain threshold on the number of WiFi data users, data users are served using LTE. The existence of a similar threshold for the blocking of voice users is also established. Based on the threshold based optimal policy, we propose two RAT selection algorithms for LTE-WiFi HetNet. Extensive simulations are performed in ns- (a discrete event network simulator) [10] to evaluate the performance of the proposed association algorithms. Using simulation results, performance gains of the proposed algorithms in comparison to another algorithm in the literature [11] are also evaluated.
I-A Related Work
The solutions which investigate RAT selection problem in a HetNet, can be broadly divided into two categories, viz., user-initiated [5]-[8] and network-initiated [12]-[22]. In [5], a user-initiated RAT selection algorithm based on Signal-to-Noise Ratio (SNR) and load information of individual RATs with the adaptation of hysteresis mechanism, is considered for LTE-WiFi HetNet. The performance of this scheme is compared with network-initiated cell-range extension schemes that use network-optimized Received Signal Strength Indicator (RSSI) bias value to steer users to other RATs. In [6], a distributed RAT selection algorithm is proposed based on the distance and peak rate obtained from different IEEE 802.11 [3] APs.
Few heuristic-based network-initiated RAT selection approaches are considered in [18]-[20]. While the algorithm proposed in [19] prefers WLAN over cellular regardless of the service type, the one proposed in [20] prefers cellular RAT for voice users and WLAN for data users. Among the other network-initiated RAT selection schemes, [12]-[17], [21]-[22] consider various optimization approaches. In [12], optimal RAT selection problem is addressed in a HetNet to optimize throughput, blocking probability, etc.. Since the associated algorithm scales exponentially with the system size, authors also propose a computationally efficient heuristic policy. In [13], the association resulting in maximum value for the sum of logarithms of throughputs is chosen as the optimal association among Wireless Stations (STAs) and APs. However, authors do not take into account user arrival and departure. RAT selection policies in wireless networks [23]-[25] are sometimes observed to contain certain threshold structures. A multi-class admission system is considered in [23], where it is demonstrated that if it is optimal to accept a user of a class, then it is optimal to accept a user of higher profit class too.
Offloading of data users from one RAT to another plays a major role in the capacity improvement of the system. Performance improvement achieved by on-the-spot offloading [26], a user-initiated WiFi offloading scheme, is analyzed in [11]. The basic idea behind on-the-spot offloading is to steer the mobile data users to WiFi, whenever WiFi is available. The user-initiated offloading scheme in [27] is based on the combined information of signal strength and network load of LTE/WLAN. However, being a greedy one, this algorithm fails to converge to a globally optimum solution. The network-initiated offloading approach in [28] computes the optimal fraction of traffic to be offloaded to WiFi such that the per-user throughput of the system is maximized and performs better than on-the-spot offloading [11]. However, the model in [28] does not incorporate voice users inside an LTE network.
I-B Our Contribution
In this paper, we investigate the optimal association policy in an LTE-WiFi HetNet. We consider a system where voice and data users can arrive or depart at any point in time. We introduce the possibility of data user offloading from one RAT to another at the time of association or departure of a user. We target to maximize the total system throughput. The problem is formulated within the framework of MDP. Another formulation is developed where we target to maximize the total system throughput, subject to a constraint on the voice user blocking probability, using CMDP. Threshold structures of optimal policies are established. We propose two algorithms based on the computed optimal policies and implement in ns-. 3GPP recommended parameters are used in the simulations. Since most of the practical offloading schemes offload data users to WiFi, performances of the proposed algorithms are compared with on-the-spot offloading algorithm [11].
The arrival of a new user in the LTE-WiFi system triggers the need for the optimal RAT selection. Also, with the arrival or departure of users, the active users in different RATs may need to get offloaded to other RATs. While few works in the literature have focused on RAT selection and offloading techniques, respectively, no existing literature, to the best of our knowledge, has addressed the issue of joint RAT selection and offloading for LTE-WiFi HetNet.
The rest of the paper is organized as follows. The system model is described in Section II. In Section III, the RAT selection problems are formulated within the framework of unconstrained and constrained continuous-time MDP, respectively. In Section IV, we derive the threshold structure of the optimal policy. Algorithms for the association of voice and data users in LTE-WiFi HetNet are proposed in Section V. Section VI presents simulation results. In Section VII, we conclude the paper.
II System Model
We consider a system where an LTE BS and a WiFi AP are present. As illustrated in Fig.1, we assume that both the BS and the AP are connected to a centralized controller by lossless links. We assume that the voice and data users are geographically located at any point in the LTE BS coverage area. Since data users outside the dual coverage area of the LTE BS and the WiFi AP always get associated with the LTE BS and no decision is involved in this case, without loss of generality, we take into consideration only those data users which are present inside the WiFi AP coverage area. We assume that there is a common resource pool in LTE for the voice users as well as the data users inside the WiFi AP coverage area. Data users inside the dual coverage area can be associated with the LTE BS or the WiFi AP. All the users are assumed to be stationary. Voice and data user arrivals follow Poisson processes with means and , respectively. Service times for voice and data user are exponentially distributed with means and , respectively. For justification behind these assumptions, see [29].
Remark 1**.**
*Although for brevity of notation, a single LTE BS and a single AP have been considered, the system model can be generalized to a single LTE BS and multiple APs with non-overlapping coverage areas. Moreover, considering that each point in a geographical area is mapped to a single LTE BS (the LTE BS with highest average signal strength, say), multiple BSs can also be included in the system model with slight modifications. *
II-A State Space
We model the system as a controlled continuous time stochastic process defined on a state space . Any state is represented as a -tuple where and represent the number of voice users in LTE, the number of data users in LTE and the number of data users in WiFi, respectively. The system state remains unchanged unless an existing user departs or a new user arrives in the system. The arrivals and departures in the system are referred to as events. Five types of events are possible, viz., () an arrival of a new voice user in the system, () an arrival of a new data user in the system, () a departure of an existing voice user from LTE, () a departure of an existing data user from LTE and () a departure of an existing data user from WiFi. Whenever an event occurs, the centralized controller takes an action, and based on the type of event and the action taken by the controller, a state transition may happen. Note that the transitions of happen only at event epochs and not otherwise. Thus, it suffices to observe the system state only at event epochs. A finite amount of reward and cost are associated with every state-action pair. Detailed descriptions of the action space, state transitions, reward and cost are provided in subsequent subsections.
Next, we elaborate on the structure of . We assume that if and where is the total number of common resource blocks reserved in LTE for voice and data users, and is the maximum number of users in WiFi, so that the per-user throughput in WiFi is greater than a threshold. The condition arises because we assume that in each LTE subframe, every admitted user is allocated one resource block. If this allocation is not possible, a new user is not admitted in the LTE system. Furthermore, note that WiFi throughput decays monotonically [9] as the number of WiFi users increases. We assume that each user gets more than a threshold value of average throughput (say Mbps), which leads to the bound on the maximum number of users that can be accommodated in the WiFi system.
Remark 2**.**
Although the allocation of multiple resource blocks is closer to the practical scenario, this complicates the system model while the methodology and approach adopted in this paper do not change.
II-B Action Space
The set of actions defines a set of possible association and offloading strategies in the event of arrival or departure of a user. Let the action space be denoted by . Depending on the arrival or departure, we have a set of actions as stated below.
[TABLE]
Remark 3**.**
In this paper, actions are chosen based on the system state and the event occurred. One way of representing this is embedding the event in the state space so that the action depends only on the system state. However, to avoid notational complications associated with this approach, we view the action as a function of the system state and the event.
Let the set of states (subset of ) in which action chosen based on an event is feasible be denoted by . Thus, in the case of voice user arrival, we have,
[TABLE]
For data user arrival,
[TABLE]
For voice user departure from LTE,
[TABLE]
For data user departure from LTE,
[TABLE]
For data user departure from WiFi,
[TABLE]
In the case of voice and data user arrivals, the set of all possible actions are and , respectively. However, when an event occurs, action is not feasible if the system state is not present in . In this paper, voice user blocking () is considered to be a feasible action in all the states, provided the system is not empty. We consider blocking as a feasible action for data users, only when capacity is reached for both the RATs. When a user departs from LTE or WiFi, the controller can choose either or . If after the departure of a user from LTE, is chosen, it offloads one data user from WiFi to LTE.
II-C State Transitions
Based on an event and an action chosen, from a state, the system moves deterministically to a different state. Assume that from the state , the system moves to the state under the event and chosen action . Values of and for different events (arrivals and departures of users) and action are tabulated in Table I.
Note that this table is exhaustive in all kinds of events and actions. However, in a state, we need to consider only those events and actions which are feasible in that state.
II-D Rewards and Costs
Let the reward and cost functions per unit time corresponding to a state , event and action be represented by and , respectively. Let and denote the bit rate of voice and data users in LTE, respectively. To keep the model simple and computationally tractable, we assume that the bit rate of data users (e.g. data services like interactive video conferencing) in LTE is constant. In general, a voice user generates constant bit rate (CBR) traffic, and hence we take to be a constant. corresponds to the per-user data throughput of users in WiFi. We assume full buffer traffic model [9] for WiFi. The calculation of is based on the contention-driven medium access of WiFi users. It is a function of the probabilistic transmission attempts of the users, corresponding success and collision probabilities, and slot times for successful transmission, idle slots and busy slots during collisions. The reward per unit time in a state under the occurrence of an event and an action chosen is defined as the total system throughput in that state under that event and the chosen action. For example, in the case of data user arrivals and , it can be expressed as
[TABLE]
The cost function considered here is as follows. Whenever the centralized controller blocks an incoming voice user, one unit cost is incurred per unit time. Otherwise it is zero. Thus,
[TABLE]
We consider blocking of data users only when both LTE and WiFi systems are full. Hence, we do not consider any cost on the blocking of data users.
III Problem Formulation And Solution Methodology
A decision rule describes the mapping regarding which action is to be chosen at different states and decision epochs . An association policy is a sequence of decision rules taken at different decision epochs. Our goal is to determine an association policy which maximizes the total system throughput. This can be formulated as a continuous-time unconstrained MDP problem. In this case, a pure optimal policy exists [30]. Since the contribution of data users to the total system throughput is more than that of the voice users, the optimal association policy may result in high blocking probability of voice users. Hence, to address the trade-off between the total system throughput and the voice user blocking probability, we consider the CMDP problem, where we target to maximize the total system throughput, subject to a constraint on the voice user blocking probability. In this case, a stationary randomized optimal policy exists [31]. A Randomized policy is a mixture of two pure policies with associated probabilities. Arrival and departure of users can occur at arbitrary points in time, which makes the problem continuous time in nature.
III-A Problem Formulation
Let be the set of all memoryless policies. To guarantee a unique stationary distribution, we assume that Markov chains associated with such policies are irreducible. Following the policy , let the average reward and cost of the system over infinite horizon be denoted by and , respectively. Let and be the total reward and cost of the system incurred up to time , respectively. For the unconstrained MDP problem, our objective is to maximize the total system throughput which can be described as follows.
[TABLE]
where denotes the expectation operator under the policy . However, for the CMDP problem, our objective is to maximize the total system throughput, subject to a constraint on the blocking probability of voice users. This can be described as follows.
[TABLE]
where denotes the constraint on the blocking probability of voice users. Our objective is to determine the optimal policy for both unconstrained and constrained MDP problem. Since the optimal policies are known to be stationary policies, the corresponding limits in Equation (1) and (2) exist.
III-B Conversion to Discrete-Time MDP
To compute the optimal policy, we can use the well-known Value Iteration Algorithm (VIA) [30]. However, before that, the continuous-time MDP has to be transformed into an equivalent discrete-time MDP using uniformization [30], so that both models have the same average expected reward and cost for a stationary policy.
Let represent the expected time until the next event, if action is chosen in state under the event . We need to choose a number , such that . The state space and the action space remain the same in the equivalent discrete-time model. Let , and represent the probabilities of the event, reward and cost in the transformed discrete-time model in state under the event and action , respectively. Thus, we have,
[TABLE]
is a function of rate of different events and . Note that, this discrete-time MDP has identical stationary policies to that of the continuous-time MDP.
III-C Lagrangian Approach
After conversion into an equivalent discrete-time MDP model, we use the Lagrangian approach [31] to solve the CMDP. For a fixed value of Lagrange Multiplier (LM) , the modified reward function of the CMDP is
[TABLE]
The dynamic programming equation below describes the necessary condition for optimality.
[TABLE]
where denotes the value function in state . Next, our aim is to determine the value of (, say) which maximizes the average expected reward, subject to a cost constraint. The value of can be determined using gradient descent algorithm, as discussed in [32]. In th iteration, we modify the value of from its previous iteration as,
[TABLE]
For a fixed value of , the unconstrained maximization problem can be solved using VIA, as described below.
[TABLE]
where is an estimate of the value function after th iteration. After determining , the next step is to determine the optimal policy for the CMDP problem. As discussed in [31], optimal policy for a CMDP problem is a mixture of two pure policies and , obtained by perturbation of by a small amount in both directions. Let the long-term average expected costs of the two pure policies be and , respectively. In the next step, we determine the value of the parameter such that
[TABLE]
Finally, we have a randomized optimal policy for the considered CMDP problem. At each decision epoch, policies and are chosen w.p. and , respectively.
Note that the iterations on LM described above are necessary only for the CMDP problem. In the case of unconstrained MDP, VIA can be employed to determine a pure optimal policy, after an equivalent discrete-time MDP model is obtained.
IV Structure of the Optimal Policy
The dynamic programming equations (Equation (3) and (5)) described in the previous section are exploited to establish the fact that the optimal policy is of threshold type. The optimality of threshold policy is established with the aid of some lemmas presented below. For the purpose of readability, we present the proofs of the lemmas in Appendices.
IV-A Optimal Policy for Data Users
In this section, we present structural properties on the optimal policy for the service of data users along with their physical interpretations. Let us denote the throughput increment in WiFi when the number of WiFi users increases from to by . Therefore, . We assume the following.
Assumption 1**.**
Let be such that , and where is a threshold such that if , the data rate improvement provided by a single data user in the LTE system is more than the improvement in total WiFi throughput as the number of WiFi data users is increased from to .
Remark 4**.**
Following the full buffer traffic model [9], initially increases with and then decreases. This behavior matches with Assumption 1.
The following two lemmas describe a threshold structure on the optimal policy for the service of data users. Specifically, up to a certain threshold on the total number of data users, data users are served using WiFi. After the threshold is crossed, data users are served using LTE.
Lemma 1**.**
For every and such that , if the total number of data users in the system is , then the optimal policy is to serve all data users using WiFi. In other words,.
Proof.
See Apppedix A. ∎
In Lemma 1, following Assumption 1, since for , the data rate improvement is more if an additional data user is served using WiFi rather than using LTE, it is optimal to serve the data users using WiFi.
Lemma 2**.**
For every and such that , if the total number of data users in the system is , then the optimal policy is to serve data users using WiFi and all other data users using LTE. In other words,.
Proof.
See Apppedix B. ∎
The physical significance of Lemma 2 is that for , the data rate improvement provided by a single data user in LTE is more than that of the WiFi (following Assumption 1), and hence it is optimal to serve up to data users using WiFi and serve the additional data users using LTE.
Following lemma is a direct consequence of how the system is modeled.
Lemma 3**.**
For every and such that , the optimal policy is to serve all the incoming data users using WiFi until , where an incoming data user is blocked.
IV-B Optimal Policy for Voice Users
In this section, we characterize the optimal policy for the arrival of voice users. We prove that the optimal policy is of threshold type. Let be the difference operator which is defined as . Similarly, we define the second difference operator as . Let be another difference operator defined as . We define the second difference operator as . Similarly, we define . In this section, the terminologies “increasing” and “decreasing” are used in the weak sense of “non-decreasing” and “non-increasing”, respectively. In each state, let the sum of arrival and service rates be denoted by . Thus, we have,
[TABLE]
Let us define . The lemma presented below describes the superiority of one action over the other for the service of incoming voice users. Specifically, up to a certain threshold on the total number of data users, (accept voice user in LTE with data user offload to WiFi) is better than (accept voice user in LTE). After the threshold is crossed, becomes better.
Lemma 4**.**
In the case of a voice user arrival in a state , where ,
- (i)
* is always better than if ,*
- (ii)
* is always better than if .*
Proof.
Proof is similar to the proof of Lemma 1. ∎
Similar to Lemma 1 and 2, following Assumption 1, since for , the data rate improvement is more if an additional data user is served using WiFi rather than using LTE, is better than . Therefore, when , the choice of optimal action is between (accept voice user in LTE with data user offload to WiFi) and (blocking). Similarly, for , the optimal action is either (accept voice user in LTE) or .
The following lemma describes that when capacity is not reached in LTE and a voice user arrives, a threshold structure is observed. Until a threshold on the number of voice/data users in LTE, (for ) or (for ) is preferred. After the threshold becomes optimal.
Lemma 5**.**
For every and such that and a voice user arrival,
- (i)
if the optimal action in state is , then the optimal action in state and in state is also ,
- (ii)
if the optimal action in state is (), then the optimal action in state and in state is also (*). *
Proof.
See Apppedix C. ∎
When the number of voice/data users in LTE is less, or is chosen as the optimal action in the event of a voice user arrival. When or crosses a certain threshold, the number of free resources for incoming voice users decreases. Therefore, the blocking probability of voice users increases. Thus, after a threshold on or , becomes optimal.
However, when , since is infeasible, optimal action is either or . The lemma presented below discusses the threshold nature of the optimal policy for voice user arrivals when .
Lemma 6**.**
For every and such that and a voice user arrival,
- (i)
if the optimal action in state is , then the optimal action in state is also ,
- (ii)
if the optimal action in state is , then the optimal action in state is also .
Proof.
See Apppedix D. ∎
The physical interpretation of the above lemma is that for states with , when is small, is preferred. However, when crosses a threshold, since becomes small, and consequently the total system throughput is small, may further lower the total system throughput. Therefore, blocking of voice users is chosen as the optimal action.
V Proposed Network-Initiated Association Algorithm
Based on the optimal policy computed by solving the unconstrained MDP and CMDP problem, respectively, in this section, we propose two network-initiated association algorithms for LTE-WiFi HetNet. The details of the unconstrained MDP-based algorithm is presented below.
The procedure CALC–OPT–POLICY–UC in Algorithm 1 computes the optimal policy by solving an unconstrained MDP problem using solution methodologies described in Section III. The calculated thresholds for the association of voice/data users are made available to the centralized controller connected to both the LTE BS and the WiFi AP. Since the centralized controller has an overall view of the whole system, the information regarding the numbers of active voice and data users in LTE and WiFi, are available to it. Whenever there is an arrival or a departure, the controller initiates the procedure POLICY–IMPL, as described in Algorithm 1. This procedure determines the state of the system based on the number of active users in LTE and WiFi networks and then chooses the appropriate action based on the corresponding thresholds.
Next, we describe the CMDP-based algorithm which addresses the issue of high blocking probability of voice users, which may be encountered in Algorithm 1. The algorithm is described in detail below.
Apart from the same set of input parameters as required by Algorithm 1, Algorithm 2 requires as an additional parameter to specify the constraint on the blocking probability of voice users. The procedure CALC–OPT–POLICY in Algorithm 2 computes the randomized optimal policy for the considered CMDP problem. First, the optimal policy is determined for a fixed value of , and then the value of is updated until becomes equal to . All other procedures are similar to the procedures described in Algorithm 1.
VI Numerical and Simulation Results
In this section, the algorithms proposed in the last section are implemented in ns- to observe the performance of the proposed algorithms. Performance of the proposed algorithms in terms of blocking probability of voice users and the total system throughput is compared to the performance of on-the-spot WiFi offloading algorithm [11]. In this algorithm [11], data user chooses LTE, only when there is no WiFi coverage. Therefore, in the considered system model, with on-the-spot offloading, data users always get associated with WiFi until WiFi capacity is exhausted. Voice users always get associated with LTE BS, and when LTE capacity is full, they are blocked.
VI-A Simulation Model and Evaluation Procedure
The simulated network model consists of a GPP LTE BS and an IEEE g WiFi AP. All users are taken to be stationary. The AP is approximately m away from the LTE BS, and data users are distributed uniformly within m radius of the WiFi AP. The WiFi AP is assumed to be deployed by the same cellular operator and hence trusted from the point of view of interworking. LTE and WiFi network parameters used in the simulation, as summarized in Table II and III, are based on 3GPP models [33]-[34] and saturation throughput [9] g WiFi model. Propagation delay in WiFi network is assumed to be negligible. We consider CBR traffic for voice and data users in LTE. The generation of a fixed rate uplink flow is implemented in ns- using an application developed by us, which works similar to the ON/OFF application. This application creates sockets between the sender and the receiver, and fixed sized packets are transmitted from the sender to the receiver at a constant bit rate.
VI-B Voice User Arrival Rate Variation
VI-B1 Voice User Blocking Probability Performance
Fig. 2(a) illustrates the variation of voice user blocking percentage of on-the-spot offloading [11], Algorithm 1 and 2 as a function of .
In on-the-spot offloading, voice users are blocked when LTE reaches the capacity. When is small, the voice user blocking probability is small. However, as increases, the probability of approaching the LTE capacity and hence the voice user blocking probability increases. The voice user blocking probability in Algorithm 1 is small when is small. However, as increases, voice user blocking probability values become marginally higher than the corresponding values for on-the-spot offloading. Algorithm 1 may introduce blocking of voice users even when LTE has not reached its capacity, i.e., for states with . Voice users have very less contribution to the total system throughput. Hence, voice users are blocked to save resources for data users which contribute significantly to the total system throughput. However, in Algorithm 2, the number of states with proactive blocking (blocking when ) is reduced due to a constraint on the voice user blocking probability. Additionally, when is small, the optimal action in states with becomes (accept voice user in LTE and data offload to WiFi). Voice user blocking probability contribution comes mainly from the states with , where is large (say states , etc.). Since a major fraction of voice user blocking occurs when and is large, the system becomes analogous to the on-the-spot offloading. Hence, the voice user blocking probability performance of Algorithm 2 is almost similar to on-the-spot offloading algorithm [11].
VI-B2 Total System Throughput Performance
Total system throughput performance comparison of different algorithms is illustrated in Fig. 2(b). In on-the-spot offloading, the average number of voice users in LTE increases with , while the average number of data users in WiFi remains constant. Thus, the total system throughput increases with . For Algorithm 1, with an increase in , the blocking probability of voice users increases. Therefore, the fraction of voice users in the system decreases, and the total system throughput increases. Besides, Algorithm 1 performs a significant amount of load balancing under (accept voice user in LTE with data user offload to WiFi) and (move data user to the RAT from where a user has departed). With higher , load balancing actions are chosen more frequently. Thus, with higher , Algorithm 1 exhibits greater improvement over on-the-spot offloading algorithm. The improvement in total system throughput varies from (for ) to (for ). In Fig. 2(b), we observe that Algorithm 2 also performs better than on-the-spot offloading. However, due to a constraint on the voice user blocking probability, performance improvement is lower than Algorithm 1. For lower values of (), the total system throughput of Algorithm 2 is same as that of Algorithm 1 as the optimal policy for the CMDP is same as that of the unconstrained MDP. On-the-spot offloading algorithm blocks voice users only when LTE reaches capacity. Typically, in Algorithm 2 also, voice user blocking occurs when the LTE is full with a large number of voice users. However, due to load balancing of data users, Algorithm 2 outperforms the on-the-spot offloading algorithm. With , the improvement in total system throughput is only and with , it becomes .
VI-C Data User Arrival Rate Variation
VI-C1 Voice User Blocking Probability Performance
In Fig. 2(c), for on-the-spot offloading, voice and data users are accepted in LTE and WiFi, respectively. Consequently, changes in do not affect the blocking probability performance of voice users in LTE. In the case of Algorithm 1, increase in associates more number of data users with LTE, since the optimal policy for data users is to associate with LTE after the number of WiFi data users crosses a certain threshold. Therefore, the number of free LTE resources for voice users reduces, eventually increasing the blocking probability of voice users. The voice user blocking probability of Algorithm 1 is worse than that of on-the-spot offloading and increases with . The blocking probability performance of Algorithm 2 is similar to that of the on-the-spot offloading. Since usually the voice users are blocked in the states where the only feasible action is blocking (say state ), the decision epochs where voice users are blocked are almost same as that of the on-the-spot offloading.
VI-C2 Total System Throughput Performance
In Fig. 3, total system throughputs for different algorithms are plotted as a function of .
In on-the-spot offloading, with an increase in , the number of WiFi data users increases, and this increases the total system throughput. However, for high , the effect of contention among data users reduces the rate of increment of the total system throughput. In Algorithm 1, as increases, more number of data users are served using LTE. Since the throughput contribution of data users is more than voice users, the blocking probability of voice users increases with . Thus, the fraction of voice users in the system reduces, effectively causing more improvement in the total system throughput. When , the improvement in system metric is , whereas for , the system metric almost doubles. In Fig. 3, the total system throughput values for Algorithm 2 are smaller than the corresponding values for Algorithm 1. The reduction in blocking probability of voice users comes at a price of the reduction in the total system throughput. Still, due to optimal association and load balancing decisions, Algorithm 2 reduces the effect of contention among data users in WiFi and hence performs better than on-the-spot offloading algorithm [11]. For example, with , the improvement in system metric is about and with , it becomes almost .
VII Conclusion
In this paper, we have formulated the optimal association problem in an LTE-WiFi HetNet as an MDP problem with an objective of maximizing the total system throughput. Constrained MDP formulation has also been presented, where maximizing the total system throughput is subject to a constraint on the blocking probability of voice users. Threshold structures on the association of voice and data users have been derived. Based on the structure of the optimal policies, we have proposed two algorithms for the association and offloading of voice/data users in an LTE-WiFi HetNet. Simulation results demonstrate that although the voice user blocking probability performance of Algorithm 1 is worse than that of on-the-spot offloading [11], Algorithm 2 performs as good as on-the-spot offloading. Moreover, the proposed algorithms perform better than on-the-spot offloading algorithm in improving the total system throughput. In future, the considered framework can be extended to consider the channel state between LTE BS/WiFi AP and users such that channel-aware association and offloading decisions can be taken.
Appendix A Proof of Lemma 1
Since the decisions of association and offloading are involved during the arrival and departure of users, proving this lemma is equivalent to proving the following statements.
- (a)
(Accept in WiFi) is optimal when there are less than data users in the system, and a data user arrives.
- (b)
(Do nothing) is optimal when there are less than or equal to data users in the system, and a voice user from LTE departs.
- (c)
(Do nothing) is optimal when there are less than or equal to data users in the system, and a data user from WiFi departs.
We prove the above statements by sample path argument. Suppose the system starts at time .
Proof of (a): We consider the scenario when the system is in the state , when a data user arrival occurs (after a time , say). Assume that the optimal policy does not associate this incoming data user with WiFi. Therefore, the optimal action must be (Accept in LTE). As the optimal policy is , we have , and , where is the set of all policies. Let us consider another policy (non-stationary in general) which takes in state . As illustrated in Fig. 4, let us assume that starting from the state and following the policy and , the system reaches the state and , respectively. The inter-arrival times and service times are same for both the sample paths as we have considered a Markovian system. Assume that from the state , based on the next event (after a time ) and the chosen action, the system makes a transition to the state according to the policy . Before reaching the state , the sample path followed by the policy has one less WiFi data user and one more LTE data user than that of the policy before it reaches the state . Suppose, the policy is such that in state , it takes the same action as that of policy and additionally offloads one data user from WiFi to LTE. Evidently, sample path followed by both the policies end up in the same state . We construct in such a manner that from the state onwards, both the policies take up same actions and follow the same sample path. Therefore, the difference of value functions of the state under the policy and is
[TABLE]
Since and , we have, . Clearly, this contradicts the original claim that is an optimal policy. Since the Markov chains induced by different policies are recurrent, the state is visited infinitely often and each time choice of upon a data user arrival provides more reward than action . Therefore, when there is no data user in the system, and one data user arrives, is optimal. In a similar manner, it can be proved that is optimal when a data user arrives and the system is in state , where .
Proof of (b) and (c):These can be proved using a similar sample path argument.
Appendix B Proof of Lemma 2
Similar to Lemma 1, proving this lemma is equivalent to proving the following statements.
- (a)
(Accept in LTE) is optimal when there are more than or equal to data users in the system, and one data user arrives.
- (b)
(Do nothing) is optimal when there are more than data users in the system, and a voice/data user from LTE departs.
- (c)
(Data offload to a RAT from where a user has departed) is optimal when there are more than data users in the system, and a data user from WiFi departs.
Proof of (a): From Lemma 1, we have, . We consider the scenario when the system is in the state , when a data user arrival occurs. Assume that the optimal policy does not associate this incoming data user with LTE. Consequently, the optimal action must be . As the optimal policy is , we have in every state . Let us consider another policy which chooses in state . As illustrated in Fig. 4, starting from the state and following the policy and , the system reaches the states and , respectively. From the state , based on an event, the system reaches the state . Suppose, in the state , the action followed by policy is such that it chooses the same action as that of policy and additionally offloads one data user from LTE to WiFi. Clearly, path followed by both the policies end up in the same state . We construct in such a way that from the state onwards, both of them follow the same path. Similar to the previous lemma, the difference of value functions under the policy and is
[TABLE]
Since , we have, . Clearly, this contradicts the original claim that is an optimal policy. Thus, is optimal when there are data users in WiFi, and one data user arrives. The same result can be extended for the case when there are data users in WiFi, more than or equal to one data user in LTE, and one data user arrives.
Statements (b) and (c) can be proved in a similar way.
Appendix C Proof of Lemma 5
To prove this lemma, we consider two cases, and . We prove the required for the first case. Proof of the second case follows in a similar manner. From Lemma 4, we know that for , is better than . Thus, for , the choice is between and . To prove this lemma, we first prove that the value function is concave in . In Lemma 1 and 2, we have already derived the structure of the optimal policy for data user arrival and departure of voice and data users. Now, for , with the aid of this, the optimality equation is as follows.
[TABLE]
Let the components in Equation (6) be denoted by and , respectively. We prove the concavity of component-wise. Start the VIA with . Hence, is concave in . Let us assume that . Equivalently, . Let us define the function as follows.
[TABLE]
By definition,
[TABLE]
Thus, we have,
[TABLE]
Let us define .
[TABLE]
[TABLE]
Since is concave in , is concave in .
Now, we need to prove that is concave in . In other words, we need to prove that . Let us assume that and are the maximizing actions in states and , respectively. Therefore,
[TABLE]
Let us take . To prove that is concave in , we need to prove that . There are four cases as described below.
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Thus, it is proved that is concave in . Since this holds for every and every , is concave in .
Similarly, in the case of the second component, let .
Thus, Therefore, is concave in . Similarly, other components also can be proved to be concave in . Therefore, is concave in .
Let us define . In order to prove this lemma, we know that if state is blocking, then Due to concavity of , . Now, . As a consequence, Thus, it is proved that if state is blocking, then the state is also blocking.
To prove that if state is blocking, then the state is also blocking, we first need to prove that the value function is submodular in . In other words, we need to prove that . Similar to the previous proof, we prove the above statement component-wise. Let us assume that and are the maximizing actions in states and , respectively. Start the VIA with . Therefore, is submodular in . In other words, . We have,
[TABLE]
Now, we consider four possible cases.
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Thus, it is proved that is submodular in .
Similarly, in the case of the second component, we have, . Therefore, we have, . Similarly, other components also can be proved to be submodular in . Therefore, the value function is submodular in .
Now, if state is blocking then we have, Again, we have, . Due to submodularity, we have . Thus, in the case of voice arrival, if is optimal in state , then in state also is optimal.
Proof of follows directly from the proof of part .
Appendix D Proof of Lemma 6
To prove this lemma, we consider two cases, and . We demonstrate the proof of the lemma for the first case. Proof of the second case follows in similar manner. To prove this lemma, we first need to prove that for , the difference of value functions is decreasing in . For and , the optimality equation can be described as
[TABLE]
Let us assume that .
Equivalently, .
In other words,
[TABLE]
We prove the above claim component-wise. Start the VIA with . Therefore, is decreasing in . We also have, . Now, Let us define and .
[TABLE]
Therefore, .
[TABLE]
Therefore, is decreasing in .
Now, we need to prove that is decreasing in . In other words, we need to prove that . Let us assume that and are the maximizing actions in states and , respectively. Therefore,
[TABLE]
Let us take . To prove that is decreasing in , we need to prove that . There are four cases as described below.
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Thus, it is proved that is decreasing in . Since this holds for every and every value of , is decreasing in .
Let . Thus, Therefore, is decreasing in . Similarly, other components can be proved to be decreasing in . Therefore, is decreasing in .
Similar to Lemma 5, using this property it can be proved that if the optimal action for voice user arrival in state is blocking, then the optimal action in state is also blocking.
Proof of follows directly from the proof of part .
Acknowledgment
This work is funded by the Department of Electronics and Information Technology (DeitY), Government of India.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Rebecchi, M. D. de Amorim, V. Conan, A. Passarella, R. Bruno, and M. Conti, “Data Offloading Techniques in Cellular Networks: a Survey,” IEEE Communications Surveys & Tutorials, Vol. 17, No. 2, pp. 580-603, Nov 2014.
- 2[2] Cisco, Cisco Visual Networking Index: Global Mobile Data Traffic Forecast Update, 2016–2021, Mar 2017.
- 3[3] IEEE 802.11-2012, Part 11, “Wireless LAN Medium Access Control (MAC) and Physical Layer (PHY) Specifications,” Mar 2012.
- 4[4] 3GPP TS 23.234 v 12.0.0, “3GPP System to Wireless Local Area Network (WLAN) Interworking; System Description,” Available online at http://www.3gpp.org/Dyna Report/23234.htm , Sep 2014.
- 5[5] M. Gerasimenko, N. Himayat, S.P. Yeh, S. Talwar, S. Andreev, and Y. Koucheryavy, “Characterizing Performance of Load-aware Network Selection in Multi-radio (Wi Fi/LTE) Heterogeneous Networks,” in GLOBECOM Workshop, pp. 397-402, Dec 2013.
- 6[6] F. Moety, M. Ibrahim, S. Lahoud, and K. Khawam, “Distributed Heuristic Algorithms for RAT Selection in Wireless Heterogeneous Networks,” in proc. of WCNC, pp. 2220 - 2224, Apr 2012.
- 7[7] M. Yassin, M. Ibrahim, and S. Lahoud, “A Hybrid Approach for RAT Selection in Wireless Heterogeneous Networks,” in proc. of ICCIT, pp. 290-294, Jun 2013.
- 8[8] E. Aryafar, A. Keshavarz-Haddad, M. Wang, and M. Chiang, “RAT Selection Games in Het Nets,” in proc. of INFOCOM, pp. 998-1006, Apr 2013.
