Joint Computation Offloading and Resource Allocation in Cloud Based Wireless HetNets
Nguyen Ti Ti, Long Bao Le

TL;DR
This paper presents a novel joint computation offloading and resource allocation framework for cloud-based wireless HetNets, optimizing energy consumption and latency with scalable algorithms.
Contribution
It introduces a new approach to solve the complex MINLP problem in HetNets, including a bisection method and a low-complexity algorithm for scalable optimization.
Findings
Achieves about 55% energy savings compared to local computation.
Effectively minimizes maximum weighted energy consumption under latency constraints.
Provides scalable solutions suitable for practical wireless HetNets.
Abstract
In this paper, we study the joint computation offloading and resource allocation problem in the two-tier wireless heterogeneous network (HetNet). Our design aims to optimize the computation offloading to the cloud jointly with the subchannel allocation to minimize the maximum (min-max) weighted energy consumption subject to practical constraints on bandwidth, computing resource and allowable latency for the multi-user multitask computation system. To tackle this non-convex mixed integer non-linear problem (MINLP), we employ the bisection search method to solve it where we propose a novel approach to transform and verify the feasibility of the underlying problem in each iteration. In addition, we propose a low-complexity algorithm, which can decrease the number of binary optimization variables and enable more scalable computation offloading optimization in the practical wireless HetNets.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Joint Computation Offloading and Resource Allocation in Cloud Based Wireless HetNets
Nguyen Ti Ti and Long Bao Le T. T. Nguyen and L.B. Le are with INRS-EMT, University of Québec, Montréal, Québec, Canada. Emails: {titi.nguyen,long.le}@emt.inrs.ca.
Abstract
In this paper, we study the joint computation offloading and resource allocation problem in the two-tier wireless heterogeneous network (HetNet). Our design aims to optimize the computation offloading to the cloud jointly with the subchannel allocation to minimize the maximum (min-max) weighted energy consumption subject to practical constraints on bandwidth, computing resource and allowable latency for the multi-user multi-task computation system. To tackle this non-convex mixed integer non-linear problem (MINLP), we employ the bisection search method to solve it where we propose a novel approach to transform and verify the feasibility of the underlying problem in each iteration. In addition, we propose a low-complexity algorithm, which can decrease the number of binary optimization variables and enable more scalable computation offloading optimization in the practical wireless HetNets. Numerical studies confirm that the proposed design achieves the energy saving gains about 55% in comparison with the local computation scheme under the strict required latency of 0.1s.
Index Terms:
Mobile edge computing, energy saving, computing and resource allocation, HetNet, MINLP, and ILP.
I introduction
The number of global devices has increased drastically in recent years. Moreover, with the multi-task processing capacity, advanced mobile devices have led to proliferation of many computation-intensive applications covering different areas including entertainment, communication, social networking, e-health, image recognition, language processing, and gaming. These computation-extensive applications have demanded more powerful central processing unit (CPU) with higher clock frequency, which will result in significant increase in the mobile energy consumption [1]. Moreover, advancement in mobile battery technology is usually not sufficiently fast to keep up with practical applications’ requirements; therefore, the battery can become the bottleneck to improve the quality of experience (QoE) for mobile users. Consequently, reducing mobile energy consumption in power-hungry applications is of great importance and one very potential solution for this problem is to offload heavy computations tasks to the edge cloud servers using the so-called mobile cloud computing (MCC) technology.
Recent development of small-cell based wireless HetNets promises enormous benefits from both the network and mobile user perspectives. First, deployment of low-power small cells enables efficient reuse of the radio spectrum, which helps increase the spectral efficiency. Second, the close transmitter-receiver proximity allows small cell users to achieve high signal-to-noise ratio (SNR) even with low transmit power. This enables them to meet the low-latency requirements of many emerging applications. Finally, realization of the MCC in small-cell based wireless HetNets can lead to significant benefits such as prolonging battery lifetime and providing high-speed and ultra-low latency communications services in future 5G wireless systems.
Several MCC platforms have been proposed and developed in the literature such as MAUI [2], CloneCloud [3], ThinkAir [4] and a good survey for them with the corresponding computation offloading designs can be found in [5]. In particular, the tradeoff between transmission and computation energy was studied in [6], [7]. The joint computation task offloading scheduling and transmit power allocation of a single-user system was investigated in [8]. Moreover, the authors in [9] studied the multi-user radio resource management problem for the HetNet-MCC system, which always offloads the entire computation task to the cloud. Dynamic computation offloading policies based on Lyapunov optimization were developed in [1], [10]. These existing works, however, only consider the single-cell setting and many practical design aspects of the multi-cell MCC system such as dynamic computation offloading, joint multi-user resource allocation and computing resource assignment, and consideration of practical constraints on bandwidth, operating frequency and tolerable delay limits are not satisfactorily accounted for. Our current work aims to fill this gap in the literature.
In this paper, we study the joint optimization problem for computation offloading and resource allocation where computation tasks are either processed locally at the mobile or offloaded and processed in the cloud. Moreover, offloaded tasks require radio resources for transmissions of the involved data (i.e., programming states). Importantly, this design is conducted for the multi-task multi-user multi-cell setting, which has not been addressed in the literature to the best of our knowledge. The underlying joint computation task, subchannel, and operating clock speed assignment problem, which aims to minimum the maximum weighted consumed energy subject to delay and resource constraints is a non-convex and difficult MINLP problem. Therefore, we employ the bisection search method to solve it where we transform the underlying non-convex INLP problem into an ILP for feasibility verification in each iteration. We also propose a low-complexity algorithm which is based on the decoupled optimization of the resource assignments for the macrocell and small cells.
The remaining of this paper is organized as follows. Section II presents the system model, computation and transmission energy models and problem formulation. Section III describes the solution method to solve the studied problem. Section IV evaluates the performance of proposed algorithms. Finally, Section V concludes the work.
II System Model and Problem Formulation
We consider a two-tier wireless HetNet with small cells (SC), one macro cell (MC), and users (UE). We assume that each cell is serviced by one base station (BS) connected to a common cloud provider via a high-speed fiber cable. Moreover, the cloud is assumed to have unlimited computing resource to serve offloaded computation demands from multiple SUs simultaneously. We denote the set of all BSs as , where [math] denotes the MC BS, and denotes the set of SC BSs, and the set of all users is denoted as , where is the set of macro users (MUE) and is the set of SC users (SUE) associated with BS .
II-A Computation Offloading Model
We assume that UE has the set of independent tasks for his/her application and these tasks can be executed locally at the mobile device or offloaded and executed in the cloud independently over the time interval where is chosen to meet the delay requirement of the underlying application. Moreover, each task has the corresponding number of required CPU cycles (CPUs) and the number of transmission bits of (bits) (i.e., to transmit the involved programming states). We now introduce a binary offloading decision variable for each task as , which can be defined as follows:
[TABLE]
It is further assumed that the processor of UE has the DVFS (Dynamic Voltage and Frequency Scaling) capability so it can adjust the operating frequency clock speed (FCS) in the clock speed set (CPU/s) which corresponds to the underlying application requirements [1]. By introducing binary variables , the operating clock speed can be expressed as , where . Then, the local computation energy in Joule at UE can be expressed as
[TABLE]
where denote the coefficients specified in the CPU model [1]. For mobile devices such as Samsung Galaxy Note or Nexus, the value of is in range of [2.7-3]. In order to satisfy the application QoS, UE should complete its program execution within the delay if such program is executed locally at the mobile device. We assume that the cloud has the replicated version of the execution file of the offloading tasks, which can be, therefore, executed in the cloud in the offloading case.
II-B Transmission Model
We assume that the available spectrum is partitioned into non-overlapping bands, which are then allocated to the MC and SC tiers to avoid the cross-tier interference. Moreover, the spectrum allocated to small cells is assumed to be fully reused over these cells (i.e., the SC spectrum reuse factor is one). Moreover, the OFDMA scheme is assumed where the set of available sub-channels is denoted as .
Let denote the channel gain from UE to BS on subchannel . The transmit power of user on each subchannel is assumed to be the constant , where is the bandwidth of each subchannel, and the noise power density on subchannel at BS is . We represent the subchannel assignment by a binary variable , where if subchannel is assigned to UE , and , otherwise. Then, the transmission rate from UE to the BS can be expressed as
[TABLE]
where
The power required to transmit data related to the program states from each mobile user comprises two parts, namely circuit power (W/Hz) and transmit power (W/Hz), which can be expressed as . Then, the total required energy related to the transmission of UE can be computed as follows:
[TABLE]
where is the transmission time of the program states from the mobile to its BS, which can be calculated as
[TABLE]
The total latency experienced by an offloaded task comprises the time required for sending program states/bits to the cloud, the computation time in the cloud, and the time required for downloading the results to the mobile. However, cloud computation time is relatively small due to the high cloud computation power and the data related to computation results’ download has much smaller size compared to the offloading data in general. Therefore, we neglect the cloud energy consumption and data download transferred time. Moreover, to ensure the constrained latency, the transmit time is required to be smaller than the maximum delay, i.e., .
II-C Problem Formulation
We now present the formulation for the considered problem where our design objective is to minimize the maximum weighted users’ consumed energy. The energy weight, denoted as , represents the priorities or the battery/computation levels of different users [11]. Then, the joint computation offloading and resource allocation problem with latency, radio and computational resource constraints can be stated as
[TABLE]
[TABLE]
In this problem formulation, constraint (C1) captures the transmission latency requirements for offloading process. Constraint (C2) and (C3) represent the MC and SCs bandwidth sharing where each subchannel can be allocated to at most one MUE or one SUE in each SC. Constraint (C4) represents the delay requirements for local computation. Furthermore, constraint (C5) captures the binary offloading decisions while the remaining constraints express the computational capacity of mobile devices.
III Algorithm Development
The considered problem is indeed a non-convex INLP due to the integer optimization variables for allocating tasks, frequency clock speeds’ selection and subchannel assignments and due to the non-convexity of the objective function and constraint functions in (C1). Therefore, this problem is very difficult to solve. To have an insightful description, we first reformulate the min-max objective of as follows:
[TABLE]
III-A Proposed Algorithm (Optimal alg.)
We can now apply the bisection search method to find the optimal min-max users’ energy consumption for the reformulated problem . Specifically, the bisection search method iteratively updates an upper-bound and a lower-bound of the objective value of problem . In particular, in each iteration, we have to verify the feasibility of problem for a given value of based on which we can update and . If the set of constraints is feasible, then upper-bound of objective function will decrease, and inversely its lower-bound will increase. This algorithm will terminate when the difference between upper-bound and lower-bound values becomes sufficiently small. The proposed algorithm which can find the optimal solution of is given in Algorithm 1.
In order to verify the feasibility of problem , we take three major steps to transform all constraints of problem into the linear form. In the first step, we linearize the involved logarithmic functions in (C1) and (C8). In the second step, we attempt to determine whether UEs can locally process their tasks or not for a given value of . In the final step, we introduce some further auxiliary variables to transform the product-form of the obtained constraint functions into the desirable linear form. The obtained linear program after step three can then be solved effectively.
These steps are described in more details for a given value of in the following.
III-A1 Step one
We introduce some auxiliary binary variables as follows:
[TABLE]
The above expression means that the variable will be active if users transmit on the same subchannel . We have to now re-write in (4), which depends on given in (5). Toward this end, the transmission rate from SUE to BS is re-expressed in (7) which is needed in the expression of . We also need (9) to re-write in (4). Moreover, constraint (C) can be now rewritten as in (8).
III-A2 Step two
For a given , UEs will not offload their tasks if they can process all tasks locally. Therefore, to determine whether UEs offload or not, we find the minimum number of transmission bits of UE as when its computing energy is less than . If this value is equal to zero, UE can locally execute its application; therefore the transmission rate will be qual to zero. This problem is formulated as follows:
[TABLE]
To solve problem , we re-express one term in computation energy expression as follows:
[TABLE]
where for and for . The constraint (C) now is the sum of the product of two binary variables, which can be given as
[TABLE]
We now deal with the non-convex term in (12). In general, the product of binaries variables can be converted to the linear inequalities as suggested in [12]. Particularly, the -variable can be expressed equivalently as
[TABLE]
Then applying (13), we can transform the non-convex term to linear form of and . In addition, constraints (C) can be easily converted to a linear form as:
Using these expressions, the considered problem can be transformed into an ILP with optimization variables which can be solved effectively by using the interior-point method or the solver CVX-Gurobi [12].
III-A3 Step three - feasibility verification for problem
We now state some important results in the following two propositions, which correspond to two cases where the offloading decision variables are zero and one, respectively.
Proposition 1: If there exists a feasible solution for given and the optimal value of is 0, then the offloading decision variables are set equal to 1 and this will form a feasible solution.
Proof.
It is clear that the efficient optimization for in must allocate the smallest amount of radio resources to meet the fixed energy level . In fact, if any user, who has computing energy less than and has execution time satisfying the computing delay time, offloads its tasks to the cloud, it will occupy the radio resources of other users demanding for computation offloading, which results in the increase of transmit energy of those users. This proves the proposition. ∎
Proposition 2: If there exists a feasible solution for given and the optimal value of is positive, then the offloading decision variable is set equal to 0 if the optimal solution of equals to 0 and this will form a feasible solution.
Proof.
If the optimal value of is greater than 0, it means that the local computational energy must be less than since the total computation and transmission energy must be less than or equal to . Because the objective of is to minimize the number of transmission bits, UE will consume the least transmission energy with a given radio resource. Therefore, the task offloading decision variables must be set equal to 0 (offload to cloud) if the solution of task allocation of is equal to 0. Note that when the optimal value of is positive, the transmission rate must be greater than 0 to offload data to the cloud. Therefore, we can rewrite the fractional constraint functions of (C) into non-fractional form as (10). ∎
Using the results in Proposition 1, we can set the zero rate for users with equal to one for all tasks . For remaining users whose optimal are equal zero for at least one task according to Proposition 2, we solve problem to determine the computation offloading and resource allocation solution.
The remaining thing is to verify the feasibility of the resulting equivalent problem of the original problem . Applying the same technique as in (13) for UEs having transmission rate , we transform the product-form of (10) into linear-form of , , , , , , and , where , for SUEs, , and for MUEs.
Using these transformations, the set of constraints of problem can be converted to the linear form of , , , , , , and . Therefore, the feasibility verification of the transformed problem can be done effectively by a standard solver.
III-B Low-complexity Algorithm with Decoupled MC-SC Optimization (LC alg.)
We now present a low-complexity algorithm, which can perform well in large-scale wireless HetNets. In this algorithm, we first determine the minimum number of subchannels so that all MC UEs (MUEs) can meet the energy consumption level , which can be stated as follows:
[TABLE]
After solving problem by using the above transformations for MUEs, the set of remaining subchannels that SC UEs (SUEs) can use can be written as .
We can then allocate these remaining subchannels () by solving problem with only SUEs (i.e., we remove any terms related to MUEs in this problem). This problem can be solved by using the proposed Algorithm 1 for only SUEs with the available subchannels . The number of optimization variables in this algorithm now decreases times in compassion with the case where the joint subchannel allocation optimization for MC and SCs is conducted.
IV Numerical Results
The network setting used in our simulations is shown in Fig.1, where there are 12 MUEs in macrocell and 2 SUEs for each small cell. The MC and SC coverage radius are m and m, respectively. All UEs have 11 levels of operating clock frequency uniformly chosen in GHz. For convenience, we assume that all users has 3 tasks and total CPU requirement for each user is Gcycles. The maximum tolerable computing delay is set equal to for all users while the transmission delay is set randomly in . The number of transmission bits/task and CPU cycles/task are illustrated in Fig.2, which are used in scenario 1 (presented in Fig. 3) in which the ratio of is chosen randomly in (as in Fig. 2). The energy weight is set randomly in . The energy coefficients are set for all users as and [1].
We set the number of subchannels as , and the bandwidth per subchannel is kHz, and the noise power density equal to dBm/Hz. The transmission power is set equal to dBm/Hz and dBm/Hz for MUEs and SUEs, respectively and the circuit power . The subchannel gains are generated according to where is a random value generated according to the exponential distribution and denotes the pathloss defined according 3GPP technical report as (dB) for MUEs and (dB) for SUEs [13] where is the geographical distance between UE and BS (km). The stop condition of bisection search is set as .
Fig. 3 shows the min-max weighted energy required to execute the applications of all UEs, which is obtained by averaging the result over 15 system realizations under the proposed optimal (Optimal alg.) and low-complexity (LC alg.) and no computation offload (No offload). The ratio of bits per CPU cycle (BPC) in scenario 2 is 50 percent higher than that in scenario 1. These results show that the smaller the BPC, the smaller the consumed energy for all schemes. For the computation load of Gcycles per user, the min-max weighted energy without offloading is much higher than that due to the proposed schemes under both scenarios. In particular, the proposed optimal algorithm can reduce the energy about compared with the “No offload” scheme in scenario 1. Moreover, the energy consumption in the LC scheme is nearly equal to the global optimal solution due to the “Optimal alg.”.
The computation allocation using LC alg. for different users with s in one system realization is illustrated in Fig. 4. This figure shows that the computation load is distributed fairly equal between users thanks to min-max weighted energy design objective. Moreover, some UEs, having small BPC tasks or high SINR ratio, offload all tasks to the cloud. This figure also shows that the worst UE can offload 0.06 (Gcycles), then its FCS decreases about 30 percent leading to the decrease of computation energy by nearly 2.4 times.
Fig. 5 shows the min-max weighted energy consumption when the number of cycles per task is fixed by (Gcycles) while the BPC is set the same for all UEs. When this parameter is small, the performance gap in terms of min-max energy consumptions between the proposed offloading (“offload”) design and “No offload” scheme is quite large. This means that the proposed scheme can result in great energy reduction.
Fig. 6 shows the achievable min-max weighted energy for scenarios where each user has 2, 4 and 6 tasks and the total computation load is 0.2 Gcycles per user, maximum tolerable latency while the BPC is fixed at . It can be observed that the min-max weighted energy decreases quite drastically as the number of tasks increases. Moreover, if there are two tasks then users send at least one task to the cloud until subchannels while if there are 4 tasks then users can send at least one task to the cloud when . However, in all cases, if sufficient radio resources are available, the UE with largest weighted energy prefers to offload first to achieve the lowest min-max weighted energy. Therefore, the “No offload” have the worst performance in term of energy consumption comparing with the proposed schemes.
V Conclusion
In this paper, we have proposed a general framework for multi-task multi-user multi-cell computation offloading. Considering the practical discontinuity of operating frequency clock speed of real-world chipsets, and the partitioning of computation load into individual tasks, we have formulated the problem which minimizes the maximum weighted energy consumption while maintaining the application latency requirement. We have then developed the optimal and low-complexity algorithms to tackle this problem. Numerical results have confirmed the desirable performance of the proposed algorithms for wireless HetNets which can lead to great saving of the energy consumption.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. Kwak, Y. Kim, J. Lee, and S. Chong, “Dream: Dynamic resource and task allocation for energy minimization in mobile cloud systems,” IEEE J. Sel. Areas Commun. , vol. 33, pp. 2510–2523, Dec. 2015.
- 2[2] E. Cuervo, A. Balasubramanian, D. Cho, A. Wolman, S. Saroiu, R. Chandra, and P. Bahl, “MAUI: Making smartphones last longer with code offload,” in Proc. ACM Mobi Sys , 2010, pp. 49–62.
- 3[3] B. Chun, S. Ihm, P. Maniatis, M. Naik, and A. Patti, “Clonecloud: Elastic execution between mobile device and cloud,” in Proc. ACM Euro Sys , 2011, pp. 301–314.
- 4[4] S. Kosta, A. Aucinas, H. Pan, R. Mortier, and X. Zhang, “Thinkair: Dynamic resource allocation and parallel execution in the cloud for mobile code offloading,” in Proc. IEEE INFOCOM , 2012, pp. 945–953.
- 5[5] S. Barbarossa, S. Sardellitti, and P. Di Lorenzo, “Communicating while computing: Distributed mobile cloud computing over 5G heterogeneous networks,” IEEE Sig. Process. Mag. , vol. 31, pp. 45–55, Nov. 2014.
- 6[6] N. T. Ti and L. B. Le, “Computation offloading leveraging computing resources from edge cloud and mobile peers,” IEEE ICC , pp. 1–6, 2017.
- 7[7] C. You, K. Huang, H. Chae, and B.-H. Kim, “Energy-efficient resource allocation for mobile-edge computation offloading,” IEEE Trans. Wireless Commun. , vol. 16, no. 3, pp. 1397–1411, 2017.
- 8[8] K. B. L. Yuyi Mao, Jun Zhang, “Joint task offloading scheduling and transmit power allocation for mobile-edge computing systems,” IEEE WCNC , pp. 1–6, 2017.
