Selection of Waveform Parameters Using Machine Learning for 5G and Beyond
Ahmet Yazar, H\"useyin Arslan

TL;DR
This paper introduces a machine learning-based method for selecting waveform parameters in 5G and beyond, aiming to enhance system flexibility and customization, supported by a novel dataset generation approach and simulation results.
Contribution
It presents a new ML-driven selection mechanism for waveform parameters and a simulation-based dataset generation methodology for flexible 5G systems.
Findings
ML-based parameter selection improves flexibility
Simulation results validate the proposed method
Dataset generation approach aids ML training
Abstract
Flexibility is one of the essential requirements in future cellular communications technologies. Providing customized communications solutions for each user and service type cannot be possible without the flexibility in 5G and beyond. Different optimizations need to be done for the flexibility related structures of 5G and beyond systems. In this paper, a novel machine learning (ML) based selection mechanism for the configurable waveform parameters is designed from the flexibility perspective. Moreover, a simulation based dataset generation methodology is proposed for ML systems. Results of computer simulations are presented using the generated dataset.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Selection of Waveform Parameters
Using Machine Learning for 5G and Beyond ††thanks: ©2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Ahmet Yazar1 and Hüseyin Arslan12
1Department of Electrical and Electronics Engineering, Istanbul Medipol University, Istanbul, 34810 Turkey
2Department of Electrical Engineering, University of South Florida, Tampa, FL 33620 USA
Email: {ayazar,huseyinarslan}@medipol.edu.tr
Abstract
Flexibility is one of the essential requirements in future cellular communications technologies. Providing customized communications solutions for each user and service type cannot be possible without the flexibility in 5G and beyond. Different optimizations need to be done for the flexibility related structures of 5G and beyond systems. In this paper, a novel machine learning (ML) based selection mechanism for the configurable waveform parameters is designed from the flexibility perspective. Moreover, a simulation based dataset generation methodology is proposed for ML systems. Results of computer simulations are presented using the generated dataset.
Index Terms:
5G and beyond, machine learning, multi-numerology, resource allocation, waveform.
I Introduction
5G systems come with various implementation flexibilities as a result of a large diversity of user requirements and service types along with wide variety of wireless channel conditions [1]. This flexibility is an important step for the future of communications systems. It is possible to see further flexibility trend in 6G radio networks [2, 3, 4, 5, 6]. New flexibilities inherently require new decision and selection mechanisms. Hence, there can be a need for different type of solutions together with the conventional ones while the flexibility is increasing.
From the flexibility perspective, ultra reliability, low latency, high security, high spectral efficiency, high energy efficiency, and low complexity are some example requirements of different service types as shown in Figure 1. As a first basic problem, meeting some of these requirements together for a user is not an easy task under various constraints. Therefore, there is a need for proper optimization. The second basic problem is related to provide complete satisfaction for all users simultaneously as summarized in Figure 2. It brings another optimization necessity because of the limited resources like in the knapsack problem. As can be seen from these basic problems, 5G and beyond flexibility comes with different optimization needs. There are several research opportunities for these optimizations in new generation communications systems through alternative solutions like Machine Learning (ML) that is one of the most popular recent trends not only for communications but also for various applications. In comparison with conventional methods, ML based solutions can be useful especially if there are many different relationships with the high number of parameters in a difficult problem.
ML can be employed for different optimization aims in wireless communications. Several ML paradigms and challenges for 5G systems are investigated in [7] and [8]. In [9], deep learning (DL) and wireless networks are discussed together in depth. Our paper tries to find an answer to the question of “how 5G and beyond base stations decide on the waveform parameters of each user in a cell using ML”. There were 500, 1000, and 1500 parameters in 2G, 3G, and 4G, respectively [10]. It is not difficult to except that 5G and beyond nodes will have more configurable parameters considering this trend. Related with these adjustable parameters, there is a large number of waveform parameters in 5G New Radio (NR). Most of these parameters are controlled by the base station. Conventional algorithms may not find the optimum parameters in such a rich set of options. The base station needs to provide the optimum waveform parameters for each user to solve two basic resource allocation problems that are given in Figure 2. ML based methods can play an important role at this point. In the literature, end-to-end physical layer of wireless systems is analyzed from the DL perspective in [11]. In another study, authors develop an Orthogonal Frequency Division Multiplexing (OFDM) receiver and equalizer with ML [12]. These are some of the example ML studies that are related with resource allocation and waveform design for 5G and beyond.
A supervised ML based method is developed with this paper for the second goal function in Figure 2 to provide a general structure before the selection of waveform parameters for each user. For example, our method decides on the efficient number of numerologies (frame parameters) that can be assigned to users. However, it does not make user-numerology association directly. It can be done with conventional methods like in [1]. Also, different scheduling algorithms like [13] can be applied to enhance reliability and spectral efficiency after the decision of general structure for configurable waveform parameters. In this paper, the main focus is on the general structure as a first step and it is done with ML methods. Contributions include designing a ML based selection mechanism for the configurable waveform parameters in 5G and beyond communications and introducing a simulation based dataset generation methodology for the objective of the waveform parameter selection.
The rest of the paper is organized as follows: Section II presents system model and assumptions. Waveform parameters and possible class labels for the dataset are described in Section III. Feature extraction and automatic class labelling processes are explained in Section IV. In Section V, simulation results of ML methods are provided. Finally, Section VI gives the conclusion.
II System Model and Assumptions
The proposed supervised ML based method is designed to use in the first step of the overall system model as shown in Figure 3(a). Decisions of the first step need to be employed while deciding on the user-based parameter assignments in BS. The first step of the model aims to increase overall system satisfaction.
The number of different service type element of the scenarios is three in 5G systems and they are listed as enhanced mobile broadband (eMBB), ultra reliable low latency communications (uRLLC), and massive machine type communications (mMTC). These service types and their requirements are used in the algorithms. Users can belong to one of 5G services and different Rayleigh fading channel models are used for each user. Therefore, the proposed ML system model can be assumed as channel-aware and service type-aware.
Dataset is generated using the results of three performance metrics in multi-numerology based 5G NR simulation. We assume that the 5G NR simulation gives perfect results. Performance metrics include signal to interference plus noise ratio (SINR), spectral efficiency, and flexibility. It is assumed that flexibility changes directly proportional to the number of numerologies. 5G NR numerology parameters are used according to 3GPP standard documents [14]. We allocate users with same numerologies contiguously in the frequency domain. It is assumed that each numerology block that consists of multiple carriers can be shared by multiple users.
III Waveform Parameters and Possible Class Labels for the Dataset
BSs make numerology assignment and resource allocation for each user according to user feedbacks. Additionally, BSs decide on the optional waveform processing techniques include windowing, filtering, guard utilization, and CP utilization. The difficulty of these decisions comes with the diversity of different requirements. Besides, the decisions need to be given considering all users together in a general structure as shown in Figure 3. In this section, various waveform parameters and then possible class labels for a dataset are provided.
III-A Waveform Parameters in 5G and Beyond
Long Term Evolution (LTE) employs Cyclic Prefix (CP) OFDM waveform with single numerology in downlink. CP-OFDM waveform with multiple numerologies is preferred in downlink of 5G NR. There are also optional waveform processing techniques like windowing and filtering in 5G NR. If it is assumed that there are options for multiple numerologies and options for waveform processing techniques, 5G NR has different options in the CP-OFDM based transmitter.
Multi-numerology structure of 5G NR provides different frame parameters for users in a cell. Four numerologies are defined for data transmission in 5G NR [15]. In the future, more numerology structures can be defined to increase flexibility. Multi-numerology structures bring good flexibility in 5G but there is only one adjustable parameter which is a subcarrier spacing [16]. The other numerology parameters depend on the subcarrier spacing CP duration, slot duration, maximum allowed bandwidth, and the number of slots per subframe are dependent to subcarrier spacing because of the simplicity and practicality. If it is assumed that users with same numerologies are located contiguously in the frequency domain and it is not mandatory to employ all of the numerologies, there can be numerology options constituted with four numerologies. If the number of waveform related independent adjustable parameters increases with beyond 5G, there are more options for multiple numerologies [1]. For example, adjustable CP duration and utilization is an important concept for beyond 5G. Additionally, using one common CP for different numerologies can also be another new concept in 5G beyond and it also changes the number of numerology options [17].
Waveform processing techniques of 5G transmitters include windowing usage, filtering usage, and guard utilization. All of these techniques are defined as optional in 3GPP [18]. There can be more optional techniques in 5G beyond systems. Multiple numerologies and the other non-orthogonality sources increase the necessity of waveform processing techniques [17]. These techniques require various variable parameters. For example, several prototype filters in the literature including rectangular, raised-cosine, Gaussian and so on are provided in [19]. Different filters and the related coefficients increase the number of options for waveform processing techniques.
III-B Possible Class Labels for the Dataset
As a summary, there are different type of numerology groups and waveform processing techniques. As mentioned, there are four types of numerologies but 64 numerology groups in 5G. For the waveform processing techniques, the same windowing, filtering and guard utilization can be employed for all users or different options can be preferred for each user. Guard bands can be used between numerologies in the frequency domain.
There are classes under these conditions. Each class gives a different set of waveform parameters. The class labels can be defined with a multidimensional lookup table like in Figure 4. More than 1000 classes can be considered for 5G cellular systems. The number of classes may increase exponentially for 5G beyond.
The number of class labels has direct effects on the difficulty of learning problem and computational complexity of training ML models. Additionally, if there are more class labels, a larger dataset is needed. Because of these restrictions, we only takes 10 classes in this paper for the sake of simplicity. It is assumed that there are four options about the numerology related parameters. Using four numerologies, three numerologies, two numerologies, and only one numerology are four basic different options. The other details of user-numerology association can be done in the next steps without ML. On the other side, it is assumed that there is only three guard band options as the waveform processing techniques. Windowing and filtering options are not included at this stage. The details of 10 class labels are given in Figure 4. There is not any guard band necessity in Class-10 because there is only one numerology for this class.
IV Feature Extraction and Automatic Class Labelling
IV-A Feature Extraction
It assumed that there are users in a cell. For all these users, independent random data is generated for different scenarios (e.g. thousands of random scenarios) based on the channel information in [20]. Scenarios are defined with the parameters of random maximum excess delay, random maximum Doppler effect, and random service type (eMBB, uRLLC, or mMTC). If there are input random parameters (e.g. I is three in our example), parameters are generated for each scenario. Therefore, the generated raw dataset includes rows that are equal to the number of different data. The raw dataset also includes columns. In the future cellular systems, there can be more type of services. In addition, there will be more different type of new channel models for 5G beyond, especially for millimeter wave systems. It means that needs to be increased with the future wireless communications.
Each data in the raw dataset includes parameters. is taken as three in our design. Feature extraction methods are chosen to make the design independent from the number of users. Seven features are extracted to make classification process efficiently. Details of features are provided in Figure 5. The number of features can change.
IV-B Simulation Based Automatic Class Labelling
In simulation based dataset generation, there is a need to analyze the results of one or more performance metrics while deciding on the class labels automatically. The target performance metrics should be selected carefully for each scenario. All options of the waveform parameter sets are tried one by one on the simulation setup and three performance metrics (SINR, spectral efficiency, and flexibility) are calculated in each time. Then, the most successful waveform parameters are determined as a class label as shown in Figure 6. Different amount of priorities and weights to the related performance metrics are given considering the service type majority. As an example, spectral efficiency metric has more priority for eMBB service but SINR metric has more priority for uRLLC service. If there are high number of users with all type of services, then the flexibility metric has also priority because the overall system needs to meet with many different requirements together.
After the automatic class labelling process, the number of samples in each class is counted. If there is a high variance for the number of samples in each class, some of the samples in the most frequent classes are removed from the dataset. Otherwise, ML training can be biased for the most frequent classes in the dataset.
V Simulation Results
In the dataset, there are 114420 samples (rows) with seven feature columns and one class label column corresponding to random scenarios. Class distributions are equal. The dataset is divided as training, validation, and testing with 80%, 20%, and 20% ratios, respectively. Automatic class labelling is employed using a multi-numerology based CP-OFDM simulator that is designed for 3GPP 5G NR standards.
MATLAB platform is preferred in the computer simulations. ’fitcecoc’ and ’patternnet’ functions of Statistics and Machine Learning Toolbox and Deep Learning Toolbox, respectively, are used while training various ML models. Hyperparameter optimizations are done with the same functions. Discriminant classifier, KNN clasifier, decision tree clasifier, naive Bayes classifier, and neural networks (NN) are employed during the simulations. The results of relatively more succesful models are presented in Figure 7. Confusion matrices show that success rates changes between 60% and 65% for 10 classes. Additionally, if neighbour classes in Figure 4 are grouped together, the success rates vary between 90% and 93% for the same classifier models. For example, if the decision for number of numerologies three or four, it can be acceptable while neighbour classes are grouped. However, confusion matrices in Figure 7 are provided without grouping neighbour classes. Receiver operating characteristic (ROC) curves in Figure 7(a) and Figure 7(b) shows the results without and with grouping neighbour classes, respectively.
As it can be seen from the simulation results, selection of waveform parameters is not an easy task. If the number of class labels (parameter sets) is increased, the problem will be more difficult. Modulation types and orders, the other waveform processing techniques like windowing and filtering parameters can be included for the class labels. In this case, there are more than 1000 class labels in a dataset. Hence, ML mechanisms may not be sufficient by itself under these conditions. Conventional methods need to be employed with ML mechanisms together to obtain better success results.
VI Conclusion
5G and beyond BSs need ML based decision units and optimization modules. This paper provides a detailed example to understand how can we use ML in 5G and beyond wireless systems. The proposed dataset generation methodology can be used to develop large datasets for better ML models in several studies. As a future work, the remaining waveform processing techniques like windowing and filtering will be included in our class labelling system. Also, different modulation types can be included with bit error rate (BER) performance metric. DL based testbed will be tested with a larger dataset. Moreover, the other steps like user-numerology association will be carried out with ML. As a concluding remark, feasibility of ML based systems should be analyzed to see that ML provides a better solution compared to the practical and effective conventional solutions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Yazar and H. Arslan, “A Flexibility Metric and Optimization Methods for Mixed Numerologies in 5G and Beyond,” in IEEE Access, vol. 6, pp. 3755-3764, Feb. 2018.
- 2[2] W. Saad, M. Bennis and M. Chen, “A Vision of 6G Wireless Systems: Applications, Trends, Technologies, and Open Research Problems,” Available online at https://arxiv.org/abs/1902.10265, Feb. 2019.
- 3[3] V. Raghavan and J. Li, “Evolution of Physical-Layer Communications Research in the Post-5G Era,” in IEEE Access, vol. 7, pp. 10392-10401, Jan. 2019.
- 4[4] F. Tariq, M. Khandaker, K.-K. Wong, M. Imran, M. Bennis, M. Debbah, “A Speculative Study on 6G,” Available online at https://arxiv.org/abs/1902.06700, Feb. 2019.
- 5[5] M. Giordani, M. Polese, M. Mezzavilla, S. Rangan, M. Zorzi, “Towards 6G Networks: Use Cases and Technologies,” Available online at https://arxiv.org/abs/1903.12216, Mar. 2019.
- 6[6] K. David and H. Berndt, “6G Vision and Requirements: Is There Any Need for Beyond 5G?,” in IEEE Vehicular Technology Magazine, vol. 13, no. 3, pp. 72-80, Sep. 2018.
- 7[7] C. Jiang, H. Zhang, Y. Ren, Z. Han, K. Chen and L. Hanzo, “Machine Learning Paradigms for Next-Generation Wireless Networks,” in IEEE Wireless Communications, vol. 24, no. 2, pp. 98-105, Apr. 2017.
- 8[8] R. Li et al., “Intelligent 5G: When Cellular Networks Meet Artificial Intelligence,” in IEEE Wireless Communications, vol. 24, no. 5, pp. 175-183, Oct. 2017.
