Deep Learning for Cross-Technology Communication Design
Anatolij Zubow, Piotr Gaw{\l}owicz, Suzan Bayhan

TL;DR
This paper introduces DeepCTC, a deep learning autoencoder approach for efficient cross-technology communication in OFDM systems, enabling simultaneous in-technology and CTC transmissions with flexible weighting and heterogeneous receiver support.
Contribution
It presents a novel DL-based autoencoder design for joint optimization of in-technology and CTC communication, outperforming classical methods in flexibility and efficiency.
Findings
Successful decoding of in-technology and CTC messages with low error rates
Supports heterogeneous OFDM receivers in CTC broadcasts
Demonstrates feasibility of deep learning for cross-technology communication
Abstract
Recently, it was shown that a communication system could be represented as a deep learning (DL) autoencoder. Inspired by this idea, we target the problem of OFDM-based wireless cross-technology communication (CTC) where both in-technology and CTC transmissions take place simultaneously. We propose DeepCTC, a DL-based autoencoder approach allowing us to exploit DL for joint optimization of transmitter and receivers for both in-technology as well as CTC communication in an end-to-end manner. Different from classical CTC designs, we can easily weight in-technology against CTC communication. Moreover, CTC broadcasts can be efficiently realized even in the presence of heterogeneous CTC receivers with diverse OFDM technologies. Our numerical analysis confirms the feasibility of DeepCTC as both in-technology and CTC messages can be decoded with sufficient low block error rate.
|
|
|
|
|
|
||||||||||||||||
| 802.11n/ac | 17.5 | 20 | 64 | 312.5 | 3.2 | ||||||||||||||||
| 802.11ax | 17.5 | 20 | 256 | 78.125 | 12.8 | ||||||||||||||||
| LTE-LAA/U | 18 | 30.72 | 2048 | 15 | 66.6 | ||||||||||||||||
| WiMAX | 18.4 | 22.4 | 2048 | 10.94 | 91.4 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWireless Signal Modulation Classification · Wireless Communication Security Techniques · Speech and Audio Processing
Deep Learning for Cross-Technology Communication Design
Anatolij Zubow, Piotr Gawłowicz, Suzan Bayhan
Technische Universität Berlin, Germany
{zubow, gawlowicz, bayhan}@tkn.tu-berlin.de
Abstract
Recently, it was shown that a communication system could be represented as a deep learning (DL) autoencoder. Inspired by this idea, we target the problem of OFDM-based wireless cross-technology communication (CTC) where both in-technology and CTC transmissions take place simultaneously. We propose DeepCTC, a DL-based autoencoder approach allowing us to exploit DL for joint optimization of transmitter and receivers for both in-technology as well as CTC communication in an end-to-end manner. Different from classical CTC designs, we can easily weight in-technology against CTC communication. Moreover, CTC broadcasts can be efficiently realized even in the presence of heterogeneous CTC receivers with diverse OFDM technologies. Our numerical analysis confirms the feasibility of DeepCTC as both in-technology and CTC messages can be decoded with sufficient low block error rate.
Index Terms:
Machine Learning, networking research, wireless, cross-technology communication
I Introduction
The latest advances in the area of wireless cross-technology communication (CTC) opened new ways to mitigate the coexistence problems in dense deployments of heterogeneous wireless technologies in the unlicensed spectrum [1, 2, 3]. CTC enables direct over-the-air communication between heterogeneous devices, e.g. LTE with WiFi [4] or WiFi with ZigBee [3], that is otherwise not possible due to incompatible communication layers. However, designing efficient, i.e. high-data rate, and generic CTC solutions is very challenging. Existing CTC proposals are either technology-specific and therefore costly, or generic but inefficient, hence, limiting the scope of possible coexistence applications, i.e., only medium or even only long-term cross technology management.
Machine learning (ML) in general and deep learning (DL) in particular is successfully applied in fields like computer vision. Despite being extremely challenging to describe images from the real world with traditional mathematical models for detection and classification of objects in images, an ML/DL based algorithm can easily recognize images given sufficient number of training examples. Similarly, efficient CTC is hard to realize using conventional rule-based design due to several reasons. First, due to the diversity of technologies and their incompatible communication layers, CTC solutions need to be designed for each pair of supported technology separately which is very costly as with different technologies CTC solutions need to be developed. Second, due to spectrum diversity even two nodes of same technology may require different CTC signals as they may use only partially overlapping spectrum bands, e.g. WiFi on two overlapping channels. Third, as the CTC signal is transmitted overlay, i.e. superimposed on the underlying in-technology transmission, it may result in mutual interference which need to be carefully controlled as otherwise certain QoS assertion cannot be met for the in-technology communication. Forth, in an environment with a myriad of diverse technologies, the support of an efficient CTC broadcast operation is crucial. Instead of transmitting the CTC data to each CTC receiver independently, e.g. in round-robin fashion, the direct utilization of the broadcast nature of the wireless channel is more efficient but requires the design of a CTC signal which can be decoded by all the heterogeneous receivers in vicinity. As today’s available models fall short of capturing all aspects of CTC, we expect ML/DL-based design to yield significant improvements in CTC. As recently shown in the literature, e.g. by O’Shea et al. [5], ML/DL-based design is promising as even a complicated communication system can be represented as a DL autoencoder. Moreover, it becomes practically relevant to use ML/DL as the computational resources of communication devices grow substantially and first ML-specialized ASIC technology has already become commercially available, e.g. Google TPU or Nvidia Jetson.
Contributions: Our key contributions are as follows:
- •
With propose DeepCTC and show that it is possible to learn full transmitter and receiver implementations in an OFDM-based communication network with in-technology and CTC communication competing for capacity. This is challenging as at the transmitter side the CTC signal needs to be superimposed on the in-technology signal resulting in mutual interference. Such a setup can be represented as a deep neural network (DNN) with two inputs and multiple outputs, where one output represents the decoded message at the in-technology receiver and the others are the decoded CTC message at each CTC receiver. The resulting DNN can be trained as an autoencoder. The learned transmitter and receiver implementations are jointly optimized towards certain performance metrics like block error rate (BLER).
- •
DeepCTC allows us to arbitrarily weight the in-technology against CTC communication, i.e. to achieve prioritization in terms of BLER.
- •
DeepCTC allows a transmitter to learn optimal CTC broadcasting strategy, which is paramount especially in coexistence settings with many CTC receivers with diverse OFDM technologies, e.g. LTE and WiFi.
II Related Work
Recently, the usage of ML/DL techniques for physical layer design of communication systems sparked the interest of the research community. O’Shea et al. [5] have shown that a communication system can be represented as a DL autoencoder. Hence, the communications system design can be seen as an end-to-end reconstruction task that seeks to jointly optimize transmitter and receiver components in a single process. The authors showed that the idea can be extended to networks of multiple transmitters and receivers, i.e. interference channel. In [6], this idea was extended to a single-user end-to-end MIMO communication system over a Rayleigh fading channel. Moreover, Borgerding et al. [7] showed that DL can be used to recover the sparse signal from noisy linear measurements in MIMO environments. Here the proposed scheme was able to outperform the traditional algorithms in estimating the massive-MIMO channel. Ye et al. [8] showed how DL can be used for channel estimation and signal detection in OFDM systems. In this paper, we extend the related work towards an OFDM-based CTC system with in-technology and CTC transmissions.
III Background Knowledge
This section provides a brief overview of OFDM and DL.
III-A OFDM Primer
Due to its high spectral efficiency and robustness to frequency-selective fading, many wireless technologies, e.g., 3GPP LTE and IEEE WiFi, use Orthogonal Frequency Division Multiplexing (OFDM). Moreover, it is likely that OFDM will remain the technology of choice for future wireless networks. The idea behind OFDM is threefold: (i) parallel transmissions over multiple narrow-band channels, (ii) orthogonal subcarriers and (iii) subcarrier level modulation and coding scheme (MCS) adaptation. First, an OFDM transmitter (TX) splits a high-rate data stream into many lower-rate parallel streams. Owing to a lower rate, the symbol duration can be longer compared to the single channel modulation schemes, which helps to reduce Inter-Symbol Interference (ISI) occurring in multipath environments. Second, the TX divides the wireless channel bandwidth into narrow-band subcarriers such that subcarriers are orthogonal to each other and thereby transmissions on these carriers do not interfere. Each data stream then is transmitted on one of these subcarriers. Third, the TX can apply a different MCS for each subcarrier to cope with the unfavorable channel conditions including frequency selective fading and narrow-band interference.
OFDM can be efficiently realized using Fast Fourier Transform (FFT). Specifically, a transmitter modulates incoming data bits into subcarriers. Then, it relies on inverse FFT to convert the frequency domain representation into the time domain and sends the signal over the air interface. An OFDM receiver reverses the steps performed at the transmitter, thus, it executes FFT on the received signal to convert it back to the frequency domain and demodulates subcarriers independently.
The operation of an OFDM transmitter can be abstracted as spreading data on a two-dimensional grid (in frequency (subcarrier) and time (symbol) domains) which we will refer to as OFDM time-frequency grid (OTFG) hereafter. Despite relying on the same modulation approach, current OFDM-based wireless networks use vastly diverse parameters for their operation. Table I presents typical parameters of various OFDM-based wireless technologies.
III-B Deep Learning (DL) Primer
Deep learning (DL) has been successfully applied in a wide range of areas with significant performance improvement, ranging from computer vision [9], natural language processing [10] to speech recognition [11]. The structure of the DNN architecture is given in Fig. 1. In order to improve the ability in representation or recognition deep neural networks (DNN) use an increased number of hidden layers. Each network layer consists of multiple neurons, each of which has an output that is a nonlinear function of a weighted sum of neurons of its preceding layer.
The widely used nonlinear functions are Sigmoid, tanh or ReLU. Hence, the output of the DNN n is a cascade of nonlinear transformation of input data I, mathematically expressed as [8]: where is the number of layers and denotes the weights of the DNN. The parameters of the model are the weights of the neurons, which need to be optimized before the network can be deployed. Usually, the optimal weights are learned on a training set, with known desired outputs.
Numerous DL tools are available that make it simple to build, train and use large NNs. Therefore, normally high-level programming languages like Python are used to hide the complexity of training routines using massively parallel GPU architecture. The most widely used ML tools are Caffe and TensorFlow111https://www.tensorflow.org/. Keras222https://keras.io/ provides a high-level API allowing to create and optimize even a very complex NN in just a few lines of code. Keras runs on top of TensorFlow that allows to represent numerical computation as a data flow graphs, i.e. nodes represent mathematical operations, while edges represent the multidimensional data arrays that flow between them. It is worth to mention, that with only a minor change of configuration parameters, the TensorFlow library is able to execute the same computing graphs on a single CPU and GPU as well as distributed clusters of them.
IV Cross-Technology Communication (CTC)
The traditional way to achieve bridging between different wireless technologies is to use multi-radio gateways. Such an indirect approach requires additional hardware for the gateway which is not only costly but also unnecessarily increases deployment complexity. Moreover, it is inefficient as the same message has to be sent multiple times, i.e. once on each technology, and may result in a bottleneck at the gateway. As a result, a new field of research termed as CTC has emerged. Pioneering works like [12, 13] showed that it is possible to exchange messages among heterogeneous wireless technologies despite their incompatible physical layer modulation - cmp. Fig 2 with Fig 3.
Known examples from literature are WEBee [3] which allows a WiFi device to talk directly to a ZigBee node and LtFi [4] which enables CTC between LTE and WiFi. We can categorize the prior CTC solutions into two as packet-level and physical-layer CTC solutions. Packet-level solutions convey the CTC message by modulating bits into either frame length [12, 14], gap or inter-frame spacing [13, 4], packet transmission power [15], timing of periodic beacon interval [16], prepending legacy packets with a customized preamble containing sequences of energy pulse [17],[18, 13, 19]. On the other hand, physical-layer solutions manipulate the signal to enable CTC. Some outstanding proposals in this category are WeBee [3], TwinBee [20] and LongBee [21] which target CTC between WiFi and ZigBee. In WeBee, a WiFi device emulates the ZigBee OQPSK signal by properly selecting the payload of WiFi frame. In a similar spirit, ULTRON [1] proposes to emulate a WiFi OFDM signal at the LTE-U transmitter.
In this paper, we focus on the problem of joint encoding of in-technology and CTC data for OFDM-based wireless technologies. Note that such a CTC modulation where the CTC signal is superimposed on top of the in-technology communication can be seen as a form of downlink superposition coding [22]. Note as different OTFGs are used at transmitter and CTC receiver, the cross-technology OFDM signal reception is not trivial. Specifically, orthogonal OFDM symbols of the transmitter interfere with each other in time and/or frequency in the heterogeneous receiver. As an example, consider the case of LTE and WiFi (802.11n) where an LTE OFDM symbol is longer in time and narrower in frequency as compared to WiFi (Table I). In Fig. 4, we illustrate the issue of cross-technology OFDM signal reception using an analogy from the image processing area.
V Design Goals and Overview of DeepCTC
As mentioned in § IV, our focus is on CTC modulations where the CTC message is superimposed on top of the underlying in-technology transmission. Such an approach creates an overhead on the underlying in-technology transmission either in increased bit error rate due to mutual interference or increased frame length as additional redundancy for forward error correction might be needed. Hence, such a trade-off needs to be considered when designing a CTC solution.
The second challenge is the fact that in a mixed environment a node is surrounded by different types of OFDM technologies which differ in their configuration, mainly in their OTFG, i.e. sizes of the FFT bins in both time and frequency. Hence, the transmitter has to account for that mismatch when encoding CTC messages for the receiver. This is especially challenging as for an efficient CTC broadcast, i.e. a CTC message needs to be received and decoded by all CTC receivers having different OTFG. Hence, we envision the following ML/DL approach. First, a node joining the radio spectrum performs scanning for different technologies in vicinity using methods like [23, 24]. After discovering the OTFG dimensions used by the technologies in vicinity, the node loads the corresponding pre-trained DeepCTC model from which it automatically derives the transmitter and receiver functionality. When a technology leaves the vicinity, the aforementioned process needs to be repeated by excluding the leaving technology. Note that in typical coexistence scenarios usually only the base stations of the networks, e.g., LTE eNB and WiFi AP, need to directly communicate for coordination of their spectrum sharing. Hence, as those entities are usually static and there is only a limited number of available technologies, it is feasible to preload the trained models on the devices.
VI System Model
An OFDM-based communication system with in-technology and CTC communication is shown in Fig. 5. It consists of a transmitter, channel(s) and multiple receivers. Moreover the receivers can be classified into in-technology and CTC receiver(s). The transmitter wants to communicate two types of messages: i) in-technology message and ii) CTC message to the in-technology and CTC receivers, respectively. Therefore, it selects one out of possible messages for in-technology and also one out of possible messages for CTC communication. The two messages are superimposed on each other, mapped to the transmitters OTFG and communicated to the receiver(s) making discrete uses of the channel333 is the size of the transmitter’s OTFG.. Hence the transmitter applies the transformation to the message and transformation to the message to generate the transmitted signal . In addition the transmitter imposes certain constraints on x like energy, power and amplitude constraints [5].
The communication rate of such a communication system is [bit per OTFG use], where , and , where , for in-technology and CTC communication, respectively. The channels are described as conditional PDFs , where denotes the received signal at the in-technology receiver. The signal for the CTC receivers passes the channel , where denotes the received signal at CTC receiver. Note that as the CTC receivers uses a different OTFG, has a different dimension as x. Upon reception, the in-technology receiver applies the transformation to produce the estimate of the transmitted message . A similar transformation is applied at the CTC receivers, i.e. to produce the estimate of the transmitted message . Recall that CTC receivers might use different values for and , i.e., different OTFG.
From the perspective of DL, such a communication system can be seen as a type of autoencoder [25]. The task of the autoencoder in the context of a communication system is to learn representations x of the transmitted messages that are robust with respect to the channel impairments mapping x to y (i.e., noise, fading, distortion, etc.), so that the transmitted message can be recovered with small probability of error [5].
In the case of CTC, the autoencoder is even more different. In addition to learning an intermediate representation robust to channel impairments, it has to learn how to superimpose the CTC signal on top of in-technology signal robust to mutual interference. Moreover, the autoencoder needs to learn that the in-technology and CTC receivers are diverse as they use OTFGs of different dimensions. Hence it has to learn an intermediate representation that allows both the in-technology and the CTC receivers to recover their respective messages with small probability of error.
Note that the proposed approach can be extended to CTC systems where the nodes use different but at least partially overlapping spectrum, e.g. two WiFi nodes operating on overlapping channels in 2.4 GHz band. Here, only the overlapping spectrum can be used for transmission of CTC messages.
VII DeepCTC: An Autoencoder for CTC
VII-A Model Building
The proposed autoencoder for CTC is shown in Fig. 6. The transmitter consists of a feedforward NN with multiple dense layers and a reshape layer for each input message, in-technology and CTC. The reshaping is needed in order to map the signal to the OTFG used by the transmitter. The outputs of the two reshape layers is merged using an add layer, i.e. superposition of the in-technology and CTC messages. The final layer is the normalization layer that ensures that the physical constraints on x are met. Note that the two inputs and to the transmitter are encoded as a one-hot vector. The OFDM channel is represented by a noise layer with fixed variance. Both the in-technology and CTC receivers are implemented as feedforward NNs.
The in-technology receiver is simpler. The first layer is a reshape layer which converts the parallel OFDM signal into a serial one. It is followed by multiple dense layers whereas the last one uses softmax activation whose output is a probability vector over all possible in-technology messages. Here the decoded message corresponds to the index of the element of p with the highest probability. The CTC receiver is slightly different as it contains two additional layers before the reshape layer, namely reducemean and repeatvector layers. Those two layers are needed to map transmitter’s OTFG on the one used by the CTC receiver, i.e. different OFDM parameters like FFT size.
VII-B Model Training
The models are trained by viewing OFDM (de-)modulation, i.e. (I)FFT, and the wireless channels as black boxes (see Fig. 5). The autoencoder can be trained end-to-end using stochastic gradient descent (SGD) on a set of all possible in-technology and CTC messages. Therefore, we take the individual cross-entropy loss functions at the in-technology and CTC receiver, respectively. We train the two coupled autoencoders with conflicting goals. Our approach consists of minimizing a weighted sum of both losses, i.e., for some . This is needed as minimizing alone () would result in the transmitter to transmit a weak and/or constant signal for that the in-technology receiver can simply subtract from . However, it would make the decoding of at the CTC receiver impossible. By selecting a proper , a proper weight can be assigned to in-technology and CTC transmission, hence allowing to control the trade-off between the decodability (BLER) of the two transmissions.
VIII Implementation
We have implemented the proposed CTC autoencoder using open source ML libraries like Tensorflow and Keras. Most of the needed NN layers are part of keras like Input, Dense, GaussianNoise, Reshape, RepeatVector and Add. Missing layers like ReduceMean where implemented using keras Lambda layer and Tensorflow. The model was trained with a data set of size whereas the testing data size was . After the training the autoencoder in AWGN channel we used the trained model to derive the the encoder used by the transmitter and the two decoders used by the in-technology and CTC receivers respectively.
IX Results
We have performed several experiments to demonstrate the feasibility of DL-based CTC design. This section summarizes the results obtained from numerical simulations.
IX-A Joint transmission of in-technology and CTC traffic
The proposed approach allows to weight the importance of in-technology communication against the superimposed CTC by controlling the value of . Fig. 7 shows the block error rate (BLER), i.e., and , of a communications system employing binary phase-shift keying (BPSK) modulation, , and an OTFG on the transmitter side of size . While the in-technology receiver uses the same 2D grid as the transmitter, the CTC receiver’s grid was , i.e. FFT size was larger.
We can observe in Fig. 7 that the CTC transmission fails with probability of 1 even for high SNR if . This is because the full preference is given to the in-technology communication. With a slightly smaller value , both communications can proceed simultaneously. Note the slight degradation of the in-technology communication of at most 1 dB. For and SNR of dB, our DeepCTC achieves almost error-free communication for both CTC and in-technology traffic.
IX-B Serving CTC broadcast
In case of multiple heterogeneous CTC receivers, efficient CTC broadcasting is paramount as otherwise a CTC transmitter has to send its CTC message to each CTC receiver in CTC unicast mode in a round-robin fashion. Instead, we can let the proposed autoencoder to learn efficient broadcast communication towards multiple heterogeneous CTC receivers. Therefore, in this experiment, we disable in-technology communication and send only CTC messages towards two CTC receivers having different FFT sizes, one with OTFG and the other with OTFG. The autoencoder was configured with a weight of 1 for both receivers and trained.
Fig. 8 shows the results. We see three curves for the three considered scenarios. Two curves represent the scenarios with homogeneous receivers, i.e. both receivers are configured with either or OTFG. The third curve represents the mixed environment where one receiver uses OTFG whereas the other OTFG. As we are interested in the broadcast performance, each curve shows maximum of the individual BLERs, i.e. where and are the BLER experienced by the two CTC receivers. We can see that DeepCTC is able to learn to encode the CTC signal in such a way that it can be decoded with high probability by two CTC receivers with different OTFG sizes - cmp. curves homogeneous B with heterogeneous A+B.
X Conclusions & Future Work
This paper shows that a cross-technology communication (CTC) system can be represented as a deep learning (DL) autoencoder. This approach allows us on one hand to jointly optimize the transmitter and receiver of both in-technology and CTC communication. On the other hand, efficient CTC broadcasting becomes possible. As future work, we plan to perform end-to-end learning over real channels and hardware. We expect DL to give significant improvements as technology, hardware, and channel together form a black-box whose input and output can be observed, but for which no exact analytic expression is known a priori [5]. Another goal is to perform transfer learning where the E2E CTC system trained on a statistical model is adopted to a real-world implementation [26].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] E. Chai, K. Sundaresan, M. A. Khojastepour, and S. Rangarajan, “LTE in unlicensed spectrum: Are we there yet?” in ACM Mobi Com , 2016.
- 2[2] A. Zubow, P. Gawlowicz, and S. Bayhan, “On Practical Coexistence Gaps in Space for LTE-U/Wi Fi Coexistence,” in European Wireless , 2018.
- 3[3] Z. Li and T. He, “WE Bee: Physical-Layer Cross-Technology Communication via Emulation,” in ACM Mobi Com , 2017.
- 4[4] P. Gawłowicz, A. Zubow, and A. Wolisz, “Enabling Cross-technology Communication between LTE Unlicensed and Wi Fi,” in IEEE INFOCOM , 2018.
- 5[5] T. O’Shea and J. Hoydis, “An introduction to deep learning for the physical layer,” IEEE Transactions on Cognitive Communications and Networking , vol. 3, no. 4, pp. 563–575, 2017.
- 6[6] T. J. O’Shea, T. Erpek, and T. C. Clancy, “Deep learning based mimo communications,” ar Xiv preprint ar Xiv:1707.07980 , 2017.
- 7[7] M. Borgerding, P. Schniter, and S. Rangan, “Amp-inspired deep networks for sparse linear inverse problems,” IEEE Transactions on Signal Processing , vol. 65, no. 16, pp. 4293–4308, 2017.
- 8[8] H. Ye, G. Y. Li, and B.-H. Juang, “Power of deep learning for channel estimation and signal detection in ofdm systems,” IEEE Wireless Communications Letters , vol. 7, no. 1, pp. 114–117, 2018.
