Improving Channel Charting with Representation-Constrained Autoencoders

Pengzhi Huang; Oscar Casta\~neda; Emre G\"on\"ulta\c{s}; Sa\"id; Medjkouh; Olav Tirkkonen; Tom Goldstein; Christoph Studer

arXiv:1908.02878·eess.SP·August 9, 2019

Improving Channel Charting with Representation-Constrained Autoencoders

Pengzhi Huang, Oscar Casta\~neda, Emre G\"on\"ulta\c{s}, Sa\"id, Medjkouh, Olav Tirkkonen, Tom Goldstein, Christoph Studer

PDF

TL;DR

This paper enhances channel charting by integrating side information into autoencoders, improving the spatial accuracy of user equipment positioning solely from channel-state information without extensive measurements.

Contribution

It introduces representation-constrained autoencoders that incorporate side information to better preserve the global geometry of channel charts for positioning.

Findings

01

Representation constraints improve channel chart quality

02

Autoencoders recover global geometry effectively

03

Positioning accuracy is enhanced without GPS or extensive data

Abstract

Channel charting (CC) has been proposed recently to enable logical positioning of user equipments (UEs) in the neighborhood of a multi-antenna base-station solely from channel-state information (CSI). CC relies on dimensionality reduction of high-dimensional CSI features in order to construct a channel chart that captures spatial and radio geometries so that UEs close in space are close in the channel chart. In this paper, we demonstrate that autoencoder (AE)-based CC can be augmented with side information that is obtained during the CSI acquisition process. More specifically, we propose to include pairwise representation constraints into AEs with the goal of improving the quality of the learned channel charts. We show that such representation-constrained AEs recover the global geometry of the learned channel charts, which enables CC to perform approximate positioning without global…

Tables2

Table 1. TABLE I: Summary of proposed representation constraints for AEs (known quantities are underlined).

Name	Constraint	Regularizer
Fixed absolute distance (FAD)	$‖ 𝐲_{i} - {\underline{𝐲}}_{j} ‖ = {\underline{d}}_{i, j}$	${(‖ 𝐲_{i} - {\underline{𝐲}}_{j} ‖ - {\underline{d}}_{i, j})}^{2}$
Fixed relative distance (FRD)	$‖ 𝐲_{i} - 𝐲_{j} ‖ = {\underline{d}}_{i, j}$	${(‖ 𝐲_{i} - 𝐲_{j} ‖ - {\underline{d}}_{i, j})}^{2}$
Maximum absolute distance (MAD)	$‖ 𝐲_{i} - {\underline{𝐲}}_{j} ‖ \leq {\underline{d}}_{i, j}$	$\max {∥ 𝐲_{i} - {\underline{𝐲}}_{j} ∥ - {\underline{d}}_{i, j}, 0}^{2}$
Maximum relative distance (MRD)	$‖ 𝐲_{i} - 𝐲_{j} ‖ \leq {\underline{d}}_{i, j}$	$\max {∥ 𝐲_{i} - 𝐲_{j} ∥ - {\underline{d}}_{i, j}, 0}^{2}$

Table 2. TABLE II: TW, CT, and KS results for channel charting with and without representation constraints.

		Q-LoS			Q-NLoS
		Plain	FAD	FAD&MRD	Plain	FAD	FAD&MRD
TW	$K = 1$	0.8468	0.8516	0.8576	0.8480	0.8492	0.8597
	$K = 51$	0.8597	0.8570	0.8651	0.8502	0.8560	0.8665
	$K = 102$	0.8609	0.8642	0.8736	0.8546	0.8626	0.8739
CT	$K = 1$	0.9700	0.9195	0.9321	0.9281	0.8924	0.9110
	$K = 51$	0.9440	0.9067	0.9215	0.9168	0.8928	0.9128
	$K = 102$	0.9358	0.9081	0.9216	0.9155	0.8964	0.9151
KS		0.3548	0.2652	0.2598	0.4096	0.2749	0.2693

Equations15

y_{n} = f^{e} (x_{n}) and x_{n} = f^{d} (y_{n}), n = 1, \dots, N .

y_{n} = f^{e} (x_{n}) and x_{n} = f^{d} (y_{n}), n = 1, \dots, N .

L (W^{e}, W^{d}) = \frac{1}{N} n = 1 \sum N ∥ x_{n} - f^{d} (f^{e} (x_{n})) ∥^{2} .

L (W^{e}, W^{d}) = \frac{1}{N} n = 1 \sum N ∥ x_{n} - f^{d} (f^{e} (x_{n})) ∥^{2} .

\nabla_{y_{i}} (∥ y_{i} - y_{j} ∥ - \underline{d}_{i, j})^{2} = 2 (∥ y_{i} - y_{j} ∥ - \underline{d}_{i, j}) \frac{y _{i} - y _{j}}{∥ y _{i} - y _{j} ∥},

\nabla_{y_{i}} (∥ y_{i} - y_{j} ∥ - \underline{d}_{i, j})^{2} = 2 (∥ y_{i} - y_{j} ∥ - \underline{d}_{i, j}) \frac{y _{i} - y _{j}}{∥ y _{i} - y _{j} ∥},

\nabla_{y_{i}} max {∥ y_{i} - y_{j} ∥ - \underline{d}_{i, j}, 0}^{2}

\nabla_{y_{i}} max {∥ y_{i} - y_{j} ∥ - \underline{d}_{i, j}, 0}^{2}

= 2 max {∥ y_{i} - y_{j} ∥ - \underline{d}_{i, j}, 0} \frac{y _{i} - y _{j}}{∥ y _{i} - y _{j} ∥},

TW (K) = 1 - \frac{2}{N K ( 2 N - 3 K - 1 )} \sum_{i = 1}^{N} \sum_{j \in U_{i}^{K}} (r (i, j) - K),

TW (K) = 1 - \frac{2}{N K ( 2 N - 3 K - 1 )} \sum_{i = 1}^{N} \sum_{j \in U_{i}^{K}} (r (i, j) - K),

CT (K) = 1 - \frac{2}{N K ( 2 N - 3 K - 1 )} \sum_{i = 1}^{N} \sum_{j \in V_{i}^{K}} (\overset{r}{^} (i, j) - K),

CT (K) = 1 - \frac{2}{N K ( 2 N - 3 K - 1 )} \sum_{i = 1}^{N} \sum_{j \in V_{i}^{K}} (\overset{r}{^} (i, j) - K),

KS = \frac{\sum _{n, m} ( δ _{n, m} - β δ ^ _{n, m} ) ^{2}}{\sum _{n, m} δ _{n, m}^{2}},

KS = \frac{\sum _{n, m} ( δ _{n, m} - β δ ^ _{n, m} ) ^{2}}{\sum _{n, m} δ _{n, m}^{2}},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSolana Customer Service Number +1-833-534-1729

Full text

\frefformat

vario\fancyrefseclabelprefixSection #1 \frefformatvariothmTheorem #1 \frefformatvariolemLemma #1 \frefformatvariocorCorollary #1 \frefformatvariodefDefinition #1 \frefformatvarioobsObservation #1 \frefformatvarioasmAssumption #1 \frefformatvarioasmsAssumptions #1 \frefformatvario\fancyreffiglabelprefixFigure #1 \frefformatvarioappAppendix #1 \frefformatvariopropProposition #1 \frefformatvarioalgAlgorithm #1 \frefformatvario\fancyrefeqlabelprefix(#1) \frefformatvariotblTable #1 \frefformatvarioremRemark #1

Improving Channel Charting with Representation-Constrained Autoencoders

Pengzhi Huang1, Oscar Castañeda1, Emre Gönültaş1, Saïd Medjkouh1,

Olav Tirkkonen2, Tom Goldstein3, and Christoph Studer1

1*School of Electrical and Computer Engineering, Cornell University, Ithaca, NY; email: [email protected]

2School of Electrical Engineering, Aalto University, Finland; e-mail: [email protected]

3Department of Computer Science, University of Maryland, College Park, MD; e-mail: [email protected] * The work of PH, OC, EG, SM, and CS was supported by Xilinx, Inc. and by the US NSF grants ECCS-1408006, CCF-1535897, CCF-1652065, CNS-1717559, and ECCS-1824379. The work of TG was supported by the US NSF under grant CCF-1535902 and by the US Office of Naval Research grant N00014-17-1-2078. The work of OT was funded in part by the Academy of Finland (grant 319484).

Abstract

Channel charting (CC) has been proposed recently to enable logical positioning of user equipments (UEs) in the neighborhood of a multi-antenna base-station solely from channel-state information (CSI). CC relies on dimensionality reduction of high-dimensional CSI features in order to construct a channel chart that captures spatial and radio geometries so that UEs close in space are close in the channel chart. In this paper, we demonstrate that autoencoder (AE)-based CC can be augmented with side information that is obtained during the CSI acquisition process. More specifically, we propose to include pairwise representation constraints into AEs with the goal of improving the quality of the learned channel charts. We show that such representation-constrained AEs recover the global geometry of the learned channel charts, which enables CC to perform approximate positioning without global navigation satellite systems or supervised learning methods that rely on extensive and expensive measurement campaigns.

I Introduction

Autoencoders (AEs) are single- or multi-layer neural networks widely used for dimensionality-reduction tasks [1, 2, 3, 4]. AEs learn low-dimensional representations (embeddings) of a given high-dimensional dataset and have been shown to accurately preserve spatial relationships in both high- and low-dimensional space for a broad range of synthetic and real-world datasets [2]. With the success of deep neural networks, AEs are also gaining increased attention for unsupervised learning tasks [5]. Notable application examples of AEs include learning word embeddings [6], image compression [7], generative models [8], and channel charting [9, 10]. AEs are typically trained in an unsupervised manner, i.e., no labels are used, while potential side information on the training data is routinely ignored or application-specific representation structure is not imposed during training.

AEs that impose structural constraints on the latent variables include sparse AEs [11] and variational AEs [12]. Sparse AEs enforce sparsity on the representations, which enables one to learn embeddings with low effective dimensionality. Variational AEs learn an embedding drawn from a distribution that represents the high-dimensional input [12]; such AEs have been shown to be able to generate complex data, such as handwritten digits [13], faces [14], or physical scenes [14].

I-A Representation-Constrained Autoencoders for Positioning

A range of dimensionality-reduction applications provide valuable side information that can be imposed on the low-dimensional representations. Such side information may stem either from the dataset itself or from the way data was collected. One example arises when data is acquired over time, where it may be natural to enforce constraints between representations by exploiting the fact that, for temporally correlated datapoints, the associated low-dimensional representations should be similar. Another example arises when a subset of the representations are known a-priori, e.g., when a small part of the training data has been annotated. The obtained information can then be translated into representation constraints, which leads to semi-supervised training of AEs.

Representation constraints are important for positioning users in wireless systems using channel charting (CC) [9, 10]. CC measures high-dimensional channel-state information (CSI) of user equipments (UEs) transmitting data to an access point or cell tower. By collecting CSI at multiple spatial locations over time, one can train an AE for which the low-dimensional representations reflect relative UE positions. While the original CC method [9] enables relative localization without access to global navigation satellite systems (GNSS) and without dedicated measurement campaigns [15, 16], valuable side information should not be ignored when available. As UEs move with finite velocity, one could consider this information when training an AE to ensure that temporally correlated datapoints are nearby in the representation space. One could also imagine that certain points in space with known location (e.g., a coffee shop) can be associated with measured CSI; this helps to pin down a subset of spatial locations in the representation space, which enables absolute positioning. Put simply, enforcing constraints between representations may improve the efficacy of AE-based CC.

I-B Contributions

This paper investigates representation constraints for AEs and provides a framework for including these during training. We propose constraints on pairs of representations in which either the absolute or relative distance among (a subset of) representations is enforced. We formulate these constraints as nonconvex regularizers, which can easily be included into well-established deep-learning frameworks. We highlight the efficacy of representation constraints for the application of CC-based positioning in wireless systems: in particular, we demonstrate that by combining partially-annotated locations with temporal UE constraints, the positioning performance of CC [9] can be improved significantly.

I-C Relevant Prior Art

Autoencoders provide excellent dimensionality-reduction performance with real-world datasets [2]. Despite their success, AEs do not aim at preserving geometric properties on the representations (such as distances between points); this is in stark contrast to, e.g., Sammon’s mapping [17] or multidimensional scaling [18]. Furthermore, AEs commonly ignore side information that stems from the application at hand. We will extend AEs with pairwise representation constraints that can improve performance of dimensionality reduction tasks.

The efficacy of deep learning has been explored recently for wireless positioning [19, 20, 21]. The methods in these papers rely on extensive measurement campaigns and require CSI measurements annotated with exact position information. To avoid the drawback of such supervised methods, CC, as put forward in [9], uses dimensionality reduction to extract relative position information of users without the necessity of costly measurement campaigns. CC exploits the fact that CSI is high-dimensional, but strongly depends on UE position, which is low-dimensional. Dimensionality reduction applied to CSI measurements learns a channel chart, in which nearby points represent nearby locations in true space—exact position information is not available. However, CC naturally provides representation constraints originating from the acquisition process—we will include such constraints into AEs to significantly improve CC.

II Representation-Constrained Autoencoders

We briefly summarize the basics of AEs and then introduce the concept of pairwise representation constraints.

II-A Autoencoders in a Nutshell

AEs take a high-dimensional dataset consisting of $N$ datapoints (vectors) $\mathbf{x}_{n}\in\mathbb{R}^{D}$ , $n=1,\ldots,N$ , of dimension $D$ , and learn two functions: the encoder $f^{e}:\mathbb{R}^{D}\to\mathbb{R}^{D^{\prime}}$ and the decoder $f^{d}:\mathbb{R}^{D^{\prime}}\to\mathbb{R}^{D}$ . The encoder maps datapoints onto low-dimensional representations $\mathbf{y}_{n}\in\mathbb{R}^{D^{\prime}}$ , $n=1,\ldots,N$ , where $D^{\prime}\ll D$ is the dimension of the representation, and the decoder maps representations back to datapoints:

[TABLE]

The encoder and decoder functions of AEs are implemented as multilayer (shallow or deep) feed-forward neural networks that are trained to minimize the mean-square error (MSE) between the input and the output of the network, specifically:

[TABLE]

The parameters to be learned from $\{\mathbf{x}_{n}\}_{n=1}^{N}$ are the weights and bias terms contained in the sets $\mathcal{W}^{e}$ and $\mathcal{W}^{d}$ , which define the encoder and decoder neural networks, respectively.

The $D^{\prime}$ -dimensional output of the encoder $f^{e}$ is typically of lower dimension than the intrinsic dimension of the manifold embedding the inputs $\mathbf{x}_{i}$ in $D$ dimensions. Hence, we have that $\mathbf{x}_{n}\approx f^{d}(f^{e}(\mathbf{x}_{n}))$ , $n=1,\ldots,N$ , unless the dataset $\{\mathbf{x}_{n}\}_{n=1}^{N}$ was $D^{\prime}$ -dimensional and we were able to learn the underlying structure. Nevertheless, AEs often find low-dimensional representations $\{\mathbf{y}_{n}\}_{n=1}^{N}$ with small loss that capture the intrinsic dimensionality of the input datapoints.

II-B Pairwise Representation Constraints

In \freftbl:constraintssummary, we propose four distinct pairwise representation constraints, where the underlined quantities represent constant scalars or vectors that are known a-priori and used during AE training; non-underlined quantities are optimization variables.

II-B1 Fixed Distance Constraints

The fixed absolute distance (FAD) and fixed relative distance (FRD) constraints enforce a known distance $\underline{d}_{i,j}$ on a pair of representations according to $\|\mathbf{y}_{i}-\mathbf{y}_{j}\|=\underline{d}_{i,j}$ . The difference between FAD and FRD is that, for FAD, one of the two representations, e.g., $\underline{\mathbf{y}}_{j}$ , is a constant known prior to AE learning; for FRD, both representations $\mathbf{y}_{i}$ and $\mathbf{y}_{j}$ are optimization variables. To facilitate the inclusion of these constraints in deep learning frameworks, we propose to use regularizers (see \freftbl:constraintssummary) for which generalized gradients exist. Concretely, the generalized gradient of the FRD constraints with respect to the representation $\mathbf{y}_{i}$ is

[TABLE]

where for the FAD constraint the representation $\mathbf{y}_{j}$ is known a priori, i.e., $\mathbf{y}_{j}=\underline{\mathbf{y}}_{j}$ . If $\underline{d}_{i,j}=0$ , then the FRD regularizer promotes equality among $\mathbf{y}_{i}$ and $\mathbf{y}_{j}$ , whereas the FAD regularizer will learn a representation $\mathbf{y}_{i}$ that is close to the constant vector $\underline{\mathbf{y}}_{j}$ . Intuitively, the FAD constraint for $\underline{d}_{i,j}=0$ acts as a semi-supervised extension in which a subset of representations are known a-priori.

II-B2 Maximum Distance Constraints

The maximum absolute distance (MAD) and maximum relative distance (MRD) constraints enforce a maximum a-priori known distance $\underline{d}_{i,j}$ between a pair of representations according to $\|\mathbf{y}_{i}-\mathbf{y}_{j}\|\leq\underline{d}_{i,j}$ . For MAD, one of the two vectors in the constraint, e.g., $\underline{\mathbf{y}}_{j}$ , is known a-priori; for MRD, both representations are learned. We include these constraints as regularizers (see \freftbl:constraintssummary) with the generalized gradient

[TABLE]

where $\mathbf{y}_{j}=\underline{\mathbf{y}}_{j}$ is known for MAD. Note that if $\underline{d}_{i,j}=0$ , then FAD is equivalent to MAD and FRD is equivalent to MRD.

II-B3 Practical Considerations

We implemented a stochastic optimizer to minimize the sum of the AE fidelity term \frefeq:approximationerror and the regularized constraint penalties using the Keras and TensorFlow frameworks [22, 23]. Because penalty terms may represent pairwise constraints that involve two data points, the stochastic approximation of the regularizers was formed by randomly sampling constraints rather than datapoints.

II-C Performance Metrics

In order to measure the performance of dimensionality reduction, we use two standard metrics for the local neighborhood-preserving performance: trustworthiness (TW) and continuity (CT). TW measures whether mapping high-dimensional datapoints to the representation space introduces new (false) neighbors. TW is defined as

[TABLE]

where $r(i,j)$ represents the rank of the representation $\mathbf{y}_{i}$ among the pairwise distances between the other representations. The set $\mathcal{U}^{K}_{i}$ contains the points that are among the $K$ nearest neighbors in representation space, but not in the high-dimensional space. CT measures if similar datapoints in original space remain similar in the representation space, and is defined as

[TABLE]

where $\hat{r}(i,j)$ represents the rank of the datapoint $\mathbf{x}_{i}$ among the pairwise distances between the other datapoints. The set $\mathcal{V}^{K}_{i}$ contains the points that are among the $K$ nearest neighbors in the high-dimensional space, but not in the representation space. Both TW and CT have values in the range $[0,1]$ and large values indicate that neighborhoods are better preserved.

Besides measuring the local-neighborhood-preservation properties via TW and CT, we also consider Kruskal’s stress (KS) [24, 25], which measures how well the global structure in the high-dimensional dataset $\{\mathbf{x}_{n}\}_{n=1}^{N}$ is mapped to the low-dimensional embedding $\{\mathbf{y}_{n}\}_{n=1}^{N}$ . KS is defined as

[TABLE]

where $\delta_{n,m}=\|\mathbf{x}_{n}-\mathbf{x}_{m}\|$ , $\hat{\delta}_{n,m}=\|\mathbf{y}_{n}-\mathbf{y}_{m}\|$ , and $\beta=\sum_{n,m}\delta_{n,m}\hat{\delta}_{n,m}/\sum_{n,m}\delta_{n,m}^{2}$ is the optimal distance scaling factor. KS is in the range $[0,1]$ and smaller values indicate that global geometrical structure is preserved better. If $\textit{KS}=0$ , then the geometry is perfectly preserved.

III Channel Charting with Representation-Constrained Autoencoders

We now augment the original CC framework [9] with representation constraints that naturally arise from the application. We start by outlining the concept of CC and then explain how representation constraints are included. We then demonstrate the efficacy in comparison to the original CC framework.

III-A Channel Charting in a Nutshell

CC measures CSI from users at different spatial locations and learns a low-dimensional channel chart that preserves locally the original spatial geometry. Put simply, users that are physically nearby will be placed nearby in the channel chart and vice versa—global geometry is typically not preserved. In this framework, high-dimensional features are extracted from CSI, then processed with dimensionality-reduction methods to obtain the low-dimensional channel chart. CC operates in an unsupervised manner, i.e., learning is only based on CSI that is passively collected at an infrastructure base-station (BS), but from multiple user locations in the service area over time. CC opens up many location-based applications as it provides BS providers with relative user location information without access to GNSS or fingerprinting methods [15].

The technical concepts behind channel charting are as follows. Suppose that we have $N$ single-antenna users located in real space with coordinates $\mathbf{z}_{n}\in\mathbb{R}^{3}$ . If the $n$ th user at location $\mathbf{z}_{n}$ is transmitting data to a BS, then the BS first extracts high-dimensional CSI in the form of a high-dimensional vector $\mathbf{h}_{n}\in\mathbb{C}^{D}$ , which represents multi-path scattering and path loss of the wireless channel. From the CSI vector $\mathbf{h}_{n}$ , one can extract features $\mathbf{x}_{n}\in\mathbb{R}^{D}$ that represent large-scale fading properties of the wireless channel. The main assumption of CC is that large-scale fading properties are mostly static and are strongly tied to user location. Specifically, due to the underlying physics of electromagnetic wave propagation, each CSI feature is a (noisy) function of the user position, a function that represents the effect of the (unknown) physical environment on the transmitted signal. One can then learn the channel chart from the set of channel features $\{\mathbf{x}_{n}\}_{n=1}^{N}$ in an unsupervised manner by means of dimensionality-reduction methods. If AEs are used in this procedure, the encoder $f^{e}$ corresponds to the forward charting function that maps CSI features $\mathbf{x}_{n}$ to relative position information $\mathbf{y}_{n}$ in the representation space.

Reference [9] proposed the use of Sammon’s mapping (SM) and AEs to learn the channel charts. While SM exhibited good performance, AEs scale well to large problem sizes and provide a parametric mapping that enables one to map new, unseen CSI features to a relative location. Despite the advantages of AEs, valuable side information that arises from the application itself has been ignored. First, in contrast to SM, conventional AEs do not enforce any geometric structure on their representations. Second, by tracking a user’s CSI over time, the corresponding low-dimensional representations that reflect their position should be similar as velocity is finite.

III-B Channel Charting with Representation Constraints

We propose the inclusion of representation constraints to overcome the limitations of the original CC framework. We impose MRD constraints on pairs of representations from a user over time, to ensure nearby representations for nearby spatial locations. We can estimate an upper limit on the maximum distance in the representation space based on the measurement CSI acquisition rate. Note that this information comes directly from the CSI measurement process and the fact that we know how data was collected.

Furthermore, to enable CC with true positioning capabilities, we unwrap the channel chart using anchor vectors, i.e., points in space for which we know both their CSI as well as their true location. One can imagine measuring CSI at a small set of locations when setting up a new BS (e.g., by knowing the precise location of a fixed access point). With this information, we impose FAD representation constraints on the AE with $\underline{d}_{i,j}=0$ to enforce the exact anchor positions. The inclusion of such constraints leads to a semi-supervised version of CC (and AEs in general). In contrast to conventional fingerprinting methods that are fully supervised and require training at wavelength resolution in space, we only require a small number of anchor vectors and use the rest of the (unlabeled) data to improve the localization accuracy of the channel chart.

III-C Numerical Results

III-C1 Scenario

\fref

fig:scenario depicts the scenario. We measure the CSI of $N=2048$ randomly placed user locations (except for the positions on the “vip” curve) within a rectangular area of $1000$ m $\times$ $500$ m. We model the acquisition of CSI at 0 dB SNR. At spatial location $(x,y,z)=(0,0,10)$ meters, we consider a uniform linear BS antenna array with half-wavelength spacing and $32$ antennas. We model narrowband data transmission at $2$ GHz and consider two channel models: (i) Quadriga LoS (Q-LoS; a realistic model for LoS channels that includes scatterers and path loss); and (ii) Quadriga non-LoS (Q-NLoS; a realistic model for channels where there is no direct path between users and the BS antenna array). For both models, we used the “Berlin UMa” scenario, which has been calibrated with real-world measurements [26]. At the BS side, we extract the same features proposed in [9], i.e., we apply feature scaling, convert them to the angular domain, and take the entry-wise absolute value. The input has $D=32$ real dimensions; the representation dimension is $D^{\prime}=2$ . We use a similar structure of the AE in [9], namely $9$ hidden dense layers: $4$ dense layers for the encoder and another $4$ dense layers for the decoder (the layers consist of $500$ , $100$ , $50$ and $20$ neurons), and an intermediate layer where we extract the channel chart with 2 neurons and linear activation functions.

III-C2 Results

\fref

fig:channelcharts shows the channel charts. The top row (Figures 2(a)–(b)) shows the results of a “plain” AE, i.e., without any representation constraints; these results reproduce those in [9]. The middle row (Figures 2(c)–(d)) shows channel charts from AEs that include FAD constraints, where we randomly selected $10\%$ of the users to be anchor vectors. Clearly, these FAD constraints unwrap the channel chart and lead to a representation with a distance scale comparable to the original scenario in \freffig:scenario. While the global structure is approximately preserved, the points on the “vip” curve are not represented accurately. The bottom row (Figures 2(e)–(f)) shows the combination of FAD and MRD constraints in AEs. Concretely, we also enforce the fact that points on the “vip” curve model a user’s motion and we can impose maximum absolute distance constraints among pairs of representations pertaining to this curve. FAD and MRD combined are able to reproduce the original scenario: It is evident that points in the channel chart approximately represent the true locations. We also observe that the propagation conditions of the wireless channel do not substantially affect the performance.

\fref

tbl:CCresults lists the TW, CT, and KS results for the channel charts shown in \freffig:channelcharts. We see that including representation constraints improves TW while slightly lowering the CT; note that TW and CT are evaluated for $K=1$ , $K=51$ ( $2.5$ % of the dataset), and $K=102$ ( $5$ % of the dataset). Hence, we observe a tradeoff with respect to neighborhood-preserving properties. More concretely, an increase in TW means that we are introducing fewer “fake” near neighbors; a reduction in CT means that neighborhoods in the original space are not as well preserved in the channel chart as before. With respect to the global geometric structure, we see that KS significantly improves for all representation-constrained AEs; this implies that the inclusion of constraints enables us to recover global geometry. We note this is also visible in \freffig:channelcharts, especially in the AE results that include both FAD constraints (anchor vectors) and MRD constraints (to enforce continuity of a user’s motion). Once again, we see that the propagation conditions do not substantially affect the performance.

IV Conclusions

We have shown how to incorporate pairwise representation constraints into autoencoders (AEs). To demonstrate effectiveness of representation-constrained AEs, we have shown an improvement to the channel charting (CC) framework in [9], where we use side information on user motion and anchor vectors to improve the positioning performance of CC. Numerical results for this application have shown that the use of representation constraints that are readily available in wireless positioning scenarios yield (often significant) improvements in recovered global geometry.

There are many opportunities for future work. While our methods enable approximate positioning, AEs with representation constraints are still not sufficient to enable GNSS-grade accuracy. Promising research directions towards this goal are the development of improved CSI features as well as the inclusion of additional geometric constraints, e.g., when acquiring CSI from multiple cell-towers or access points.

Bibliography26

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] G. E. Hinton and R. R. Salakhutdinov, “Reducing the dimensionality of data with neural networks,” Science , vol. 313, no. 5786, pp. 504–507, Jul. 2006.
2[2] L. van der Maaten, E. Postma, and J. Van den Herik, “Dimensionality reduction: A comparative review,” in J. Mach. Learn. Res. , vol. 10, Oct. 2009, pp. 66–71.
3[3] I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning . MIT Press, 2016.
4[4] P. Baldi, “Autoencoders, unsupervised learning, and deep architectures,” in Proc. ICML Workshop Unsupervised Transfer Learn. , vol. 27, Jul. 2012, pp. 37–49.
5[5] J. Geng, J. Fan, H. Wang, X. Ma, B. Li, and F. Chen, “High-resolution SAR image classification via deep convolutional autoencoders,” IEEE Geosci. Remote Sens. Lett. , vol. 12, no. 11, pp. 2351–2355, Nov. 2015.
6[6] T. Mikolov, I. Sutskever, K. Chen, G. Corrado, and J. Dean, “Distributed representations of words and phrases and their compositionality,” in Adv. Neural Inf. Process. Syst. , Dec. 2013, pp. 3111–3119.
7[7] L. Theis, W. Shi, A. Cunningham, and F. Huszár, “Lossy Image Compression with Compressive Autoencoders,” Ar Xiv , Mar. 2017.
8[8] Y. Bengio, L. Yao, G. Alain, and P. Vincent, “Generalized denoising auto-encoders as generative models,” in Adv. Neural Inf. Process. Syst. , Dec. 2013, pp. 899–907.