Artificial Intelligence and Location Verification in Vehicular Networks
Ullah Ihsan, Ziqing Wang, Robert Malaney, Andrew Dempster, Shihao, Yan

TL;DR
This paper introduces a neural network-based location verification system for vehicular networks that outperforms traditional methods, especially when prior information about user authenticity is unknown and under challenging signal conditions.
Contribution
The work presents a novel NN-based LVS that functions effectively without prior knowledge of genuine user proportions, improving verification accuracy in vehicular IoT environments.
Findings
NN-LVS outperforms traditional LVS in various real-world scenarios.
Effective under Non-Line-of-Site (NLoS) signal bias.
Maintains high performance even with unknown user authenticity proportions.
Abstract
Location information claimed by devices will play an ever-increasing role in future wireless networks such as 5G, the Internet of Things (IoT). Against this background, the verification of such claimed location information will be an issue of growing importance. A formal information-theoretic Location Verification System (LVS) can address this issue to some extent, but such a system usually operates within the limits of idealistic assumptions on a-priori information on the proportion of genuine users in the field. In this work we address this critical limitation by using a Neural Network (NN) showing how such a NN based LVS is capable of efficiently functioning even when the proportion of genuine users is completely unknown a-priori. We demonstrate the improved performance of this new form of LVS based on Time of Arrival measurements from multiple verifying base stations within the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Artificial Intelligence and Location Verification in Vehicular Networks
Ullah Ihsan1, Ziqing Wang1, Robert Malaney1, Andrew Dempster1 and Shihao Yan2
1School of Electrical Engineering & Telecommunications,
The University of New South Wales,
Sydney, NSW 2052, Australia
2School of Engineering,
Macquarie University,
Sydney, NSW 2109, Australia
Artificial Intelligence and Location Verification in Vehicular Networks
Ullah Ihsan1, Ziqing Wang1, Robert Malaney1, Andrew Dempster1 and Shihao Yan2
1School of Electrical Engineering & Telecommunications,
The University of New South Wales,
Sydney, NSW 2052, Australia
2School of Engineering,
Macquarie University,
Sydney, NSW 2109, Australia
Abstract
Location information claimed by devices will play an ever-increasing role in future wireless networks such as 5G, the Internet of Things (IoT). Against this background, the verification of such claimed location information will be an issue of growing importance. A formal information-theoretic Location Verification System (LVS) can address this issue to some extent, but such a system usually operates within the limits of idealistic assumptions on a-priori information on the proportion of genuine users in the field. In this work we address this critical limitation by using a Neural Network (NN) showing how such a NN based LVS is capable of efficiently functioning even when the proportion of genuine users is completely unknown a-priori. We demonstrate the improved performance of this new form of LVS based on Time of Arrival measurements from multiple verifying base stations within the context of vehicular networks, quantifying how our NN-LVS outperforms the stand-alone information-theoretic LVS in a range of anticipated real-world conditions. We also show the efficient performance for the NN-LVS when the users’ signals have added Non-Line-of-Site (NLoS) bias in them. This new LVS can be applied to a range of location-centric applications within the domain of the IoT.
I Introduction
We are at the verge of new wireless networks that aim to bring communication revolutions in homes, hospitals, education, transportation, and other aspects of society. Emerging Intelligent Transportation Systems (ITS) are particularly exciting due to their potential to save many lives. The success of these new technologies in general, and ITS in particular, find their roots in the true location information of the clients (devices, users, vehicles) involved. In many scenarios it is anticipated that clients can directly obtain their location information [1, 2] through the Global Navigation Satellite System (GNSS). This location information is then usually provided by the clients to other clients or to some central Processing Center (PC), for verification and other network functionality purposes. But what if a client provides incorrect information about its true location intentionally in an attempt to obtain some advantage over other users [3, 4]? Such circumstances could also occur unintentionally due to difficulty in recording the GNSS location information or faulty hardware issues.
The focus of the work in this paper is location verification in the Internet of Things (IoT), in general, and in ITS (a sub-application of IoT) in particular. The location verification framework proposed in this work is applicable to all IoT applications whose performance is related to a user’s reported location. Within ITS, if a malicious user provides inaccurate location information and this goes unnoticed, the possible aftermath could range from sub-optimal traffic routing all the way through to life-threatening collisions [5, 6]. Verification of a user’s reported location information is hence critical for successful operation in ITS [5, 7, 8, 9, 10]. Due to this, Location Verification System (LVS) performance has been a research focus in ITS for well over a decade. Recently, several information-theoretic LVSs have been devised [11, 12, 13]. These LVSs operate under a set of well-defined rules and conditions. Additionally, they have limitations in addressing various anomalies since they usually assume idealized channel conditions [11]. As such, information-theoretic LVSs usually possess performance limitations in real-world situations. One of the most important of these limitations is the a-priori lack of knowledge on the proportion (fraction) of vehicles in the field that will be malicious (alternatively, the fraction that will be genuine).
Neural Networks (NNs) have recently brought breakthroughs into many aspects of modern society. Web mining[14], content filtering[15], image recognition[16, 17], speech processing[18], language identification [19], speaker verification[20], object detection [21, 22], advanced genomics[23], and drug discovery[24] are just a few of the fields impacted. NNs also lay the foundation for many aspects of the self-driving car paradigm[25]. Many of these breakthroughs are achieved through the development of new NN algorithms[26].
We consider an LVS based on Time of Arrival (ToA) measurements [27] under the influence of Non-Line-of-Sight (NLoS) biases. The novel contributions in this work are summarized below.
- •
We show that the NN-LVS proposed in this work outperforms an information-theoretic LVS[27] when the ToA of the users’ signals have added NLoS bias in them.
- •
We also show that unlike the information-theoretic LVS which assumes an a-priori knowledge about the proportion of malicious vehicles in the field, the NN-LVS works satisfactorily in the complete absence of this knowledge.
Recent advancements in digital signal processing and hardware design now provide us with very accurate physical-layer timing information for wireless networks[28, 29]. These developments provide us with the clock synchronization that enables the LVS we study here. As such, we suggest our new NN-LVS can offer a viable and pragmatic solution to the important task of location verification for many IoT applications under real-world conditions and uncertainties.
The remainder of this paper is organized as follows. Section II describes the system model. Section III details the performance analysis using information theory and neural network techniques. Section IV provides the numerical results, and Section V highlights future prospects. Section VI concludes the paper.
II System Model
We outline the system model and assumptions considered for our framework:
The framework consists of trusted Base Stations (BSs) as verifiers with publicly known locations that are assumed to be in the range of the prover (the vehicle whose claimed location is to be authenticated). The location of the i-th BS is where . 2. 2.
The true location from a genuine or malicious vehicle (the prover) is denoted by and is assumed to possess zero localization error. 3. 3.
We refer to the announced (reported) location from a legitimate or malicious vehicle as claimed location and denote it by . For a legitimate vehicle, the claimed location is exactly the same as its true location. On the other hand, a malicious vehicle spoofs its true location to the BSs (to potentially obtain an advantage over other vehicles or to disrupt the system performance). The true location of a malicious vehicle is unknown to the wider network. 4. 4.
One of the BSs is chosen as the PC. Measurements from all BSs are collected at the PC before being processed into a a binary decision related to a vehicle’s claimed location. 5. 5.
Under the null hypothesis , the framework assumes a vehicle to be legitimate, i.e.,
[TABLE] 6. 6.
Under the alternate hypothesis , the framework considers a vehicle to be malicious, i.e.,
[TABLE]
Under , the ToA value measured by the i-th BS from a legitimate vehicle is given by
[TABLE]
where , the BS’s receiver thermal noise, is a zero-mean normal random variable with variance . is the ToA and is given by
[TABLE]
where is the Euclidian distance of i-th BS to a legitimate vehicle’s true location, and is given by
[TABLE]
with as the speed of light.
We assume the measurements made by the N BSs to be independent of each other. Under , they collectively form a vector . Vector follows a multi-variate normal distribution given as
[TABLE]
where is the mean vector under the null hypothesis, and is the covariance matrix.
Under , a malicious vehicle claims to be at a location removed from its true location. In a real-world scenario, we can think of this as if the malicious vehicle pretends to be on the road when he actually is placed off the road in a street or in a building. The ToA value measured by the i-th verifier from a malicious vehicle is given by
[TABLE]
where is a time bias potentially added by malicious vehicle which impacts the overall ToA value. is given by
[TABLE]
where is the Euclidian distance of i-th BS to a malicious vehicle’s true location, and is given by
[TABLE]
We assume the measurements made by N BSs to be independent of each other. Under , they collectively form a vector . Vector follows a multi-variate normal distribution given as
[TABLE]
where and is a vector equal to the length of the number of BSs with all its elements set to 1. For later convenience we set and rewrite (8) as
[TABLE]
III Performance Analysis
The outcome of an LVS is a binary result, i.e., legitimate or malicious. This is different from a localization system where the output is an estimated location. We measure the performance of our LVS using two methodologies; through information-theoretic analysis followed in [27] and, through the newly designed NN method which makes use of the machine-learning techniques. In both cases, a Bayes average cost function is chosen as the performance metric for LVS in terms of the ‘Total Error’. The information-theoretic method is based on the a-priori assumption that the proportion of malicious vehicles is known. Usually this is set to 0.5 in the absence of any other information. On the other hand, the NN-LVS calculates the Total Error irrespective of any such a-priori assumption and it can function with any proportion of malicious vehicles in the field. The Total Error is given by
[TABLE]
where and are the a priori probabilities of occurrences of (i.e., legitimate vehicle) and (i.e., malicious vehicle), respectively, and are set equal, i.e., 0.5. represents the false positive rate (the rate of legitimate vehicles being detected incorrectly) and represents the detection rate (the rate of malicious vehicles being detected correctly). Equation (10) then takes the form
[TABLE]
III-A LVS Performance Analysis Using Information Theory
The Likelihood Ratio Test (LRT) is used for performance measurement of the LVS. It has been proven earlier that the LRT achieves the optimum detection results for a given false positive rate[30]. This leads to the conclusion that the LRT minimizes the Total Error and maximizes the mutual information between input and output of LVS[31]. We follow the decision rule below for the LRT
[TABLE]
where is the likelihood ratio, is the decision threshold, and and are the binary decision values (i.e., whether the vehicle is legitimate or malicious). Given the multi-variate normal form of the observations, (12) can be reformulated as
[TABLE]
III-B LVS Performance Analysis Using Neural Networks
This section highlights the novel approach used to design a classification framework for the verification of a vehicle’s claimed location through supervised machine-learning techniques. The framework uses a multi-layer feed-forward NN for the binary classification of a vehicle as either legitimate or malicious.
For uniformity, the framework considers the same inputs as considered for the information-theoretic method. These inputs include the ToA of the signal based on the user’s claimed location, and (the observation vector influenced by the thermal noise ). Based on a series of trials with changing architectures for the NN-LVS, we finalised a framework that has an input vector, a hidden layer (with 10 neurons), and a binary output layer as shown in Fig. 1. The NN LVS achieved optimum performance through the use of a Hyperbolic tangent sigmoid, and linear transfer functions in the hidden, and output layers, respectively.
IV Numerical Results
We now present some numerical results based on our analysis from the information-theoretic and NN-based LVS. In carrying out these simulations, BSs are located in a 1000 meters by 500 meters area at fixed publicly known locations. This area closely resembles a small district of a city and corresponds to the context of ITS where the BSs are trusted verifiers located on the roadside or in the nearby parking lots. The claimant vehicle (the prover whose location is yet to be verified) resides in a 500 X 500 meters area in between the BSs. In order to simulate the attacking scenario and thus study the performance of both the LVS, we assume there are two claimants that are within the communication range, namely a legitimate vehicle which is reporting its true location to the BSs and a malicious vehicle which is performing the location-spoofing attack. Both the vehicles can overhear the communication between the BSs and thus both acquire the locations of the BSs. If malicious, the vehicle can also overhear the communication between legitimate vehicles and the BSs so that it can forge its claimed location to that of the legitimate vehicle’s true location.
The malicious vehicle sets its true location at a far-off point so that its transmitted signal (with the appropriate timing offset) has equal ToA at all the BSs (in the limit of the true location of a malicious vehicle being much greater that any other scale all NLoS biases at all BSs are the same). Under this approximation the mean ToA at the BSs is just the mean of the timings anticipated from a vehicle at the claimed location. The resultant alteration in ToA due to the receiver’s thermal noise is extracted from a Gaussian random distribution with fixed standard deviation. The value of standard deviation considered in our simulation is set to 300 nanoseconds.
We use simulated ToA data in our numerical experiments. The claimed locations for genuine and malicious vehicles in equal proportion are generated randomly in the specified area. The ToA from the claimed locations at the 4 BSs is calculated using equation (4). The receivers in the BSs are under the influence of independent thermal noise and thus the ToA measurements they make have a certain degree of variation. We extract this variation (in nanoseconds) from a Gaussian random function that has a fixed standard deviation. The area around the vehicles is non LoS and therefore their transmitted signal cannot reach the BSs directly and hence their ToA have an additional NLoS bias in them. To mimic reality, we extract from an exponential distribution as given below
[TABLE]
where is the scale parameter.
For the information-theoretic LVS, we calculate the Total Error, the false positive rate, and the detection rate using equations (11) and (13), respectively.
The data considered for the information-theoretic LVS analysis is used to also train the NN-LVS. We call this data the training data.111By training data we mean ToA data received from vehicles who we know a priori to be legitimate or malicious. Use of such data in order to set the neural network parameters, prior to its use on ‘unlabeled’ data (i.e., data from vehicles who we do not know *a priori * to be legitimate or malicious), is known as the training phase. In the training phase, we feed the NN-LVS with random vehicles data at a speed of one vehicle data per second. During each second the NN-LVS is trained with the available training data. The backpropagation algorithm has a set of internal parameters to terminate the training phase for the NN-LVS. We observe that in most of the cases the maximum validation failures ; which is the maximum number of iterations in a row during which the NN-LVS’s performance fails to improve or remains the same, terminates the training phase. We set this parameter to 6. The weights and biases are considered as optimised once the training phase has concluded. The NN-LVS afterwards can be used to classify a vehicle as genuine or malicious in the test data222The test data is simulated under a different realization with same settings as training data. Further, the test data has no labels..
The NN-LVS is trained during the \nth1 second with an input training data from a single random vehicle. At the end of \nth1 second, we subject the NN-LVS (with its weights and biases optimized) to calculate a Total Error for the test data. In the \nth2 second, we add another random vehicle training data to the previously available single vehicle training data. The combined data forms a new training data set which is used to retrain the NN-LVS (from \nth1 second). After a re-training, the NN-LVS is used to calculate a new Total Error for the test data. We add yet another random vehicle training data to the previously available training data in the \nth3 second and use the updated data set to once again train the NN-LVS. At the end of the third second a revised Total Error is calculated for the test data. This process of updating the training data set, retraining the NN-LVS and recalculating a new Total Error for the test data continues in the following seconds. The Total Error keeps on decreasing with the passage of time.
In Fig. 2 we initially calculate the Total Error for a data set that has genuine and malicious vehicles in equal proportions (Po). The standard deviation for (extracted from a Gaussian distribution) is 300 nanoseconds while the standard deviation for NLoS bias (extracted from an exponential distribution) is indicated by the different curves. The number of BSs used is 4. The LRT (i.e., the Total Error arising from the information-theoretic LVS) corresponding to each NLoS curve is indicated by the dashed arrow lines. We can see that performance for the information-theoretic LVS deteriorates as the NLoS bias increases while the performance for the NN-LVS improves with an increase in the NLoS bias in the ToA data. It is clear that the NN-LVS is able to accommodate the NLoS conditions significantly better than an information-theoretic LVS.
Next, we study the impact of changing Po (the proportion of malicious vehicles) over the performance of LVS. We train a NN-LVS through similar procedures as described earlier but change the proportion of malicious vehicles in the test data. In one of the experiments, we fix the standard deviation for and the NLoS bias both to 300 nanoseconds. The number of BSs are 4. We can see in Fig. 3 that NN-LVS performs consistently even when Po is different in the test data. We can see that NN-LVS performance is satisfactory even when the test data has 99.95% genuine vehicles and 0.05% malicious vehicles. The red line in the Fig. 3 shows the Total Error for the information-theoretic LVS when the genuine and malicious vehicles are in equal proportions in the data (with no changes to and NLoS bias). Our study shows that unlike the information-theoretic LVS whose performance is conditioned to the a-priori knowledge of Po, the NN-LVS’s performance is largely independent of Po.
In Fig. 4, we change the standard deviation for NLoS bias to 500 nanoseconds. still is extracted from a Gaussian random distribution with a standard deviation of 300 nanoseconds. We can observe that NN-LVS’s performance is independent of the Po value.
In Fig. 5, we change number of BSs to 6 while the standard deviation for NLoS bias and are kept the same as in Fig. 4. Still we can see a promising performance from the NN-LVS under different Po(s).
In another work, we also analyzed the performance of the NN-LVS proposed here by subjecting it to a real-world experimental situation where measured Received Signal Strength (RSS) inputs were used instead of simulated ToA values[32]. The RSS from the vehicle was measured at 3 BSs. The performance results for the NN-LVS using input RSS measurements were consistent with the claims of this work, showing our NN-LVS’s adaptability to other metrics beyond ToA.
The NN-LVS framework proposed in this work can be applied to many sub-applications within IoT and 5G whose performance largely depends on the true and verified locations of users. Map services, smart parking, Massive MIMO, Enhanced beamforming, coverage enrichment, and interference mitigation are just a few of the applications which can benefit from a NN-LVS. The same framework can also assist in defence-related operations where location and its verification is of prime importance.
V Future Work
We aim to add more prominent features related to the channel environment so as to further investigate gains achieved by NN-based LVSs. These additional features will better describe the real-world channels. We also plan to compliment ToA systems by adding RSS and Angle of Arrival (AoA) measurements thus enhancing the reliability of NN-based LVSs. A combination of ToA, RSS and AoA will make the NN LVS even more reliable and efficient in its location verification. We also plan to extend the NN framework to more complex channel fading models such as Rician fading channels. Estimating channel parameters through machine-learning will also result in an extended performance for the NN LVS. In this case the neural network architecture will need to be extended so as to accommodate the additional unknowns that must be learned.
VI Conclusion
Information-theoretic LVS frameworks, due to their operating limitations, are not practical in many real-world scenarios. To address this gap, we have proposed the use of a neural network approach to location verification. This approach is particulary useful when we consider that one of the key inputs to any LVS is knowledge on the proportion of vehicles anticipated to be malicious - an input usually unknown. Using simulated ToA data, we have shown how a NN-based LVS outperforms a state-of-the-art information-theoretic LVS. Unlike the information-theoretic LVS, the working of the NN-LVS is shown to be largely independent of the proportion of malicious vehicles in the area. Unknown channel conditions, such as NLoS bias effects, were also shown to be better accommodated by the NN-LVS approach.
More salient real-world channel features, will be considered in our future work to further improve the overall system performance. This will help us develop a more robust state-of-the-art artificially intelligent LVS, an LVS which will be wholly practical in terms of its location verification performance in a wide range of future wireless networks beyond the ITS we have studied here. We believe the novel approach for enhancing the performance of real-world LVSs that we have developed here potentially forms the foundation for all future works in this important area.
Acknowledgment
The authors acknowledge support by the University of New South Wales, Australia, and Macquarie University, Australia. Ullah Ihsan acknowledges financial support from the Australian Government through its Research Training Program.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] B. Hofmann-Wellenhof, H. Lichtenegger, and J. Collins, Global positioning system: theory and practice . Springer Science & Business Media, 2012.
- 2[2] N. Bulusu, J. Heidemann, and D. Estrin, “GPS-less low-cost outdoor localization for very small devices,” IEEE Personal Communications , vol. 7, no. 5, pp. 28–34, Oct. 2000.
- 3[3] A. Jaeger, N. Bißmeyer, H. Stübing, and S. Huss, “A novel framework for efficient mobility data verification in vehicular ad-hoc networks,” International Journal of Intelligent Transportation Systems Research , vol. 10, no. 1, pp. 11–21, Jan. 2012.
- 4[4] B. Yu, C.-Z. Xu, and B. Xiao, “Detecting sybil attacks in VANE Ts,” Journal of Parallel and Distributed Computing , vol. 73, no. 6, pp. 746–756, Jun. 2013.
- 5[5] T. Leinmüller, E. Schoch, F. Kargl, and C. Maihöfer, “Influence of falsified position data on geographic ad-hoc routing,” in Proceedings of the European Workshop on Security in Ad-hoc and Sensor Networks . Springer, Dec. 2005, pp. 102–112.
- 6[6] T. Leinmüller and E. Schoch, “Greedy routing in highway scenarios: The impact of position faking nodes,” in Workshop On Intelligent Transportation , Jan. 2006.
- 7[7] P. Papadimitratos, V. Gligor, and J. Hubaux, “Securing vehicular communications - assumptions, requirements, and principles,” Workshop on Embedded Security in Cars (ESCAR) , pp. 5–14, Nov. 2006.
- 8[8] M. Raya and J. Hubaux, “Securing vehicular ad-hoc networks,” Journal of Computer Security , vol. 15, no. 1, pp. 39–68, Jul. 2007.
