Robust commuter movement inference from connected mobile devices

Baoyang Song; Hasan Poonawala; Laura Wynter; Sebastien Blandin

arXiv:1903.01045·cs.LG·March 6, 2019

Robust commuter movement inference from connected mobile devices

Baoyang Song, Hasan Poonawala, Laura Wynter, Sebastien Blandin

PDF

Open Access

TL;DR

This paper presents a robust, unsupervised approach to infer train movements and estimate commuter demand in a city-wide public transport network using noisy WiFi data from connected devices, combining clustering and classification models.

Contribution

It introduces a novel robust clustering method for train inference and a real-time commuter pattern classification model from noisy IoT data.

Findings

01

Achieved high accuracy on large-scale anonymized dataset

02

Demonstrated effective real-time demand estimation

03

Validated robustness against noisy data

Abstract

The preponderance of connected devices provides unprecedented opportunities for fine-grained monitoring of the public infrastructure. However while classical models expect high quality application-specific data streams, the promise of the Internet of Things (IoT) is that of an abundance of disparate and noisy datasets from connected devices. In this context, we consider the problem of estimation of the level of service of a city-wide public transport network. We first propose a robust unsupervised model for train movement inference from wifi traces, via the application of robust clustering methods to a one dimensional spatio-temporal setting. We then explore the extent to which the demand-supply gap can be estimated from connected devices. We propose a classification model of real-time commuter patterns, including both a batch training phase and an online learning component. We describe…

Tables4

Table 1. TABLE I: Hit rate and RMSE: metrics are averaged over the peak hours of five workdays.

Station ID	Hit rate		RMSE
	Baseline	SC	Baseline	SC
A	0.80	0.89	0.75	0.51
B	0.92	0.91	0.23	0.26
C	0.96	0.96	0.21	0.21
D	0.85	0.88	0.30	0.26
E	1.00	1.00	0.00	0.00
F	1.00	1.00	0.00	0.00

Table 2. TABLE II: Hit rate and RMSE during a train incident: metrics are averaged from 7 7 7 to 9 9 9 am at station 15 15 15 .

Hit rate		RMSE
Baseline	Spectral clustering	Baseline	Spectral clustering
0.06	0.52	7.50	3.63

Table 3. TABLE III: Feature weights: weights from logistic regression model to classify an occurrence of a train arrival as DSG or not.

Feature	Weight
Intercept	-7.3645
Count	0.00033
Missed Count	0.06824
75% Quantile WaitTime	0.00682
Std Dev WaitTime	0.00014
Headway time	0.00251

Table 4. TABLE IV: Model hierarchy: complexity and performance of different model categories.

Category	$#$ Models	Precision	Recall	Accuracy
Network	1	75	72	98
Line	1 to 10	77	72	98
Station	10 to 100	85	75	99

Equations7

t = ((t_{1}^{A}, t_{1}^{D}), \dots, (t_{N_{i}}^{A}, t_{N_{i}}^{D}))_{i \in N}

t = ((t_{1}^{A}, t_{1}^{D}), \dots, (t_{N_{i}}^{A}, t_{N_{i}}^{D}))_{i \in N}

sim_{soft} (t_{1}, t_{2})

sim_{soft} (t_{1}, t_{2})

sim_{hard} (t_{1}, t_{2})

θ_{s, t} = \frac{Y _{s, t}}{X _{s, t}} .

θ_{s, t} = \frac{Y _{s, t}}{X _{s, t}} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Mobility and Location-Based Analysis · Traffic Prediction and Management Techniques · Transportation Planning and Optimization

Full text

Robust commuter movement inference from connected mobile devices

Baoyang Song

*Ecole Polytechnique *

Palaiseau, France

[email protected]

Hasan Poonawala

IBM Research *

Singapore

[email protected]

Laura Wynter

IBM Research *

Singapore

[email protected]

Sebastien Blandin

IBM Research *

Singapore

[email protected]

Abstract

The preponderance of connected devices provides unprecedented opportunities for fine-grained monitoring of the public infrastructure. However while classical models expect high quality application-specific data streams, the promise of the Internet of Things (IoT) is that of an abundance of disparate and noisy datasets from connected devices. In this context, we consider the problem of estimation of the level of service of a city-wide public transport network. We first propose a robust unsupervised model for train movement inference from wifi traces, via the application of robust clustering methods to a one dimensional spatio-temporal setting. We then explore the extent to which the demand-supply gap can be estimated from connected devices. We propose a classification model of real-time commuter patterns, including both a batch training phase and an online learning component. We describe our deployment architecture and assess our system accuracy on a large-scale anonymized dataset comprising more than $10$ billion records.

Index Terms:

Public transport; Real-time estimation; Online learning; Unsupervised models; Classification.

I Introduction

Artificial Intelligence methods have seen tremendous progress in recent years, as illustrated by the significant progress made on curated and customized datasets and in the context of games. In practice, however, data streams are rarely curated in advance and exhibit a number of properties making it inadequate for off-the-shelf machine learning methods.

In this work, we consider the problem of fine-grained monitoring of public transport systems, via the creation of a digital twin of the system, able to faithfully capture its fundamental properties. While traditional public infrastructure monitoring has historically been based on proprietary and application-specific data streams, the democratization of connected devices is a considerable game changer, making available datasets that are orders of magnitude larger and noisier.

We focus our efforts on assessing the extent to which key performance indicators and real-time operational information needed for monitoring public transport systems can be obtained with sufficient accuracy from passive sensing via the use of appropriate machine learning techniques. One example is determining the true (as opposed to the published) train arrival timings of a public transport system in real-time. Often, there is no publicly available source for this information, yet, a reliable timetable is indispensable for the real-time monitoring of public transport service levels, and is of great value to commuters when planning their trips.

Another important metric to assess service quality of public transport is the demand-supply gap (DSG), evidenced by the inability for a passenger to board a train due to the train being already at capacity when it enters a station. These events signal an inadequacy of the supply with respect to the demand and as such are an effective way to measure public transport service quality. Historically, demand-supply gap has been evaluated from necessarily sub-sampled surveys conducted manually and available offline compared to the events they refer to.

I-A Literature review

There has been considerable work in recent years on understanding mobility patterns using data from mobile devices [14, 13, 16, 5, 18, 3, 12, 4], and using them in the context of incident management [19]. These works often make use of telecommunications data to trace movements of people. However, spatial resolution of the telecommunications data is usually quite coarse. GPS signal enables fine-grained estimation [10]. In the context of public transport, wifi traces have the advantage of having very fine spatial resolution, and do not require a proprietary sensing mechanism.

In [9], the authors design a system to estimate the number of passengers in public transport vehicles. The authors of [2] build a system on top of a Raspberry Pi to track users’ locations at a mass event using probe, association and re-association requests. In [20], the authors build a system to “sniff” wifi signals of office workers along with an online SVM-based model to predict their lengths of stay.

In [22], the authors study queue waiting times using single-point wifi and propose a Bayesian network to estimate queue length. The authors of [15] design a dwell time prediction framework for retail store environments using various sensors from smartphones including wifi signal strength and data transmission rate; their approach relies, however, on features requiring users to install dedicated software on their devices, limiting the applicability of the framework.

In an offline setting, [17] leverage probe requests to reveal past behaviors of users. Similarly, [6] employ the spatio temporal information of probe requests to reveal the underlying social relationships within a small sample of users. In [1] the authors build snapshots of users involved in a large scale event.

The problem of inferring or predicting train arrivals and hence delays of public transport services has been addressed in a number of studies, assuming access to train locations from, for example, the train signaling system. In practice, though, only the train operator has access to the actual train location data. This gap is partially addressed by [11] which derives regional train timetables using cell phone data by detecting bursts in number of cell phone subscribers. The authors report a precision of $85\%$ within $5$ minutes, but a recall rate of only $49\%$ . While their method works reasonably well for highly separated regional train lines, it would not work well on a dense urban metro system. In addition, as with train signaling data, cell phone records are not readily available.

The problem of detecting events of commuters left behind in a subway system is addressed in [23]. The authors rely on offline farecard data, and estimate a model using a maximum likelihood approach assuming known distributions of waiting times and walking times of the passengers.

I-B Contributions

Our contribution is thus to define means for using passive, universally-accessible wifi data to infer train movements and train service levels. More specifically:

•

we formulate the problem of fine-grained passive sensing and illustrate the associated challenges,

•

we propose unsupervised and supervised models for train movement inference and demand-supply gap estimation that are robust to the inherent limitations of wifi-based sensing,

•

we deploy our models within a Big Data architecture handling city-scale data volume on the order of hundreds of stations and millions of users in real-time,

•

we conduct a thorough model evaluation and present accuracy results.

The rest of the article is organized as follows: Section II formulates the problem and goes over the challenges associated with sensing from connected devices. Sections III and IV present the spectral clustering approach for train movement inference, and the demand-supply gap learning model, respectively. Section V describes our numerical results and concluding remarks are provided in Section VI.

II Problem statement

In this section we outline the specificities of the problem of real-time estimation from connected devices.

II-A Passive sensing

The ubiquity of connected devices creates the conditions for large-scale decentralized open monitoring systems. Wifi networks are probably exemplary of this paradigm, since wifi access points (AP) are omnipresent and most mobile devices support the wifi protocol.

In order to discover and automatically connect to known wifi networks, mobile stations scan periodically the wifi bands by broadcasting on all available channels their so-called probe requests [7, 9], providing information of the spatio-temporal locations of connected devices. It is important to note that even if no AP is present, probe requests are systematically sent and thus connected devices are observable.

In fact, thanks to the openness of the protocol, it is possible without proprietary technical knowledge, to collect these probe requests by using a wifi sniffer such as tcpdump or Wireshark. In addition, accessing probe requests is device-agnostic and non-intrusive.

On the other hand, because the wifi protocol stems from the need for efficient wireless communication, it does not exhibit properties conducive to accurate and effective sensing. For instance it is clear that probe requests are correlated with the device activity, and hence inherently event-based, yet the event is of no immediate relevance to spatio-temporal sensing. Additionally, for a variety of reasons, device identifiers are sometimes randomized, leading to further noise in the traces.

In the following section, we go over the main challenges associated with passive sensing for fine-grained public infrastructure monitoring.

II-B Wifi-based public transport monitoring

In this work, we assume that wifi access points are located at the platforms of a train network so that wifi connectivity occurs when a device is physically on or near a railway platform. A record consists of a device identifier, a location expressed in a specific system of coordinates, and a timestamp.

Because devices scan for access points at a frequency (typically $10-100$ Hz) much greater than the frequency of relevant public transport events (typically $0.1-0.01$ Hz), we first pre-process the raw data and identify a journey with:

[TABLE]

where $i$ denotes the device id, and $(t^{A}_{j},t^{D}_{j})$ denote the first and last times when the device is observed at location $j$ . By ignoring the events that are not the first or last event at a given location, we significantly reduce the amount of data to be processed (by several orders of magnitude) and preserve the spatio-temporal statistics of events of interest since we are considering location-aggregate quantities (where a location is typically a station). In this work we do not consider within-location spatially heterogeneous phenomena.

II-B1 Uncertain observation probability

We first note that since the sensing mechanism is passive, there is no guarantee for a commuter to be observed. This can be due to a number of causes; variable frequency of probe requests, randomization of MAC addresses, lack of reliability of sensing or positioning scheme, etc. We illustrate this property in Figure 1.

While it is clear using a continuity argument that the commuter associated with this device travels from station $10$ to station $1$ , a significant proportion of observations are missing.

II-B2 Non-stationary sensing properties

The second challenge associated with wifi-based sensing is the lack of consistency of the sensing scheme parameters. Considering for instance the proportion of commuters sampled by the wifi-based sensing scheme, one would expect a relatively stable proportion, only depending on the penetration of connected devices in the population considered.

We illustrate in Figure 2 that the proportion of commuters observed is much less regular than one might expect and is both highly time-varying and location-dependent.

II-B3 Spatio-temporal accuracy limitations

Finally we discuss the lack of accuracy of position estimates generated from connected devices traces. Figure 3 presents a set of wifi traces across a set of $3$ stations over a $10$ minutes interval.

While the blue traces are indicative of reliable traces because they qualitatively seem to correspond to distinct trace clusters suggesting distinct trains, it is clear that the red trace is non-physical, in that it reflects the movement of a device starting its trip at station $5$ around $7:16$ , and then catching up with the previous train at station $4$ around $7:20$ . This may be caused by the device not being sampled towards the end of its stay at station $5$ . Such examples illustrate the lack of accuracy in the spatial and temporal quantities provided by the individual records, and the need for robustness in the estimation algorithms considered.

II-C Big Data platform

Our application is deployed as a real-time analytics platform in a city-scale context, ingests data from on the order of hundreds of stations, processes the data and runs the machine learning components described subsequently in real-time. Given operational targets, it is important for the real-time pipeline to scale, in particular during peak hours when the traffic is an order of magnitude greater than off-peak traffic.

We illustrate in Figure 4 our real-time machine learning pipeline, architected around the IBM InfoSphere Streams processing engine able to seamlessly spawn new processing streams as required. The IBM Integration Bus handles mini-batch data records received from the on the ground sensing system, and we also use an in-memory database in order to minimize end-to-end latency.

Model training, not depicted here, follows a classical Spark architecture, in its IBM Big Insights implementation. We typically train the models using $1$ year of wifi traces, i.e. more than $10$ billion records.

In the following section we present the machine learning models.

III Spectral clustering of noisy traces

In this section we introduce the unsupervised learning model used for train movement identification. We first present a baseline model and its limitations, motivating the use of a more sophisticated model.

III-A Baseline model

The baseline method considers each station independently, performs station-wise clustering of event timestamps, and then re-identifies clusters across stations. For univariate time-series data, a well-known clustering method is DBSCAN [8]. For robustness the final step consists of pruning the events not associated with train movements (i.e. for instance corresponding to idle commuters). The procedure is iterative: suppose that clusters at all stations $j$ such that $j<i$ are successfully identified. Given a station $i$ and a cluster $l_{i}$ , if a record is seen previously in a cluster $l_{j}$ at a station $j(j<i)$ , and the timestamp difference is within a tolerable range (in our experiments within $30$ minutes), we consider that $l_{i}$ should actually be labeled $l_{j}$ . We examine each record and use majority voting to determine the final cluster label.

A limitation of the baseline method is that it can fail to distinguish distinct trains, especially at peak times when the headway is short. Consider the following example: a line with three stations $i,j$ and $k$ and a single train traveling from station $i$ to station $k$ via station $j$ . Suppose that at station $j$ no passenger having boarded at $i$ is observed, but at station $k$ , all passengers boarding at $i$ and $j$ are recognized. Given that none of the passengers observed at station $i$ is observed at station $j$ , the baseline model considers the train going through station $j$ to be distinct from the train defined by commuters seen at both station $i$ and station $k$ . This is an example of an erroneous detection, when a train is wrongly detected as distinct from the existing train set.

The opposite type of error is illustrated in Figure 3, and is associated with the fact that two traces are considered as belonging to the same cluster when they do not in practice.

III-B Spatio-temporal embedding

In order to address the limitations of the baseline model presented in the previous section, we propose an approach defined by a global view of reconstructed train trips. We first define a line-level similarity metric for any two journeys.

Definition 1 (Soft and hard embeddings).

For $\bm{t}_{1},\bm{t}_{2}$ two points as in (1), we define

[TABLE]

In (2), the $L^{0}$ term quantifies the spatial similarity, i.e. the number of stations where both journeys are recorded, the infinite norm term $L^{\infty}$ quantifies the temporal similarity, i.e. the maximum time difference at stations where both journeys are recorded. Note that Definition 1 is robust to mis-identified journeys, that is, records belonging to two different trains but identified as a single journey. Figure 3 shows such an example (the journey in red). According to Definition 1, the blue journeys are dissimilar to the red journey due to the infinite norm term in (2). The red journey thus has very low similarity with the blue journeys, and the two trains to which the two blue journeys belong are unlikely to be wrongly assigned to a single cluster due the red journey.

III-C Robust spectral clustering

Spectral clustering considers the entire journey of a traveler, hence in the context of our application, can contribute to addressing issues stemming from the local nature of the baseline model. For more details on spectral clustering, we refer the interested reader to the tutorial of [21].

One of the difficulties with spectral clustering lies in choosing a-priori the number of clusters $k$ . However, the similarity graph associated with the embedding from Definition 1 is usually very sparse, therefore, we use the eigengap heuristic [21], which consists of finding the largest gap between the eigenvalues of the Laplacian matrix, to determine a-priori the optimal value of $k$ . Figure 5 shows the result of spectral clustering with soft similarity metric for our toy problem, and using the eigengap heuristic to define the number of clusters.

We observe that most of the model errors are associated with distinct clusters being wrongly associated. The converse issue of clusters being fragmented is much less common because from the spectral clustering output and centrality estimates on travel-times, cluster fragments are easily reconciled, hence do not require complex special treatment. In order to address the issue of clusters being wrongly associated, for example the cluster at 7:00 at station $13$ and the cluster at 7:00 at station $10$ being wrongly connected, we post-process the clustering results to eliminate wrong cluster associations, and specifically the following two types of outliers:

•

data points whose label is different from their neighbors label : we remove these by thresholding labels of a k-NN model. This approach is not only robust to mis-classified journeys, but also to having too many clusters k.

•

data points who are not similar to their neighbors: we remove these by thresholding the similarity metric for each cluster.

The resulting clusters with outlier detection are shown in Figure 6.

As we can see, most wrongly associated clusters are detected and removed.

In the following section we present a model using the spectral clustering output in order to provide estimates of the demand-supply gap.

IV Inference model for demand-supply gap estimation

In this section we first present the intermediary processes used to estimate relevant features, and then describe the core model for demand-supply gap estimation.

IV-A Preliminaries

The count of passengers waiting to board a train at any given point in time is an informative albeit indirect measure of the unmet demand. However no sensor provides a complete measurement of that quantity. In particular CCTV provides observations on portions of the platform and is notoriously difficult to use for measuring accurately the demand-supply gap. Ticketing data provides only the entry counts, reflecting the demand rather than the demand-supply gap.

In order to estimate the count of passengers waiting to board a train, given that noisy observations of the number of connected devices are available, we propose to learn scaling factors relating the number of connected devices observations to the number of commuters. As illustrated in Figure 2, this number is both time-varying and station-dependent.

Assuming that the scaling factor is uniform within each station considered. Let $Y$ be the count of passengers entering the station as observed by the fare gate sensors, with $X$ the count of passengers entering the station as observed from connected devices, for a station $s$ , and time $t$ , the scaling factor $\theta_{s,t}$ reads:

[TABLE]

We update this estimate online using an auto-regressive approach. The offline calibration process is illustrated in Fig. 7.

Using the learned scaling factor, the count of passengers waiting to board a train can be derived from the number of observations from connected devices at any point in time, and in particular before and after a train arrives. More complex online or noise-modeling methods could be considered to improve the accuracy of the scaling factor.

Another relevant feature is the commuter wait time, which can be directly estimated by considering robust statistics of the time difference between the last observed record and the first observed record from connected devices who are observed to be traveling. In the next section we explain how we use these features in order to estimate the demand-supply gap.

IV-B Demand-supply gap (DSG) estimation

We propose using a discriminative classification method in order to estimate the demand-supply gap based on observations from connected devices. We consider the following feature set:

•

count of commuters waiting to board a train,

•

count of commuters “missing the train”, i.e. observed continuing to wait for the next train once a train departs,

•

waiting time third quartile,

•

waiting time standard deviation,

•

train headway obtained from the robust spectral clustering method.

We highlight that we are trying to estimate the macroscopic demand-supply gap, and not whether specific commuters will be left behind. We use greedy forward feature selection to select the most relevant features for model building. Since the datasets are highly skewed, with the vast majority of samples reflecting no DSG occurrences, we invoke a bootstrapping procedure to obtain an unbiased classification result.

Given the low number of DSG events, it is unrealistic to rely solely on station-specific models for accurate estimation. On the other hand, given the lack of stationarity of the underlying processes across stations and times, we cannot readily train models across the entire dataset. We normalize the features across the entire dataset, and then build a hierarchical model.

We train the models in a top-down fashion starting from a network-wide model for all stations on the network to line specific models, with different models for each unique line on the network, and finally to fine-grained models for each unique station on the network.

In the following section we present numerical results for both the movement model and the demand-supply gap model.

V Numerical results

V-A Train movement estimation

V-A1 Performance metric

To evaluate the performance of the train arrival detection method, we compare the estimated train arrivals derived by the baseline and the spectral clustering method to ground-truth arrival times at six stations. We use the following accuracy metrics:

•

hit rate: the proportion of arrival times within $1$ minute of the ground-truth,

•

root mean squared error (RMSE) of arrival time, expressed in minutes.

We first evaluate the methods under typical conditions, and then during an incident, during which train movements differ from their regular schedule.

V-A2 Typical conditions

We analyze the results during typical conditions during the most challenging peak travel times, with short train headways, over the five workdays of a given week. Table I presents the hit rate and RMSE.

Best performance (excluding ties) is marked in bold. Both methods perform well under these circumstances, though spectral clustering moderately outperforms the baseline method.

V-A3 Incident scenario

We further evaluate the two methods during a two-hour train incident. Specifically, we consider a real incident during which a train disruption occurs from around 7:15 to 8:15, illustrated in Figure 8. The traffic is interrupted at station $15$ for half an hour and then partially resumes but remains perturbed until $9$ am.

Figure 8 depicts commuter traces during an incident. The data being extremely noisy, journeys can easily be mis-identified, and clusters mis-detected. We analyze the performance of both the spectral clustering model and the baseline model over this time period.

Table II shows the hit rate and RMSE at Station $15$ (chosen because ground truth at other stations is unavailable).

In this case of complex movements with overlapping clusters, the spectral clustering method significantly outperforms the baseline approach, by a factor $10$ in terms of hit rate, and by a factor $2$ for the RMSE metric.

V-B Demand-supply gap model evaluation

V-B1 Performance metric

We consider a performance metric based on the DSG value over $30$ minute time intervals, as this is how it is measured by transport operators. With the objective of making results easier to interpret, we express the DSG as the percentage of commuters left behind, i.e. unable to board the first train they waited for, over the set of commuters who intended to board during the $30$ minute interval.

We first present the model parameters obtained, then results of the detection of occurrences of DSG, and finally we provide more details on the accuracy of the DSG estimates and its robustness properties.

V-B2 Model training

We make use of 100 K ground-truth DSG event labels (positive and negative instances) collected over a period of $8$ months at $60$ stations. The dataset is highly skewed with a majority of non-DSG occurrences. The associated raw wifi traces dataset consists of $1$ GB of raw wifi traces per day, or about $10$ billion records over the period considered. The dataset is split into $75\%$ training and $25\%$ testing.

We perform $10$ -fold cross validation. Outlier Winsorization is performed with a percentile of $0.99$ . We build a GLM model with logistic regression in R. The F1 score is used as the metric in training. A grid search reveals a probability cutoff threshold of $0.1$ on the cross validation set. The test set yields a precision of $84\%$ and a recall of $75\%$ . Other models examined including SVM, Random forests and LogitBoost did not improve accuracy.

In Table III we present the feature weights obtained for a logistic regression model focused on detecting the severity of the DSG measured as the average number of trains an impacted passenger had to miss.

It is clear that the results are most sensitive to the feature indicating the number of commuters still waiting to board after the train departs, as this feature is a noisy aggregate observation of the DSG. We observe that other features such as wait time statistics have a reduced albeit significant impact. This is explained by the inherent noise in the count estimates while time-based estimates are less subject to non-stationary sampling rate, hence more stable by construction.

V-B3 Model testing

We now analyze the benefit of the hierarchical model structure. In Table IV we present the Precision, Recall, and Accuracy of detecting DSG with specific models.

Observe that the use of specific models moderately improves model accuracy, in particular in terms of reducing the number of false alarms. On the other hand, it significantly increases (by 2 orders of magnitude) the complexity associated with model maintenance, i.e. automated training, deployment, and monitoring. As these algorithms form the basis of a large-scale, real-time system, the importance of model maintenance is not to be neglected.

Overall accuracy results are presented in Figure 9 and Figure 10 in terms of the mean absolute error of DSG values. We further split the DSG into values whereby left behind passengers only miss $1$ train (DSG $1$ ), or miss $2$ or more trains (DSG $2+$ ). The error statistics are reported at a set of $7$ stations where a DSG is observed.

The inherent variability due to non-stationarity of the overall dataset is visible from the results. The median error remains relatively low for most stations.

V-B4 Model robustness

Given the inherent noise of the data, in particular as regards the variable proportion of commuters observed, we further analyze the robustness of the model estimates to that proportion:

•

we train the model on the full dataset, i.e. a sampling factor of $100\%$ . Recall that the true sampling factor, with respect to the number of commuters, is unknown, time-varying and location-dependent.

•

we test the model with subsets of the data obtained by down-sampling the full dataset. We down-sample by increments of $10\%$ over the interval $[0,100]$ .

Figure 11 presents the average Precision and Recall as a function of the down-sampling value.

We observe that the model is able to maintain good performance when the sampling factor is above $60\%$ .

VI Conclusion

In this work we considered the problem of large-scale inference of public transport level of service based on connected devices data. We investigated the specific properties of such datasets, shared in particular with IoT data. We highlighted the inherent noise of the data, not only in terms of spatio-temporal accuracy of the data, but also in terms of the non-stationarity of the underlying sensing scheme such as its sampling factor.

We introduced unsupervised learning methods appropriate for estimating global conditions of the public transport network such as train headways and train line level of service. We further developed robust classification methods using estimated train movements as a building block in order to estimate the demand-supply gap in real-time and in a tractable manner. We emphasize that the demand-supply gap is a very general quantity of relevance outside of public transport.

Numerous extensions can be considered. First it is clear that more complex approaches can be developed for the training phase. In order to compensate the impact of data uncertainty, one could for instance jointly consider all network events as informative of the conditions at any specific station. Second, it is clear that a key aspect of the deployment of such systems is the ability to monitor model stability in real-time and re-train if needed. Lastly, in order to support more diverse crowd monitoring applications, it would be of interest to develop similar inference processes in two and three-dimensional space.

Bibliography23

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Marco V. Barbera, Alessandro Epasto, Alessandro Mei, Vasile C. Perta, and Julinda Stefa. Signals from the crowd: Uncovering social relationships through smartphone probes. In Proceedings of the 2013 Conference on Internet Measurement Conference , IMC ’13, pages 265–276, New York, NY, USA, 2013. ACM.
2[2] B. Bonné, A. Barzan, P. Quax, and W. Lamotte. Wifipi: Involuntary tracking of visitors at mass events. In World of Wireless, Mobile and Multimedia Networks (Wo W Mo M), 2013 IEEE 14 th International Symposium and Workshops on a , pages 1–6, June 2013.
3[3] Francesco Calabrese, Laura Ferrari, and Vincent D Blondel. Urban sensing using mobile phone network data: a survey of research. Acm computing surveys (csur) , 47(2):25, 2015.
4[4] Julián Candia, Marta C González, Pu Wang, Timothy Schoenharl, Greg Madey, and Albert-László Barabási. Uncovering individual and collective human dynamics from mobile phone records. Journal of physics A: mathematical and theoretical , 41(22):224015, 2008.
5[5] Adam Caspari, Brian Levine, Jeffrey Hanft, Principal Transportation Planner, Rail Network Planning, and Alla Reddy. Real-time estimation of platform crowding for new york city subway: 1 case study at wall st 2/3 station in financial district 2. situations , 60:61, 2017.
6[6] N. Cheng, P. Mohapatra, M. Cunche, M. A. Kaafar, R. Boreli, and S. Krishnamurthy. Inferring user relationship from hidden information in wlans. In MILCOM 2012 - 2012 IEEE Military Communications Conference , pages 1–6, October 2012.
7[7] Mathieu Cunche. Smartphone, Wi-Fi et vie privée : comment votre smartphone peut se révéler être votre pire ennemi, October 2013. 1.
8[8] Martin Ester, Hans peter Kriegel, Jörg Sander, and Xiaowei Xu. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In Knowledge Discovery and Data Mining , pages 226–231, 1996.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Robust commuter movement inference from connected mobile devices

Abstract

Index Terms:

I Introduction

I-A Literature review

I-B Contributions

II Problem statement

II-A Passive sensing

II-B Wifi-based public transport monitoring

II-B1 Uncertain observation probability

II-B2 Non-stationary sensing properties

II-B3 Spatio-temporal accuracy limitations

II-C Big Data platform

III Spectral clustering of noisy traces

III-A Baseline model

III-B Spatio-temporal embedding

Definition 1** (Soft and hard embeddings).**

III-C Robust spectral clustering

IV Inference model for demand-supply gap estimation

IV-A Preliminaries

IV-B Demand-supply gap (DSG) estimation

V Numerical results

V-A Train movement estimation

V-A1 Performance metric

V-A2 Typical conditions

V-A3 Incident scenario

V-B Demand-supply gap model evaluation

V-B1 Performance metric

V-B2 Model training

V-B3 Model testing

V-B4 Model robustness

VI Conclusion

Definition 1 (Soft and hard embeddings).