Occupancy Counting with Burst and Intermittent Signals in Smart Buildings
Bekir Sait Ciftler, Sener Dikmese, Ismail Guvenc, Kemal Akkaya,, Abdullah Kadri

TL;DR
This paper proposes a passive WiFi-based occupancy counting method in smart buildings using probe requests and a multi-model Kalman filter, achieving accurate zone-level occupancy estimates without extensive sensor deployment.
Contribution
It introduces a novel approach combining WiFi probe request analysis with an interacting multi-model Kalman filter for improved occupancy counting accuracy.
Findings
WiFi probe requests can effectively estimate occupancy in smart buildings.
The multi-model Kalman filter enhances localization accuracy.
Numerical results demonstrate viability in real building environments.
Abstract
Zone-level occupancy counting is a critical technology for smart buildings and can be used for several applications such as building energy management, surveillance, and public safety. Existing occupancy counting techniques typically require installation of large number of occupancy monitoring sensors inside a building as well as an established network. In this study, in order to achieve occupancy counting, we consider the use of WiFi probe requests that are continuously transmitted from WiFi enabled smart devices. To this end, WiFi Pineapple equipment are used for passively capturing ambient probe requests from WiFi devices such as smart phones and tablets, where no connectivity to a WiFi network is required. This information is then used to localize users within coarsely defined occupancy zones, and subsequently to obtain occupancy count within each zone at different time scales. An…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 6
Figure 6
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18
Figure 19
Figure 20
Figure 21
Figure 22
Figure 23
Figure 24
Figure 25
Figure 26
Figure 27
Figure 28
Figure 29
Figure 30
Figure 31| No Bayesian Tracking | Bayesian Tracking | |
|---|---|---|
| Heuristic-1 | Model-1 () | |
| Heuristic-2 | Model-2 () | |
| Heuristic-3 | Model-3 () | |
| NLS | Model-3 () |
| Parameter | Value |
|---|---|
| WiFi protocol | IEEE 802.11n |
| Floor width and length | m x m |
| Sampling window length (T) | seconds |
| Sampling window hold length (L) | to |
| Occupancy grace period | minutes |
| Number of the reference nodes | |
| Path loss exponent () | |
| Reference distance () | meter |
| Signal strength at reference distance () | dBm |
| Noise floor | dBm |
| Detection threshold | dBm |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndoor and Outdoor Localization Technologies · Context-Aware Activity Recognition Systems · Human Mobility and Location-Based Analysis
Occupancy Counting with Burst and Intermittent Signals in Smart Buildings††thanks: This work was made possible by the National Science Foundation Grant AST–1443999. The statements made herein are solely the responsibility of the authors. ††thanks: Bekir S. Ciftler and Kemal Akkaya are with the Department of Electrical and Computer Engineering, Florida International University, Miami, FL, USA (e-mail: bcift001; kakkaya@fiu.edu). ††thanks: Sener Dikmese and Ismail Guvenc are with the Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, NC, USA (e-mail: sdikmes; iguvenc@ncsu.edu).††thanks: Abdullah Kadri is with Qatar Mobility Innovation Center, Qatar University, Doha, Qatar (e-mail: [email protected]).
Bekir Sait Çiftler, , Sener Dikmese, , İsmail Güvenç, , Kemal Akkaya, , and Abdullah Kadri
Abstract
Zone-level occupancy counting is a critical technology for smart buildings and can be used for several applications such as building energy management, surveillance, and public safety. Existing occupancy counting techniques typically require installation of large number of occupancy monitoring sensors inside a building as well as an established network. In this study, in order to achieve occupancy counting, we consider the use of WiFi probe requests that are continuously transmitted from WiFi enabled smart devices. To this end, WiFi Pineapple equipment are used for passively capturing ambient probe requests from WiFi devices such as smart phones and tablets, where no connectivity to a WiFi network is required. This information is then used to localize users within coarsely defined occupancy zones, and subsequently to obtain occupancy count within each zone at different time scales. An interacting multi-model Kalman filter technique is developed to improve occupancy counting accuracy. Our numerical results using WiFi data collected at a university building show that utilization of WiFi probe requests can be considered a viable solution tool for zone-level occupancy counting in smart buildings.
Index Terms:
Indoor localization, occupancy counting, probe request, smart building, smart city, tracking, WiFi.
I Introduction
Smart cities of the future are expected to provide better use of public resources, increase quality of service offered to citizens, and reduce operational costs of public administrators [1]. Internet of Things (IoT) technology is a key enabler for smart cities, and it can support a plethora of services, ranging from building health inspection, to waste management, noise/air quality monitoring, traffic congestion control, city energy consumption reduction, smart parking/lighting, and automation of smart buildings [2, 3, 4]. Realizing the vision of smart cities necessitate effective use of IoT technologies for proximity detection, localization, tracking of objects and humans, and occupancy monitoring [5, 6].
Smart buildings constitute a key component of smart cities, which will benefit extensively from the use of IoT technologies for health monitoring, energy management, public safety, and surveillance, [7]. In particular, buildings are among the largest consumers of electricity in the United States: they account for of primary energy consumption and of electricity consumption[1]. An important portion of the electricity consumption of buildings is used for heating, ventilation, and air conditioning (HVAC). To this end, low cost, seamless, and accurate occupancy counting can help in achieving significant energy savings in smart buildings, such as by dynamically scheduling HVAC activity based on real-time building occupancy levels at different areas [8].
Occupancy counting in smart buildings can be implemented via video processing and camera systems or deployment of occupancy sensors throughout the building [9]. These options require installation of new equipment that are often costly to deploy. An alternative way is to use ambient wireless signals of opportunity that uniquely match to building occupants. While other technologies such as sensors, cameras, and RFID provide occupancy tracking at certain building zones, WiFi technology is already available in most buildings, and can provide good occupancy monitoring coverage. Even though several positioning solutions exist based on available WiFi infrastructure, they typically require a connection between user equipment and WiFi access points (APs) [9][10].
In this paper, we consider the use passive sniffing of WiFi probe requests for occupancy monitoring and tracking in smart buildings as illustrated in Fig. 1. Probe requests are signals that are continuously broadcast from devices with WiFi technology, such as smartphones, laptops, and tablets[11, 12, 13, 14, 15]. The probe requests are not encrypted, and can be captured and decoded with the help of passive sniffers as shown in Fig. 1(a), without connecting to a particular network. Probe requests are also burst in nature, since they are broadcasted in the air in search of WiFi networks to get connected, to get a list of available networks, or to handover between WiFi APs. Frequent transmission of probe requests from mobile devices introduces an opportunity to track the occupancy count in different zones of a building by simply monitoring the probe requests (see Fig. 1(b)). In particular, we can capture the received signal strength (RSS) of probe requests using sniffers such as WiFi Pineapple (WiFi-PA) [16], which can then be used for occupancy monitoring inside the building.
To the best knowledge of the authors, there are no detailed studies in the literature that report efficiency of occupancy counting using WiFi probe requests using varying number of reference nodes to track individual devices. In this paper, we use WiFi probe requests captured at various reference locations for occupancy monitoring in smart buildings as summarized in Fig. 1. To this end, seven WiFi-PAs are deployed at various locations within the FIU Engineering Center, and probe requests are collected over multiple days. The burst and intermittent nature of probe requests require post processing of the data to make it ready for the localization and tracking. Subsequently, at every sampling window, various localization techniques are used to obtain location estimates of each WiFi device, which are further refined using an Interacting Multiple Model (IMM) filter for tracking user location. Position estimates are then aggregated into one of the eight occupancy zones inside the building for real-time occupancy counting.
The main novel contributions of this paper can be summarized as follows
- •
We propose an adaptive tracking algorithm, which enables to study occupancy count with burst and intermittent measurements, as well as the varying number of positioning reference nodes (sniffers) during location tracking. To authors’ best knowledge, there is no similar study considering both conditions in the literature.
- •
We develop estimators and heuristic methods to localize and track the target node even with measurements from a single reference node in the worst case. In our experiments, we observe that of the time, probe requests are detected by a single reference node whereas of them are received by two reference nodes. Thus, including them in tracking and occupancy counting is crucial for robustly estimating occupancy count in building zones.
- •
We implement simulations with realistic WiFi channel models to show the occupancy detection performance of our proposed framework. We also show the accuracy of our approach using real world data, by carrying out an experiment with WiFi-PAs as the probe request sniffers for occupancy tracking for an indoor university campus environment. Proposed method achieves up to performance in zone-level tracking. Our simulation results agree with experimental results, which show close performance.
This paper is organized as follows. In Section II, a brief overview of the existing literature related to the tracking techniques is presented. Models for the tracking algorithm are explained in Section III. Localization and initialization for user tracking using probe request data are presented in Section IV, while tracking with the IMM filters and zone-level occupancy counting are provided in Section V. Subsequently, gathering of probe requests is explained in Section VI in detail. Simulation and experimental results are presented in Section VII, and finally, concluding remarks and future prospects are summarized in Section VIII.
II Literature Review
Use of probe requests have recently received interest from researchers for various applications. For example, they are used for load balancing in wireless networks to find hidden and mobile nodes in [17], and to analyze the handover processes of 802.11 network in [18]. A WiFi flood attack detection system, which is a method based on the probe request and probe response timeouts, is proposed in [19]. Privacy issues with probe requests are evaluated in [20], where authors utilize probe requests to link the devices by creating a table for the requested WiFi network names and comparing them with the other devices.
Although RSS-based tracking with Kalman filters (KF) is a well-established area, it is still an interesting research topic due to its various new applications for accurate localization [21, 22]. In [23] two-slope RSS model is used with two Extended Kalman Filters (EKF) considering an IMM framework to improve tracking accuracy. In [24], the authors present a novel exponential-Rayleigh RSS model for device-free localization and tracking with KF, where the main contribution is to include multipath components in RSS model for improving localization and tracking accuracy. The accuracy is shown to increase significantly compared to the standard RSS model. In [25], the RSS variance problem in tracking due to the hardware differences, device placement, and environmental changes are studied, and a particle filter based solution is proposed. The results show that the tracking accuracy is more robust against the RSS variance when the accelerometer and digital compass readings are included in system. In [26], a novel approach based on compressive sensing is used for the localization and tracking of WiFi devices. In particular, coarse estimates of RSS fingerprinting are improved by the compressive sensing techniques, and KF with map information is used to track the devices.
In the papers mentioned above, a constant number of reference nodes are always assumed to continuously localize and track the users. However, in practice, reference nodes may fail to detect transmitted signal or may not be available at all times. Even though there are several studies suggesting increased localization and tracking accuracy by introducing secondary reference nodes based on their location estimates in cooperative networks [27], or self-adaptive localization techniques with a similar approach [28], they have not investigated localization with the limited and dynamically changing number of the reference nodes.
III System Model
In this paper, we consider an occupancy counting scenario as seen in Fig. 2. There are reference nodes placed through the tracking zone. The known location of th reference node is denoted with , and a target node has an unknown position . Burst and intermittent probe requests are broadcasted from the mobile target node as shown in Fig. 3. We consider a predefined sampling window, where data from all the burst probe requests within the window are aggregated. In particular, the th sampling window of duration spans the interval between time instances between and . Eventually, the true location of the target node in the sampling window is represented with .
During tracking of the mobile target node, some reference nodes may not detect transmitted probe requests by the target node due to being far away from the target, poor link quality, and collision of probe request packets, as illustrated in Fig. 2. In such cases, we need to be able to work with RSS from the rest of the reference nodes, which are called available measurements in sampling window , and denoted by a set , where represents RSS value at reference node at sampling window .
We refer as a representative RSS value for window at node , since there might be multiple probe requests observed from a certain target node within the same sampling window as seen in Fig. 3. Then, we have to extract the representative RSS value from these burst of observations. Even though the RSS values are usually close or equal to each other for spatio-temporally close probe requests, there might be outliers due to the varying channel conditions. Using the mean of the RSS values would therefore bias our results towards outliers; therefore, median value of the RSS values within the sampling window is used as a representative RSS for that sampling window as follows
[TABLE]
where the th reference node is considered to receive different probe requests within the sampling window . Let the RSS measurement for the probe request with the median RSS value be defined as
[TABLE]
where is the signal strength at the reference distance , is the path loss exponent (PLE), is the Euclidian distance between the th reference node and the target device at sampling window ,
and is the noise modeled by additive white Gaussian noise (AWGN) with a variance of and mean of zero.
Once the representative RSS measurements are obtained for each reference node within the th sampling window, the number of the available representative RSS measurements from the different reference nodes at sampling window is given by , which is the cardinality of set . Due to the intermittent nature of probe requests that may result in unavailable measurements within subsequent measurement windows, a sample-and-hold approach is used to hold RSS values for a certain number of sampling windows at different reference nodes. Otherwise, the useful ranging information from the difference nodes would be lost. In Fig. 3, an example sampling windows is given with , where is the holding length.
Assume that the last probe request detected by the th reference node is within the sampling window , and the th reference node did not receive any probe requests within future sampling window(s). Then, the RSS value of the th reference node for the samples after the sampling window using a holding length can be written as
[TABLE]
for , given . If there is still no available signal received at the th reference node after the sample-and-hold window, the RSS measurement will be removed from the set of the available RSS values which represented with purple color in Fig. 3.
Given this system model and framework, our main goal in this paper is to determine zone occupancy based on tracking position of WiFi devices with varying number of the available reference nodes and intermittent transmission of the probe requests. In the following sections, we explain the localization of the individual target nodes, IMM-based tracking with varying conditions, and finally zone-level mapping and occupancy counting of WiFi devices.
IV Localization and Initialization Techniques
In this section, RSS-based localization of the target node with varying number of reference nodes and different levels of a priori information will be presented. The technique used for localization changes with the number of available reference nodes and the unknown parameters is shown in Table I. Since the unknown parameters are revealed in time, we will use them in the following sampling windows as prior information. The novelty of our system lies in the use of any number of reference nodes without discarding any single probe request.
Below, we will first consider heuristics and algorithms for estimating target location when we do not know the prior location information of the target (e.g., the target enters a building and no prior measurements available). Subsequently, in Section V we will investigate how we can improve the localization accuracy when the prior trajectory of the target is available.
IV-A RSS Available at Three or Less Reference Nodes
First, consider the case that the RSS measurement in the sampling window is available at only a single reference node (i.e. when ), and and are unknown. Since and can only be estimated when (see Section IV-B), we consider that (as in [29]) and are obtained by averaging over (potentially limited number of) ground truth measurements where the location of the target is known.
Then, the target device is assumed to be at the vicinity of the available reference node, and we may estimate the location of the target device to be
[TABLE]
where is the set of possible locations in the coverage of th reference node, considering a circular coverage area for all reference nodes. In other words, (4) implies that the location is estimated to be in the median location of possible positions sampled densely within the coverage area of the reference node. On the other hand, as shown in Fig. 4(a), coverage area of several reference nodes may overlap, and the location of the target node is assumed not to be within the coverage area of the reference nodes that do not have the RSS measurement from the target node. Even though the location estimate in (4) is relatively coarse and based solely on the coverage area of the reference nodes, it is acceptable for the zone-level estimation, and we can still utilize the probe request measurements even if they are available only at a single reference node. It also allows to initialize the IMM tracking algorithm in Section V at a better location for quicker and more accurate estimation, and to avoid local optimums.
Second, consider that the RSS information from the target node at sampling window is available only at two reference nodes () and no prior information is available on the location of the target node. Then, the location estimate for the sampling window can be estimated as follows
[TABLE]
where represents the distance between the th and th reference nodes, and the estimated distance between the th reference node and the target node () based on the RSS measurements is given as
[TABLE]
In other words, as shown in Fig. 4(c), the target node is assumed to be on the straight line that passes through the locations of the two reference nodes, and positioned on the line depending on the RSS measurements at each node.
Third, consider that at sampling windows , RSS measurements are available at three or more reference nodes (). Then, we consider a linear least squares (LLS) solution as a low computational complexity method [30, 31] that has acceptable localization performance as shown in Fig. 4(e). The location of the target device can be obtained by solving the LLS problem in the form with
[TABLE]
where and are defined with the known position of the reference nodes () and ranging information for reference nodes using (6). The solution to the LLS problem can then be found by solving for , as follows
[TABLE]
IV-B RSS Available at Four or More Reference Nodes
Consider now that the RSS information is available at least at four reference nodes, i.e., . Then, adapting [32], a nonlinear least squares algorithm is used to solve the localization problem with unknown parameters. The optimization problem is based on the relaxation of a maximum likelihood estimator with Taylor expansions of RSS defined in (2), around assumed values of unknown parameters[32]. The general optimization problem is defined as a special type of quadratic problem, general trust region problem, given as follows
[TABLE]
where is the translation matrix and is the vector for unknown parameters and their derivations for case , and holds the measurements regarding the unknown parameters. The constraint defines the feasible set for solution of optimization, where and are the selection diagonal and vector, respectively. Each of these vectors and matrices are defined and given explicitly for three different availability level of a priori information below (i.e., estimated or unknown and ).
Unknown ()
In the case of unknown and with knowledge of , at least three reference nodes are required to estimate . If the number of the reference nodes are less than three, approaches introduced in Section IV-A are used. The optimization problem given in (10) for this case is written as
[TABLE]
where and , and solution for minimization problem is defined as \boldsymbol{y}_{1}\triangleq\big{[}\|\boldsymbol{x}^{\prime}_{k}\|^{2}~{}\boldsymbol{x}^{\prime T}_{k}~{}\alpha\big{]}.
Unknown ()
In the case of unknown and with knowledge of , optimization matrices and vectors become
[TABLE]
where and is the tuning parameter of the algorithm for . The final solution of the optimization is defined as \boldsymbol{y}_{2}\triangleq\big{[}\|\boldsymbol{x}^{\prime}_{k}\|~{}\boldsymbol{x}^{\prime T}_{k}~{}\delta\big{]} where shows the suitability of the tuning parameter.
Unknown and ()
In the worst case, which there is no prior information on the parameters, we have to estimate both the location of the target nodes and channel parameters with below matrices and vectors
[TABLE]
where and is the tuning parameter for . Solution for the corresponding NLS to a general trust region problem is defined as y_{3}\triangleq\big{[}\|\boldsymbol{x}^{\prime}_{k}\|^{2}~{}\boldsymbol{x}^{\prime T}_{k}~{}\gamma~{}\gamma\delta\big{]}^{T} where is a measure on tuning parameter suitability. The IMM based tracking takes the localization estimates as an input, which is explained in the following section in details.
V Interacting Multiple Model Tracking and Occupancy Counting
In this section, we study Bayesian tracking of the individual WiFi devices by exploiting prior location information of the mobile device. Probe requests are intermittent, which means that the target device may not send any probe signals over a duration of many sampling windows. Even when the probe requests are transmitted by the mobile device, they may not always be captured at the reference node. Therefore, using a standard Kalman filtering technique (see e.g. [33, 34]) that assumes uniform set of system parameters will not work effectively. Considering varying system properties such as the number of the available measurements and prior information related to the device, we consider an IMM framework that employs multiple, interacting Kalman filters in this study. The overall model [35] is represented with a linear system as follows
[TABLE]
where is the state of the system during sampling window , is the transition matrix for the model order , where the model order depends on the number of available reference nodes (see Table I). which is in effect during the sampling windows , and is the process noise which is Gaussian distributed with zero mean and covariance for the th model. The measurement on the sampling window is denoted by , where is the th measurement model matrix, and is the measurement noise on the step for th model with Gaussian distribution of mean zero and covariance . As explained earlier, each model in the system has its own Kalman filter parameters, and the resulting estimate is calculated with the selection of the correct model based on the status of the system such as and prior information of channel parameters.
A standard Wiener process velocity model is used for all of the models, with varying measurement noise and step sizes based on sampling window length . Defining matrices of the models are given as
[TABLE]
V-A Review of a Generic Kalman Filter
Kalman filtering, in general, has two steps: 1) prediction and 2) update. In the prediction step, next state of the system is predicted with the given previous measurements, and in the update step, the current state of the system is estimated with the given measurement at that sampling window [35]. In below, first the general procedure for the prediction and update stages are mathematically summarized (model index will be dropped for brevity), and we will subsequently discuss how Kalman filters with multiple models can be utilized for our scenario.
Prediction
[TABLE]
where and are the predicted mean vector and the predicted covariance matrix of the state in sampling window , respectively, while and are the estimated mean vector and the estimated covariance matrix of the state in sampling window . Mean and covariance of states consist of position and velocity information of the target device.
Update
[TABLE]
where and are the measurement residual and measurement prediction covariance on state for sampling window , respectively. Kalman gain is the tuning parameter, which adjusts prediction’s correction for sampling window . Estimated mean and covariance of the state at sampling window with adding the measurements are represented with and , respectively.
V-B IMM Based Kalman Tracking of Target Location
Section V-A summarized the model for a generic Kalman filter; in this section, we consider a multi-modal Kalman filter, whose parameters change with respect to the model order, and the models are selected based on the number of available RSS measurements at different reference nodes. These models also vary by their measurement error since the number of measurements affects the localization accuracy. Each filter receives a raw location estimate as an input (relying solely on measurements), and outputs a filtered location output using a technique that varies by the model.
First, consider that the RSS of the probe request from the target mobile device is available only at a single reference node (i.e. ). Then, since we have access to prior location estimate, a better location estimate than that was introduced in (4) and Fig. 4(a) can be obtained, as summarized in Fig. 4(b) as Model-1 (). The target node is assumed to be on the straight line between the predicted position for sampling window (represented by ), and the position of the available reference node (), utilizing ranging information from the reference node as
[TABLE]
where is as in (6). In other words, position of target node is estimated as far from available reference node (from measurements), towards predicted position by IMM. In particular, the predicted position based on previous location estimate and trajectory are used for reducing the error of the estimation in Fig. 4(a) which relied only on a single reference node measurement.
When (probe RSS measurement available at two reference nodes), a similar approach as illustrated in Fig. 4(d) called as Model-2 is followed. In particular, we consider the predicted position of the target node () in (14) for compensating bias with estimated location in (5) as
[TABLE]
Simply, the position of the target node is first estimated as in (5) using the measurements from the two reference nodes, and then the bias of the estimation is adjusted with the predicted position by taking average of the estimate and the prediction.
Finally, in Model-3 (i.e. when ), LLS or NLS estimator is used based on the number of reference nodes and unknown parameters. If either or is unavailable, we use NLS to estimate the location as well as that parameter, as described in Section IV-B. If there are no unknown parameters (i.e. all the parameters are estimated), LLS is used to estimate the location due to its lower computational complexity.
V-C Interaction of Multiple Models
IMM consists of three major steps: interaction (mixing), filtering and combination. In each sampling window , the initial conditions are obtained for certain model, from the measurement and previous sampling window with the knowledge of , hence right model is selected as given in Table I. Then standard Kalman filtering is performed for each model, and after a weighted combination of updated state estimates produced by all the filters yielding a final estimate for the state () and covariance of the state () in that particular sampling window . The weights are chosen according to the probabilities of the models, which is binary and deterministic in our case since is known. Thus, and are updated for IMM filter as follows
[TABLE]
In each model, the measurement noise covariance matrix varies with the number of measurements as follows
[TABLE]
where represents the variance of the error in the location estimates. Using the covariance matrix as in (24) for different models ensures that the prediction is weighted more when there are limited number of available reference nodes, while measurements are weighted more when there are at least three or more available reference nodes.
V-D Occupancy Counting
Our final goal is real-time occupancy monitoring, by counting the number of users within coarsely defined occupancy zones in the area of interest. The IMM filtering results lead to mapping of the each target node to a zone which is created as illustrated in Fig. 1. Sum of the individual target nodes in a zone gives the occupancy count of that zone. In this case, there might be a problem when a user switches the device’s screen off since probe requests may not be sent for seconds to seconds[36]. Thus, this interval should be considered as a grace period for that individual device, and the user should be kept within the occupancy count for that zone, until the time threshold expires. In our experimental model, we consider the grace period to be seconds, after which we remove a user from the building zone which has been last localized.
VI Sniffing Probe Requests
In the experiments, seven different WiFi-PA equipment are deployed at various locations on the floor of a university campus building as in Fig. 5. The data captured at WiFi-PA include absolute time stamp providing the time at which the data was captured, MAC address of the WiFi device, and the RSS of the WiFi device. Gathered data is transferred to a server for execution of the occupancy counting.
Probe request mechanism is explained in [18]. Probe requests, which are sent by WiFi devices to scan WiFi APs at certain channels, are active mechanisms to discover APs around the mobile device. APs respond with probe responses and beacons. A problem with probe requests is passing-by and static devices. For instance, there might be people outside of the building passing by a WiFi device and it still can be detected by the reference nodes which are close to the border. Such devices usually send a few low power probe requests while in the vicinity of the reference node. Static devices (such as wireless printers) send thousands of probe requests with the same RSS value usually periodically. Passing-by and static devices are removed from the occupancy count in our analysis using outlier detection and filtering techniques. Our experimental results indicate that only of the detected devices are in our region of interest as shown in Fig. 6. In total, unique and legitimate MAC addresses are detected within a period of one week.
Since probe requests are only used to discover APs, they are burst and intermittent as shown in Fig. 3. After connecting to a network, probe requests are rarely sent to search for a network with better quality or handover possibilities. Therefore, they could be sent 50 times within a minute (i.e., burst), or do not trigger for 5 to 10 minutes (i.e., intermittent). This depends on the implementation of the IEEE 802.11 protocol at the device in terms of both the design of hardware and software. Due to the mobility, WiFi device needs to send probe requests more frequently since it causes to lose connectivity with the AP, or degrade the quality of the connection. Therefore, probe requests can effectively facilitate localization and tracking of the WiFi devices.
VII Numerical Results
In this section, we will first investigate the performance of the proposed framework with computer simulations using realistic WiFi signal propagation data. Subsequently, we will present our experimental results and occupancy counting based on the measurement data that has been collected at FIU College of Engineering and Computing building.
VII-A Simulation Results
The proposed framework is simulated considering an indoor environment with propagation parameters and indoor channel characteristics summarized in Table LABEL:table:parameter. In our simulations, we used a realistic multipath indoor channel model to generate the RSS values. It is compared with the ideal flat fading channel results, where there is only a single strong path together with the AWGN. Probe request statistics such as interarrival times and detection probability are included from experimental measurements as well to increase the reliability of the results. By introducing detection probability, detection of probe requests by single or multiple reference nodes is modeled realistic. Also, path loss exponents and reference received powers are used from the obtained measurements.
An example trajectory from the simulations are plotted as seen in Fig. 7 with the individual position estimates before filtering, as well as the IMM output as the estimated path. The estimated path is very close to the real path as we expect. The mean RMSE for the estimated path is meters in this realization, while mean RMSE for individual localization estimates is meters which is significantly higher. Cumulative distribution function (CDF) of the mean tracking RMSE for bare localization estimates without IMM filtering (EST), and the IMM filtering results are compared for a given scenario in Fig. 8. Simulations are averaged over 1000 Monte Carlo realizations with the realistic indoor channel parameters. Median RMSE for IMM filter is meters, whereas , , and meters for Model-1 (M-1), Model-2 (M-2), and Model-3 (M-3), respectively. It is also meters for the individual location estimates (EST), which is almost twice the RMSE obtained from the IMM filter.
This performance is also reflected to the zone-level localization performance (i.e., percentage of correctly localizing a target device within the building zone where its true location belongs) as shown in Fig. 9. Even though the zone-level localization performance varies from to for IMM, the median performance is . Individual Kalman filters are very close to each other with the median performance of with range of to . Similar to the tracking RMSE, performance of zone-level localization is much lower for individual location estimates as the median performance is , with a range between to .
VII-B Experimental Results
In addition to simulations, extensive experimental data are collected at FIU to validate the proposed methods with real measurements. First, several experimental walks are considered using WiFi devices to compare the simulation and experimental results of the ground-truth data on a fixed path within the building, using the reference nodes as shown in Fig. 7. The track is walked at different times of the day along with several WiFi devices. The comparison of the zone-level localization performance is presented in Fig. 10. The performance of simulations with the AWGN and realistic indoor channel are close to each other for both the individual location estimates and the IMM. The median performance of the individual location estimates is under ideal AWGN, whereas it is under realistic indoor channel. Performance with ground-truth data is lower with . The IMM improves the median performance to in the simulation results, whereas it increases to in the experimental results with real data. The growth in the experimental performance using IMM is substantial with increase. Hence, the IMM plays a critical role in the zone-level localization of the individual target devices.
Once we localize all the target mobile users in building zones (filtered out from passing by users and other fixed devices as shown in Fig. 6) using IMM, we can obtain the real-time occupancy count in different building zones using these location estimates. Using experimental data, occupancy counting results over a period of 24 hours are shown in Fig. 11 as a sum of total occupants over all the zones in a week day. As expected, peak hours of the occupancy in the building is between PM and PM, where most of the classes are scheduled. The lowest occupancy is around AM. Fig. 11 also shows that if we enforce the localization and occupancy counting technique to use at least reference nodes (), a significant portion of the occupancy is not detected. Therefore, the proposed techniques in Fig. 4 and the IMM method can help in significantly improving the occupancy counting.
Fig. 11 also compares the effects of the sample-and-hold window length on occupancy counting results. Even though the total occupancy count with (i.e. the measurements are dropped if ) at peak hours increases to using a sample-and-hold method using holding length of , this is still significantly lower compared to when we use IMM based tracking with (i.e. measurements are used regardless of number of available reference nodes). Use of a larger holding window also smooths the total occupancy count when the IMM approach is used.
In Fig. 12 and Fig. 13, we provide occupancy counting results in each of the eight individual zones over a period of one whole week day and four hours, respectively, considering different time resolutions. Although the individual zones show similar behavior throughout the day, their peak hours are different as seen in Fig. 12. For instance, Zone-E has a peak at 11 AM, while Zone-D has its peak at 5 PM. The most common characteristic is that all of the zones have their minimum around 5 AM in the morning. Zone-C has the highest peak among all zones, with people around 3 PM. Another interesting observation is that Zone-H, which consists a student study area and senior design laboratory, has occupants until very late hours, while Zone-C has least occupied zone at midnight, since it consists mostly administrative offices. In Fig. 13, occupancy counts of individual zones are shown within peak hours, i.e. from 10 AM to 2 PM. The movement of people from one zone to another is more visible in this figure.
In Fig. 14, percentage of occupants based on hours spent in the building is presented. For instance, of the people spend to hours in the building, whereas the people spending less than hour is less than . More than half of the occupants spend less than hours in the building, with of them spending to hours.
VIII Conclusion
In this paper, we propose a new framework for real-time occupancy counting of buildings by passively sniffing WiFi probe requests. We first differentiate the static and passing-by devices in the building using probe request statistics, and then use the information for the remaining users for localization, tracking, and occupancy counting inside the building. The proposed method assures the utilization of the probe requests regardless of the number of the available reference nodes that they are detected, which allows to detect three times more users in the occupancy counting process when compared to requiring at at least four reference nodes for localization. In our experimental results, we are able to perform zone-level occupancy tracking with up to accuracy. We are also able to determine the peak hours of the individual zones as well as the quiet times of the building. Our future work includes use of more advanced filtering techniques such as particle filters to further improve zone level localization and occupancy counting performance.
Acknowledgements
We would like to thank Edwin Vattapparamban for helping in probe request data collection within Florida International University, and Yusuf Said Eroglu for reviewing the manuscript and providing feedback.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] V. Erickson, S. Achleitner, and A. Cerpa, “POEM: power-efficient occupancy-based energy management system,” in Proc. IEEE Information Processing in Sensor Networks (IPSN) , Apr 2013, pp. 203–216.
- 2[2] A. Zanella, N. Bui, A. Castellani, L. Vangelista, and M. Zorzi, “Internet of Things for smart cities,” IEEE Internet of Things J. , vol. 1, no. 1, pp. 22–32, Feb. 2014.
- 3[3] J. Jin, J. Gubbi, S. Marusic, and M. Palaniswami, “An information framework for creating a smart city through Internet of Things,” IEEE Internet of Things J. , vol. 1, no. 2, pp. 112–121, Apr. 2014.
- 4[4] P. Vlacheas, R. Giaffreda, V. Stavroulaki, D. Kelaidonis, V. Foteinos, G. Poulios, P. Demestichas, A. Somov, A. Biswas, and K. Moessner, “Enabling smart cities through a cognitive management framework for the Internet of Things,” IEEE Commun. Mag. , vol. 51, no. 6, pp. 102–111, June 2013.
- 5[5] K. Pahlavan, P. Krishnamurthy, and Y. Geng, “Localization challenges for the emergence of the smart world,” IEEE Access , vol. 3, pp. 3058–3067, 2015.
- 6[6] G. Pan, G. Qi, W. Zhang, S. Li, Z. Wu, and L. Yang, “Trace analysis and mining for smart cities: issues, methods, and applications,” IEEE Commun. Mag. , vol. 51, no. 6, pp. 120–126, June 2013.
- 7[7] S. Tadokoro, Q. shan Jia, Q. Zhao, H. Darabi, G. Huang, B. Becerik-Gerber, H. Sandberg, and K. Johansson, “Smart building technology,” IEEE Robotics Automation Mag. , vol. 21, no. 2, pp. 18–20, June 2014.
- 8[8] Y. Agarwal, B. Balaji, R. Gupta, J. Lyles, M. Wei, and T. Weng, “Occupancy-driven energy management for smart building automation,” in Proc. ACM Workshop on Embedded Sensing Systems for Energy-Efficiency in Building , 2010, pp. 1–6.
