Designing a Roadside Sensor Infrastructure to Support Automated Driving
Florian Geissler, S\"oren Kohnert, Reinhard Stolle

TL;DR
This paper explores the design of roadside sensor networks, analyzing coverage and occlusion to support automated driving, using simulations based on real traffic data to determine optimal sensor configurations.
Contribution
It provides a systematic analysis of sensor placement and parameters to optimize roadside infrastructure for automated vehicle support.
Findings
Sensor network design significantly affects traffic detection completeness.
Optimal sensor range and orientation improve coverage and reduce occlusions.
Proposed setup could substantially support automated vehicle perception.
Abstract
Automation of complex traffic scenarios is expected to rely on input from a roadside infrastructure to complement the vehicles' environment perception. We here explore design requirements for a prototypical setup of virtual vision or RADAR sensors along one roadside. Explicitly, we analyze the road coverage and the probability of vehicle occlusions, with the objective of evaluating the completeness of information that is captured by the sensor field. Simulation case studies are performed based on real traffic data acquired at the German Autobahn 9 near Munich. Our findings indicate how the sensor network should be designed in terms of sensor range, orientation and opening angle, in order to enable effective traffic detection. The achieved degree of completeness suggests that such a setup could be used to support automated vehicles to a substantial extent.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Designing a Roadside Sensor Infrastructure to Support Automated Driving
Florian Geissler
Dependability Research Lab, Intel Labs Europe
85579 Neubiberg, Germany
Email: [email protected]
Sören Kohnert and Reinhard Stolle
Augsburg University of Applied Sciences
86161 Augsburg, Germany
Email: [email protected]
(March 16, 2024)
Abstract
Automation of complex traffic scenarios is expected to rely on input from a roadside infrastructure to complement the vehicles’ environment perception. We here explore design requirements for a prototypical setup of virtual vision or RADAR sensors along one roadside. Explicitly, we analyze the road coverage and the probability of vehicle occlusions, with the objective of evaluating the completeness of information that is captured by the sensor field. Simulation case studies are performed based on real traffic data acquired at the German Autobahn 9 near Munich. Our findings indicate how the sensor network should be designed in terms of sensor range, orientation and opening angle, in order to enable effective traffic detection. The achieved degree of completeness suggests that such a setup could be used to support automated vehicles to a substantial extent.
I Introduction
For the last decades, the realization of automated driving (AD) has experienced tremendous progress, powered by continuous technological innovations throughout a variety of fields. Improved object detection techniques (see e.g. [1, 2]), data fusion and tracking schemes [3, 4], dedicated wireless communication paths [5], as well as smart driving strategies [6], have contributed to the functional safety and dependability of AD. So far, most concepts focus on an in-vehicle perspective: Automated cars are equipped with various sensors, comprising for instance camera, RADAR and LIDAR, to be able to detect threats in their environment. It appears, however, that this development faces some important challenges. First, the vehicle horizon is limited to the range of the sensors on board. Second, the processing of data collected by all sensors in the vehicle demands a lot of computational resources. To overcome such difficulties, one approach is to share information and resources with other agents on the road, or with the infrastructure. Corresponding technologies are commonly referred to as vehicle-to-vehicle (V2V) and vehicle-to-infrastructure (V2I), respectively (e.g. [7, 8]). The latter is typically understood as the exchange of information with individual elements of an existing infrastructure, such as traffic lights.
In this paper, we extend this concept to study a system where the perception of road traffic, and therefore the information relevant for an autonomous agent, is exclusively acquired by the infrastructure. This not only shifts the computational load off the vehicle, but also provides long-range vision for early road hazard warnings and traffic flow management. The infrastructure is in this case represented by an array of sensors mounted along one roadside at fixed spatial intervals. This particular setup is motivated by a highway scenario where a sensor network for road surveillance can be established, exploiting existing equipment, by attaching each sensor to one of the roadside guiding posts. The fact that we restrict ourselves to a single roadside here reflects the practical difficulties of placing sensors on the median strip of the highway. Detections of individual sensors are collected, processed in an close-by edge computation platform, and are eventually broadcast to the vehicle. We particularly focus on RADAR sensors here, even though the first part of our analysis applies to any generic type of sensor, that is specified only by the form of its field of view (FoV). RADAR sensors were shown to be feasible instruments for vehicle detection [9, 10, 11], however, to the best of our knowledge, the possibility of a seamless road surveillance by a network of RADAR sensors has not been studied so far.
A crucial concern for the type of setup described above, is whether the entirety of information obtained by the infrastructure is of an adequate quality to dictate safe driving decisions. In this paper, we address this question with respect to the completeness of information, which is one of the paramount quality attributes in this context (see e.g. [12]), and a prerequisite for the system’s functional safety. Hereby, we define completeness as the extent to which data are of sufficient scope to represent the universe of discourse [13, 14]. We thus analyze, to what degree the system captures objects on the road, simulating different detection mechanisms such as vision and RADAR sensing, and using realistic road data. Completeness of information is generally challenged by false negative detections, that can occur for the following reasons: i) An object is missed because it is not in the sensors FoV, ii) an object is in the sensors FoV, but the infrastructure fails to detect it, for instance due to a low signal-to-noise ratio (SNR), iii) an object can not be detected because it is occluded by other objects. False positive detections further complicate traffic detection, however, are typically far less safety-critical than false negatives [6]. They do not affect the information completeness, and are therefore not studied explicitly here. Importantly, the specifications of the setup differ from previous work (see e.g.[15, 16]). A special emphasis is put on the influence of vehicle occlusions on a seamless road surveillance. The relation between occlusions and traffic density, based on sample measurements, was for instance explored in [17]. The goal of the present paper is to investigate the fitness of a roadside sensor infrastructure for the purpose of guiding automated vehicles. Our analysis can be used to design sensor network topologies with a high degree of detection completeness.
The article is organized as follows: In Sec. II we analytically derive the requirements for full and multiple road coverage in the given setup, in the absence of road traffic. In order to avoid systematic incompleteness of traffic detection, full coverage is a key aspect for the design of a dependable sensor network. Subsequently, in Sec. III, we address the problem of detecting moving objects taking into account vehicle occlusions. We discuss several simulation results that are based on different detection mechanisms and traffic conditions. Finally, we conclude in Sec. IV.
II Road coverage
II-A Setup
In the following, we consider a generic sensor model of reduced complexity. All sensors forming the network are identical, oriented the same way, and arranged periodically along a single, straight roadside. The symmetries of the system allow to study the overlap of two neighboring sensor sectors only, in order to analyze the requirements for full road coverage of the entire sensor network. In this section, we derive the positions of the two possible intersection points of a pair of neighboring sensor FoVs. Full road coverage is given if the road boundaries are within the corridor specified by those lower and upper intersection points. Furthermore, we presume that a sensor FoV is two-dimensional and has the geometric form of a circle sector, thus it can be parametrized by opening angle and range. The defining parameters of the sensor array topology are illustrated in Fig. 1: sensor range (), sensor opening angle (), rotation angle of a sensor relative to the -axis (), distance of a sensor to the nearest road boundary (), distance between two neighboring sensors, i.e. two neighboring pylons, (), and road width (). For convenience, let us also define the angles and .
A sensor FoV that points away from the road does not yield any additional information, and the structure can always be transformed into a more efficient one by modifying and accordingly. Therefore, we restrict ourselves to
[TABLE]
II-B Requirements for full road coverage
To derive the requirements for full coverage, we have to find the lower and upper intersection points of two overlapping sensor sectors. The former can emerge either as the intersection of two lines representing the lateral edges of the FoV, or as the intersection of one boundary line with a circle arc, representing the front edge of a sensor FoV. Similarly, the latter intersection point can arise as the intersection of a line and a circle arc, or as the intersection of two circle arcs, leaving us with a total of four different geometrical constellations.
We first study the lower intersection point. The lines describing the left and right lateral edge of a sensors FoV are given by
[TABLE]
Here, () is the lateral distance of a point on the right (left) boundary line from its original sensor, while the vertical distances are given by and , respectively. The boundary lines of two neighboring sensors cross at
[TABLE]
With (2), this leads us to
[TABLE]
This defines the lower intersection point in case the sensor range is sufficiently large, so if the crossing point is in reach of both sensors, . It follows that the first requirement for full road coverage can be phrased as
[TABLE]
Second, if the sensor range is not sufficient for the above requirement, the sensor sectors can still overlap if the crossing point emerges as an intersection of one line and one circle arc. We have
[TABLE]
where we use to account for both cases of and . The constraints for the crossing point in (3) lead us to
[TABLE]
with the two solutions
[TABLE]
Those solutions are real (an intersection exists) if . To identify the lower crossing point in this case, we choose the smaller solution (lower sign) and therefore obtain the requirement for full coverage,
[TABLE]
Here, an additional constraint needed to be taken into account – the range has to be sufficiently large to reach the crossing point at all, and . These terms were added in the last two lines of (9).
Next we study the vertical distance of the upper crossing point. As before, we have to distinguish two cases. First, the intersection can emerge as the crossing point of two circle arcs. This is the case if the opening angle is wide enough, explicitly,
[TABLE]
In this case, the crossing point is simply given by
[TABLE]
which brings us to the requirement for full coverage of
[TABLE]
Furthermore, the upper crossing point can arise as an intersection point of a circle arc and a line. We then make use of (8), where the desired maximum distance is now associated with the upper sign, , and full road coverage is given if
[TABLE]
As before, we check for sufficient sensor range by and .
Full road coverage is achieved if (5) or (9), and either of (12) and (13) holds.
II-C Implications
The findings of the above section allow us to derive analytically the threshold between the regimes of incomplete and complete road coverage. We note the following implications.
The minimum sensor range that possibly provides full road coverage is determined by the points the furthest away from any sensor. This distance is given by (11), and we find
[TABLE]
The angle on the other hand has no lower bound – for infinitely large range it approaches zero.
Without a repositioning of the sensors, road coverage can be optimized by varying dynamically the parameters , and . Mounting the sensors very close to the road, such that in particular (which seems realistic for our scenario), is a challenge to full road coverage. A large rotation angle , almost to the full extent of , is required then. We therefore fix in the following to this maximum, and investigate the impact of the remaining parameters and on the road coverage. Fig. 2 gives an example, and illustrates the validity of the above analysis by comparing it to the outcome of a numerical integration approach.
Importantly, the analysis of Sec. II-B can be easily extended to identify regimes of multiple road coverage. To do so, we replace in (5), (9), (12), and (13) the parameter by , where is a positive integer. We then obtain the constraints for full road coverage by a subset of sensors that are -nearest neighbors. This is equivalent to the statement that each point of the road is in range of at least sensors, so we face multiple road coverage of degree (see also Fig. 2). This finding can be used to design a robust sensor network that maintains full road coverage despite temporary sensor outages.
By inspection of Fig. 2, we state that full (and even multiple) road coverage is readily possible with current state-of-the-art RADAR technology [11].
III Occlusions
III-A Traffic model
In this section, we investigate the influence of object occlusions on the detection capabilities of the sensor network, depending on the sensor FoV. To account for a realistic traffic situation, we use statistical road traffic data recorded at the Autobahn 9 close to Neufahrn, near Munich, which was provided by the traffic authority of Southern Bavaria [18]. This data contains for instance the average number of vehicles per hour per lane, average speed per lane, and the distribution of vehicle types per lane. The latter follows a classification scheme with five different groups, including cars, trucks, buses, as well as cars and trucks with trailers. The considered highway segment offers three lanes, with an additional emergency lane that can be opened for vehicles on demand by the traffic control authority. We compare three selected scenarios here: Christmas Eve morning, which represents a low-density traffic scenario, an average Tuesday morning, representing typical commuting traffic, and a traffic jam, as an example for a high-density traffic situation. In our simulation we adopt a simplified traffic model. The road is randomly populated with vehicles, weighted according to traffic density and object type distributions. All vehicles on a lane move with the same, constant speed – lane changes and related maneuvers are neglected here.
III-B Vision sensing
In the first stage of the simulation, we study occlusions using a line-of-sight (LoS) sensing model. A ray casting algorithm calculates the shadows thrown by the objects. Explicitly, if at least one resolution cell of a very small size of of a vehicle’s bounding box is illuminated, the car is classified as visible. With such LoS sensors we model cameras with almost perfect detection capabilities, that act as a reference detection frame. The simulation returns the percentage of missed vehicles over time, which equals the degree of incompleteness in this setup.
III-C RADAR sensing
Next, we simulate a network of RADAR sensors using existing MATLAB*®* toolboxes [19]. This provides us with generic RADAR detection generators, that can be used to model object detections without further specification of the signal processing inside a sensor. The radiation pattern of the RADAR sensors is idealized as conoidal and unambiguous, meaning in particular that sidelobes are neglected. We assume an azimuth angle resolution of , as it was achieved with a linear array of 12 antennas, where the number of antennas is constant in contrast to the aperture size. Furthermore, we use a radial resolution of , and the false alarm rate is quantified as [20]. As before, we assume that an area within the FoV of a given sensor is occluded for RADAR detections if it is shadowed by an object such that there exists no clear LoS to that sensor. The RADAR cross section (RCS) of a vehicle is taken as a constant across the object dimensions here for simplicity. In order to determine the number of missed objects at an instant of time, each vehicle is assigned a unique ID, which is registered for an individual sensor detection. Completeness is then evaluated by comparing the list of detected IDs with the ground truth. Eventually, the time average over the course of the simulation is performed.
We further advance our analysis by studying object tracking based on the RADAR detections. Generally, tracking allows to compensate temporary losses of information in situations where objects are occluded, or out of sensor range. On the other hand, the sensitivity to errors, and the temporal delay of a measurement-to-track association reduces the success rate of object detections. In the following we therefore observe a competition of these two opposing trends. Before tracks are created, all individual detections are collected by a central authority – for instance the edge computing platform – and fused to clusters according to spatial proximity. Such a centralized fusion scheme facilitates tracking in situations where an object travels from one sensor sector to another. As a cluster length, we hereby choose the width of a car, i.e. the smallest object length scale studied in the present setup. Subsequently, clusters are assigned to tracks using a linear Kalman filter (KF), that is based on a two-dimensional constant-velocity motion model. A KF uses a series of consecutive measurements to predict the next vehicle state, and updates this estimation by comparing it to the subsequent measurements [3]. With the help of such periodic prediction-update cycles, the filter is able to track vehicle trajectories. Key tracking parameters of our model, such as the track confirmation threshold and the number of coasting updates, were set to the standard values specified in [19].
In the simulation, the following aspects further impede a complete track object detection: i) If an object enters the sensor FoV, a track is confirmed only after a threshold number of detections that match with the KF predictions. In the meantime, it is missed. ii) Multiple objects can be assigned to the same track, this applies in particular to two objects in close vicinity on different lanes. iii) False alarm detections due to noise disarray tentative tracks. Note that long objects, like trucks and buses in this model, are typically assigned to several tracks at once, however, such redundant false positive detections will not affect completeness here, as discussed above.
III-D Results
Our simulation results are shown in Fig. 3. First, we observe the following general trends. The degree of completeness of the captured information decreases with increasing traffic density, due to the growing probability of vehicle occlusions. Furthermore, the perfect cameras and the generic RADAR sensors exhibit very similar detection capabilities, taking into account random fluctuations in the vehicle distribution for a respective traffic scenario. This implies that the postulated difference in resolution does not affect object detection much. Tracking improves the completeness of detections especially for poor road coverage (e.g. small sensor range), since a vehicle trajectory can still be predicted for some time after the object has left the sensors’ FoV. On the other hand, the measurement-to-track association becomes progressively difficult for distant targets, such that detections without tracking perform again better than the KF in the long-range limit.
For a more detailed analysis, we compare the selected traffic scenarios. In the low traffic density scenario of Fig. 3a (Christmas Eve), with only very few vehicles on the road at the same time, almost all objects are detected by camera, RADAR or RADAR tracking, given that the sensor network provides sufficient road coverage. Detection failures at small sensor ranges are attributed to objects not being in the sensors’ FoV. Complete traffic detection up to a degree of can be readily achieved with feasible sensor parameters.
The scenario in Fig. 3b describes a typical commuting traffic. Here, object occlusions have a noticeable impact on the completeness of the detected information. However, by improving sensor range and opening angle one can eliminate most of the occlusion events, and obtain an average completeness of about . Importantly, for small to moderate sensor ranges, traffic detection benefits greatly from tracking here. To clarify this, we note the following mechanism. A vehicle that is on a distant lane (i.e. a lane that is not closest to the sensor network) experiences alternating periods in which there either exists a clear LoS to at least one sensor, or in which it is occluded by another object. Accordingly, a detection of the vehicle is possible or not. In this scenario we find that, typically, the intervals in which a selected object is visible are large compared to the track confirmation time, while the occlusion times are small or comparable to the coasting time, such that the filter algorithm can work efficiently.
Last, we study the high-density traffic scenario of a congested highway in Fig. 3c. Due to the increased frequency of occlusions, the degree of completeness achieved in the simulation is reduced, and we find values of about . Note that the KF does not perform very well in this scenario: In contrast to the preceding cases, the periods in which a selected object is occluded are typically large compared to the KF coasting time, such that the relative benefit of tracking is diminished. In addition, the increased proximity of vehicles impairs the association of measurements to tracks. Since the highest vehicle density is usually on the lane which is closest to the sensor network, our system is fairly sensitive to occlusions in this scenario. We expect a higher degree of completeness if the sensors are mounted on the opposite, or on either roadside. To some extent, the detection capability of the infrastructure can also be improved by refining the tracking parameters.
IV Summary
Assessing the dependability of a roadside infrastructure for AD is a challenge of great complexity, that needs to be tackled from various angles. Here, we approach this problem by addressing the perceptional completeness of the sensor network in a given setup. Our analysis offers guidelines for the design of such an infrastructure, e.g. in terms of a favorable sensor positioning.
Since any missing information about an object is potentially safety-critical, to this point no meaningful threshold for the map completeness can be given that was able to guarantee safe automated driving. Therefore, we here consider the sensor infrastructure as an instrument to support AD, e.g. in the form of augmenting agent-based vision, or for improved traffic orchestration. Those services do not necessarily require an absolute perceptional completeness of the sensor network, yet become more feasible and beneficial with an increasing degree of information completeness. We verify the correlation of the latter with the highway traffic density, to demonstrate that the quality of the support the infrastructure can offer is best for low to moderate traffic density scenarios, while in situations of very dense traffic a substantial portion of the information is lost due to vehicle occlusions. Note that the sensors were placed only on a single roadside in our model, as such a setup is expected to be of practical relevance. The system design could be easily extended e.g. by deploying sensors on either roadside, or by elevating the sensors, in order to decrease the probability of occlusion events. A global design methodology is beyond the scope of this article, however will be a subject of further investigations. The same holds for instance for the analysis of a more refined traffic model, a RCS modeling physical vehicle shapes, or multipath propagation of RADAR waves (e.g. on the road surface for transvision effects).
V Acknowledgment
We thank Frederik Bachmann (Technische Universität München) and the Traffic Authority of Southern Bavaria for providing us with traffic measurement data. This work was funded by the German Federal Ministry of Transport, Building and Urban Development (BMVI) within the project KoRA9 (grant No. 16AVF1032A).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Z. Sun, G. Bebis, and R. Miller, “On-Road Vehicle Detection: A Review,” IEEE Trans. on Pattern Analysis and Machine Intelligence , vol. 28, no. 5, pp. 694–711, 2006.
- 2[2] A. Geiger, P. Lenz, and R. Urtasun, “Are we ready for autonomous driving? The KITTI vision benchmark suite,” Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition , pp. 3354–3361, 2012.
- 3[3] R. E. Kalman, “A New Approach to Linear Filtering and Prediction Problems,” Journal of Basic Engineering, Transactions of the ASME , vol. 82, no. Series D, pp. 35–45, 1960.
- 4[4] R. Danescu, F. Oniga, and S. Nedevschi, “Modeling and tracking the driving environment with a particle-based occupancy grid,” IEEE Transactions on Intelligent Transportation Systems , vol. 12, no. 4, pp. 1331–1342, 2011.
- 5[5] “Federal Communications Commission. FCC 99-305. FCC Report and Order,” 1999.
- 6[6] S. Shalev-Shwartz, S. Shammah, and A. Shashua, “On a Formal Model of Safe and Scalable Self-driving Cars,” ar Xiv:1708.06374 [cs.RO] , 2017.
- 7[7] H. Wu, R. Fujimoto, and G. Riley, “Analytical models for information propagation in vehicle-to-vehicle networks,” 2004 IEEE 60th Vehicular Technology Conference, 2004. VTC 2004-Fall , vol. 6, no. C, pp. 4548–4552, 2004.
- 8[8] J. Santa, A. F. Gómez-Skarmeta, and M. Sánchez-Artigas, “Architecture and evaluation of a unified V 2V and V 2I communication system based on cellular networks,” Computer Communications , vol. 31, no. 12, pp. 2850–2861, 2008.
