A Robust Roll Angle Estimation Algorithm Based on Gradient Descent
Rui Fan, Lujia Wang, Ming Liu, Ioannis Pitas

TL;DR
This paper introduces a more efficient roll angle estimation algorithm using gradient descent to optimize a global energy function, reducing iterations while maintaining accuracy.
Contribution
It replaces the golden section search with gradient descent for faster, more computationally efficient roll angle estimation from disparity maps.
Findings
Fewer iterations needed for the same precision.
Maintains accuracy comparable to previous methods.
Improved computational efficiency.
Abstract
This paper presents a robust roll angle estimation algorithm, which is developed from our previously published work, where the roll angle was estimated from a dense disparity map by minimizing a global energy using golden section search algorithm. In this paper, to achieve greater computational efficiency, we utilize gradient descent to optimize the aforementioned global energy. The experimental results illustrate that the proposed roll angle estimation algorithm takes fewer iterations to achieve the same precision as the previous method.
| Method | (rad) | |||
|---|---|---|---|---|
| GSS | 16 | 21 | 26 | 30 |
| GD | 3 | 4 | 4 | 4 |
| Method | (rad) | |||
|---|---|---|---|---|
| GSS | 1.551 | 1.986 | 2.457 | 2.810 |
| GD | 0.412 | 0.401 | 0.511 | 0.512 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Measurement and Detection Methods · Autonomous Vehicle Technology and Safety
A Robust Roll Angle Estimation Algorithm
Based on Gradient Descent
Rui Fan1, , Lujia Wang2, ,
Ming Liu1, , Ioannis Pitas3, ,
1*Robotics and Multi-Perception Laboratory, Robotics Institute, *
*the Hong Kong University of Science and Technology, Hong Kong. *
2Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
3*School of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece. *
{eeruifan, eelium}@ust.hk, [email protected], [email protected]
Abstract
This paper presents a robust roll angle estimation algorithm, which is developed from our previously published work, where the roll angle was estimated from a dense disparity map by minimizing a global energy using golden section search algorithm. In this paper, to achieve greater computational efficiency, we utilize gradient descent to optimize the aforementioned global energy. The experimental results illustrate that the proposed roll angle estimation algorithm takes fewer iterations to achieve the same precision as the previous method.
I Introduction
Stereo vision is one of the most commonly used technologies for 3D information extrapolation [1, 2, 3]. This technology extrapolates the depth of the real-world scenery by comparing the difference between a pair of stereo images [4, 5]. The relative positional difference between each pair of correspondence points is generally known as disparity [2, 6]. The disparity information enables autonomous systems to segment road regions more effectively [7]. For example, Labayrade et al. [8] introduced an accurate obstacle detection method based on the analysis of a so-called “v-disparity” image, which was created by computing the histogram of each horizontal row of the disparity map. Since then, many algorithms based on v-disparity image analysis have been proposed for road region segmentation [9, 10, 11]. These algorithms generally hypothesize that the road surface is planar and the stereo rig baseline is perfectly parallel to the horizontal road surface [12, 13]. These assumptions make it feasible to represent the road disparity projections using a linear model [14, 15, 16]. The road regions can, therefore, be extracted by comparing the difference between the actual disparity values and the estimated linear road disparity projection model [17].
However, in practice, the stereo rig baseline is not always perfectly parallel to the horizontal road surface. This fact can introduce a non-zero roll angle , shown in Fig. 1, into the imaging process, where and represent the baseline and the mounting height of the stereo rig, respectively; and are the origins of the left and right camera coordinate systems, respectively. The stereo rig baseline can become perfectly parallel to the horizontal road surface by transforming the original world coordinate system to a new world coordinate system . The original disparity map is shown in Fig. 2, where readers can observe that the disparity values change gradually in the horizontal direction. This makes the way of representing the road disparity projections with a linear model problematic [18]. Therefore, the effects caused by the non-zero roll angle have to be eliminated before performing road region segmentation [18].
In recent years, many researchers have proposed to estimate the roll angle from the disparity map. For example, in [19] and [20], the road surface was assumed to be planar, and the disparities in a selected small area were modeled by a plane , where and are the horizontal and vertical disparity coordinates, respectively. The roll angle was then calculated as . However, finding a proper small area for plane modeling is always challenging, because such an area may contain an obstacle [18]. This can severely affect the plane modeling precision [21]. In this regard, an iterative roll angle estimation algorithm was proposed in [21], where multiple small areas were selected for plane modeling. The roll angle was then estimated from the optimum selected area. However, the aforementioned algorithms can only work for planar road surfaces. Therefore, in our previous work [18], the projections of road disparities were represented by a parabola, and the roll angle estimation was formulated as an energy minimization problem. To achieve greater processing efficiency, golden section search (GSS) algorithm [22] was utilized to reduce the roll angle search range. However, this method still takes extensive iterations to converge to the minimum. Therefore, the algorithm proposed in this paper solves the energy minimization problem using gradient descent (GD) [23], which greatly reduces the computational complexity of roll angle estimation.
The remainder of this paper is structured as follows: Section II presents the proposed roll angle estimation algorithm. The experimental results for performance evaluation are illustrated in Section III. Finally, Section IV summarizes the paper and provides recommendations for future work.
II Methodology
The input of our proposed algorithm is a dense disparity map, which was obtained from a stereo road image pair (see Fig. 2 and 2) through our previously published disparity estimation algorithm [20]. The estimated disparity map is shown in Fig. 2, and its v-disparity map is shown in Fig. 2. To rotate the disparity map around a given angle , each set of original coordinates is transformed to a set of new coordinates using the following equations:
[TABLE]
[TABLE]
The parabola representing the road disparity projections can, therefore, be written as follows:
[TABLE]
The coefficients of can, therefore, be estimated by solving an energy minimization problem as follows:
[TABLE]
where
[TABLE]
the column vector stores the disparity values, the column vector stores the coefficients of , and
[TABLE]
The energy minimization problem in (4) has a closed-form solution [24]:
[TABLE]
where
[TABLE]
The minimum energy can also be obtained by combining (5) and (7):
[TABLE]
According to [18], the desirable roll angle corresponds to the minimum . Therefore, the proposed roll angle estimation algorithm is equivalent to the following energy minimization problem:
[TABLE]
However, deriving the numeric solution of (10) is significantly challenging [18]. Therefore, the proposed algorithm utilizes GD to approximate by formulating (10) as an iterative optimization problem as follows [25]:
[TABLE]
where
[TABLE]
[TABLE]
is an identity matrix, and is the learning rate. If is too high, (11) may overshoot the minimum. On the other hand, if is set to a relatively low value, the convergence of (11) may take too many iterations [22]. Therefore, selecting a proper is always essential for GD. Instead of fixing the learning rate with a constant, we utilize backtracking line search to produce an adaptive learning rate:
[TABLE]
The initial learning rate is set to 26. We will discuss the selection of in Section III. The initial approximation is set to [math], because in practical experiments the absolute value of the roll angle is usually not that high. The optimization iterates until the absolute difference between and is smaller than a preset threshold . The performance of the proposed roll angle estimation algorithm will be discussed in Section III.
III Experimental Results
In this section, we evaluate the performance of the proposed roll angle estimation algorithm both qualitatively and quantitatively. The proposed algorithm was implemented in Matlab 2019a on an Intel Core i7-8700K CPU (3.7 GHz) using a single thread.
Firstly, we discuss the selection of . In our experiments, we utilized a ZED stereo camera111https://www.stereolabs.com/ to capture stereo road image pairs. The disparity maps estimated from the stereo image pairs were then utilized to estimate the roll angles. Some experimental results of our pothole datasets222www.ruirangerfan.com are shown in Fig. 3, where readers can observe that the disparities on each row in the rotated disparity maps have similar values. Therefore, the proposed algorithm can effectively estimate the roll angle from a given disparity map.
Furthermore, we select a range of and record the number of iterations that GD takes to converge with respect to different , as shown in Fig. 4, where readers can see that is the optimum value for different . When rad (), GD takes three iterations to converge, while GD iterates four times, when is set to rad (), rad () or rad ().
Moreover, we compare the computational efficiency of the proposed algorithm with [18]. As discussed in [18], the computational complexity of GSS is , where is the golden section ratio[22]. The comparison of the computational efficiency between GSS and GD is shown in Table I, where we can see that GSS takes more iterations to converge with the decrease of , while the performance of GD largely remains the same. Therefore, the proposed roll angle estimation algorithm performs more robustly than [18] with respect to different thresholds .
However, GD involves the computations of both and , which are more complex than the computation of . Therefore, we also compare the runtime of GSS and GD with respect to different thresholds , as shown in Table II. Readers can observe that the runtime of GSS goes up gradually with the decrease of , while the runtime of GD changes slightly with respect to different . Overall, the proposed algorithm performs much faster than [18].
To quantify the accuracy of the proposed roll angle estimation algorithm, we utilize a synthesized stereo dataset from EISATS333https://ccv.wordpress.fos.auckland.ac.nz/ [26, 27], where the roll angle is perfectly zero. We manually rotate the disparity maps around a given angle, and then estimate the roll angles from the rotated disparity maps. Examples of the EISATS stereo datasets are shown in Fig. 5, where readers can see that the effects caused by the roll angle are successfully eliminated by performing the proposed algorithm. Furthermore, we compare the accuracy of the estimated roll angles between the proposed method and [18] with respect to different . The comparison results are shown in Fig. 6, where represents the average difference between the actual and estimated roll angles. From Fig. 6, we can find that GD performs more accurately than GSS when is set to a lower value. Additionally, the computational efficiency of GD is also greater than that of GSS. Therefore, GD is a better solution than GSS in terms of solving the energy minimization problem in roll angle estimation.
IV Conclusion and Future Work
In this paper, a robust GD-based algorithm was proposed to estimate the roll angle from disparity maps. The proposed algorithm was developed based on our previously published work, where roll angle estimation was formulated as an energy minimization problem, which was then solved using GSS. The experimental results demonstrated that the proposed algorithm outperforms our previous method in terms of both accuracy and efficiency, and that it can successfully eliminate the effects caused by the non-zero roll angle. In the future, we aim to design a neural network to estimate the roll angle with an unsupervised learning strategy.
Acknowledgment
This work is supported by grants from the Research Grants Council of the Hong Kong SAR Government, China (No. 11210017, No. 16212815, No. 21202816, NSFC U1713211) awarded to Prof. Ming Liu. This work is also supported by grants from the Shenzhen Science, Technology and Innovation Commission (JCYJ20170818153518789), National Natural Science Foundation of China (No. 61603376) and Guangdong Innovation and Technology Fund (No. 2018B050502009) awarded to Dr. Lujia Wang. This work is also supported by funding awarded to Prof. Ioannis Pitas received from the European Union Horizon 2020 research and innovation programme, under grant agreement No. 731667 (Multidrone).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Fan, “Real-time computer stereo vision for automotive applications,” Ph.D. dissertation, University of Bristol, 2018.
- 2[2] E. Trucco and A. Verri, Introductory techniques for 3-D computer vision . Prentice Hall Englewood Cliffs, 1998, vol. 201.
- 3[3] R. Fan, Y. Liu, M. J. Bocus, and M. Liu, “Real-time subpixel fast bilateral stereo,” ar Xiv preprint ar Xiv:1807.02044 , 2018.
- 4[4] N. Qian, “Binocular disparity and the perception of depth,” Neuron , vol. 18, no. 3, pp. 359–368, 1997.
- 5[5] R. Fan and N. Dahnoun, “Real-time implementation of stereo vision based on optimised normalised cross-correlation and propagated search range on a gpu,” in 2017 IEEE International Conference on Imaging Systems and Techniques (IST) . IEEE, 2017, pp. 1–6.
- 6[6] R. Fan, Y. Liu, X. Yang, M. J. Bocus, N. Dahnoun, and S. Tancock, “Real-time stereo vision for road surface 3-d reconstruction,” in 2018 IEEE International Conference on Imaging Systems and Techniques (IST) . IEEE, 2018, pp. 1–6.
- 7[7] R. Fan, J. Jiao, J. Pan, H. Huang, S. Shen, and M. Liu, “Real-time dense stereo embedded in a uav for road inspection,” ar Xiv preprint ar Xiv:1904.06017 .
- 8[8] R. Labayrade, D. Aubert, and J.-P. Tarel, “Real time obstacle detection in stereovision on non flat road geometry through” v-disparity” representation,” in Intelligent Vehicle Symposium, 2002. IEEE , vol. 2. IEEE, 2002, pp. 646–651.
