Light Weight Color Image Warping with Inter-Channel Information
Chuangye Zhang, Yan Niu, Tieru Wu, Ximing Li

TL;DR
This paper introduces a low-cost color image warping method that leverages inter-channel correlations to improve quality, suitable for real-time mobile multimedia applications.
Contribution
The paper proposes a novel warping scheme that combines intra-channel interpolation with cross-channel information, enhancing quality with minimal computational overhead.
Findings
Significant improvement in warping quality using inter-channel correlation.
Method achieves high efficiency suitable for mobile devices.
Validated through extensive experiments.
Abstract
Image warping is a necessary step in many multimedia applications such as texture mapping, image-based rendering, panorama stitching, image resizing and optical flow computation etc. Traditionally, color image warping interpolation is performed in each color channel independently. In this paper, we show that the warping quality can be significantly enhanced by exploiting the cross-channel correlation. We design a warping scheme that integrates intra-channel interpolation with cross-channel variation at very low computational cost, which is required for interactive multimedia applications on mobile devices. The effectiveness and efficiency of our method are validated by extensive experiments.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7| Dataset | Bilinear |
|
Bicubic |
|
||||
|---|---|---|---|---|---|---|---|---|
| Set5 | 28.86 | 30.43 | 28.64 | 29.73 | ||||
| Set14 | 26.55 | 27.50 | 26.23 | 26.85 | ||||
| BSD100 | 26.44 | 26.89 | 26.02 | 26.16 | ||||
| Urban100 | 23.57 | 24.12 | 23.17 | 23.45 |
| Bilinear | Bicubic | Lanczos | |
|---|---|---|---|
| 0.094 | 0.045 | 0.032 | |
| 0.119 | 0.064 | 0.041 | |
| 0.195 | 0.096 | 0.058 | |
| 0.008 | 0.010 | 0.015 | |
| 0.180 | 0.089 | 0.054 | |
| -0.003 | 0.003 | 0.008 |
| Independent | Correlated | ||||||
| PSNR | SSIM | Time | PSNR | SSIM | Time | ||
| Bilinear | 30.42 | 0.9566 | 0.0013 | 32.00 | 0.9673 | 0.0062 | |
| 27.78 | 0.9274 | 0.0012 | 28.55 | 0.9368 | 0.0052 | ||
| 25.84 | 0.8966 | 0.0011 | 26.82 | 0.9096 | 0.0056 | ||
| Bicubic | 31.81 | 0.9657 | 0.0020 | 32.37 | 0.9694 | 0.0075 | |
| 28.63 | 0.9365 | 0.0019 | 29.07 | 0.9413 | 0.0065 | ||
| 26.70 | 0.9087 | 0.0018 | 27.13 | 0.9139 | 0.0068 | ||
| Lanczos | 32.43 | 0.9687 | 0.0020 | 32.74 | 0.9706 | 0.0074 | |
| 29.05 | 0.9402 | 0.0018 | 29.29 | 0.9427 | 0.0063 | ||
| 27.07 | 0.9129 | 0.0017 | 27.33 | 0.9157 | 0.0067 | ||
| Dataset | GR | Ours | |||
|---|---|---|---|---|---|
| PSNR | Time | PSNR | Time | ||
| Set5 | 33.04 | 0.34 | 32.74 | 0.0074 | |
| 29.50 | 0.22 | 29.29 | 0.0063 | ||
| 27.49 | 0.19 | 27.33 | 0.0067 | ||
| Set14 | 29.85 | 0.67 | 29.62 | 0.0138 | |
| 26.92 | 0.46 | 26.75 | 0.0133 | ||
| 25.21 | 0.38 | 25.08 | 0.0124 | ||
| BSD100 | 28.90 | 0.45 | 28.97 | 0.0073 | |
| 26.38 | 0.32 | 26.30 | 0.0070 | ||
| 25.03 | 0.26 | 24.97 | 0.0070 | ||
| Urban100 | 26.30 | 2.31 | 26.20 | 0.0480 | |
| 23.69 | 1.46 | 23.50 | 0.0427 | ||
| 22.28 | 1.24 | 22.08 | 0.0428 | ||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image and Signal Denoising Methods
Light Weight Color Image Warping with Inter-Channel Information
Abstract
Image warping is a necessary step in many multimedia applications such as texture mapping, image-based rendering, panorama stitching, image resizing and optical flow computation etc. Traditionally, color image warping interpolation is performed in each color channel independently. In this paper, we show that the warping quality can be significantly enhanced by exploiting the cross-channel correlation. We design a warping scheme that integrates intra-channel interpolation with cross-channel variation at very low computational cost, which is required for interactive multimedia applications on mobile devices. The effectiveness and efficiency of our method are validated by extensive experiments.
**Index Terms— ** Image warping, inter-channel correlation, Laplacian filtering, image enhancement
1 Introduction
Image warping is fundamental for a variety of multimedia applications such as image resizing (e.g, [1]), texture mapping (e.g., [2]), image-based rendering (e.g., [3]), panorama stitching (e.g., [4]), stereo reconstruction (e.g., [5]), optical flow computation (e.g., [6]), to name but a few. Image warping can be briefly described as transforming a source image into a target image under a geometric mapping , which can be linear, affine, perspective, non-parametric etc.[7]. Particularly, assuming has rows and columns, each pixel in the target image lattice has a correspondence image point in the source image, from which the intensity value is assigned to . The central problem of image warping is to estimate from pixels in the vicinity of , which are generally not on the integer grid, and hence the true intensity is not immediately available.
The most popular estimation techniques for image warping in real practice are nearest neighbour, bilinear and bicubic (spline) interpolation schemes [8], which do not adapt well to irregular image regions. Therefore, academic research effort has been mainly dedicated to edge guided interpolation, such as anisotropic filtering (e.g., [9], [10]) and variational regularization (e.g., [11]). Particularly on the sub-topic of Signal Image Super-resolution, i.e., in the special case that degenerates to a scaling function, mathematical models such as compressive sensing (e.g., [12]), dictionary learning (e.g., [13]) and Convolutional Neural Networks (CNN, e.g., [1]) have also been intensively investigated.
Although numerous image warping schemes have been proposed, they are mostly designed for monochrome images. This means that on warping color images, these methods should be applied to each channel independently (e.g., [14]) or to the luminance channel only (e.g.[15]), without exploiting the cross-channel correlation. The only exception is end-to-end CNNs that take RGB images as input. In this situation, the color correlation is captured during the training process, but the learned correlation is CNN specific rather than being general. In contrast, cross-channel correlation is thoroughly studied for Image Demosaicking, which recovers the full RGB images from subsampled color channels, with only one primary color component available at each pixel [16]. Although each channel can be recovered by merely using the intra-channel interpolation, it is commonly known that inter-channel correlation information can significantly improve the demosaicking accuracy.
In this paper, we integrate intra-channel image warping with cross-channel correlation, and show that our approach significantly improve the warping accuracy at very low cost. The proposed integration scheme is inspired by the Malvar-He-Cutler High Quality Linear Interpolation (HQLI) demosaicking method [17], which is probably the fastest demosaicking algorithm in the literature (except the Nearest Neighbour and Bilinear interpolation), but its trade-off between accuracy and speed is superior to many sophisticated algorithms [18]. We first examine the potential of cross-channel correlation for image warping, by applying HQLI to postprocess color images upsampled with intra-channel interpolation. We then generalize this combination to formulate general color image warping. The performance of our method is assessed in the scenario of image super-resolution, as it is the most active sub-area of image warping. Despite its very low cost, the proposed algorithm achieves comparable accuracy to state of the art algorithms at about times faster speed. Compared to CNN-based methods, our model has the flexibility of resizing the image to arbitrary size. These features are desirable for real time interactive applications on multimedia devices that have limited computation power (e.g., mobile smart phones).
2 Inter-Channel Correlation Gain
Image warping is generally implemented by backward mapping. Given the geometric mapping from the source image to the target image , and be its inverse mapping, maps the coordinate frame of back to the coordinate frame of . Image is generated in such a way that, for each pixel of , the intensity value is copied from , where
[TABLE]
As and are generally not integers, has to be estimated from the surrounding pixels. Let and be the floor integer parts of and , and define , . For example, the bilinear warping scheme estimates by
[TABLE]
Today, multimedia images are usually chromatic rather than being grayscale. Therefore, and generally have RGB three channels. Typically, bilinear warping for color images is to replace in Eq.6 with its color channels , and , to estimate , and , without cross-channel interaction.
The importance of cross-channel correlation is well investigated for the problem of Image Demosaicking, entailed in current consumer-level digital cameras, which commonly use Color Filter Arrays (CFA) to sample one of the RGB components for each pixel. Fig.1 shows an example of the Bayer pattern CFA, adopted for most digital cameras.
Define , and to be the sets that collect the pixels where the original red, green and blue values are captured respectively. Malvar-He-Cutler propose estimating the missing color values by a weighted combination of the local intra-channel average and the Laplacian of another channel [17]. For example, at a pixel , the green value is estimated by
[TABLE]
where constant is learned from training images, stands for the average of , , and , and
[TABLE]
The other missing color values are recovered in the same fashion. This demosaicking method, namely HQLI, adds second-order details (the Laplacian) of one channel to the average of another channel, thereby it performs image enhancement.
To examine whether the inter-channel information supplements intra-channel recovery in image warping. We conduct a preliminary experiment using image super-resolution as an example application. We upsample test images by bilinear and bicubic interpolation, and then refine the upsampled image values by sequential HQLI demosaicking. In particular, we first apply the GRBG CFA pattern to filter the upsampled color image, and estimate the filtered-out color values by the remaining color values. Subsequently, we apply the RGGB CFA to the demosaicked full RGB image, and perform the second-round demosaicking. We then change the CFA pattern to BGGR, and repeat HQLI demosaicking for the final round. In this way, each color component of a pixel is refined by the intensity values in the local neighbourhood of the other two channels. Table1 compares the reconstruction accuracy, measured by Peak Signal to Noise Ratio (PSNR), before and after demosaicking (i.e., the cross-channel information) on benchmark datasets Set5 [19], Set14 [20], BSD100 [21] and Urban100 [22]. It can be seen that the inter-channel correlation introduced by demosaicking notably improves the intra-channel interpolation, on all test datasets. Numerically, the improvement is upto dB and dB for bilinear and bicubic intra-channel interpolation respectively.
Note that to simulate the real world applications, here the low resolution images are obtained by Nearest Neighbour downsampling of the high resolution test images. This is different from the common SISR algorithm evaluation routine, which utilizes Bicubic downsampling. The Nearest Neighbour sampling is known to suffer severe aliasing and blurring, whereas Bicubic downsampling implicitly performs low pass filtering before sampling, thereby largely avoids such artifacts [16]. However, using Bicubic downsampling also implicitly assumes that the high resolution images are known, which is not the case for real applications. Therefore we only apply bicubic downsampling in the comparison experiments for fair conditions. Due to the coarse downsampling, bicubic warping has lower accuracy than bilinear warping, which is seemingly counter intuitive.
3 Cross-Channel Color Image Warping
3.1 General Formulation
Based on the analysis of the gain achieved by the inter-channel information, we formulate general color image warping by
[TABLE]
where , and can be any intra-channel estimation of , and ; also, , and can be estimated by interpolating , and respectively, where stands for the local neighbourhood of pixel . The involved interpolation schemes can be either linear or non-linear, isotropic or anisotropic, depending on the particular requirements of the applications at hand. The weights are to be learned from training images, as described in Section3.2.
3.2 Parameter Training
We illustrate the weight learning process using the Single Image Super Resolution (SISR) application as an example, for which the geometric mapping and its inverse mapping are
[TABLE]
where in a real number larger than . Note that is not restricted to be an integer. We downsample the groundtruth images as the source images (i.e., the data) by bicubic interpolation, following the traditional experimental setting in the literature of SISR, as the learned weights are to be used for algorithm evaluation in comparison with state of the art. We use the groundtruth images (taken from benchmark training dataset BSD200 [21]) as target images (i.e., the labels). For each pixel in the an image , the RGB values of its correspondence in thus are known in the training process. , and , as well as , and are computed on the pixels in each source image by interpolation function . Subsequently, Eq.9 can be rewritten as linear systems of the weights . For example,
[TABLE]
for Eq.9a. Here integer is the total number of randomly selected training pixels, indexed by the subscripts of , , and . Eq.9b and Eq.9c can be re-organized similarly. The Mean Square Error (MSE) solutions or Mean Absolute Error (MAE) solutions to the linear systems can be easily solved in closed-form. This yields the weights , ,,, and .
Seemingly, the learned weight values should be specific to the scaling factor and interpolation function . However, we observe that weights learned for large actually generalizes well to small . Therefore, we suggest conducting the training for a large scaling factor (e.g., ), and apply the learned weights directly to applications whose scaling factor is smaller.
Table2 lists the weights we learned from Eq.15 by linear regression for , with the interpolation function being Bilinear, Bicubic and Lanczos (with footprint).
4 Experimental Results
We conduct two sets of experiments to evaluate the proposed color image warping technique. The first experiment measures the SISR accuracy before and after incorporating the cross-channel terms to intra-channel warping, at various scaling factors and using different real time interpolation functions. The second experiments compares our technique with the state of the art SISR method Global Regression (GR) proposed by Timofte-De Smet-Van Gool in [23], using the released Matlab source code. The GR method is the fastest SISR method proposed in recent years to the best of our knowledge, and hence is chosen for comparison in our experiment. It should be mentioned that, the weight GR is significantly higher than our linear isotropic scheme used for testing.
For comparison in equal conditions, the low resolution test images are obtained by bicubicly downsampling the ground truth image at sampling factor . That is, the low resolution images have rows and columns. Laplacians are computed at each pixel in each channel of the low resolution images. Both the Laplacian maps and low resolution images are upsampled by the same interpolation function . In our experiments, is set to be 2, 3 or 4; and is set to be Bilinear, Bicubic or Lanczos (of kernel size ).
Evaluation is carried out on benchmark datasets Set5, Set14, BSD100 and Urban100. Beside the PSNR accuracy measure, we also report the Structure Similarity (SSIM) measure [24] and running time (in seconds) of the SISR algorithms. All experiments are conducted on an Intel Core i7-7700 3.60GHz CPU with 8GB RAM, using Matlab.
4.1 Improvement on Popular Warping Methods
We compare the performance of popular real time warping methods and the proposed method. Bilinear warping probably has the best trade-off between performance and speed in the literature, as pointed out by Zitová-Flusser [8]. Hence it is recommended for video and animation applications, such as frame registration for motion estimation in OpenCV and texture mapping in OpenGL. Bicubic warping is another frequently adopted option, especially for still image processing. For example, image editing softwares Photoshop and GIMP employ bicubic interpolation for image resizing and perspective view rectification.
Table3 presents the PSNR, SSIM and running time averaged over Set5 for various and . In this table, the plain intra-channel warping is referred to as “Independent”, and the proposed inter-channel color warping is referred to as “Correlated”. In all situations, the proposed color warping scheme achieves evidential improvement, either in terms of PSNR or SSIM, in real time computation. The most remarkable quantitative improvement is obtained for bilinear interpolation, the PSNR values of which are increased by 1.58dB, 0.77dB and 0.98dB for 2, 3 and 4 times upscaling respectively. It can be observed that the improvement decreases with the complexity of the interpolation function. However, in the case of Lanczos interpolation (the most complex function here), the accuracy of our color warping scheme for upscaling is probably sufficient in many applications, and its computation is merely in several milliseconds.
4.2 Comparison with State of the Art
To the best of our knowledge, among the state of the art SISR algorithms that have Matlab source code released, GR [23] is the fastest. We compare the accuracy and speed of our method with GR. In this experiment, we fix the interpolation function to be Lanczos. Table 4 shows the comparison, and Fig.2 shows a visual assessment. Relative to GR, the PSNR of our method is generally around dB lower for various scaling factors. However, on BSD100 the accuracy performance are very close. This might be attributed to the integration weights learned from BSD200, and suggests the potential for higher performance by using more sophisticated training strategy. It should be noted that the proposed warping model has merely 6 parameters, and the computation is tens of times faster.
5 Conclusion
In this paper, we have proposed a light weight color image warping technique, which integrates intra-channel interpolation with cross-channel details. Extensive experiments on four benchmark datasets have validated that, the proposed technique substantially improves the most popular image warping algorithms, in very simple form and at trivial computational cost. Our algorithm has also been shown to push the performance of traditional Lanczos upsampling scheme to be comparable with state of the art methods, while being real time.
The proposed technique can be readily applied to interactive multimedia applications that require both fast image warping and realistic visual effects, such as Computer Gaming and Image Editing. As our integration of cross-channel information is of light weight, it is suitable for multimedia devices with limited computation power. It should be noted that our baseline interpolation, integration formulation and weight training are all very basic. Actually, each step can be extended for higher performance, if the computation cost budget allows. Moreover, our method can also be used as an high quality initialization to many CNN-based methods.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew P Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, et al., “Photo-realistic single image super-resolution using a generative adversarial network.,” in IEEE Conference on Computer Vision and Pattern Recognition , 2017.
- 2[2] Mason Woo, Jackie Neider, Tom Davis, and Dave Shreiner, Open GL programming guide: the official guide to learning Open GL, version 1.2 , Addison-Wesley Longman Publishing Co., Inc., 1999.
- 3[3] Leonard Mc Millan, “Image-based rendering using image warping,” in COM-PUTER GRAPHICS Proceedings, Annual Conference Sefies, ACM SIG-GRAP, New Orleans, Louisiana , 2009, pp. 123–125.
- 4[4] Kaiming He, Huiwen Chang, and Jian Sun, “Rectangling panoramic images via warping,” ACM Transactions on Graphics (TOG) , vol. 32, no. 4, pp. 79, 2013.
- 5[5] Michael Bleyer, Margrit Gelautz, Carsten Rother, and Christoph Rhemann, “A stereo approach that handles the matting problem via image warping,” in Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on , 2009, pp. 501–508.
- 6[6] Jean-Yves Bouguet, “Pyramidal implementation of the affine lucas kanade feature tracker description of the algorithm,” Intel Corporation , vol. 5, no. 1-10, pp. 4, 2001.
- 7[7] Chris A Glasbey and Kantilal Vardichand Mardia, “A review of image-warping methods,” Journal of applied statistics , vol. 25, no. 2, pp. 155–171, 1998.
- 8[8] Barbara Zitova and Jan Flusser, “Image registration methods: a survey,” Image and vision computing , vol. 21, no. 11, pp. 977–1000, 2003.
