TL;DR
This paper introduces a comprehensive dataset of 24 scenes under 18 different lighting conditions, designed for evaluating colour constancy and segmentation algorithms in realistic scenarios.
Contribution
It provides a diverse, annotated dataset with spectral data and ground truth for benchmarking colour research and computer vision tasks.
Findings
Dataset covers varied spectral colours, intensities, and distances.
Includes pixel-level ground truth for colour segmentation.
Facilitates evaluation of colour constancy algorithms.
Abstract
We present a collection of 24 multiple object scenes each recorded under 18 multiple light source illumination scenarios. The illuminants are varying in dominant spectral colours, intensity and distance from the scene. We mainly address the realistic scenarios for evaluation of computational colour constancy algorithms, but also have aimed to make the data as general as possible for computational colour science and computer vision. Along with the images of the scenes, we provide spectral characteristics of the camera, light sources and the objects and include pixel-by-pixel ground truth annotation of uniformly coloured object surfaces thus making this useful for benchmarking colour-based image segmentation algorithms. The dataset is freely available at https://github.com/visillect/mls-dataset.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18
Figure 19
Figure 20
Figure 21
Figure 22
Figure 23
Figure 24
Figure 25
Figure 26
Figure 27
Figure 28
Figure 29
Figure 30
Figure 31
Figure 32
Figure 33
Figure 34
Figure 35
Figure 36
Figure 37
Figure 38
Figure 39
Figure 40Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Multiple Light Source Dataset for Colour Research
Anna Smagina\supit1
Egor Ershov\supit1
Anton Grigoryev\supit1, 2 \skiplinehalf\supit1Institute for Information Transmission Problems (Kharkevich Institute) of the Russian Academy of Sciences
\supit2Moscow Institute of Physics and Technology (State University)
Abstract
We present a collection of 24 multiple object scenes recorded under 18 multiple light source illumination scenarios each. The illuminants are varying in dominant spectral colours, intensity and distance from the scene. We mainly address the realistic scenarios for evaluation of computational colour constancy algorithms, but also have aimed to make the data as general as possible for computational colour science and computer vision. Along with the images, we provide also spectral characteristics of the camera, light sources, and the objects and include pixel-by-pixel ground truth annotation of uniformly coloured object surfaces. The dataset is freely available at https://github.com/visillect/mls-dataset.
keywords:
computer vision, colour constancy, illuminant estimation, multiple light source
1 Introduction
In this paper we describe a new laboratory dataset mainly designed for evaluation of computational colour constancy[1, 2]. 39 years have already passed since the GrayWorld method invention [3], which is one of the first and the most popular colour constancy approaches. Colour constancy algorithms find their application in such tasks as face identification[4], illumination-invariant object tracking [5], shadow suppression for image segmentation[6]. The research activity in this topic is not declining even now, which is largely due to the increase in available computing power and the to development of technologies for creating better optical sensors. One can learn more about the history of the colour constancy research up to from the work [1].
Indicative of the relevance of colour constancy to modern science is the fact that in the period from to the present, on average, each year one colour constancy dataset was published. Such datasets could be essentially be divided into two types: ones collected in the laboratory and ones collected under uncontrolled conditions. And while the latter are usually created to evaluate existing algorithms under real-life conditions, the former are more likely to accurately evaluate the quality of existing solutions, as well as to determine solvability of tasks that have not yet been solved. The list of the latter should include such problems as the colour estimation of multiple light sources (about ten works have already been written on this topic, in particular [7]), restoring the position of light sources in the camera coordinate system from an image, creating low-parametric spectral models for describing colour transformations [8], and others.
Datasets collected under natural and uncontrolled conditions, for example, REC [9], CubePlus [10], GrayBall [11], NUS [12], are poorly suited for quality evaluation of problems listed above. The estimation of source colour in such datasets is performed using the calibration object (or objects) illuminated by several light sources simultaneously. But since its proportions are unknown, the estimation of the source colour is performed inaccurately. The same is true for sets of optical multichannel images [13, 14, 15]. Although they allow not only a more accurate colorimetric algorithms evaluation, but also an augmentation of lighting. For example, [16] proposes a method for synthesis images such that different parts of the scene are illuminated by various sources.
More precisely, the illuminant estimation can be performed either via increasing the number of calibration objects in the scene, for example, this is done in [17], or, more accurately but more time-consuming, by measuring the emission spectra of illuminants (our method). While forming this dataset, we were focusing precisely on accurate measuring of all characteristics in the scene, both spectra and the location of objects. In the context of the tasks mentioned above, the following requirements were defined for the dataset:
- •
The presence of different light sources (different chromaticities) with known spectral characteristics;
- •
The presence of scenes illuminated by one or several light sources of various types (spot and diffuse);
- •
Various object shapes and materials—metals (preferably chromatic), dielectrics, flat / non-flat (important for evaluation of algorithms based on reflectance models [18, 19]);
- •
Known spectral characteristics of the objects reflectance.
- •
Known camera characteristics and linear images.
Among the published laboratory datasets [20, 21, 22, 23] there is no one that would simultaneously satisfy all the items on the list, which served as the motivation for the construction of the one described in this paper. Although, collected data is also usefull for colour-based image segmentation, validation of spectral colour models, and other colour vision problems.
2 Scene setup
The setup used to acquire the image collection consists of a softbox (FALCON EYES PBF-60AB), 4 different light sources and a camera, see Fig. 1. The halogen lights are mounted at equal distances (1.5 m) from two opposite sides of the softbox to produce a diffuse lighting. In immediate vicinity of the softbox the desk and the LED lamps are placed to produce close lighting. Objects are positioned manually inside the of the softbox. The camera is pointing on the scene through the open side of the softbox.
The halogen lamps are 35 W each with similar light spectra. Duration of the whole experimental procedure (setting-up, calibrating, and recording the database, approx. 100 hours) was only a fraction of the average life time of the lamps (about 1000 hours), hence ageing effects affecting colour temperature can be considered minimal. The brightness of the incandescent lamp (60W) was adjustable and was set to 30% of the maximum. The RGB LED strip is under computer control via in-house made USB controller, which allows to tune the emitting power of red, green and blue LEDs individually.
Images are recorded with Canon 5D Mark III camera equipped with Canon EF 24–70 mm f/2.8L II USM lens. The shooting was performed in a RAW format with 5760 x 3840 resolution in the manual mode. The camera settings was adjusted under the maximal bright in the experiment illumination in the way that avoids over-exposures. Zoom was switched off. Focal length was set to 25 mm, ISO speed to 320. Aperture was closed to F/16 to achieve sharpness across most of the scene. The exposure time was set to 1 second and the 2 second delay was used for shooting. The in-camera white balance was adjusted to 6500K. The spectral responses for Canon 5D Mark III was measured by Baek et al. in [24]. To extract this data published as figures we use Web Plot Digitizer [25]. The resulting csv-files are provided along with the dataset.
3 Objects
To compose the scenes we used objects without colour-textured surfaces and mirroring effects to provide clear colouring annotation (section 4) and spectral characteristics (section 5). The objects represent 122 surfaces of various materials and shapes. These include 88 dielectric surfaces (both matte and glance), 6 achromatic and 10 chromatic metals. Also, we also recorded a scene with DGK Color Tools WDKK Color Chart consisting of 18 patches with known colors (fig. 2).
4 Image data
The scenes were composed from multiple objects placed on a coloured background. The illumination of each scene was varied in 18 configurations, which are shown at figure 2 and include the following combinations of light sources used for shooting:
- •
2HAL — two halogen lights only,
- •
2HAL_DESK – two halogen lights with the desktop lamp,
- •
2HAL_DESK_LED-{R, RG, BG, B}{025, 050, 075, 100} — two halogen lights with the desktop lamp and red (R), blue (B), red and green (RG) or blue and green (BG) lights of the LED strip turned on at 25%, 50%, 70%, and 100% of the emitting power correspondingly.
Each image is provided in 3 formats: a full-resolution raw (single-channel RGGB Bayer array) image cropped and converted to 16-bit PNG format from the camera-specific raw CR2 image, a half-resolution colour PNG (in linear sensor-specific RGB coordinates) image obtained by naive demosaicing algorithm (the camera sensor has a built-in optical low-pass filter, so no moire artifacts), and a quarter-resolution colour JPEG image for preview. The raw and 16-bit colour images are presented without any auto-contrast or range stretching and may appear very dark when viewed in ordinary software, however they contain all information to reconstruct the full-colour images as given by previews.
Each scene is accompanied with the pixel-wise annotation masks of uniformly coloured object surfaces. The masks are given in a 8-bit PNG file, where pixels corresponding to uniformly coloured object surface have the same, unique on the mask, colour. For annotation purposes, images were first automatically split into small regions (but not less than 100 pixels) with guaranteed colour constancy using the algorithm given in [26]. Then, regions were manually merged in accordance to uniform colouring of the scene. Each such region is associated with a reflectance spectrum (section 5). Same as with the scene images, the colouring annotation is provided in full-, half- and quarter-resolutions.
5 Spectral data
In addition to colour images under various lighting conditions, we also collected a set of spectral measurements, both for illuminant emission and for object reflectance. The measurement was conducted using a miniature spectrometer OceanOptics FLAME-S-VIS-NIR-ES with a corresponding set of accessories from OceanOptics, including two light sources (HL-2000 for reflectance measurement and HL-3P-CAM for radiometric calibration for illuminant characterization) and a reflectance standard WS-1.
The source spectra were collected for all individual sources (two halogen lamps, the desk lamp and the 3 components of the RGB LED lamp) measured through the softbox, to characterise the real spectra illuminating the objects. Also, the spectrum of both halogen lamps measured near the camera lens to get some information on relative spectral intensities. The dataset is accompanied by the photos illustrating the spectral measurement conditions.
The measurements were taken with built-in non-linearity correction and calibrated for the uniform sensitivity for all wavelengths using the radiometrically calibrated light source. The dark room background was taken into account and subtracted, however, some small non-uniformity (e.g. imperfectly blinking indicator lights on) can be visible in the resulting spectra. The chromaticities of illuminants relative to our camera are shown in the figure 4a.
The reflectance spectra were measured relative to the polytetrafluoroethylene standard (Ocean Optics WS-1). The reflection probe holder was used to control the angle of the measurement. measurement angle was used for all materials, the specular component of metallic or metallised surfaces was also measured with probe perpendicular to the surface. For each object the spectra for all major constituent materials visible in the scene were measured. Chromacities of all surfaces taken at measurement angle are shown in the figure 4b. The materials are specified by a unique string (e.g. “red_plastic_ball” or “brown_metallic_doorknob”) containing the colour, the material and the object identification.
6 Consistency of spectral and image data
Here we demonstrate the expected accuracy in illuminant or surface chromaticity estimation using this data, to provide a better support for benchmarking of colour vision algorithms.
For demonstration we used the image of the colour chart taken under 2HAL illumination (Fig. 2). For each patch we calculated the chromaticity in a two ways: first, using measured spectral data (surface spectrum, illuminant spectrum and camera spectral responses), second, directly from the RGB image – observed chromaticities. The former was obtained in terms of Lambertian image formation model. The result is given in the figure 5a. As it could be seen, the chromaticity coordinates of white and neutral patches are densely grouped together, while the coordinates of light blue (on the left) and red, brown, and purple patches (on the ride side of the chromaticity distribution) matches poorly.
Miscalculations, shown in figure 5a, could be at least partially explained by linear miscalculations in camera spectral sensitivity and lighting setup. Considering, that the chomaticities across a linear change in capture condition (light color, shading and imaging device) are homography apart [27], we applied better fitting of observed chromaticity coordinates to the measured ones. As it shown in figure 5b, the homography transform of observed chromaticitiy coordinates provides visibly better correspondence – the measured and observed coordinates on the left and the right side of the chromaticity distribution now more closely located to each other. To estimate planar homography matrix we conduct fitting above all of the patches chromaticity coordinates.
However, some inaccuracies are still remain, which are probably caused by non-linear effects in image noise – such as, for example, because of the colour chart was glance, not matte. Also note, that in this work we did not measured the camera responses by ourselves (section 2).
7 Conclusion
We present a collection of 24 scenes images recorded under 18 different multiple light source illuminations. The main feature of the dataset is a completeness, containing spectral data both for illuminants and for object surfaces. The dataset mainly offers a evaluation for computation colour constancy, but also can be used in variety of computer vision involving a colour- or material based image segmentation, photometric invariant extraction and others.
ACKNOWLEDGEMENTS
This work is partially supported by Russian Foundation for Basic Research (projects 17-29-03514 and 17-29-03370).
We are grateful to Alexander Belokopytov for his help in constructing the acquisition setup. We thank Dmitry Nikolaev for his valuable comments in experimental design and result verification. We thank Alexey Glikin for the technical help. For contributing the objects, we acknowledge Artem Sereda, Sergey Gladilin, Dmitry Bocharov.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Gijsenij, T. Gevers, and J. Van De Weijer, “Computational color constancy: Survey and experiments,” IEEE Transactions on Image Processing 20 (9), 2475–2489 (2011).
- 2[2] P. Nikolayev, “About new estimation methods for primary illuminant chromaticity in colour constancy algorithms,” Sensory systems 21 (1), 29–44 (2007). In Russian.
- 3[3] G. Buchsbaum, “A spatial processor model for object colour perception,” Journal of the Franklin institute 310 (1), 1–26 (1980).
- 4[4] S. Sengupta, A. Kanazawa, C. D. Castillo, and D. W. Jacobs, “Sfsnet: Learning shape, reflectance and illuminance of faces in the wild’,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 6296–6305 (2018).
- 5[5] V. Agarwal, B. R. Abidi, A. Koschan, and M. A. Abidi, “An overview of color constancy algorithms,” Journal of Pattern Recognition Research 1 (1), 42–54 (2006).
- 6[6] H. Zhou, Y. Gao, G. Yuan, and R. Ji, “Moving cast shadow resistant for foreground segmentation based on shadow properties analysis,” in Eighth International Conference on Machine Vision (ICMV 2015) , 9875 , 98750 B, International Society for Optics and Photonics (2015).
- 7[7] S. Bianco, C. Cusano, and R. Schettini, “Single and multiple illuminant estimation using convolutional neural networks,” IEEE Transactions on Image Processing 26 (9), 4347–4362 (2017).
- 8[8] N. Gusamutdinova, E. Ershov, S. Gladilin, and D. Nikolaev, “Verification of applicability two multiplicative closed spectral models for multiple reflection effect description,” in 2016 International Conference on Robotics and Machine Vision , 10253 , 1025305, International Society for Optics and Photonics (2017).
