Flash and Ambient Illuminations Dataset
The Dataset
The Flash and Ambient Illuminations Dataset (FAID) consists of aligned flash-only and ambient-only illumination pairs captured with mobile devices by many crowd-workers participated in our collection effort. This dataset accompanies our ECCV 2018 paper.
Capture Setup
The photographs were captured in 12-bit raw and jpeg formats using iPhone 6s and iPhone 7 devices by the crowd-workers we reached through the Amazon's Mechanical Turk platform. The raw images were used to generate and represent the flash and ambient illuminations. The jpeg images proccesed by iPhones are provided as well in the dataset. In our setup, the flash and no-flash photograph pairs were captured in burst, typically 0.5-1.0 seconds apart. The two photographs were then aligned via homography estimations, cropped and resized to 1080x1440 resolution. The aligned raw flash and no-flash photographs are then used to compute the flash-only illumination. The photographs and their alignment went through a review process before getting included in the dataset.
Scenes and Illumination Conditions
The scenes are typically well-illuminated indoor spaces, without any visible direct light sources. The objects and materials otherwise has a wide variety. We defined the scenes to be indoors for a reliable flash illumination estimation, and well-illuminated in order to get a useful ambient illumination as well. As two illuminations are provided separately, the ambient illumination can be made dimmer and added to the flash illumination using the raw values to create a typical flash photograph taken in a dark envrionment.
Categories
The dataset is divided into 6 categories: People, Shelves, Plants, Toys, Rooms, and Objects. The first 5 categories were defined in order to give the crowd-workers a starting point when they are photographing, and the Objects category was then defined to include images that did not fit well in any of the defined categories.
Color Mapping
The illuminations are provided as raw sensor readings, and the corresponding exif information are available in the dataset as separate files. The illuminations can be mapped to first XYZ and then RGB color spaces via the color matrix. The color matrix and the calibration illuminant we used are available in the exif data of the PNG files as well as the .mat files with the complete exif information. A sample code in MATLAB for the color mapping is available here. Note that even after the color mapping, the images will not match the colors of the jpeg photographs we provide, as the jpeg photographs went through the pre-processing pipeline of the cameras.
File Description
We provide the illumination pairs in 16-bit PNG files. These files already have the color matrix and calibration illuminant information necessary for the color mapping. The full exif information originally attached to the DNG files are provided as .mat files separately. We also provide the flash and no-flash photographs processed by the iPhones. Note that the flash photograph in this case includes both flash and ambient illuminations.
License and Credit
We provide this dataset for research purposes only, and it is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. We hereby give the rights to use the images in figures and supplementary material of scientific publications. If you use this dataset, please cite our paper:
author={Ya\u{g}{\i}z Aksoy and Changil Kim and Petr Kellnhofer and Sylvain Paris and Mohamed Elgharib and Marc Pollefeys and Wojciech Matusik},
booktitle={Proc. ECCV},
title={A Dataset of Flash and Ambient Illumination Pairs from the Crowd},
year={2018},
}
Included in our extended dataset
–
We combine and extend existing flash/no-flash datasets to create a diverse set of real-world flash/ambient pairs suitable for training deep networks. Our dataset is constructed from three existing datasets: The Multi-Illumination Dataset (MID), The Flash and Ambient Illuminations Dataset (FAID), and the Deep Flash Portrait Dataset (DPD). We propose a pipeline for compositing portraits from DPD onto backgrounds from FAID, and discuss multiple considerations for normalizing and augmenting flash/no-flash data.
This dataset accompanies our CVPR 2023 paper.
@INPROCEEDINGS{Maralan2023Flash,
author={Sepideh Sarajian Maralan and Chris Careaga and Ya\u{g}{\i}z Aksoy}, title={Computational Flash Photography through Intrinsics}, journal={Proc. CVPR}, year={2023}, } |
Files
Sample Set (1GB) | |
The sample set includes illuminations, photographs and EXIF files for 60 pairs, 10 from each category. | |
Illuminations | Photographs |
The main dataset | Processed photographs |
People (4.3GB) | People (1.3GB) |
Shelves (5.4GB) | Shelves (1.7GB) |
Plants (3.2GB) | Plants (1.1GB) |
Toys (5.3GB) | Toys (1.7GB) |
Rooms (10.3GB) | Rooms (3.0GB) |
Objects (7.5GB) | Objects (2.3GB) |
EXIF info | |
EXIFs (32MB) |
ECCV Publication