This is a Tensorflow implementation of Learning to See in the Dark in CVPR 2018

Last update: Jan 01, 2023

Related tags

Overview

Learning-to-See-in-the-Dark

This is a Tensorflow implementation of Learning to See in the Dark in CVPR 2018, by Chen Chen, Qifeng Chen, Jia Xu, and Vladlen Koltun.

Project Website
Paper

This code includes the default model for training and testing on the See-in-the-Dark (SID) dataset.

Demo Video

https://youtu.be/qWKUFK7MWvg

Setup

Requirement

Required python (version 2.7) libraries: Tensorflow (>=1.1) + Scipy + Numpy + Rawpy.

Tested in Ubuntu + Intel i7 CPU + Nvidia Titan X (Pascal) with Cuda (>=8.0) and CuDNN (>=5.0). CPU mode should also work with minor changes but not tested.

Dataset

Update Aug, 2018: We found some misalignment with the ground-truth for image 10034, 10045, 10172. Please remove those images for quantitative results, but they still can be used for qualitative evaluations.

You can download it directly from Google drive for the Sony (25 GB) and Fuji (52 GB) sets.

There is download limit by Google drive in a fixed period of time. If you cannot download because of this, try these links: Sony (25 GB) and Fuji (52 GB).

New: we provide file parts in Baidu Drive now. After you download all the parts, you can combine them together by running: "cat SonyPart* > Sony.zip" and "cat FujiPart* > Fuji.zip".

The file lists are provided. In each row, there are a short-exposed image path, the corresponding long-exposed image path, camera ISO and F number. Note that multiple short-exposed images may correspond to the same long-exposed image.

The file name contains the image information. For example, in "10019_00_0.033s.RAF", the first digit "1" means it is from the test set ("0" for training set and "2" for validation set); "0019" is the image ID; the following "00" is the number in the sequence/burst; "0.033s" is the exposure time 1/30 seconds.

Testing

Clone this repository.
Download the pretrained models by running

python download_models.py

Run "python test_Sony.py". This will generate results on the Sony test set.
Run "python test_Fuji.py". This will generate results on the Fuji test set.

By default, the code takes the data in the "./dataset/Sony/" folder and "./dataset/Fuji/". If you save the dataset in other folders, please change the "input_dir" and "gt_dir" at the beginning of the code.

Training new models

To train the Sony model, run "python train_Sony.py". The result and model will be save in "result_Sony" folder by default.
To train the Fuji model, run "python train_Fuji.py". The result and model will be save in "result_Fuji" folder by default.

Loading the raw data and proccesing by Rawpy takes significant more time than the backpropagation. By default, the code will load all the groundtruth data processed by Rawpy into memory without 8-bit or 16-bit quantization. This requires at least 64 GB RAM for training the Sony model and 128 GB RAM for the Fuji model. If you need to train it on a machine with less RAM, you may need to revise the code and use the groundtruth data on the disk. We provide the 16-bit groundtruth images processed by Rawpy: Sony (12 GB) and Fuji (22 GB).

Citation

If you use our code and dataset for research, please cite our paper:

Chen Chen, Qifeng Chen, Jia Xu, and Vladlen Koltun, "Learning to See in the Dark", in CVPR, 2018.

License

MIT License.

FAQ

Can I test my own data using the provided model?

The proposed method is designed for sensor raw data. The pretrained model probably not work for data from another camera sensor. We do not have support for other camera data. It also does not work for images after camera ISP, i.e., the JPG or PNG data.

Will this be in any product?

This is a research project and a prototype to prove a concept.

How can I train the model using my own raw data?

Generally, you just need to subtract the right black level and pack the data in the same way of Sony/Fuji data. If using rawpy, you need to read the black level instead of using 512 in the provided code. The data range may also differ if it is not 14 bits. You need to normalize it to [0,1] for the network input.

Why the results are all black?

It is often because the pre-trained model not downloaded properly. After downloading, you should get 4 checkpoint related files for the model.

Questions

If you have additional questions after reading the FAQ, please email to [email protected].

This is a Tensorflow implementation of Learning to See in the Dark in CVPR 2018

Related tags

Overview

Learning-to-See-in-the-Dark

Demo Video

Setup

Requirement

Dataset

Testing

Training new models

Citation

License

FAQ

Questions

Owner

Fast Scattering Transform with CuPy/PyTorch

Repository for the paper "Exploring the Sensory Spaces of English Perceptual Verbs in Natural Language Data"

Official Python implementation of the FuzionCoin protocol

Yolact-keras实例分割模型在keras当中的实现

A PyTorch implementation of "CoAtNet: Marrying Convolution and Attention for All Data Sizes".

Transformer Tracking (CVPR2021)

PyTorch implementation of paper "IBRNet: Learning Multi-View Image-Based Rendering", CVPR 2021.

Anchor-free Oriented Proposal Generator for Object Detection

Python PID Tuner - Makes a model of the System from a Process Reaction Curve and calculates PID Gains

Exploration of some patients clinical variables.

Using Convolutional Neural Networks (CNN) for Semantic Segmentation of Breast Cancer Lesions (BRCA)

disentanglement_lib is an open-source library for research on learning disentangled representations.

Implementation of Gans

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

Embracing Single Stride 3D Object Detector with Sparse Transformer

source code for 'Finding Valid Adjustments under Non-ignorability with Minimal DAG Knowledge' by A. Shah, K. Shanmugam, K. Ahuja

Official TensorFlow code for the forthcoming paper

A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition