Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

Last update: Sep 26, 2022

Related tags

Overview

Unsupervised Image Denoising with Frequency Domain Knowledge (BMVC 2021 Oral) : Official Project Page

This repository provides the official PyTorch implementation of the following paper:

Unsupervised Image Denoising with Frequency Domain Knowledge

Nahyun Kim* (KAIST), Donggon Jang* (KAIST), Sunhyeok Lee (KAIST), Bomi Kim (KAIST), and Dae-Shik Kim (KAIST) (*The authors have equally contributed.)

BMVC 2021, Accepted as Oral Paper.

Abstract: Supervised learning-based methods yield robust denoising results, yet they are inherently limited by the need for large-scale clean/noisy paired datasets. The use of unsupervised denoisers, on the other hand, necessitates a more detailed understanding of the underlying image statistics. In particular, it is well known that apparent differences between clean and noisy images are most prominent on high-frequency bands, justifying the use of low-pass filters as part of conventional image preprocessing steps. However, most learning-based denoising methods utilize only one-sided information from the spatial domain without considering frequency domain information. To address this limitation, in this study we propose a frequency-sensitive unsupervised denoising method. To this end, a generative adversarial network (GAN) is used as a base structure. Subsequently, we include spectral discriminator and frequency reconstruction loss to transfer frequency knowledge into the generator. Results using natural and synthetic datasets indicate that our unsupervised learning method augmented with frequency information achieves state-of-the-art denoising performance, suggesting that frequency domain information could be a viable factor in improving the overall performance of unsupervised learning-based methods.

Requirements

To install requirements:

conda env create -n [your env name] -f environment.yaml
conda activate [your env name]

To train the model

Synthetic Noise (AWGN)

Download DIV2K dataset for training in here
Randomly split the DIV2K dataset into Clean/Noisy set. Please refer the .txt files in split_data.
Place the splitted dataset(DIV2K_C and DIV2K_N) in ./dataset directory.

dataset
└─── DIV2K_C
└─── DIV2K_N
└─── test

Use gen_dataset_synthetic.py to package dataset in the h5py format.
After that, run this command:

sh ./scripts/train_awgn_sigma15.sh # AWGN with a noise level = 15
sh ./scripts/train_awgn_sigma25.sh # AWGN with a noise level = 25
sh ./scripts/train_awgn_sigma50.sh # AWGN with a noise level = 50

After finishing the training, .pth file is stored in ./exp/[exp_name]/[seed_number]/saved_models/ directory.

Real-World Noise

Download SIDD-Medium Dataset for training in here
Radnomly split the SIDD-Medium Dataset into Clean/Noisy set. Please refer the .txt files in split_data.
Place the splitted dataset(SIDD_C and SIDD_N) in ./dataset directory.

dataset
└─── SIDD_C
└─── SIDD_N
└─── test

Use gen_dataset_real.py to package dataset in the h5py format.
After that, run this command:

sh ./scripts/train_real.sh

After finishing the training, .pth file is stored in ./exp/[exp_name]/[seed_number]/saved_models/ directory.

To evaluate the model

Synthetic Noise (AWGN)

Download CBSD68 dataset for evaluation in here
Place the dataset in ./dataset/test directory.

dataset
└─── train
└─── test
     └─── CBSD68
     └─── SIDD_test

After that, run this command:

sh ./scripts/test_awgn_sigma15.sh # AWGN with a noise level = 15
sh ./scripts/test_awgn_sigma25.sh # AWGN with a noise level = 25
sh ./scripts/test_awgn_sigma50.sh # AWGN with a noise level = 50

Real-World Noise

Download the SIDD test dataset for evaluation in here
Place the dataset in ./dataset/test directory.

dataset
└─── train
└─── test
     └─── CBSD68
     └─── SIDD_test

After that, run this command:

sh ./scripts/test_real.sh

Pre-trained model

We provide pre-trained models in ./checkpoints directory.

checkpoints
|   AWGN_sigma15.pth # pre-trained model (AWGN with a noise level = 15)
|   AWGN_sigma25.pth # pre-trained model (AWGN with a noise level = 25)
|   AWGN_sigma50.pth # pre-trained model (AWGN with a noise level = 50)
|   SIDD.pth # pre-trained model (Real-World noise)

Acknowledgements

This code is built on U-GAT-IT,CARN, SSD-GAN. We thank the authors for sharing their codes.

Contact

If you have any questions, feel free to contact me ([email protected])

Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

Related tags

Overview

Unsupervised Image Denoising with Frequency Domain Knowledge (BMVC 2021 Oral) : Official Project Page

Requirements

To train the model

Synthetic Noise (AWGN)

Real-World Noise

To evaluate the model

Synthetic Noise (AWGN)

Real-World Noise

Pre-trained model

Acknowledgements

Contact

Owner

Donggon Jang

Moving Object Segmentation in 3D LiDAR Data: A Learning-based Approach Exploiting Sequential Data

The source codes for TME-BNA: Temporal Motif-Preserving Network Embedding with Bicomponent Neighbor Aggregation.

Repositorio oficial del curso IIC2233 Programación Avanzada 🚀✨

This project aims to be a handler for input creation and running of multiple RICEWQ simulations.

Companion code for the paper Theoretical characterization of uncertainty in high-dimensional linear classification

An open source object detection toolbox based on PyTorch

Deep Reinforcement Learning for Keras.

Code for our paper "Multi-scale Guided Attention for Medical Image Segmentation"

FaRL for Facial Representation Learning

MassiveSumm: a very large-scale, very multilingual, news summarisation dataset

Job Assignment System by Real-time Emotion Detection

This repository is an open-source implementation of the ICRA 2021 paper: Locus: LiDAR-based Place Recognition using Spatiotemporal Higher-Order Pooling.

Jupyter notebooks for using & learning Keras

Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds (CVPR 2022, Oral)

Video2x - A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR.

Solving reinforcement learning tasks which require language and vision

Heterogeneous Deep Graph Infomax

Using NumPy to solve the equations of fluid mechanics together with Finite Differences, explicit time stepping and Chorin's Projection methods

[AAAI2021] The source code for our paper 《Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion》.

Json2Xml tool will help you convert from json COCO format to VOC xml format in Object Detection Problem.