Pytorch implementation of the paper: "SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive Image Deraining"

Last update: Oct 17, 2022

Overview

SAPNet

This repository contains the official Pytorch implementation of the paper: "SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive Image Deraining"

Updates:

Code will be updated before 2021/11/23 **Arxiv Link is available at https://arxiv.org/abs/2111.08892

Abstract

Deep learning algorithms have recently achieved promising deraining performances on both the natural and synthetic rainy datasets. As an essential low-level pre-processing stage, a deraining network should clear the rain streaks and preserve the fine semantic details. However, most existing methods only consider low-level image restoration. That limits their performances at high-level tasks requiring precise semantic information. To address this issue, in this paper, we present a segmentation-aware progressive network (SAPNet) based upon contrastive learning for single image deraining. We start our method with a lightweight derain network formed with progressive dilated units (PDU). The PDU can significantly expand the receptive field and characterize multi-scale rain streaks without the heavy computation on multi-scale images. A fundamental aspect of this work is an unsupervised background segmentation (UBS) network initialized with ImageNet and Gaussian weights. The UBS can faithfully preserve an image's semantic information and improve the generalization ability to unseen photos. Furthermore, we introduce a perceptual contrastive loss (PCL) and a learned perceptual image similarity loss (LPISL) to regulate model learning. By exploiting the rainy image and groundtruth as the negative and the positive sample in the VGG-16 latent space, we bridge the fine semantic details between the derained image and the groundtruth in a fully constrained manner. Comprehensive experiments on synthetic and real-world rainy images show our model surpasses top-performing methods and aids object detection and semantic segmentation with considerable efficacy.

Preparing Dataset

First, download training and testing dataset from either link BaiduYun OneDrive

Next, create new folders called dataset. Then create sub-folders called train and test under that folder. Finally, place the unzipped folders into ./datasets/train/ (training data) and ./datasets/test/ (testing data)

Training

Run the following script in terminal

python train.py

Testing

Run the following script in terminal

bash main.sh

Hyperparameters

General Hyperparameters

Name	Type	Default
preprocess	bool	False
batch_size	int	12
epochs	int	100
milestone	int	[30,50,80]
lr	float	0.001
save_path	str	logs/SAPNet/Model11
save_freq	int	1

Train/Test Hypeparameters

Name	Type	Default
test_data_path	str	datasets/test/Rain100H
output_path	str	results/Rain100H/Model11
data_path	str	datasets/train/RainTrainH
use_contrast	bool	True
use_seg_stage1	bool	True
use_stage1	bool	True
use_dilation	bool	True
recurrent_iter	int	6
num_of_SegClass	int	21

Contact

Please reach [email protected] for further questions. You can also open an issue (prefered) or a pull request in this Github repository

Acknowledgement

This repository is borrowed heavily from PreNet. Thanks for sharing!

Pytorch implementation of the paper: "SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive Image Deraining"

Related tags

Overview

SAPNet

Updates:

Abstract

Preparing Dataset

Training

Testing

Hyperparameters

General Hyperparameters

Train/Test Hypeparameters

Contact

Acknowledgement

TODO List

Owner

A voice recognition assistant similar to amazon alexa, siri and google assistant.

FastFace: Lightweight Face Detection Framework

Code for "NeRS: Neural Reflectance Surfaces for Sparse-View 3D Reconstruction in the Wild," in NeurIPS 2021

Instant Real-Time Example-Based Style Transfer to Facial Videos

Use AI to generate a optimized stock portfolio

Anomaly detection analysis and labeling tool, specifically for multiple time series (one time series per category)

This is a demo app to be used in the video streaming applications

✨✨✨An awesome open source toolbox for stereo matching.

We have implemented shaDow-GNN as a general and powerful pipeline for graph representation learning. For more details, please find our paper titled Deep Graph Neural Networks with Shallow Subgraph Samplers, available on arXiv (https//arxiv.org/abs/2012.01380).

Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)

A Python package for generating concise, high-quality summaries of a probability distribution

using yolox+deepsort for object-tracker

This is the official implementation code repository of Underwater Light Field Retention : Neural Rendering for Underwater Imaging (Accepted by CVPR Workshop2022 NTIRE)

Deep Learning GPU Training System

Official code of "Mitigating the Mutual Error Amplification for Semi-Supervised Object Detection"

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

Multi-objective gym environments for reinforcement learning.

Code for the ECIR'22 paper "Evaluating the Robustness of Retrieval Pipelines with Query Variation Generators"

Asymmetric Bilateral Motion Estimation for Video Frame Interpolation, ICCV2021

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)