Pytorch implementation of the paper: "SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive Image Deraining"

Last update: Oct 17, 2022

Overview

SAPNet

This repository contains the official Pytorch implementation of the paper: "SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive Image Deraining"

Updates:

Code will be updated before 2021/11/23 **Arxiv Link is available at https://arxiv.org/abs/2111.08892

Abstract

Deep learning algorithms have recently achieved promising deraining performances on both the natural and synthetic rainy datasets. As an essential low-level pre-processing stage, a deraining network should clear the rain streaks and preserve the fine semantic details. However, most existing methods only consider low-level image restoration. That limits their performances at high-level tasks requiring precise semantic information. To address this issue, in this paper, we present a segmentation-aware progressive network (SAPNet) based upon contrastive learning for single image deraining. We start our method with a lightweight derain network formed with progressive dilated units (PDU). The PDU can significantly expand the receptive field and characterize multi-scale rain streaks without the heavy computation on multi-scale images. A fundamental aspect of this work is an unsupervised background segmentation (UBS) network initialized with ImageNet and Gaussian weights. The UBS can faithfully preserve an image's semantic information and improve the generalization ability to unseen photos. Furthermore, we introduce a perceptual contrastive loss (PCL) and a learned perceptual image similarity loss (LPISL) to regulate model learning. By exploiting the rainy image and groundtruth as the negative and the positive sample in the VGG-16 latent space, we bridge the fine semantic details between the derained image and the groundtruth in a fully constrained manner. Comprehensive experiments on synthetic and real-world rainy images show our model surpasses top-performing methods and aids object detection and semantic segmentation with considerable efficacy.

Preparing Dataset

First, download training and testing dataset from either link BaiduYun OneDrive

Next, create new folders called dataset. Then create sub-folders called train and test under that folder. Finally, place the unzipped folders into ./datasets/train/ (training data) and ./datasets/test/ (testing data)

Training

Run the following script in terminal

python train.py

Testing

Run the following script in terminal

bash main.sh

Hyperparameters

General Hyperparameters

Name	Type	Default
preprocess	bool	False
batch_size	int	12
epochs	int	100
milestone	int	[30,50,80]
lr	float	0.001
save_path	str	logs/SAPNet/Model11
save_freq	int	1

Train/Test Hypeparameters

Name	Type	Default
test_data_path	str	datasets/test/Rain100H
output_path	str	results/Rain100H/Model11
data_path	str	datasets/train/RainTrainH
use_contrast	bool	True
use_seg_stage1	bool	True
use_stage1	bool	True
use_dilation	bool	True
recurrent_iter	int	6
num_of_SegClass	int	21

Contact

Please reach [email protected] for further questions. You can also open an issue (prefered) or a pull request in this Github repository

Acknowledgement

This repository is borrowed heavily from PreNet. Thanks for sharing!

Pytorch implementation of the paper: "SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive Image Deraining"

Related tags

Overview

SAPNet

Updates:

Abstract

Preparing Dataset

Training

Testing

Hyperparameters

General Hyperparameters

Train/Test Hypeparameters

Contact

Acknowledgement

TODO List

Owner

Label-Free Model Evaluation with Semi-Structured Dataset Representations

Pytorch cuda extension of grid_sample1d

Implementation for paper: Self-Regulation for Semantic Segmentation

Block Sparse movement pruning

Implementation of " SESS: Self-Ensembling Semi-Supervised 3D Object Detection" (CVPR2020 Oral)

Spectral Temporal Graph Neural Network (StemGNN in short) for Multivariate Time-series Forecasting

RGB-D Local Implicit Function for Depth Completion of Transparent Objects

Provide baselines and evaluation metrics of the task: traffic flow prediction

MediaPipeのPythonパッケージのサンプルです。2020/12/11時点でPython実装のある4機能(Hands、Pose、Face Mesh、Holistic)について用意しています。

Official implementation of Densely connected normalizing flows

Advances in Neural Information Processing Systems (NeurIPS), 2020.

Negative Interactions for Improved Collaborative Filtering:

Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks

Implementation of paper "DCS-Net: Deep Complex Subtractive Neural Network for Monaural Speech Enhancement"

ESTDepth: Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks (CVPR 2021)

OpenAi's gym environment wrapper to vectorize them with Ray

Object detection, 3D detection, and pose estimation using center point detection:

PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners for self-supervised ViT.

Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

[ICLR'19] Trellis Networks for Sequence Modeling