Official code for "Stereo Waterdrop Removal with Row-wise Dilated Attention (IROS2021)"

Last update: Oct 01, 2022

Related tags

Deep Learning Stereo-Waterdrop-Removal

Overview

Stereo-Waterdrop-Removal-with-Row-wise-Dilated-Attention

This repository includes official codes for "Stereo Waterdrop Removal with Row-wise Dilated Attention (IROS2021)".

Stereo Waterdrop Removal with Row-wise Dilated Attention
Zifan Shi, Na Fan, Dit-Yan Yeung, Qifeng Chen
HKUST

[Paper] [Datasets]

Introduction

Existing vision systems for autonomous driving or robots are sensitive to waterdrops adhered to windows or camera lenses. Most recent waterdrop removal approaches take a single image as input and often fail to recover the missing content behind waterdrops faithfully. Thus, we propose a learning-based model for waterdrop removal with stereo images. A real-world dataset that contains stereo images with and without waterdrops is provided to benefit the related research.

Installation

Clone this repo.

git clone https://github.com/VivianSZF/Stereo-Waterdrop-Removal.git
cd Stereo-Waterdrop-Removal/

We have tested our code on Ubuntu 18.04 LTS with PyTorch 1.6.0 and CUDA 10.2. Please install dependencies by

conda env create -f environment.yml

Datasets

The dataset can be downloaded from the link.

'train', 'val' and 'test' refer to training set, validation set and test set captured by ZED 2. 'test_mynt' contains test images from MYNT EYE camera. In each folder, '000' denotes the waterdrop-free image (Ground truth). 'xxx_0' is the left image while 'xxx_1' is the right image. The dataset can be put under the 'dataset' folder.

Training

The arguments for training are listed in train.py. To train the model, run with the following code

sh train.sh

The checkpoints and the validation ressults will be saved into ./result/{exp_name}/train/.

Test

Download the pretrained checkpoints and put them under ./result/{exp_name}/train/. The arguments for test are listed in test.py. You can specify them in test.sh and run the command

sh test.sh

The output images are available under ./result/{exp_name}/test/

Citation

@inproceedings{shi2021stereo,
  title = {Stereo Waterdrop Removal with Row-wise Dilated Attention},
  author = {Shi, Zifan and Fan, Na and Yeung, Dit-Yan and Chen, Qifeng},
  booktitle = {IROS},
  year = {2021}
}

Official code for "Stereo Waterdrop Removal with Row-wise Dilated Attention (IROS2021)"

Related tags

Overview

Stereo-Waterdrop-Removal-with-Row-wise-Dilated-Attention

Introduction

Installation

Datasets

Training

Test

Citation

Owner

[ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models

This is a simple face recognition mini project that was completed by a team of 3 members in 1 week's time

Source code for paper: Knowledge Inheritance for Pre-trained Language Models

Official Implementation of Domain-Aware Universal Style Transfer

A Weakly Supervised Amodal Segmenter with Boundary Uncertainty Estimation

Code for our NeurIPS 2021 paper 'Exploiting the Intrinsic Neighborhood Structure for Source-free Domain Adaptation'

Project code for weakly supervised 3D object detectors using wide-baseline multi-view traffic camera data: WIBAM.

Robustness between the worst and average case

Official implementation for "Symbolic Learning to Optimize: Towards Interpretability and Scalability"

SymPy-powered, Wolfram|Alpha-like answer engine totally in your browser, without backend computation

Learning 3D Part Assembly from a Single Image

Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.

An implementation of Equivariant e2 convolutional kernals into a convolutional self attention network, applied to radio astronomy data.

Justmagic - Use a function as a method with this mystic script, like in Nim

这是一个yolox-pytorch的源码，可以用于训练自己的模型。

Attack on Confidence Estimation algorithm from the paper "Disrupting Deep Uncertainty Estimation Without Harming Accuracy"

PyTorch implementation of Algorithm 1 of "On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models"

Cross-modal Deep Face Normals with Deactivable Skip Connections

This repo provides a demo for the CVPR 2021 paper "A Fourier-based Framework for Domain Generalization" on the PACS dataset.

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT