MoCoPnet - Deformable 3D Convolution for Video Super-Resolution

Last update: Dec 15, 2022

Related tags

Deep Learning MoCoPnet

Overview

Deformable 3D Convolution for Video Super-Resolution

Pytorch implementation of local motion and contrast prior driven deep network (MoCoPnet). [PDF]

Overview

Requirements

Python 3
pytorch >= 1.6
numpy, PIL

Datasets

Training & test datasets

Download SAITD dataset.

SAITD dataset is a large-scale high-quality semi-synthetic dataset of infrared small target. We employ the 1st-50th sequences with target annotations as the test datasets and the remaining 300 sequences as the training datasets.

Download Hui and Anti-UAV.

Hui and Anti-UAV datasets are used as the test datasets to test the robustness of our MoCoPnet to real scenes. In Anti-UAV dataset, only the sequences with infrared small target (i.e., The target size is less than 0.12% of the image size) are selected as the test set (21 sequences in total). Note that, we only use the first 100 images of each sequence for test to balance computational/time cost and generalization performance.

For simplicity, you can also Download the test datasets in https://pan.baidu.com/s/1oobhklwIChvNJIBpTcdQRQ?pwd=1113 and put the folder in code/data.

Data format:

The training dataset is in code/data/train/SAITD.

train
  └── SAITD
       └── 1
              ├── 0.png
              ├── 1.png
              ├── ...
       └── 2
              ├── 00001
              ├── 00002
              ├── ...		
       ...

The test datasets are in code/data/test as below:

 test
  └── dataset_1
         └── scene_1
              ├── 0.png  
              ├── 1.png  
              ├── ...
              └── 100.png    
               
         ├── ...		  
         └── scene_M
  ├── ...    
  └── dataset_N

Results

Quantitative Results of SR performance

Table 1. PSNR/SSIM achieved by different methods.

Table 2. SNR and CR results of different methods achieved on super-resolved LR images and super-resolved HR images.

Qualitative Results of SR performance

Figure 1. Visual results of different SR methods on LR images for 4x SR.

Figure 2. Visual results of different SR methods on LR images for 4x SR.

Quantitative Results of detection

Table 3. Quantitative results of Tophat, ILCM, IPI achieved on super-resolved LR images.

Table 4. Quantitative results of Tophat, ILCM, IPI achieved on super-resolved HR images.

Figure 3. ROC results of Tophat, ILCM and IPI achieved on super-resolved LR images.

Figure 4. ROC results of Tophat, ILCM and IPI achieved on super-resolved HR images.

Qualitative Results of detection

Figure 5. Qualitative results of super-resolved LR image and detection results.

Figure 6. Qualitative results of super-resolved HR image and detection results.

Citiation

@article{MoCoPnet,
  author = {Ying, Xinyi and Wang, Yingqian and Wang, Longguang and Sheng, Weidong and Liu, Li and Lin, Zaipin and Zhou, Shilin},
  title = {MoCoPnet: Exploring Local Motion and Contrast Priors for Infrared Small Target Super-Resolution},
  journal={arXiv preprint arXiv:2201.01014},
  year = {2020},
}

Contact

Please contact us at [email protected] for any question.

MoCoPnet - Deformable 3D Convolution for Video Super-Resolution

Related tags

Overview

Deformable 3D Convolution for Video Super-Resolution

Overview

Requirements

Datasets

Training & test datasets

Download SAITD dataset.

Download Hui and Anti-UAV.

Data format:

Results

Quantitative Results of SR performance

Qualitative Results of SR performance

Quantitative Results of detection

Qualitative Results of detection

Citiation

Contact

Owner

Xinyi Ying

Learning from Synthetic Data with Fine-grained Attributes for Person Re-Identification

Semantic Segmentation Architectures Implemented in PyTorch

RoadMap and preparation material for Machine Learning and Data Science - From beginner to expert.

A Python package for faster, safer, and simpler ML processes

Semantic Segmentation in Pytorch. Network include: FCN、FCN_ResNet、SegNet、UNet、BiSeNet、BiSeNetV2、PSPNet、DeepLabv3_plus、 HRNet、DDRNet

Attention-driven Robot Manipulation (ARM) which includes Q-attention

CCNet: Criss-Cross Attention for Semantic Segmentation (TPAMI 2020 & ICCV 2019).

IGCN : Image-to-graph convolutional network

Learning Neural Network Subspaces

(ICCV 2021) PyTorch implementation of Paper "Progressive Correspondence Pruning by Consensus Learning"

A framework for annotating 3D meshes using the predictions of a 2D semantic segmentation model.

Hypernetwork-Ensemble Learning of Segmentation Probability for Medical Image Segmentation with Ambiguous Labels

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.

Simple (but Strong) Baselines for POMDPs

PASSL包含 SimCLR，MoCo，BYOL，CLIP等基于对比学习的图像自监督算法以及 Vision-Transformer，Swin-Transformer，BEiT，CVT，T2T，MLP_Mixer等视觉Transformer算法

Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment.

Code for paper [ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot] (ICCV 2021, oral))

DPC: Unsupervised Deep Point Correspondence via Cross and Self Construction (3DV 2021)

An Straight Dilated Network with Wavelet for image Deblurring