Detail-Preserving Transformer for Light Field Image Super-Resolution

Last update: Jan 01, 2023

Related tags

Overview

DPT

Official Pytorch implementation of the paper "Detail-Preserving Transformer for Light Field Image Super-Resolution" accepted by AAAI 2022 .

Updates

2022.01: Our method is available at the newly-released repository BasicLFSR, an open-source and easy-to-use toolbox for LF image SR.
2022.01: The code is released.

Requirements

Python 3.7.7
Pytorch=1.5.0
torchvision=0.6.0
h5py=2.8.0
Matlab

Dataset

We use the EPFL, HCInew, HCIold, INRIA and STFgantry datasets for both training and testing. You can download the above dataset from Baidu Drive (key:912V).

Download the visual results

We share the super-resolved results generated by our DPT. Then, researchers can compare their methods to our DPT without performing inference. Results are available at Baidu Drive (key:912V).

Prepare the datasets

To generate the training data,

 Using Matlab to run `GenerateTrainingData.m`

To generate the testing data,

 Using Matlab to run `GenerateTestData.m`

We also provide the processed datasets we used in the paper. The processed datasets are avaliable at Baidu Drive (key:912V).

Train

To perform DPT training, please run

python train.py

Checkpoint will be saved to ./log/.

Test

To evaluate DPT performance, please run

python test.py

The performance of DPT on five datasets will be printed on the screen. The visual result of each scene will be saved in ./Results/. The PSNR and SSIM values of each scene will aslo be saved in ./PSNRSSIM/.

Generate visual results

To generate the visual super-resolved results,

Using Matlab to run `GenerateResultImages.m`

The '.mat' files in ./Results/ will be converted to '.png' images to ./SRimages/.

To generate the visual gradient results, please run

python generate_visual_gradient_map.py

Gradient results will be saved to ./GRAimages/.

Citation

If you find this work helpful, please consider citing the following paper:

@article{wang2022detail,
  title={Detail Preserving Transformer for Light Field Image Super-Resolution},
  author={Wang, Shunzhou and Zhou, Tianfei and Lu, Yao and Di, Huijun},
  journal={arXiv preprint arXiv:2201.00346},
  year={2022}
}

Acknowledgements

This code is heavily based on LF-DFNet. We also refer to the codes in VSR-Transformer, COLA-Net, and SPSR. We thank the authors for sharing the codes. We would like to thank Yingqian Wang for his help with LFSR. We would also like to thank Zhengyu Liang for adding our DPT to the repository BasicLFSR.

Contact

If you have any question about this work, feel free to concat with me via [email protected].

Detail-Preserving Transformer for Light Field Image Super-Resolution

Related tags

Overview

DPT

Updates

Requirements

Dataset

Download the visual results

Prepare the datasets

Train

Test

Generate visual results

Citation

Acknowledgements

Contact

Owner

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

GrabGpu_py: a scripts for grab gpu when gpu is free

CoReNet is a technique for joint multi-object 3D reconstruction from a single RGB image.

Hand gesture recognition model that can be used as a remote control for a smart tv.

My 1st place solution at Kaggle Hotel-ID 2021

Deep Learning Based EDM Subgenre Classification using Mel-Spectrogram and Tempogram Features"

A neuroanatomy-based augmented reality experience powered by computer vision. Features 3D visuals of the Atlas Brain Map slices.

MTA:SA Server Configer.

Zero-shot Synthesis with Group-Supervised Learning (ICLR 2021 paper)

SMPLpix: Neural Avatars from 3D Human Models

Multimodal Descriptions of Social Concepts: Automatic Modeling and Detection of (Highly Abstract) Social Concepts evoked by Art Images

A style-based Quantum Generative Adversarial Network

Playable Video Generation

Implementation of the paper Recurrent Glimpse-based Decoder for Detection with Transformer.

A deep learning CNN model to identify and classify and check if a person is wearing a mask or not.

pytorch implementation for PointNet

Multi-Objective Reinforced Active Learning

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

Deep Watershed Transform for Instance Segmentation

PaSST: Efficient Training of Audio Transformers with Patchout