Detail-Preserving Transformer for Light Field Image Super-Resolution

Last update: Jan 01, 2023

Related tags

Overview

DPT

Official Pytorch implementation of the paper "Detail-Preserving Transformer for Light Field Image Super-Resolution" accepted by AAAI 2022 .

Updates

2022.01: Our method is available at the newly-released repository BasicLFSR, an open-source and easy-to-use toolbox for LF image SR.
2022.01: The code is released.

Requirements

Python 3.7.7
Pytorch=1.5.0
torchvision=0.6.0
h5py=2.8.0
Matlab

Dataset

We use the EPFL, HCInew, HCIold, INRIA and STFgantry datasets for both training and testing. You can download the above dataset from Baidu Drive (key:912V).

Download the visual results

We share the super-resolved results generated by our DPT. Then, researchers can compare their methods to our DPT without performing inference. Results are available at Baidu Drive (key:912V).

Prepare the datasets

To generate the training data,

 Using Matlab to run `GenerateTrainingData.m`

To generate the testing data,

 Using Matlab to run `GenerateTestData.m`

We also provide the processed datasets we used in the paper. The processed datasets are avaliable at Baidu Drive (key:912V).

Train

To perform DPT training, please run

python train.py

Checkpoint will be saved to ./log/.

Test

To evaluate DPT performance, please run

python test.py

The performance of DPT on five datasets will be printed on the screen. The visual result of each scene will be saved in ./Results/. The PSNR and SSIM values of each scene will aslo be saved in ./PSNRSSIM/.

Generate visual results

To generate the visual super-resolved results,

Using Matlab to run `GenerateResultImages.m`

The '.mat' files in ./Results/ will be converted to '.png' images to ./SRimages/.

To generate the visual gradient results, please run

python generate_visual_gradient_map.py

Gradient results will be saved to ./GRAimages/.

Citation

If you find this work helpful, please consider citing the following paper:

@article{wang2022detail,
  title={Detail Preserving Transformer for Light Field Image Super-Resolution},
  author={Wang, Shunzhou and Zhou, Tianfei and Lu, Yao and Di, Huijun},
  journal={arXiv preprint arXiv:2201.00346},
  year={2022}
}

Acknowledgements

This code is heavily based on LF-DFNet. We also refer to the codes in VSR-Transformer, COLA-Net, and SPSR. We thank the authors for sharing the codes. We would like to thank Yingqian Wang for his help with LFSR. We would also like to thank Zhengyu Liang for adding our DPT to the repository BasicLFSR.

Contact

If you have any question about this work, feel free to concat with me via [email protected].

Detail-Preserving Transformer for Light Field Image Super-Resolution

Related tags

Overview

DPT

Updates

Requirements

Dataset

Download the visual results

Prepare the datasets

Train

Test

Generate visual results

Citation

Acknowledgements

Contact

Owner

This is a library for training and applying sparse fine-tunings with torch and transformers.

Awesome Deep Graph Clustering is a collection of SOTA, novel deep graph clustering methods

Planning from Pixels in Environments with Combinatorially Hard Search Spaces -- NeurIPS 2021

Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"

Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).

Official implementation of the method ContIG, for self-supervised learning from medical imaging with genomics

Pytorch implementation of Learning Rate Dropout.

Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,

DTCN IJCAI - Sequential prediction learning framework and algorithm

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation(mCOLT/mRASP2), ACL2021

Wav2Vec for speech recognition, classification, and audio classification

Indonesian Car License Plate Character Recognition using Tensorflow, Keras and OpenCV.

Accelerated Multi-Modal MR Imaging with Transformers

MIM: MIM Installs OpenMMLab Packages

Code for our paper Domain Adaptive Semantic Segmentation with Self-Supervised Depth Estimation

SwinIR: Image Restoration Using Swin Transformer

a basic code repository for basic task in CV(classification,detection,segmentation)

A hyperparameter optimization framework

SurfEmb (CVPR 2022) - SurfEmb: Dense and Continuous Correspondence Distributions

Morphable Detector for Object Detection on Demand