edge-SR: Super-Resolution For The Masses

Last update: Nov 10, 2022

Related tags

Overview

edge-SR: Super Resolution For The Masses

Citation

Pablo Navarrete Michelini, Yunhua Lu and Xingqun Jiang. "edge-SR: Super-Resolution For The Masses", in IEEE Winter conference on Applications of Computer Vision (WACV), 2022.

BibTeX

@inproceedings{eSR,
    title     = {edge--{SR}: Super--Resolution For The Masses},
    author    = {Navarrete~Michelini, Pablo and Lu, Yunhua and Jiang, Xingqun},
    booktitle = {Proceedings of the {IEEE/CVF} Winter Conference on Applications of Computer Vision ({WACV})},
    month     = {January},
    year      = {2022},
    pages     = {1078--1087},
    url       = {https://arxiv.org/abs/2108.10335}
}

Instructions:

Place input images in input directory (provided as empty directory). Color images will be converted to grayscale.
To upscale images run: python run.py.

Output images will come out in output directory.
The GPU number and model file can be changed in run.py (in comment "CHANGE HERE").

Requirements:

Python 3, PyTorch, NumPy, Pillow, OpenCV

Experiment results

The data directory contains the file tests.pkl that has the Python dictionary with all our test results on different devices. The following sample code shows how to read the file:

>>> import pickle
>>> test = pickle.load(open('tests.pkl', 'rb'))
>>> test['Bicubic_s2']
    {'psnr_Set5': 33.72849620514912,
     'ssim_Set5': 0.9283912810369976,
     'lpips_Set5': 0.14221979230642318,
     'psnr_Set14': 30.286027790636204,
     'ssim_Set14': 0.8694934108301432,
     'lpips_Set14': 0.19383049915943826,
     'psnr_BSDS100': 29.571233006609656,
     'ssim_BSDS100': 0.8418117904964167,
     'lpips_BSDS100': 0.26246454380452633,
     'psnr_Urban100': 26.89378248655882,
     'ssim_Urban100': 0.8407461069831571,
     'lpips_Urban100': 0.21186692919582129,
     'psnr_Manga109': 30.850672809780587,
     'ssim_Manga109': 0.9340133711400112,
     'lpips_Manga109': 0.102985977955641,
     'parameters': 104,
     'speed_AGX': 18.72132628065749,
     'power_AGX': 1550,
     'speed_MaxQ': 632.5429857814075,
     'power_MaxQ': 50,
     'temperature_MaxQ': 76,
     'memory_MaxQ': 2961,
     'speed_RPI': 11.361346064182795,
     'usage_RPI': 372.8714285714285}

The keys of the dictionary identify the name of each model and its hyper--parameters using the following format:

Bicubic_s#,
eSR-MAX_s#_K#_C#,
eSR-TM_s#_K#_C#,
eSR-TR_s#_K#_C#,
eSR-CNN_s#_C#_D#_S#,
ESPCN_s#_D#_S#, or
FSRCNN_s#_D#_S#_M#,

where # represents an integer number with the value of the correspondent hyper-parameter. For each model the data of the dictionary contains a second dictionary with the information displayed above. This includes: number of model parameters; image quality metrics PSNR, SSIM and LPIPS measured in 5 different datasets; as well as power, speed, CPU usage, temperature and memory usage for devices AGX (Jetson AGX Xavier), MaxQ (GTX 1080 MaxQ) and RPI (Raspberry Pi 400).

edge-SR: Super-Resolution For The Masses

Related tags

Overview

edge-SR: Super Resolution For The Masses

Citation

BibTeX

Instructions:

Requirements:

Experiment results

Owner

Pablo

Ecco is a python library for exploring and explaining Natural Language Processing models using interactive visualizations.

This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.

Write Alphabet, Words and Sentences with your eyes.

Negative sampling for solving the unlabeled entity problem in NER. ICLR-2021 paper: Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition.

Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

Bidirectional Variational Inference for Non-Autoregressive Text-to-Speech (BVAE-TTS)

Collection of useful (to me) python scripts for interacting with napari

Mesh TensorFlow: Model Parallelism Made Easier

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Trex is a tool to match semantically similar functions based on transfer learning.

Neural-Machine-Translation - Implementation of revolutionary machine translation models

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

A BERT-based reverse dictionary of Korean proverbs

A Python script which randomly chooses and prints a file from a directory.

Nmt - TensorFlow Neural Machine Translation Tutorial

DensePhrases provides answers to your natural language questions from the entire Wikipedia in real-time

LightSeq: A High-Performance Inference Library for Sequence Processing and Generation

Generating Korean Slogans with phonetic and structural repetition

A very simple framework for state-of-the-art Natural Language Processing (NLP)