Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Last update: Dec 27, 2022

Related tags

Deep Learning vs-realesrgan

Overview

Real-ESRGAN

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Ported from https://github.com/xinntao/Real-ESRGAN

Dependencies

NumPy
PyTorch, preferably with CUDA. Note that torchvision and torchaudio are not required and hence can be omitted from the command.
VapourSynth

Installation

pip install --upgrade vsrealesrgan
python -m vsrealesrgan

Usage

from vsrealesrgan import RealESRGAN

ret = RealESRGAN(clip)

See __init__.py for the description of the parameters.

Comments

Installing on portable vapoursynth?
I'm getting this error:

` python -m pip install --upgrade vsrealesrgan Collecting vsrealesrgan Using cached vsrealesrgan-3.1.0-py3-none-any.whl (7.4 kB) Collecting tqdm Using cached tqdm-4.64.0-py2.py3-none-any.whl (78 kB) Requirement already satisfied: numpy in d:\vapoursynth\lib\site-packages (from vsrealesrgan) (1.22.3) Collecting VapourSynth>=55 Using cached VapourSynth-58.zip (558 kB) Preparing metadata (setup.py) ... error error: subprocess-exited-with-error

× python setup.py egg_info did not run successfully. │ exit code: 1 ╰─> [15 lines of output] Traceback (most recent call last): File "C:\Users*\AppData\Local\Temp\pip-install-2415kpn4\vapoursynth_712c69d39f4a4718a3f6b523a85b39eb\setup.py", line 64, in dll_path = query(winreg.HKEY_LOCAL_MACHINE, REGISTRY_PATH, REGISTRY_KEY) File "C:\Users*\AppData\Local\Temp\pip-install-2415kpn4\vapoursynth_712c69d39f4a4718a3f6b523a85b39eb\setup.py", line 38, in query reg_key = winreg.OpenKey(hkey, path, 0, winreg.KEY_READ) FileNotFoundError: [WinError 2] The system cannot find the file specified

During handling of the above exception, another exception occurred: Traceback (most recent call last): File "<string>", line 2, in <module> File "<pip-setuptools-caller>", line 34, in <module> File "C:\Users\**\AppData\Local\Temp\pip-install-2415kpn4\vapoursynth_712c69d39f4a4718a3f6b523a85b39eb\setup.py", line 67, in <module> raise OSError("Couldn't detect vapoursynth installation path") OSError: Couldn't detect vapoursynth installation path [end of output]

note: This error originates from a subprocess, and is likely not a problem with pip. error: metadata-generation-failed

× Encountered error while generating package metadata. ╰─> See above for output.

note: This is an issue with the package mentioned above, not pip. hint: See above for details. `
opened by manus693 8
'vapoursynth.VideoFrame' object is not subscriptable

Error on frame 15 request: 'vapoursynth.VideoFrame' object is not subscriptable

py3.6.4 vs.core.version: VapourSynth Video Processing Library\nCopyright (c) 2012-2018 Fredrik Mellbin\nCore R44\nAPI R3.5\nOptions: -\n torch.version: 1.10.0+cu111

vpy: import vapoursynth as vs import sys sys.path.append("C:\C\Transcoding\VapourSynth\core64\plugins\Scripts") import mvsfunc as mvf sys.path.append(r"C:\Users\liujing\AppData\Local\Programs\Python\Python36\Lib\site-packages\vsrealesrgan") from vsrealesrgan import RealESRGAN

core = vs.get_core(accept_lowercase=True) source = core.ffms2.Source(sourcename) source = mvf.ToRGB(source,depth=32) source = RealESRGAN(source) source= mvf.ToYUV(source,depth=16) source.set_output()

opened by splinter21 4

TensorRT "Ran out of input"?

Using:

# Imports
import vapoursynth as vs
# getting Vapoursynth core
core = vs.core
import site
import os
# Adding torch dependencies to PATH
path = site.getsitepackages()[0]+'/torch_dependencies/'
path = path.replace('\\', '/')
os.environ["PATH"] = path + os.pathsep + os.environ["PATH"]
# Loading Plugins
core.std.LoadPlugin(path="i:/Hybrid/64bit/vsfilters/Support/fmtconv.dll")
core.std.LoadPlugin(path="i:/Hybrid/64bit/vsfilters/SourceFilter/LSmashSource/vslsmashsource.dll")
# source: 'G:\TestClips&Co\files\test.avi'
# current color space: YUV420P8, bit depth: 8, resolution: 640x352, fps: 25, color matrix: 470bg, yuv luminance scale: limited, scanorder: progressive
# Loading G:\TestClips&Co\files\test.avi using LWLibavSource
clip = core.lsmas.LWLibavSource(source="G:/TestClips&Co/files/test.avi", format="YUV420P8", stream_index=0, cache=0, prefer_hw=0)
# Setting color matrix to 470bg.
clip = core.std.SetFrameProps(clip, _Matrix=5)
clip = clip if not core.text.FrameProps(clip,'_Transfer') else core.std.SetFrameProps(clip, _Transfer=5)
clip = clip if not core.text.FrameProps(clip,'_Primaries') else core.std.SetFrameProps(clip, _Primaries=5)
# Setting color range to TV (limited) range.
clip = core.std.SetFrameProp(clip=clip, prop="_ColorRange", intval=1)
# making sure frame rate is set to 25
clip = core.std.AssumeFPS(clip=clip, fpsnum=25, fpsden=1)
clip = core.std.SetFrameProp(clip=clip, prop="_FieldBased", intval=0)
original = clip
from vsrealesrgan import RealESRGAN
# adjusting color space from YUV420P8 to RGBH for VsRealESRGAN
clip = core.resize.Bicubic(clip=clip, format=vs.RGBH, matrix_in_s="470bg", range_s="limited")
# resizing using RealESRGAN
clip = RealESRGAN(clip=clip, device_index=0, trt=True, trt_cache_path="G:/Temp", num_streams=4) # 2560x1408
# resizing 2560x1408 to 640x352
# adjusting resizing
clip = core.resize.Bicubic(clip=clip, format=vs.RGBS, range_s="limited")
clip = core.fmtc.resample(clip=clip, w=640, h=352, kernel="lanczos", interlaced=False, interlacedd=False)
original = core.resize.Bicubic(clip=original, width=640, height=352)
# adjusting output color from: RGBS to YUV420P8 for x264Model
clip = core.resize.Bicubic(clip=clip, format=vs.YUV420P8, matrix_s="470bg", range_s="limited", dither_type="error_diffusion")
original = core.text.Text(clip=original,text="Original",scale=1,alignment=7)
clip = core.text.Text(clip=clip,text="Filtered",scale=1,alignment=7)
stacked = core.std.StackHorizontal([original,clip])
# Output
stacked.set_output()

I get

Failed to evaluate the script: Python exception: Ran out of input

Traceback (most recent call last):
File "src\cython\vapoursynth.pyx", line 2866, in vapoursynth._vpy_evaluate
File "src\cython\vapoursynth.pyx", line 2867, in vapoursynth._vpy_evaluate
File "C:\Users\Selur\Desktop\test_2.vpy", line 32, in 
clip = RealESRGAN(clip=clip, device_index=0, trt=True, trt_cache_path="G:/Temp", num_streams=4) # 2560x1408
File "I:\Hybrid\64bit\Vapoursynth\Lib\site-packages\torch\autograd\grad_mode.py", line 27, in decorate_context
return func(*args, **kwargs)
File "I:\Hybrid\64bit\Vapoursynth\Lib\site-packages\vsrealesrgan\__init__.py", line 284, in RealESRGAN
module = [torch.load(trt_engine_path) for _ in range(num_streams)]
File "I:\Hybrid\64bit\Vapoursynth\Lib\site-packages\vsrealesrgan\__init__.py", line 284, in 
module = [torch.load(trt_engine_path) for _ in range(num_streams)]
File "I:\Hybrid\64bit\Vapoursynth\Lib\site-packages\torch\serialization.py", line 795, in load
return _legacy_load(opened_file, map_location, pickle_module, **pickle_load_args)
File "I:\Hybrid\64bit\Vapoursynth\Lib\site-packages\torch\serialization.py", line 1002, in _legacy_load
magic_number = pickle_module.load(f, **pickle_load_args)
EOFError: Ran out of input

Works fine with trt=False.

->Any idea what is going wrong there?

opened by Selur 3

[REQ] SwinIR port

Hi there, 1st of all thanks for your great work in porting all those goods to VapourSynth !

Dunno if it's the right place to ask, but it would be great to have SwinIR by @JingyunLiang in VS too:

SwinIR: Image Restoration Using Swin Transformer

Hope that inspires !

opened by forart 1

Vapoursynth R58 support

When trying to install vs-realesrgan in Vapoursynth R58 I get:

I:\Hybrid\64bit\Vapoursynth>python -m pip install --upgrade vsrealesrgan
Collecting vsrealesrgan
  Using cached vsrealesrgan-2.0.0-py3-none-any.whl (12 kB)
Collecting VapourSynth>=55
  Using cached VapourSynth-57.zip (567 kB)
  Preparing metadata (setup.py) ... error
  error: subprocess-exited-with-error

  × python setup.py egg_info did not run successfully.
  │ exit code: 1
  ╰─> [15 lines of output]
      Traceback (most recent call last):
        File "C:\Users\Selur\AppData\Local\Temp\pip-install-7_na63f8\vapoursynth_4864864388024a95a1e8b4adda80b293\setup.py", line 64, in <module>
          dll_path = query(winreg.HKEY_LOCAL_MACHINE, REGISTRY_PATH, REGISTRY_KEY)
        File "C:\Users\Selur\AppData\Local\Temp\pip-install-7_na63f8\vapoursynth_4864864388024a95a1e8b4adda80b293\setup.py", line 38, in query
          reg_key = winreg.OpenKey(hkey, path, 0, winreg.KEY_READ)
      FileNotFoundError: [WinError 2] Das System kann die angegebene Datei nicht finden

      During handling of the above exception, another exception occurred:

      Traceback (most recent call last):
        File "<string>", line 2, in <module>
        File "<pip-setuptools-caller>", line 34, in <module>
        File "C:\Users\Selur\AppData\Local\Temp\pip-install-7_na63f8\vapoursynth_4864864388024a95a1e8b4adda80b293\setup.py", line 67, in <module>
          raise OSError("Couldn't detect vapoursynth installation path")
      OSError: Couldn't detect vapoursynth installation path
      [end of output]

  note: This error originates from a subprocess, and is likely not a problem with pip.
error: metadata-generation-failed

× Encountered error while generating package metadata.
╰─> See above for output.

note: This is an issue with the package mentioned above, not pip.
hint: See above for details.

any idea how to fix this?

opened by Selur 0

'vapoursynth.VideoFrame' object has no attribute 'get_read_array'

I have been trying to use this plugin, however I get the below error when trying to preview the video in VapourSynth Editor r19-mod-2-x86_64

Error on frame 0 request: 'vapoursynth.VideoFrame' object has no attribute 'get_read_array'

The code I am getting this error from is below

from vapoursynth import core
from vsrealesrgan import RealESRGAN
import havsfunc as haf
import vapoursynth as vs
video = core.ffms2.Source(source='EDIT.mkv')
video = haf.QTGMC(video, Preset="slow", MatchPreset="slow", MatchPreset2="slow", SourceMatch=3, TFF=True)
video = core.std.SelectEvery(clip=video, cycle=2, offsets=0)
video = core.std.Crop(clip=video, left=8, right=8, top=0, bottom=0)
video = core.resize.Spline36(clip=video, width=640, height=480)
video = core.resize.Bicubic(clip=video, format=vs.RGBS, matrix_in_s="470bg", range_s="limited")
video = RealESRGAN(clip=video, device_index=0)
video = core.resize.Bicubic(clip=video, format=vs.YUV420P10, matrix_s="470bg", range_s="limited")
video = core.resize.Spline36(clip=video, width=1440, height=1080)
video = core.std.AssumeFPS(clip=video, fpsnum=30000, fpsden=1001)
video.set_output()

opened by silentsudin 0

Releases(v4.0.1)

v4.0.1(Dec 4, 2022)
Switch to PyTorch again for inference. A few parameters are added and some parameters are removed.

Add official ESRGAN x4 model and realesr-general-x4v3 model.

See Discussions for benchmarks of some models.
Source code(tar.gz)
Source code(zip)
CUDA-11.7_cuDNN-8.6.0_TensorRT-8.5.1.7_win64.7z(447.57 MB)
v3.1.0(Apr 24, 2022)
Replace RealESRGANv2-animevideo model with realesr-animevideov3 model.

Change the default of model to 3.

Source code(tar.gz)
Source code(zip)
v3.0.0(Mar 27, 2022)
Switch to ONNX Runtime for inferencing.

Rename model_type parameter to model.

Rename tile_x, tile_y parameters to tile_w, tile_h.

Remove pre_pad parameter.

Source code(tar.gz)
Source code(zip)
v2.0.0(Dec 26, 2021)
Add a check to make sure model files have been downloaded.

Only VS API4 is supported now.

Add support for RealESRGANv2-animevideo models.

Remove scale and anime parameters.

Add model_type parameter.

Source code(tar.gz)
Source code(zip)
v1.2.0(Sep 8, 2021)
Change tile parameter to tile_x and tile_y.

Add support for VS API4.

Source code(tar.gz)
Source code(zip)
v1.1.0(Sep 1, 2021)
Add model optimized for anime.

Rename half parameter to fp16 for intuition.

Source code(tar.gz)
Source code(zip)
v1.0.0(Aug 16, 2021)
Initial release.

Source code(tar.gz)
Source code(zip)
model(Aug 16, 2021)

Source code(tar.gz)
Source code(zip)
ESRGAN_SRx4_DF2KOST_official-ff704c30.pth(63.82 MB)
realesr-animevideov3.onnx(2.37 MB)
realesr-animevideov3.pth(2.38 MB)
realesr-general-wdn-x4v3.pth(4.65 MB)
realesr-general-x4v3.pth(4.65 MB)
RealESRGANv2-animevideo-xsx2.pth(2.30 MB)
RealESRGANv2-animevideo-xsx4.pth(2.38 MB)
RealESRGAN_x2plus.onnx(63.89 MB)
RealESRGAN_x2plus.pth(63.95 MB)
RealESRGAN_x4plus.onnx(63.90 MB)
RealESRGAN_x4plus.pth(63.93 MB)
RealESRGAN_x4plus_anime_6B.onnx(17.09 MB)
RealESRGAN_x4plus_anime_6B.pth(17.10 MB)

Owner

Holy Wu

GitHub Repository

A PyTorch library for Vision Transformers

VFormer A PyTorch library for Vision Transformers Getting Started Read the contributing guidelines in CONTRIBUTING.rst to learn how to start contribut

142 Nov 28, 2022

Mememoji - A facial expression classification system that recognizes 6 basic emotions: happy, sad, surprise, fear, anger and neutral.

a project built with deep convolutional neural network and ❤️ Table of Contents Motivation The Database The Model 3.1 Input Layer 3.2 Convolutional La

761 Dec 05, 2022

SWA Object Detection

SWA Object Detection This project hosts the scripts for training SWA object detectors, as presented in our paper: @article{zhang2020swa, title={SWA

237 Nov 28, 2022

Semi-Supervised Signed Clustering Graph Neural Network (and Implementation of Some Spectral Methods)

SSSNET SSSNET: Semi-Supervised Signed Network Clustering For details, please read our paper. Environment Setup Overview The project has been tested on

9 Nov 24, 2022

Draw like Bob Ross using the power of Neural Networks (With PyTorch)!

Draw like Bob Ross using the power of Neural Networks! (+ Pytorch) Learning Process Visualization Getting started Install dependecies Requires python3

116 Mar 07, 2022

EZ graph is an easy to use AI solution that allows you to make and train your neural networks without a single line of code.

EZ-Graph EZ Graph is a GUI that allows users to make and train neural networks without writing a single line of code. Requirements python 3 pandas num

1 Jul 03, 2022

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

tf-fsvd TensorFlow Implementation of Functional Singular Value Decomposition for paper Fast Graph Learning with Unique Optimal Solutions Cite If you f

14 Nov 25, 2021

Extract MNIST handwritten digits dataset binary file into bmp images

MNIST-dataset-extractor Extract MNIST handwritten digits dataset binary file into bmp images More info at http://yann.lecun.com/exdb/mnist/ Dependenci

6 May 24, 2021

[ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction

FaPN: Feature-aligned Pyramid Network for Dense Image Prediction [arXiv] [Project Page] @inproceedings{ huang2021fapn, title={{FaPN}: Feature-alig

175 Dec 30, 2022

AoT is a system for automatically generating off-target test harness by using build information.

AoT: Auto off-Target Automatically generating off-target test harness by using build information. Brought to you by the Mobile Security Team at Samsun

10 Oct 19, 2022

SalGAN: Visual Saliency Prediction with Generative Adversarial Networks

SalGAN: Visual Saliency Prediction with Adversarial Networks Junting Pan Cristian Canton Ferrer Kevin McGuinness Noel O'Connor Jordi Torres Elisa Sayr

347 Nov 22, 2022

Selene is a Python library and command line interface for training deep neural networks from biological sequence data such as genomes.

323 Jan 01, 2023

Project dự đoán giá cổ phiếu bằng thuật toán LSTM gồm: code train và code demo

Web predicts stock prices using Long - Short Term Memory algorithm Give me some start please!!! User interface image: Choose: DayBegin, DayEnd, Stock

8 Nov 11, 2022

In this project I played with mlflow, streamlit and fastapi to create a training and prediction app on digits

Fastapi + MLflow + streamlit Setup env. I hope I covered all. pip install -r requirements.txt Start app Go in the root dir and run these Streamlit str

76 Nov 23, 2022

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018

Learning Pixel-level Semantic Affinity with Image-level Supervision This code is deprecated. Please see https://github.com/jiwoon-ahn/irn instead. Int

337 Dec 15, 2022

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Related tags

Overview

Real-ESRGAN

Dependencies

Installation

Usage

Comments

Installing on portable vapoursynth?

'vapoursynth.VideoFrame' object is not subscriptable

TensorRT "Ran out of input"?

[REQ] SwinIR port

Vapoursynth R58 support

'vapoursynth.VideoFrame' object has no attribute 'get_read_array'

Releases(v4.0.1)

v4.0.1(Dec 4, 2022)

v3.1.0(Apr 24, 2022)

v3.0.0(Mar 27, 2022)

v2.0.0(Dec 26, 2021)

v1.2.0(Sep 8, 2021)

v1.1.0(Sep 1, 2021)

v1.0.0(Aug 16, 2021)

model(Aug 16, 2021)

Owner

Holy Wu

A PyTorch library for Vision Transformers

Mememoji - A facial expression classification system that recognizes 6 basic emotions: happy, sad, surprise, fear, anger and neutral.

SWA Object Detection

Semi-Supervised Signed Clustering Graph Neural Network (and Implementation of Some Spectral Methods)

Draw like Bob Ross using the power of Neural Networks (With PyTorch)!

EZ graph is an easy to use AI solution that allows you to make and train your neural networks without a single line of code.

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

Extract MNIST handwritten digits dataset binary file into bmp images

[ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction

AoT is a system for automatically generating off-target test harness by using build information.

SalGAN: Visual Saliency Prediction with Generative Adversarial Networks

Selene is a Python library and command line interface for training deep neural networks from biological sequence data such as genomes.

Project dự đoán giá cổ phiếu bằng thuật toán LSTM gồm: code train và code demo

In this project I played with mlflow, streamlit and fastapi to create a training and prediction app on digits

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018

A simplistic and efficient pure-python neural network library from Phys Whiz with CPU and GPU support.

GPU Programming with Julia - course at the Swiss National Supercomputing Centre (CSCS), ETH Zurich

Source codes of CenterTrack++ in 2021 ICME Workshop on Big Surveillance Data Processing and Analysis

Object Detection Projekt in GKI WS2021/22

Code of the paper "Deep Human Dynamics Prior" in ACM MM 2021.