Using VapourSynth with super resolution models and speeding them up with TensorRT.

Last update: Jan 05, 2023

Related tags

Overview

VSGAN-tensorrt-docker

Using image super resolution models with vapoursynth and speeding them up with TensorRT. Using NVIDIA/Torch-TensorRT combined with rlaphoenix/VSGAN. This repo makes the usage of tiling and ESRGAN models very easy. Models can be found on the wiki page. Further model architectures are planned to be added later on.

Currently working:

ESRGAN
RealESRGAN (adjust model load manually in inference.py, settings wont be adjusted automatically currently)

Usage:

# install docker, command for arch
yay -S docker nvidia-docker nvidia-container-toolkit
# Put the dockerfile in a directory and run that inside that directory
docker build -t vsgan_tensorrt:latest .
# run with a mounted folder
docker run --privileged --gpus all -it --rm -v /home/Desktop/tensorrt:/workspace/tensorrt vsgan_tensorrt:latest
# you can use it in various ways, ffmpeg example
vspipe --y4m inference.py - | ffmpeg -i pipe: example.mkv

If docker does not want to start, try this before you use docker:

# fixing docker errors
systemctl start docker
sudo chmod 666 /var/run/docker.sock

Windows is mostly similar, but the path needs to be changed slightly:

Example for C://path
docker run --privileged --gpus all -it --rm -v //c/path:/workspace/tensorrt vsgan_tensorrt:latest

If you don't want to use docker, vapoursynth install commands are here and a TensorRT example is here.

Set the input video path in inference.py and access videos with the mounted folder.

It is also possible to directly pipe the video into mpv, but you most likely wont be able to archive realtime speed. Change the mounted folder path to your own videofolder and use the mpv dockerfile instead. If you use a very efficient model, it may be possible on a very good GPU. Only tested in Manjaro.

yay -S pulseaudio

# i am not sure if it is needed, but go into pulseaudio settings and check "make pulseaudio network audio devices discoverable in the local network" and reboot

# start docker
docker run --rm -i -t \
    --network host \
    -e DISPLAY \
    -v /home/Schreibtisch/test/:/home/mpv/media \
    --ipc=host \
    --privileged \
    --gpus all \
    -e PULSE_COOKIE=/run/pulse/cookie \
    -v ~/.config/pulse/cookie:/run/pulse/cookie \
    -e PULSE_SERVER=unix:${XDG_RUNTIME_DIR}/pulse/native \
    -v ${XDG_RUNTIME_DIR}/pulse/native:${XDG_RUNTIME_DIR}/pulse/native \
    vsgan_tensorrt:latest
    
# run mpv
vspipe --y4m inference.py - | mpv -

Comments

Invalid data found when processing input

Hey when i start the inference.py script this happen :

someone can help me ?


> ffmpeg version N-62110-g4d45f5acbd-static https://johnvansickle.com/ffmpeg/  Copyright (c) 2000-2022 the FFmpeg developers
>   built with gcc 8 (Debian 8.3.0-6)
>   configuration: --enable-gpl --enable-version3 --enable-static --disable-debug --disable-ffplay --disable-indev=sndio --disable-outdev=sndio --cc=gcc --enable-fontconfig --enable-frei0r --enable-gnutls --enable-gmp --enable-libgme --enable-gray --enable-libaom --enable-libfribidi --enable-libass --enable-libvmaf --enable-libfreetype --enable-libmp3lame --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenjpeg --enable-librubberband --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libvorbis --enable-libopus --enable-libtheora --enable-libvidstab --enable-libvo-amrwbenc --enable-libvpx --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxml2 --enable-libdav1d --enable-libxvid --enable-libzvbi --enable-libzimg
>   libavutil      57. 26.100 / 57. 26.100
>   libavcodec     59. 33.100 / 59. 33.100
>   libavformat    59. 24.100 / 59. 24.100
>   libavdevice    59.  6.100 / 59.  6.100
>   libavfilter     8. 40.100 /  8. 40.100
>   libswscale      6.  6.100 /  6.  6.100
>   libswresample   4.  6.100 /  4.  6.100
>   libpostproc    56.  5.100 / 56.  5.100
> Information: Generating grammar tables from /usr/lib/python3.8/lib2to3/Grammar.txt
> Information: Generating grammar tables from /usr/lib/python3.8/lib2to3/PatternGrammar.txt
> Script evaluation failed:
> Python exception: libtorch_cuda_cu.so: cannot open shared object file: No such file or directory
> 
> Traceback (most recent call last):
>   File "src\cython\vapoursynth.pyx", line 2890, in vapoursynth._vpy_evaluate
>   File "src\cython\vapoursynth.pyx", line 2891, in vapoursynth._vpy_evaluate
>   File "inference.py", line 85, in <module>
>     clip = ESRGAN_inference(clip=clip, model_path="/workspace/RealESRGAN_x4plus_anime_6B.pth", tile_x=480, tile_y=480, tile_pad=16, fp16=False, tta=False, tta_mode=1)
>   File "/workspace/tensorrt/src/esrgan.py", line 680, in ESRGAN_inference
>     import torch_tensorrt
>   File "/usr/local/lib/python3.8/dist-packages/torch_tensorrt/__init__.py", line 11, in <module>
>     from torch_tensorrt._compile import *
>   File "/usr/local/lib/python3.8/dist-packages/torch_tensorrt/_compile.py", line 2, in <module>
>     from torch_tensorrt import _enums
>   File "/usr/local/lib/python3.8/dist-packages/torch_tensorrt/_enums.py", line 1, in <module>
>     from torch_tensorrt._C import dtype, DeviceType, EngineCapability, TensorFormat
> ImportError: libtorch_cuda_cu.so: cannot open shared object file: No such file or directory
> 
> pipe:: Invalid data found when processing input

opened by NeoBurgerYT 10

Module not found 'scipy'

I can't run my inference.py without getting this error message. Can someone direct me to where I can get the repo?

File "/usr/local/lib/python3.8/dist-packages/mmedit/core/evaluation/metrics.py", line 7, in from scipy.ndimage import convolve ModuleNotFoundError: No module named 'scipy'

pipe:: Invalid data found when processing input

opened by terminatedkhla 8
Tutorial?

Hi! This is amazing technology! I’m blown away. I’d love to contact you directly on how to use it in colab, I’m quite confused with the process. I’ve tried running it but not sure I’m running it correctly. Thanks in advance!

opened by AIManifest 6
Trying On A M1 Mac

So I followed this tutorial https://www.youtube.com/watch?v=B134jvhO8yk&t=0s But when docker run --privileged --gpus all -it --rm -v /home/vsgan_path/:/workspace/tensorrt styler00dollar/vsgan_tensorrt:latest it just gives me an error that it doesn't find the right amd64 or somthing and I rage quit deleted it without seeing the full error. PLS HELP ME :(

opened by Ghostkwebb 6

Crash when using RIFE ensemble models in vsmlrt

I get this error

vapoursynth.Error: operator (): expects 8 input planes

from this

import vapoursynth as vs
from vapoursynth import core
core = vs.core
import vsmlrt

clip = core.lsmas.LWLibavSource(source=r"R:\output.mkv",cache=1, prefer_hw=1)
clip = core.resize.Bicubic(clip, matrix_in_s="709", transfer_in_s='709', format=vs.RGBS)
clip = vsmlrt.RIFE(clip, multi=4, model=46, backend=vsmlrt.Backend.TRT(fp16=True), tilesize=[1920,1088])
clip = core.std.AssumeFPS(clip=clip, fpsnum=60, fpsden=1)
clip = core.resize.Bicubic(clip, format=vs.RGB24, matrix_in_s="709")
clip.set_output()

opened by banjaminicc 4

Support for AITemplate?

There is something that came out recently and it's look promising in terms of performance/speed. Would it be possible to implement it for ESERGAN mode? https://github.com/facebookincubator/AITemplate

opened by kodxana 4
CUDA out of Memory

System Specs: Ryzen 9 5900HX, NVidia 3070 Mobile, Arch Linux (EndeavorOS) on Kernel 5.17.2

Whenever I try to run a model that is relying on CUDA, for example cugan, the program exits with

Error: Failed to retrieve frame 0 with error: CUDA out of memory. Tried to allocate 148.00 MiB (GPU 0; 7.80 GiB total capacity; 5.53 GiB already allocated; 68.56 MiB free; 5.69 GiB reserved in total by PyTorch)

and stops after having output 4 frames.

However, TensorRT works fine for models that support it (like RealESRGAN for example).

Edit: Running nvidia-smi while the command is executed reveals that vspipe is allocating GPU Memory, but <2 GiB of VRAM, far from the 8GiB my model has.

opened by mmkzer0 4
No module named 'vsbasicvsrpp'

Traceback (most recent call last): File "src\cython\vapoursynth.pyx", line 2832, in vapoursynth._vpy_evaluate File "src\cython\vapoursynth.pyx", line 2833, in vapoursynth._vpy_evaluate File "inference.py", line 12, in from vsbasicvsrpp import BasicVSRPP ModuleNotFoundError: No module named 'vsbasicvsrpp'

opened by xt851231 4
Google colab request?

I recently stumbled upon this VSGAN-tensorrt-docker and found it so incredible! Could anyone make a google colab notebook that features everything from this VSGAN-tensorrt-docker, so that we could experience the speed of TensorRT! Thanks in advance!

opened by mikebilly 3
model conversion from onnx to trt

@styler00dollar this is not issue but a question, I read the scripts in inference.py and found real-esrgan 2x is loaded from trt engine file, since real-2x uses dynamic shapes as input, could you share any ideas how to convert this model to trt, thanks!

opened by deism 3
ESRGAN with full episode

Hello,

I'm trying to upscale MKV files of full episodes with ESRGAN. I tried using vspipe -c y4m inference.py - | ffmpeg -i pipe: example.mkv, and it seems to run up to the point where it starts to give an ETA. Once there the time doesn't move and eventually, it says it was killed.

Can you give me some tips on how to make this work better? I'm not familiar with most of the tools I've been given.

opened by Ultramonte 2
[SUGGESTION] per-scene processing

Hi there, this project is awesome so thanks for your - voluntary - work !

Since GANs-based processing is quite heavy computing task, it could be very useful to split it into multiple "segments" to allow parallel/scalable/collaborative/resumable instances.

We suggest you to check @master-of-zen's Av1an framework, wich implements it.

Hope that inspires.

opened by forart 1

Releases(models)

models(Feb 11, 2022)

Just a place to store models. Sources are in the README.

ffmpeg was compiled with markus-perl/ffmpeg-build-script.
Source code(tar.gz)
Source code(zip)
4x_fatal_Anime_500000_G.onnx(63.83 MB)
4x_fatal_Anime_500000_G.pth(63.85 MB)
compact2x_ncnn.tar(2.30 MB)
cugan_pro-conservative-up2x.pth(4.91 MB)
cugan_pro-conservative-up2x_opset13.onnx(4.92 MB)
cugan_pro-conservative-up2x_opset14.onnx(4.92 MB)
cugan_pro-conservative-up2x_opset15.onnx(4.92 MB)
cugan_pro-conservative-up2x_opset16.onnx(4.92 MB)
cugan_pro-conservative-up2x_opset17.onnx(4.92 MB)
cugan_pro-conservative-up3x.pth(4.92 MB)
cugan_pro-conservative-up3x_opset13.onnx(4.93 MB)
cugan_pro-conservative-up3x_opset14.onnx(4.93 MB)
cugan_pro-conservative-up3x_opset15.onnx(4.93 MB)
cugan_pro-conservative-up3x_opset16.onnx(4.93 MB)
cugan_pro-conservative-up3x_opset17.onnx(4.93 MB)
cugan_pro-denoise3x-up2x.pth(4.91 MB)
cugan_pro-denoise3x-up2x_opset13.onnx(4.92 MB)
cugan_pro-denoise3x-up2x_opset14.onnx(4.92 MB)
cugan_pro-denoise3x-up2x_opset15.onnx(4.92 MB)
cugan_pro-denoise3x-up2x_opset16.onnx(4.92 MB)
cugan_pro-denoise3x-up2x_opset17.onnx(4.92 MB)
cugan_pro-denoise3x-up3x.pth(4.92 MB)
cugan_pro-denoise3x-up3x_opset13.onnx(4.93 MB)
cugan_pro-denoise3x-up3x_opset14.onnx(4.93 MB)
cugan_pro-denoise3x-up3x_opset15.onnx(4.93 MB)
cugan_pro-denoise3x-up3x_opset16.onnx(4.93 MB)
cugan_pro-denoise3x-up3x_opset17.onnx(4.93 MB)
cugan_pro-no-denoise-up2x.pth(4.91 MB)
cugan_pro-no-denoise-up2x_opset13.onnx(4.92 MB)
cugan_pro-no-denoise-up2x_opset14.onnx(4.92 MB)
cugan_pro-no-denoise-up2x_opset15.onnx(4.92 MB)
cugan_pro-no-denoise-up2x_opset16.onnx(4.92 MB)
cugan_pro-no-denoise-up2x_opset17.onnx(4.92 MB)
cugan_pro-no-denoise-up3x.pth(4.92 MB)
cugan_pro-no-denoise-up3x_opset13.onnx(4.93 MB)
cugan_pro-no-denoise-up3x_opset14.onnx(4.93 MB)
cugan_pro-no-denoise-up3x_opset15.onnx(4.93 MB)
cugan_pro-no-denoise-up3x_opset16.onnx(4.93 MB)
cugan_pro-no-denoise-up3x_opset17.onnx(4.93 MB)
cugan_pro-no-denoise3x-up3x.pth(4.92 MB)
cugan_pro-no-denoise3x-up3x_opset13.onnx(4.93 MB)
cugan_pro-no-denoise3x-up3x_opset14.onnx(4.93 MB)
cugan_pro-no-denoise3x-up3x_opset15.onnx(4.93 MB)
cugan_pro-no-denoise3x-up3x_opset16.onnx(4.93 MB)
cugan_pro-no-denoise3x-up3x_opset17.onnx(4.93 MB)
cugan_up2x-latest-conservative.pth(4.90 MB)
cugan_up2x-latest-conservative_opset13.onnx(4.92 MB)
cugan_up2x-latest-conservative_opset14.onnx(4.92 MB)
cugan_up2x-latest-conservative_opset15.onnx(4.92 MB)
cugan_up2x-latest-conservative_opset16.onnx(4.92 MB)
cugan_up2x-latest-conservative_opset17.onnx(4.92 MB)
cugan_up2x-latest-denoise1x.pth(4.90 MB)
cugan_up2x-latest-denoise1x_opset13.onnx(4.92 MB)
cugan_up2x-latest-denoise1x_opset14.onnx(4.92 MB)
cugan_up2x-latest-denoise1x_opset15.onnx(4.92 MB)
cugan_up2x-latest-denoise1x_opset16.onnx(4.92 MB)
cugan_up2x-latest-denoise1x_opset17.onnx(4.92 MB)
cugan_up2x-latest-denoise2x.pth(4.90 MB)
cugan_up2x-latest-denoise2x_opset13.onnx(4.92 MB)
cugan_up2x-latest-denoise2x_opset14.onnx(4.92 MB)
cugan_up2x-latest-denoise2x_opset15.onnx(4.92 MB)
cugan_up2x-latest-denoise2x_opset16.onnx(4.92 MB)
cugan_up2x-latest-denoise2x_opset17.onnx(4.92 MB)
cugan_up2x-latest-denoise3x.pth(4.90 MB)
cugan_up2x-latest-denoise3x_opset13.onnx(4.92 MB)
cugan_up2x-latest-denoise3x_opset14.onnx(4.92 MB)
cugan_up2x-latest-denoise3x_opset15.onnx(4.92 MB)
cugan_up2x-latest-denoise3x_opset16.onnx(4.92 MB)
cugan_up2x-latest-denoise3x_opset17.onnx(4.92 MB)
cugan_up2x-latest-no-denoise.pth(4.90 MB)
cugan_up2x-latest-no-denoise_opset13.onnx(4.92 MB)
cugan_up2x-latest-no-denoise_opset14.onnx(4.92 MB)
cugan_up2x-latest-no-denoise_opset15.onnx(4.92 MB)
cugan_up2x-latest-no-denoise_opset16.onnx(4.92 MB)
cugan_up2x-latest-no-denoise_opset17.onnx(4.92 MB)
cugan_up3x-latest-conservative.onnx(4.91 MB)
cugan_up3x-latest-conservative.pth(4.91 MB)
cugan_up3x-latest-denoise3x.onnx(4.91 MB)
cugan_up3x-latest-denoise3x.pth(4.91 MB)
cugan_up3x-latest-no-denoise.onnx(4.91 MB)
cugan_up3x-latest-no-denoise.pth(4.91 MB)
cugan_up4x-latest-conservative.onnx(5.37 MB)
cugan_up4x-latest-conservative.pth(5.37 MB)
cugan_up4x-latest-denoise3x.onnx(5.37 MB)
cugan_up4x-latest-denoise3x.pth(5.37 MB)
cugan_up4x-latest-no-denoise.onnx(5.37 MB)
cugan_up4x-latest-no-denoise.pth(5.37 MB)
DF2K_JPEG_ncnn.tar.gz(29.46 MB)
DF2K_ncnn.tar.gz(29.46 MB)
dpir_drunet_color.onnx(124.52 MB)
dpir_drunet_deblocking_color.onnx(124.52 MB)
dpir_drunet_deblocking_grayscale.onnx(124.51 MB)
dpir_drunet_gray.onnx(124.51 MB)
EGVSR_iter420000.pth(9.89 MB)
eisai_anime_interp_full.ckpt(23.73 MB)
eisai_dtm.pt(56.88 KB)
eisai_ssl.pt(10.53 MB)
ffmpeg(69.71 MB)
ffmpeg_colab(72.76 MB)
FILM.tar.gz(366.17 MB)
GMFSS_union_fusionnet_vanilla.pkl(7.92 MB)
GMFSS_union_fusionnet_wgan.pkl(7.92 MB)
GMFSS_union_metric_vanilla.pkl(183.07 KB)
GMFSS_union_metric_wgan.pkl(183.07 KB)
GMFupSS_flownet.pkl(18.04 MB)
GMFupSS_fusionnet.pkl(7.88 MB)
GMFupSS_metric.pkl(158.82 KB)
IFRNet_GoPro.pth(18.94 MB)
IFRNet_L_GoPro.pth(75.16 MB)
IFRNet_L_Vimeo90K.pth(75.16 MB)
IFRNet_S_GoPro.pth(10.71 MB)
IFRNet_S_Vimeo90K.pth(10.71 MB)
IFRNet_Vimeo90K.pth(18.93 MB)
IFUNet.pth(123.46 MB)
M2M.pth(29.10 MB)
PANx2_DF2K.pth(1.02 MB)
PANx3_DF2K.pth(1.02 MB)
PANx4_DF2K.pth(1.06 MB)
RealBasicVSR_x4.pth(200.72 MB)
realesr-animevideov3.onnx(2.37 MB)
realesr-general-wdn-x4v3_opset13.onnx(4.63 MB)
realesr-general-wdn-x4v3_opset14.onnx(4.63 MB)
realesr-general-wdn-x4v3_opset15.onnx(4.63 MB)
realesr-general-wdn-x4v3_opset16.onnx(4.63 MB)
RealESRGANv2-animevideo-xsx2.pth(2.30 MB)
RealESRGANv2-animevideo-xsx2_opset13.onnx(2.29 MB)
RealESRGANv2-animevideo-xsx2_opset15.onnx(2.29 MB)
RealESRGANv2-animevideo-xsx2_opset16.onnx(2.29 MB)
RealESRGANv2-animevideo-xsx4.onnx(2.37 MB)
RealESRGANv2-animevideo-xsx4.pth(2.38 MB)
RealESRGAN_x4plus_anime_6B.pth(17.10 MB)
RealESRGAN_x4plus_anime_6B_opset13.onnx(17.08 MB)
RealESRGAN_x4plus_anime_6B_opset14.onnx(17.08 MB)
RealESRGAN_x4plus_anime_6B_opset15.onnx(17.08 MB)
RealESRGAN_x4plus_anime_6B_opset16.onnx(17.08 MB)
rife40.pth(32.15 MB)
rife40_ensembleFalse_fastTrue_opset16.onnx(19.76 MB)
rife40_ensembleTrue_fastFalse_opset16.onnx(32.29 MB)
rife41.pth(32.15 MB)
rife41_ensembleFalse_fastTrue_opset16.onnx(19.77 MB)
rife41_ensembleTrue_fastFalse_opset16.onnx(32.30 MB)
rife42.pth(32.11 MB)
rife42_ensembleFalse_fastTrue_opset16.onnx(19.74 MB)
rife42_ensembleTrue_fastFalse_opset16.onnx(32.27 MB)
rife43.pth(32.11 MB)
rife43_ensembleFalse_fastTrue_opset16.onnx(19.74 MB)
rife43_ensembleTrue_fastFalse_opset16.onnx(32.27 MB)
rife44.pth(32.11 MB)
rife44_ensembleFalse_fastTrue_opset16.onnx(19.74 MB)
rife44_ensembleTrue_fastFalse_opset16.onnx(32.27 MB)
rife45.pth(20.16 MB)
rife45_ensembleFalse_opset16.onnx(20.21 MB)
rife45_ensembleTrue_opset16.onnx(20.25 MB)
rife46.pth(20.28 MB)
rife46_ensembleFalse_opset16.onnx(20.31 MB)
rife46_ensembleFalse_opset17.onnx(20.32 MB)
rife46_ensembleTrue_opset16.onnx(20.34 MB)
rife46_ensembleTrue_opset17.onnx(20.37 MB)
rvpV1_105661_G.pt(68.09 MB)
rvpV1_105661_G.pth(67.61 MB)
scunet_color_15.pth(68.64 MB)
scunet_color_25.pth(68.64 MB)
scunet_color_50.pth(68.64 MB)
scunet_color_real_gan.pth(68.64 MB)
scunet_color_real_psnr.pth(68.64 MB)
sc_efficientformerv2_s0+rife46_84119_224.pth(12.92 MB)
sc_efficientformerv2_s0_12263_224.pth(12.91 MB)
sc_efficientformerv2_s0_29735_224.pth(12.91 MB)
sc_efficientnetv2b0+rife46_flow_1362_256.pth(22.75 MB)
sc_efficientnetv2b0_17957_256.pth(22.73 MB)
sc_efficientnetv2b0_int8_18964_256.pth(23.21 MB)
sc_maxvit_small+rife46_1512_224.pth(258.29 MB)
sc_maxvit_small_9072_224.pth(258.25 MB)
sc_regnetz_005_33142_256.pth(23.64 MB)
sc_repvgg_b0_7575_256.pth(55.76 MB)
sc_resnetrs50_4840_256.pth(128.69 MB)
sc_resnetv2_50_1815_256.pth(89.97 MB)
sc_rexnet_100_7264_256.pth(13.72 MB)
sc_swinv2_small_window16+rife46_1814_256.pth(192.04 MB)
sc_swinv2_small_window16_10412_256.pth(191.94 MB)
sc_TimeSformer_2592_224.pth(241.98 MB)
sc_uniformerv2_b16_36288_224.pth(513.07 MB)
sepconv.pth(51.76 MB)
stmfnet.pth(80.68 MB)
sudo_RealESRGAN2x_Dropout_3.799.042_opset13.onnx(16.94 MB)
sudo_RealESRGAN2x_Dropout_3.799.042_opset14.onnx(16.94 MB)
sudo_RealESRGAN2x_Dropout_3.799.042_opset15.onnx(16.94 MB)
sudo_RealESRGAN2x_Dropout_3.799.042_opset16.onnx(16.94 MB)
sudo_rife4_269.662_testV1_ensembleFalse_fastTrue.bin(9.86 MB)
sudo_rife4_269.662_testV1_ensembleFalse_fastTrue.param(19.76 KB)
sudo_rife4_269.662_testV1_ensembleTrue_fastFalse.bin(26.49 MB)
sudo_rife4_269.662_testV1_ensembleTrue_fastFalse.param(61.86 KB)
sudo_rife4_269.662_testV1_ensembleTrue_fastTrue.bin(19.72 MB)
sudo_rife4_269.662_testV1_ensembleTrue_fastTrue.param(41.15 KB)
sudo_rife4_269.662_testV1_scale1.pth(32.15 MB)
sudo_UltraCompact_2x_1.121.175_G.pth(1.16 MB)
sudo_UltraCompact_2x_1.121.175_G_opset13.onnx(1.16 MB)
sudo_UltraCompact_2x_1.121.175_G_opset14.onnx(1.16 MB)
sudo_UltraCompact_2x_1.121.175_G_opset15.onnx(1.16 MB)
sudo_UltraCompact_2x_1.121.175_G_opset16.onnx(1.16 MB)
vapsr2x_opset16.onnx(1.35 MB)
vapsr3x_opset16.onnx(1.38 MB)
vapsr4x_opset16.onnx(1.41 MB)
vs_precompiled_colab.7z(112.06 MB)
waifu2x_anime_style_art_noise1_model.onnx(1.09 MB)
waifu2x_anime_style_art_noise2_model.onnx(1.09 MB)
waifu2x_anime_style_art_noise3_model.onnx(1.09 MB)
waifu2x_anime_style_art_rgb_noise0_model.onnx(1.11 MB)
waifu2x_anime_style_art_rgb_noise1_model.onnx(1.11 MB)
waifu2x_anime_style_art_rgb_noise2_model.onnx(1.11 MB)
waifu2x_anime_style_art_rgb_noise3_model.onnx(1.11 MB)
waifu2x_anime_style_art_rgb_scale2.0x_model.onnx(1.11 MB)
waifu2x_anime_style_art_scale2.0x_model.onnx(1.09 MB)
waifu2x_cunet_noise0_model.onnx(4.90 MB)
waifu2x_cunet_noise0_scale2.0x_model.onnx(4.91 MB)
waifu2x_cunet_noise1_model.onnx(4.90 MB)
waifu2x_cunet_noise1_scale2.0x_model.onnx(4.91 MB)
waifu2x_cunet_noise2_model.onnx(4.90 MB)
waifu2x_cunet_noise2_scale2.0x_model.onnx(4.91 MB)
waifu2x_cunet_noise3_model.onnx(4.90 MB)
waifu2x_cunet_noise3_scale2.0x_model.onnx(4.91 MB)
waifu2x_cunet_scale2.0x_model.onnx(4.91 MB)
waifu2x_photo_noise0_model.onnx(1.11 MB)
waifu2x_photo_noise1_model.onnx(1.11 MB)
waifu2x_photo_noise2_model.onnx(1.11 MB)
waifu2x_photo_noise3_model.onnx(1.11 MB)
waifu2x_photo_scale2.0x_model.onnx(1.11 MB)
waifu2x_ukbench_scale2.0x_model.onnx(1.11 MB)
waifu2x_upconv_7_anime_style_art_rgb_noise0_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_anime_style_art_rgb_noise1_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_anime_style_art_rgb_noise2_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_anime_style_art_rgb_noise3_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_anime_style_art_rgb_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_photo_noise0_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_photo_noise1_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_photo_noise2_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_photo_noise3_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_photo_scale2.0x_model.onnx(2.10 MB)
waifu2x_upresnet10_noise0_scale2.0x_model.onnx(1.61 MB)
waifu2x_upresnet10_noise1_scale2.0x_model.onnx(1.61 MB)
waifu2x_upresnet10_noise2_scale2.0x_model.onnx(1.61 MB)
waifu2x_upresnet10_noise3_scale2.0x_model.onnx(1.61 MB)
waifu2x_upresnet10_scale2.0x_model.onnx(1.61 MB)

Owner

I like Google Colab and Python.

GitHub Repository

This is an official repository of CLGo: Learning to Predict 3D Lane Shape and Camera Pose from a Single Image via Geometry Constraints

CLGo This is an official repository of CLGo: Learning to Predict 3D Lane Shape and Camera Pose from a Single Image via Geometry Constraints An earlier

32 Dec 20, 2022

PyTorch implementation of ENet

PyTorch-ENet PyTorch (v1.1.0) implementation of ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation, ported from the lua-torc

333 Dec 29, 2022

Garbage classification using structure data.

垃圾分类模型使用说明 1.包含以下数据文件文件描述 data/MaterialMapping.csv 物体以及其归类的信息 data/TestRecords 光谱原始测试数据 CSV 文件 data/TestRecordDesc.zip CSV 文件描述文件 data/Boundaries.cs

1 Dec 10, 2021

Learning hidden low dimensional dyanmics using a Generalized Onsager Principle and neural networks

OnsagerNet Learning hidden low dimensional dyanmics using a Generalized Onsager Principle and neural networks This is the original pyTorch implemenati

3 Aug 24, 2022

[CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

897 Jan 05, 2023

COPA-SSE contains crowdsourced explanations for the Balanced COPA dataset

COPA-SSE Repository for COPA-SSE: Semi-Structured Explanations for Commonsense Reasoning. COPA-SSE contains crowdsourced explanations for the Balanced

5 Jul 31, 2022

An official PyTorch implementation of the TKDE paper "Self-Supervised Graph Representation Learning via Topology Transformations".

Self-Supervised Graph Representation Learning via Topology Transformations This repository is the official PyTorch implementation of the following pap

2 Oct 31, 2022

Multi-query Video Retreival

17 Nov 22, 2022

Code for "PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds", CVPR 2021

PV-RAFT This repository contains the PyTorch implementation for paper "PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clou

43 Dec 05, 2022

Python PID Tuner - Based on a FOPDT model obtained using a Open Loop Process Reaction Curve

PythonPID_Tuner Step 1: Takes a Process Reaction Curve in csv format - assumes data at 100ms interval (column names CV and PV) Step 2: Makes a rough e

6 Jan 14, 2022

ByteTrack超详细教程！训练自己的数据集&&摄像头实时检测跟踪

45 Dec 19, 2022

[CVPR 2022 Oral] EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

EPro-PnP EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation In CVPR 2022 (Oral). [paper] Hanshen

842 Jan 04, 2023

Using VapourSynth with super resolution models and speeding them up with TensorRT.

Related tags

Overview

VSGAN-tensorrt-docker

Comments

Invalid data found when processing input

Module not found 'scipy'

Tutorial?

Trying On A M1 Mac

Crash when using RIFE ensemble models in vsmlrt

Support for AITemplate?

CUDA out of Memory

No module named 'vsbasicvsrpp'

Google colab request?

model conversion from onnx to trt

ESRGAN with full episode

[SUGGESTION] per-scene processing

Releases(models)

models(Feb 11, 2022)

Owner

This is an official repository of CLGo: Learning to Predict 3D Lane Shape and Camera Pose from a Single Image via Geometry Constraints

PyTorch implementation of ENet

Garbage classification using structure data.

Learning hidden low dimensional dyanmics using a Generalized Onsager Principle and neural networks

[CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

COPA-SSE contains crowdsourced explanations for the Balanced COPA dataset

An official PyTorch implementation of the TKDE paper "Self-Supervised Graph Representation Learning via Topology Transformations".

Multi-query Video Retreival

Code for "PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds", CVPR 2021

Python PID Tuner - Based on a FOPDT model obtained using a Open Loop Process Reaction Curve

ByteTrack超详细教程！训练自己的数据集&&摄像头实时检测跟踪

[CVPR 2022 Oral] EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

[ICCV2021] Learning to Track Objects from Unlabeled Videos

PyTorch Implementation of our paper Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators

Pytorch implementation for ACMMM2021 paper "I2V-GAN: Unpaired Infrared-to-Visible Video Translation".

natural image generation using ConvNets

Over-the-Air Ensemble Inference with Model Privacy

Radar-to-Lidar: Heterogeneous Place Recognition via Joint Learning

code for our paper "Source Data-absent Unsupervised Domain Adaptation through Hypothesis Transfer and Labeling Transfer"