「PyTorch Implementation of AnimeGANv2」を用いて、生成した顔画像を元の画像に上書きするデモ

Last update: Oct 18, 2022

Overview

AnimeGANv2-Face-Overlay-Demo

PyTorch Implementation of AnimeGANv2を用いて、生成した顔画像を元の画像に上書きするデモです。

Requirement

mediapipe 0.8.9 or later
OpenCV 4.5.3.56 or later
onnxruntime-gpu 1.9.0 or later
※onnxruntimeでも動作しますが、推論時間がかかるのでGPUをお勧めします

処理速度参考値

GeForce GTX 1050 Ti：約3.3fps
GeForce RTX 3060：約9fps

Demo

デモの実行方法は以下です。

python main.py

--device
カメラデバイス番号の指定
デフォルト：0
--movie
動画ファイルの指定 ※指定時はカメラデバイスより優先
デフォルト：指定なし
--width
カメラキャプチャ時の横幅
デフォルト：960
--height
カメラキャプチャ時の縦幅
デフォルト：540
--fd_model_selection
顔検出モデル選択(0：2m以内の検出に最適なモデル、1：5m以内の検出に最適なモデル)
デフォルト：model/face_paint_512_v2_0.onnx
--min_detection_confidence
顔検出信頼値の閾値
デフォルト：0.5
--animegan_model
AnimeGANv2のモデル格納パス
デフォルト：model/face_paint_512_v2_0.onnx
--animegan_input_size
AnimeGANv2のモデルの入力サイズ
デフォルト：512
--ss_model_selection
モデル種類指定
0：Generalモデル(256x256x1 出力)
1：Landscapeモデル(144x256x1 出力)
デフォルト：0
--ss_score_th
スコア閾値(閾値以上：人間、閾値未満：背景)
デフォルト：0.1
--debug
デバッグウィンドウを表示するか否か
デフォルト：指定なし
--debug_subwindow_ratio
デバッグウィンドウの拡大率
デフォルト：0.5

※デバッグ表示有効時は以下のようなウィンドウを表示

Reference

bryandlee/animegan2-pytorch
Kazuhito00/AnimeGANv2-ONNX-Sample
同梱しているONNXはAnimeGANv2-ONNX-Sampleのノートブックを利用

Author

高橋かずひと(https://twitter.com/KzhtTkhs)

License

AnimeGANv2-Face-Overlay-Demo is under MIT License.

「PyTorch Implementation of AnimeGANv2」を用いて、生成した顔画像を元の画像に上書きするデモ

Related tags

Overview

AnimeGANv2-Face-Overlay-Demo

Requirement

処理速度参考値

Demo

Reference

Author

License

Owner

KazuhitoTakahashi

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

CHERRY is a python library for predicting the interactions between viral and prokaryotic genomes

Sum-Product Probabilistic Language

Instant-nerf-pytorch - NeRF trained SUPER FAST in pytorch

Rate-limit-semaphore - Semaphore implementation with rate limit restriction for async-style (any core)

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

This is a project based on ConvNets used to identify whether a road is clean or dirty. We have used MobileNet as our base architecture and the weights are based on imagenet.

DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition, TPAMI 2021

Latex code for making neural networks diagrams

Implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

LabelImg is a graphical image annotation tool.

Deep Latent Force Models

Dynamic Slimmable Network (CVPR 2021, Oral)

A flexible framework of neural networks for deep learning

Pytorch implementation of AngularGrad: A New Optimization Technique for Angular Convergence of Convolutional Neural Networks

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

Codes for "CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation"

Optimizing Deeper Transformers on Small Datasets

BLEND: A Fast, Memory-Efficient, and Accurate Mechanism to Find Fuzzy Seed Matches

FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification