ArcaneGAN by Alex Spirin

Last update: Dec 28, 2022

Related tags

Deep Learning ArcaneGAN

Overview

ArcaneGAN by Alex Spirin

Changelog

2021-12-12 ArcaneGAN v0.3 is live
2021-12-09 Thanks to ak92501 we now have a huggingface demo

ArcaneGAN v0.3

Videos processed by the huggingface video inference colab.

obama2.mp4

ryan2.mp4

Image samples

Faces were enhanced via GPEN before applying the ArcaneGAN v0.3 filter.

ArcaneGAN v0.2

The release is here

Implementation Details

It does something, but not much at the moment.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.

Comments

How to convert the FastAI model to Pytorch JIT

Hi,

I trained a model with unet_learner but I can't convert it to jit.

I run the following code: torch.jit.save(torch.jit.script(learn.model), 'jit.pt')

Here is the error:

UnsupportedNodeError: GeneratorExp aren't supported: File "/usr/local/lib/python3.7/dist-packages/fastai/callbacks/hooks.py", line 21 "Applieshook_functomodule,input,output." if self.detach: input = (o.detach() for o in input ) if is_listy(input ) else input.detach() ~ <--- HERE output = (o.detach() for o in output) if is_listy(output) else output.detach() self.stored = self.hook_func(module, input, output)

May I know how you convert it to a jit model? Thanks

opened by ramtiin 2
Ошибка

Добрый вечер.В ArcaneGAN на colab for videos,выдаёт ошибку:

RuntimeError: CUDA out of memory. Tried to allocate 2.80 GiB (GPU 0; 11.17 GiB total capacity; 5.74 GiB already allocated; 2.21 GiB free; 8.44 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

Помогите пожалуйста!

opened by Zzip7 2
How do you change the style of the whole image

Nice work! My only confusion is how you change the style of the whole image instead of just the face. Usually, StyleGAN generates aligned face images by fine-tuning the FFHQ checkpoint. How does the pix2pix model trained with these face image pairs work with the full image or frame.

opened by zhanglonghao1992 2
Architecture for video

Hi, what does the architecture look like? Is it similar to Pix2Pix? And for processing of the video, are you doing anything extra to make sure the frames are consistent?

opened by unography 2
How to prevent eyes occur in nose?

Hello, I try your model and it's amazing, but I find in some pictures if the nose is too big, there will be eyes in the nose. I try to lower the 'target_face' and it can work. But the details like the light of the eyes and background will also lose when I lower the 'target_face'. So I wonder is there a way to prevent the eyes occurs in the nose and keep the details in the meantime?

opened by Folkfive 1
support arbitrary image size?

Great work!

The unet prediction result will be cropped to be the same size as the training input, e.g. 256 or 512. For arbitrary image size (e.g. 1280*720), how to config or set the model to output the same size of the input image as your colab did? Thank you.

opened by foobarhe 1
RuntimeError: CUDA out of memory

Добрый вечер.Извините,это опять я.Снова эта ошибка появляется.Можно ли,самому эту ошибку решать?Или исправлять можете только вы?Обьясните пожалуйста подробно.

opened by Zzip7 1
about the paired datasets generated by stylegan

how do you make sure the background and expression similarity between the generated input(face) and target(style face) ? I find that the style is too weak when less finetune and the similarity is too weak when more finetune, how do you solve it ? Would you like to share the paired datasets generated code with me ? thanks a lot ~

opened by Leocien 1
Any news for training code?

Interesting topic... I wonder how you trained the model, especially the augmentation part. Fixed crop limitation is a well-known problem and would like to know how you handle it. :)

opened by dongyun-kim-arch 0
tuple issue

Was trying the ArcaneGan video colab but I am having a tuple issue can you please help, i am really excited to try the Arcane video can you please help out

opened by mau021 0
What GPU is used for training?

Hi,

I want to train the Fastai u-net model. However, when I try to train the critic (learn_critic.fit_one_cycle(6, 1e-3)), I get the following error:

CUDA out of memory. Tried to allocate 4.00 GiB (GPU 0; 14.76 GiB total capacity; 9.78 GiB already allocated; 891.75 MiB free; 12.57 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

The GPU is a Tesla T4 with 16 GB of VRAM. My batch size is 4 and the training images size is 512*512. I also tried with lower numbers, but I'm still getting the same error.

opened by ramtiin 2
How to make the style stronger?

The following are input image, my training output from pair label supervision, and the output from your test model。 I trained my model (Super-Resolution model) on the images from your model outputs, I find it difficult to change the facial features。 Like the eyes and face texture are changed, how to do it ? I use L1Loss (weight is 1) + PerceptualLoss (weight is 1)+ GANLoss (weight is 0.1),

opened by xuanandsix 1

Releases(v0.4)

v0.4(Dec 25, 2021)
ArcaneGAN v0.4

The main differences are:

lighter styling (closer to original input)

sharper result

happier faces

reduced childish eyes effect

reduced stubble on feminine faces

increased temporal stability on videos

reduced mouth\teeth artifacts

Image samples

v0.3 vs v0.4

Video samples

https://user-images.githubusercontent.com/11751592/146966428-f4e27929-19dd-423f-a772-8aee709d2116.mp4

https://user-images.githubusercontent.com/11751592/146966462-6511998e-77f5-4fd2-8ad9-5709bf0cd172.mp4
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.4.jit(59.75 MB)
v0.3(Dec 12, 2021)

ArcaneGAN v0.3

Video samples

This is a stronger-styled version. It performs okay on videos, though visible flickering is present. Here are some video examples.

https://user-images.githubusercontent.com/11751592/145702737-c02b8b00-ad30-4358-98bf-97c8ad7fefdf.mp4

https://user-images.githubusercontent.com/11751592/145702740-afd3377d-d117-467d-96ca-045e25d85ac6.mp4

Image samples

Faces were enhanced via GPEN before applying the ArcaneGAN v0.3 filter.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.3.jit(79.40 MB)
v0.2(Dec 7, 2021)

ArcaneGAN v0.2 This version is a bit better at doing something other than making images darker :D

Here are some image pairs. I've specifically picked various images to see how the model performs in the wild, not on aligned and cropped faces.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.

Inference notebook is here
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.2.jit(79.52 MB)
v0.1(Dec 6, 2021)

ArcaneGAN v0.1 This is a proof of concept release. The model is in beta (which means it's beta than nothin')

Here are some image pairs. I've specifically picked various images to see how the model performs in the wild, not on aligned and cropped faces.

It does something, but not much at the moment.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.

Inference notebook is here
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.1.jit(79.53 MB)

Owner

Alex

GitHub Repository

Object detection on multiple datasets with an automatically learned unified label space.

Simple multi-dataset detection An object detector trained on multiple large-scale datasets with a unified label space; Winning solution of E

407 Dec 30, 2022

Meli Data Challenge 2021 - First Place Solution

My solution for the Meli Data Challenge 2021

23 Mar 09, 2022

METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)

Nautilus-OCR The National Library of Luxembourg (BnL) started its first initiative in digitizing newspapers, with layout recognition and OCR on articl

36 Dec 05, 2022

nnFormer: Interleaved Transformer for Volumetric Segmentation

nnFormer: Interleaved Transformer for Volumetric Segmentation Code for paper "nnFormer: Interleaved Transformer for Volumetric Segmentation ". Please

610 Dec 28, 2022

Python Auto-ML Package for Tabular Datasets

Tabular-AutoML AutoML Package for tabular datasets Tabular dataset tuning is now hassle free! Run one liner command and get best tuning and processed

18 Nov 20, 2022

Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).

Fisher Induced Sparse uncHanging (FISH) Mask This repo contains the code for Fisher Induced Sparse uncHanging (FISH) Mask training, from "Training Neu

37 Dec 30, 2022

Stacked Hourglass Network with a Multi-level Attention Mechanism: Where to Look for Intervertebral Disc Labeling

⚠️ ‎‎‎ A more recent and actively-maintained version of this code is available in ivadomed Stacked Hourglass Network with a Multi-level Attention Mech

14 Oct 24, 2022

Fedlearn支持前沿算法研发的Python工具库 | Fedlearn algorithm toolkit for researchers

FedLearn-algo Installation Development Environment Checklist python3 (3.6 or 3.7) is required. To configure and check the development environment is c

89 Nov 14, 2022

Diverse Object-Scene Compositions For Zero-Shot Action Recognition

Diverse Object-Scene Compositions For Zero-Shot Action Recognition This repository contains the source code for the use of object-scene compositions f

7 Sep 21, 2022

Face recognition with trained classifiers for detecting objects using OpenCV

Face_Detector Face recognition with trained classifiers for detecting objects using OpenCV Libraries required to be installed using pip Command: cv2 n

0 Oct 31, 2021

Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples

Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples This repository is the official implementation of paper [Qimera: Data-free Q

21 Nov 03, 2022

Super Resolution for images using deep learning.

Neural Enhance Example #1 — Old Station: view comparison in 24-bit HD, original photo CC-BY-SA @siv-athens. As seen on TV! What if you could increase

11.7k Dec 29, 2022

The Pytorch implementation for "Video-Text Pre-training with Learned Regions"

Region_Learner The Pytorch implementation for "Video-Text Pre-training with Learned Regions" (arxiv) We are still cleaning up the code further and pre

0 Mar 20, 2022

Vertex AI: Serverless framework for MLOPs (ESP / ENG)

Vertex AI: Serverless framework for MLOPs (ESP / ENG) Español Qué es esto? Este repo contiene un pipeline end to end diseñado usando el SDK de Kubeflo

2 Apr 28, 2022

PRTR: Pose Recognition with Cascade Transformers

PRTR: Pose Recognition with Cascade Transformers Introduction This repository is the official implementation for Pose Recognition with Cascade Transfo

133 Dec 30, 2022

Point-NeRF: Point-based Neural Radiance Fields

Point-NeRF: Point-based Neural Radiance Fields Project Sites | Paper | Primary c

662 Jan 01, 2023

pytorch, hand(object) detect ,yolo v5，手检测

YOLO V5 物体检测，包括手部检测。项目介绍手部检测手部检测示例如下：视频示例：项目配置作者开发环境： Python 3.7 PyTorch = 1.5.1 数据集手部检测数据集该项目数据集采用 TV-Hand 和 COCO-Hand (COCO-Hand-Big 部分) 进

11 Dec 20, 2022

DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs

DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs Abstract: Image-to-image translation has recently achieved re

23 Apr 14, 2022

Prompts - Read a textfile of prompts and import into anki via ankiconnect

prompts read a textfile of prompts and import into anki via ankiconnect Usage In

2 Jul 28, 2022

Clustering with variational Bayes and population Monte Carlo

pypmc pypmc is a python package focusing on adaptive importance sampling. It can be used for integration and sampling from a user-defined target densi

45 Feb 06, 2022

ArcaneGAN by Alex Spirin

Related tags

Overview

ArcaneGAN by Alex Spirin

ArcaneGAN v0.3

Image samples

ArcaneGAN v0.2

Implementation Details

Comments

Releases(v0.4)

v0.4(Dec 25, 2021)

ArcaneGAN v0.4

Image samples

Video samples

v0.3(Dec 12, 2021)

ArcaneGAN v0.3

Video samples

Image samples

v0.2(Dec 7, 2021)

v0.1(Dec 6, 2021)

Owner

Alex

Object detection on multiple datasets with an automatically learned unified label space.

Meli Data Challenge 2021 - First Place Solution

METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)

nnFormer: Interleaved Transformer for Volumetric Segmentation

Python Auto-ML Package for Tabular Datasets

Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).

Stacked Hourglass Network with a Multi-level Attention Mechanism: Where to Look for Intervertebral Disc Labeling

Fedlearn支持前沿算法研发的Python工具库 | Fedlearn algorithm toolkit for researchers

Diverse Object-Scene Compositions For Zero-Shot Action Recognition

Face recognition with trained classifiers for detecting objects using OpenCV

Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples

Super Resolution for images using deep learning.

The Pytorch implementation for "Video-Text Pre-training with Learned Regions"

Vertex AI: Serverless framework for MLOPs (ESP / ENG)

PRTR: Pose Recognition with Cascade Transformers

Point-NeRF: Point-based Neural Radiance Fields

pytorch, hand(object) detect ,yolo v5，手检测

DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs

Prompts - Read a textfile of prompts and import into anki via ankiconnect

Clustering with variational Bayes and population Monte Carlo