Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Last update: Dec 20, 2022

Overview

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Artifact Detection/Correction - Offcial PyTorch Implementation

This repo provides the official PyTorch implementation of the following paper:

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?
Hwanil Choi, Wonjoon Chang, Jaesik Choi*
Korea Advanced Institute of Science and Technology, KAIST

Abstract
Even though image generation with Generative Adversarial Networks (GANs) has been showing remarkable ability to generate high-quality images, GANs do not always guarantee photorealistic images will be generated. Sometimes they generate images that have defective or unnatural objects, which are referred to as 'artifacts'. Research to determine why the artifacts emerge and how they can be detected and removed has not been sufficiently carried out. To analyze this, we first hypothesize that rarely activated neurons and frequently activated neurons have different purposes and responsibilities for the progress of generating images. By analyzing the statistics and the roles for those neurons, we empirically show that rarely activated neurons are related to failed results of making diverse objects and lead to artifacts. In addition, we suggest a correction method, called 'sequential ablation', to repair the defective part of the generated images without complex computational cost and manual efforts.
https://arxiv.org/abs/1812.04948

Dependencies

PyTorch 1.4.0
python 3.6
cuda 10.0.x
cudnn 7.6.3

Pre-Trained Models (Official) - GenForce

Dataset \ Model	PGGAN	StyleGAN2
CelebA-HQ (Official)	1024 x 1024	X
FFHQ (Official)	X	1024 X 1024
LSUN-Church (Official)	256 x 256	256 x 256
LSUN-CAT (Official)	256 x 256	256 x 256

For following implementation, download StyleGAN2 FFHQ weights in current directory. Otherwise, you should change the '--weight_path' options to your directory.

More pre-trained weights are available in genforce-model-zoo

optional : StyleGAN3

Implementation

Options

optional arguments:
  -h, --help                show this help message and exit
  --gpu GPU                 gpu index numper
  --batch_size BATCH_SIZE
                            batch size for pre processing and generating process
  --sample_size SAMPLE_SIZE
                            sample size for statistics
  --freq_path FREQ_PATH
                            loading saved frequencies of neurons
  --model MODEL             pggan, styelgan2
  --dataset DATASET         ffhq, cat, church, etc
  --resolution RESOLUTION
                            dataset resolution
  --weight_path WEIGHT_PATH
                            pre-trained weight path
  --detection DETECTION
                            implement normal/artifact detection
  --correction CORRECTION
                            implement correction task

Usage

python main.py --gpu 0 --batch_size 30 --sample_size 30000 --freq_pth ./stats \
               --model stylegan2 --dataset ffhq --resolution 1024 --weight_path ./ \
               --detection True --correction True

If you are on remote server, then to show the results, you should do the following. (X11 forwarding).

X11 forwarding

You can also implement our codes in 'Jupyter Notebook' that has more degree of freedom. Use the 'notebook.ipynb' file.

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Related tags

Overview

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Artifact Detection/Correction - Offcial PyTorch Implementation

Dependencies

Pre-Trained Models (Official) - GenForce

Implementation

Detection results for 50K samples

Bottom 60 images

Top 60 images

Correction results

Owner

CHOI HWAN IL

Using Opencv ,based on Augmental Reality(AR) and will show the feature matching of image and then by finding its matching

Links to awesome OCR projects

Rotational region detection based on Faster-RCNN.

Perspective recovery of text using transformed ellipses

DouZero is a reinforcement learning framework for DouDizhu - 斗地主AI

Use Youdao OCR API to covert your clipboard image to text.

ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.

Assignment work with webcam

CellProfiler is a open-source application for biological image analysis

It is a image ocr tool using the Tesseract-OCR engine with the pytesseract package and has a GUI.

Text modding tools for FF7R (Final Fantasy VII Remake)

A simple demo program for using OpenCV on Android

CNN+LSTM+CTC based OCR implemented using tensorflow.

⛓ marc is a small, but flexible Markov chain generator

Use Convolutional Recurrent Neural Network to recognize the Handwritten line text image without pre segmentation into words or characters. Use CTC loss Function to train.

Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)

Convert PDF/Image to TXT using EasyOcr - the best OCR engine available!

Motion Detection Squid Game with OpenCV Python

OpenCV-Erlang/Elixir bindings

A post-processing tool for scanned sheets of paper.