Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Last update: Dec 20, 2022

Overview

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Artifact Detection/Correction - Offcial PyTorch Implementation

This repo provides the official PyTorch implementation of the following paper:

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?
Hwanil Choi, Wonjoon Chang, Jaesik Choi*
Korea Advanced Institute of Science and Technology, KAIST

Abstract
Even though image generation with Generative Adversarial Networks (GANs) has been showing remarkable ability to generate high-quality images, GANs do not always guarantee photorealistic images will be generated. Sometimes they generate images that have defective or unnatural objects, which are referred to as 'artifacts'. Research to determine why the artifacts emerge and how they can be detected and removed has not been sufficiently carried out. To analyze this, we first hypothesize that rarely activated neurons and frequently activated neurons have different purposes and responsibilities for the progress of generating images. By analyzing the statistics and the roles for those neurons, we empirically show that rarely activated neurons are related to failed results of making diverse objects and lead to artifacts. In addition, we suggest a correction method, called 'sequential ablation', to repair the defective part of the generated images without complex computational cost and manual efforts.
https://arxiv.org/abs/1812.04948

Dependencies

PyTorch 1.4.0
python 3.6
cuda 10.0.x
cudnn 7.6.3

Pre-Trained Models (Official) - GenForce

Dataset \ Model	PGGAN	StyleGAN2
CelebA-HQ (Official)	1024 x 1024	X
FFHQ (Official)	X	1024 X 1024
LSUN-Church (Official)	256 x 256	256 x 256
LSUN-CAT (Official)	256 x 256	256 x 256

For following implementation, download StyleGAN2 FFHQ weights in current directory. Otherwise, you should change the '--weight_path' options to your directory.

More pre-trained weights are available in genforce-model-zoo

optional : StyleGAN3

Implementation

Options

optional arguments:
  -h, --help                show this help message and exit
  --gpu GPU                 gpu index numper
  --batch_size BATCH_SIZE
                            batch size for pre processing and generating process
  --sample_size SAMPLE_SIZE
                            sample size for statistics
  --freq_path FREQ_PATH
                            loading saved frequencies of neurons
  --model MODEL             pggan, styelgan2
  --dataset DATASET         ffhq, cat, church, etc
  --resolution RESOLUTION
                            dataset resolution
  --weight_path WEIGHT_PATH
                            pre-trained weight path
  --detection DETECTION
                            implement normal/artifact detection
  --correction CORRECTION
                            implement correction task

Usage

python main.py --gpu 0 --batch_size 30 --sample_size 30000 --freq_pth ./stats \
               --model stylegan2 --dataset ffhq --resolution 1024 --weight_path ./ \
               --detection True --correction True

If you are on remote server, then to show the results, you should do the following. (X11 forwarding).

X11 forwarding

You can also implement our codes in 'Jupyter Notebook' that has more degree of freedom. Use the 'notebook.ipynb' file.

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Related tags

Overview

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Artifact Detection/Correction - Offcial PyTorch Implementation

Dependencies

Pre-Trained Models (Official) - GenForce

Implementation

Detection results for 50K samples

Bottom 60 images

Top 60 images

Correction results

Owner

CHOI HWAN IL

Ackermann Line Follower Robot Simulation.

Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"

This is a project to detect gestures to zoom in or out, using the real-time distance between the index finger and the thumb. It's based on OpenCV and Mediapipe.

This is used to convert a string to an Image with Handwritten Characters.

SRA's seminar on Introduction to Computer Vision Fundamentals

Source code of RRPN ---- Arbitrary-Oriented Scene Text Detection via Rotation Proposals

Satoshi is a discord bot template in python using discord.py that allow you to track some live crypto prices with your own discord bot.

Msos searcher - A half-hearted attempt at finding a magic square of squares

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

Assignment work with webcam

Perspective recovery of text using transformed ellipses

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

A simple Security Camera created using Opencv in Python where images gets saved in realtime in your Dropbox account at every 5 seconds

SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition

aardio的opencv库

Qrcode Attendence System with Opencv and Pyzbar

WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

Textboxes_plusplus implementation with Tensorflow (python)

Official code for ROCA: Robust CAD Model Retrieval and Alignment from a Single Image (CVPR 2022)

fishington.io bot with OpenCV and NumPy