Artstation-Artistic-face-HQ Dataset (AAHQ)

Last update: Dec 16, 2022

Related tags

Overview

Artstation-Artistic-face-HQ Dataset (AAHQ)

Artstation-Artistic-face-HQ (AAHQ) is a high-quality image dataset of artistic-face images. It is proposed in:

BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation
Mingcong Liu, Qiang Li, Zekui Qin, Guoxin Zhang, Pengfei Wan, Wen Zheng
https://arxiv.org/abs/2110.11728
source code: https://github.com/onion-liu/BlendGAN

Images in this dataset are collected from the "portraits" channel of Artstation. Since the original images are subject to copyright, we do not make them available directly. Instead, we provide URLs and associated face landmarks from the dataset. The dataset consists of about 25,000 high-quality artistic images (less than the number in the paper because the original URLs of some images are invalid). It offers a lot of variety in terms of painting styles, color tones and face attributes.

Please notice that this dataset is made available for academic research purpose only. The copyright of the images belongs to the original owners. The dataset itself (including JSON metadata, download script, and documentation) is made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. You can use, redistribute, and adapt it for non-commercial purposes, as long as you (a) give appropriate credit by citing our paper, (b) indicate any changes that you've made, and (c) distribute any derivative works under the same license.

If any of the images belongs to you and you would like it removed, please kindly inform us, we will remove it from our dataset immediately.

Caution: Images in AAHQ inherit all the biases of Artstation. Please be careful of unintended societal, gender, racial and other biases when training or deploying models trained on this data.

Prepare dataset

1. download the original images from Artstation (~19G):

python download.py

2. crop and align the images (~24G):

python face_alignment.py

Bibtex

If you use this dataset for your research, please cite our paper:

@inproceedings{liu2021blendgan,
    title = {BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation},
    author = {Liu, Mingcong and Li, Qiang and Qin, Zekui and Zhang, Guoxin and Wan, Pengfei and Zheng, Wen},
    booktitle = {Advances in Neural Information Processing Systems},
    year = {2021}
}

Acknowledgements

We sincerely thank all the reviewers for their comments. We also thank Zhenyu Guo for help in preparing the comparison to StarGANv2. The face alignment code borrows heavily from FFHQ dataset.

Artstation-Artistic-face-HQ Dataset (AAHQ)

Related tags

Overview

Artstation-Artistic-face-HQ Dataset (AAHQ)

Prepare dataset

1. download the original images from Artstation (~19G):

2. crop and align the images (~24G):

Bibtex

Acknowledgements

Owner

onion

PartImageNet is a large, high-quality dataset with part segmentation annotations

Hyperbolic Procrustes Analysis Using Riemannian Geometry

Think Big, Teach Small: Do Language Models Distil Occam’s Razor?

AutoML library for deep learning

Joint Gaussian Graphical Model Estimation: A Survey

PyTorch code for our paper "Attention in Attention Network for Image Super-Resolution"

Local Attention - Flax module for Jax

Semi-supervised semantic segmentation needs strong, varied perturbations

MAg: a simple learning-based patient-level aggregation method for detecting microsatellite instability from whole-slide images

Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation (ACM MM 2020)

Modular Gaussian Processes

In this project, we'll be making our own screen recorder in Python using some libraries.

Rafael Project- Classifying rockets to different types using data science algorithms.

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery

A synthetic texture-invariant dataset for object detection of UAVs

A modular, research-friendly framework for high-performance and inference of sequence models at many scales

Preprocessed Datasets for our Multimodal NER paper

Semantic graph parser based on Categorial grammars

A PyTorch-based library for semi-supervised learning

[NeurIPS 2021] Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data