PyTorch implementation of paper: AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer, ICCV 2021.

Last update: Dec 30, 2022

Related tags

Deep Learning AdaAttN

Overview

AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer

[Paper] [PyTorch Implementation] [Paddle Implementation]

Overview

This repository contains the official PyTorch implementation of paper:

AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer,

Songhua Liu, Tianwei Lin, Dongliang He, Fu Li, Meiling Wang, Xin Li, Zhengxing Sun, Qian Li, Errui Ding

ICCV 2021

Prerequisites

Linux or macOS
Python 3
PyTorch 1.7+ and other dependencies (torchvision, visdom, dominate, and other common python libs)

Getting Started

Clone this repository:

git clone https://github.com/Huage001/AdaAttN
cd AdaAttN

Inference:
- Make a directory for checkpoints if there is not:
```
mkdir checkpoints
```
- Download pretrained model from Google Drive, move it to checkpoints directory, and unzip:
```
mv [Download Directory]/AdaAttN_model.zip checkpoints/
unzip checkpoints/AdaAttN_model.zip
rm checkpoints/AdaAttN_model.zip
```
- Configure content_path and style_path in test_adaattn.sh firstly, indicating paths to folders of testing content images and testing style images respectively.
- Then, simply run:
```
bash test_adaattn.sh
```
- Check the results under results/AdaAttN folder.
Train:
- Download COCO dataset and WikiArt dataset and then extract them.
- Configure content_path and style_path in train_adaattn.sh, indicating paths to folders of training content images and training style images respectively.
- Before training, start visdom server:
```
python -m visdom.server
```
- Then, simply run:
```
bash train_adaattn.sh
```
- You can monitor training status at http://localhost:8097/ and models would be saved at checkpoints/AdaAttN folder.
- You may feel free to try other training options written in train_adaattn.sh.

Citation

If you find ideas or codes useful for your research, please cite:

@inproceedings{liu2021adaattn,
  title={AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer},
  author={Liu, Songhua and Lin, Tianwei and He, Dongliang and Li, Fu and Wang, Meiling and Li, Xin and Sun, Zhengxing and Li, Qian and Ding, Errui},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  year={2021}
}

Acknowledgments

This implementation is developed based on the code framework of pytorch-CycleGAN-and-pix2pix by Junyan Zhu et al.

PyTorch implementation of paper: AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer, ICCV 2021.

Related tags

Overview

AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer

Overview

Prerequisites

Getting Started

Citation

Acknowledgments

Owner

Keras implementation of PersonLab for Multi-Person Pose Estimation and Instance Segmentation.

A High-Performance Distributed Library for Large-Scale Bundle Adjustment

Official repository for "PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation"

BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting

Adjust Decision Boundary for Class Imbalanced Learning

The MLOps platform for innovators 🚀

GEA - Code for Guided Evolution for Neural Architecture Search

GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning

A object detecting neural network powered by the yolo architecture and leveraging the PyTorch framework and associated libraries.

Finite Element Analysis

Source code for "Pack Together: Entity and Relation Extraction with Levitated Marker"

EMNLP 2021 paper Models and Datasets for Cross-Lingual Summarisation.

SNIPS: Solving Noisy Inverse Problems Stochastically

Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms

This is the source code for the experiments related to the paper Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Luminous is a framework for testing the performance of Embodied AI (EAI) models in indoor tasks.

Tensorflow implementation of the paper "HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences", CVPR 2021.

Video Frame Interpolation without Temporal Priors (a general method for blurry video interpolation)

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)

An pytorch implementation of Masked Autoencoders Are Scalable Vision Learners