Official PaddlePaddle implementation of Paint Transformer

Last update: Dec 31, 2022

Related tags

Deep Learning PaintTransformer

Overview

Paint Transformer: Feed Forward Neural Painting with Stroke Prediction

[Paper] [Paddle Implementation]

Update

We have optimized the serial inference procedure to achieve better rendering quality and faster speed.

Overview

This repository contains the official PaddlePaddle implementation of paper:

Paint Transformer: Feed Forward Neural Painting with Stroke Prediction,

Songhua Liu*, Tianwei Lin*, Dongliang He, Fu Li, Ruifeng Deng, Xin Li, Errui Ding, Hao Wang (* indicates equal contribution)

ICCV 2021 (Oral)

Prerequisites

Linux or macOS
Python 3.6+
PaddlePaddle 2.0+ and other dependencies (numpy, cv2, and other common python libs)
```
python -m pip install paddlepaddle-gpu
```

Getting Started

Clone this repository:

git clone https://github.com/wzmsltw/PaintTransformer
cd PaintTransformer

Download pretrained model from Google Drive and move it to inference directory:
```
mv [Download Directory]/paint_best.pdparams inference/
cd inference
```
Inference:
```
python inference.py
```
- Input image path, output path, and etc can be set in the main function.
- Notably, there is a flag serial as one parameter of the main function:
  - If serial is True, strokes would be rendered serially. The consumption of video memory will be low but it requires more time. Serial inference can achieve better rendering quality.
  - If serial is False, strokes would be rendered in parallel. The consumption of video memory will be high but it would be faster.
  - If animated results are required, serial must be True.
Train:
- You can send email to us for the training codes.

More Results

Input	Animated Output

App

Do not want to run the code? Try an App 一刻相册 downloaded from here!

Citation

If you find ideas or codes useful for your research, please cite:

@inproceedings{liu2021paint,
  title={Paint Transformer: Feed Forward Neural Painting with Stroke Prediction},
  author={Liu, Songhua and Lin, Tianwei and He, Dongliang and Li, Fu and Deng, Ruifeng and Li, Xin and Ding, Errui and Wang, Hao},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  year={2021}
}

Contact

For any question, please file an issue or contact

Songhua Liu: s[email protected]
Tianwei Lin: [email protected]

Official PaddlePaddle implementation of Paint Transformer

Related tags

Overview

Paint Transformer: Feed Forward Neural Painting with Stroke Prediction

Update

Overview

Prerequisites

Getting Started

More Results

App

Citation

Contact

Owner

TianweiLin

Companion repo of the UCC 2021 paper "Predictive Auto-scaling with OpenStack Monasca"

A crash course in six episodes for software developers who want to become machine learning practitioners.

The repo for reproducing Seed-driven Document Ranking for Systematic Reviews: A Reproducibility Study

Free Book about Deep-Learning approaches for Chess (like AlphaZero, Leela Chess Zero and Stockfish NNUE)

Implementation of self-attention mechanisms for general purpose. Focused on computer vision modules. Ongoing repository.

N-gram models- Unsmoothed, Laplace, Deleted Interpolation

[ACM MM 2021] Yes, "Attention is All You Need", for Exemplar based Colorization

ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with ONNX, TensorRT, ncnn, and OpenVINO supported.

3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans.

Agent-based model simulator for air quality and pandemic risk assessment in architectural spaces

Model-based reinforcement learning in TensorFlow

Pipeline code for Sequential-GAM(Genome Architecture Mapping).

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.

PyTorch implementation of our ICCV 2021 paper Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer.

You Only Look Once for Panopitic Driving Perception

Learning Open-World Object Proposals without Learning to Classify

[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Title: Heart-Failure-Classification