PyTorch Implementation of our paper Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

Last update: Jul 08, 2022

Related tags

Overview

Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

[Code] [Data] [Project Page]

Official PyTorch Implementation of our paper Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation, published at ICCV 2021.

Have you ever looked at a painting and wondered what is the story behind it? This work presents a framework to bring art closer to people by generating comprehensive descriptions of ﬁne-art paintings. Generating informative descriptions for artworks, however, is extremely challenging, as it requires to 1) describe multiple aspects of the image such as its style, content, or composition, and 2) provide background and contextual knowledge about the artist, their inﬂuences, or the historical period. To address these challenges, we introduce a multi-topic and knowledgeable art description framework, which modules the generated sentences according to three artistic topics and, additionally, enhances each description with external knowledge. The framework is validated through an exhaustive analysis, both quantitative and qualitative, as well as a comparative human evaluation, demonstrating outstanding results in terms of both topic diversity and information veracity.

Setup

Requirements

The code are tested under Python3.6 with the following packages:

torch==1.1.0
torchvision==0.2.2
numpy==1.16.2
visdom==0.1.8.9
transformers==2.1.1
nltk==3.2.3
stanfordcorenlp==3.9.1.1
scipy==1.3.1
pandas==0.25.1

Prepare Data

1.Download the dataset from this repository

2.Put the annotation folder into the MaskedSentenceGeneration

Masked Sentence Generation

cd MaskedSentenceGeneration
python prepare_dataset.py
bash train.sh
bash test_one.sh / bash test_all.sh

Knowledge Retrieval

Please look into here

Knowledge Filling

cd KnowledgeFilling
python create_dataset_drqa_src.py
bash train.sh
bash test.sh

Citation

If you find the data in this repository useful, please cite our paper:

@InProceedings{bai2021explain,
   author    = {Zechen Bai and Yuta Nakashima and Noa Garcia},
   title     = {Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation},
   booktitle = {International Conference in Computer Vision},
   year      = {2021},
}

PyTorch Implementation of our paper Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

Related tags

Overview

Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

[Code] [Data] [Project Page]

Setup

Requirements

Prepare Data

Masked Sentence Generation

Knowledge Retrieval

Knowledge Filling

Citation

Owner

Zechen Bai

In this work, we will implement some basic but important algorithm of machine learning step by step.

Implementation of Restricted Boltzmann Machine (RBM) and its variants in Tensorflow

This repository contains the code for our paper VDA (public in EMNLP2021 main conference)

Stochastic Tensor Optimization for Robot Motion - A GPU Robot Motion Toolkit

Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track (SIGIR 2021 Full Paper).

An easy way to build PyTorch datasets. Modularly build datasets and automatically cache processed results

Conditional Generative Adversarial Networks (CGAN) for Mobility Data Fusion

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks

🤗 Push your spaCy pipelines to the Hugging Face Hub

A Python library that provides a simplified alternative to DBAPI 2

Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation.

Small-bets - Ergodic Experiment With Python

This is a deep learning-based method to segment deep brain structures and a brain mask from T1 weighted MRI.

Official repository of my book: "Deep Learning with PyTorch Step-by-Step: A Beginner's Guide"

A Pytorch Implementation of Domain adaptation of object detector using scissor-like networks

The official repository for BaMBNet

Official Pytorch implementation of the paper "MotionCLIP: Exposing Human Motion Generation to CLIP Space"

Official pytorch implementation of "Scaling-up Disentanglement for Image Translation", ICCV 2021.

Pytorch implementation of Compressive Transformers, from Deepmind

Single Image Random Dot Stereogram for Tensorflow