LSUN Dataset Documentation and Demo Code

Last update: Jan 02, 2023

Related tags

Overview

LSUN

Please check LSUN webpage for more information about the dataset.

Data Release

All the images in one category are stored in one lmdb database file. The value of each entry is the jpg binary data. We resize all the images so that the smaller dimension is 256 and compress the images in jpeg with quality 75.

Citing LSUN

If you find LSUN dataset useful in your research, please consider citing:

@article{yu15lsun,
    Author = {Yu, Fisher and Zhang, Yinda and Song, Shuran and Seff, Ari and Xiao, Jianxiong},
    Title = {LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop},
    Journal = {arXiv preprint arXiv:1506.03365},
    Year = {2015}
}

Download data

Please make sure you have cURL installed

# Download the whole latest data set
python3 download.py
# Download the whole latest data set to <data_dir>
python3 download.py -o <data_dir>
# Download data for bedroom
python3 download.py -c bedroom
# Download testing set
python3 download.py -c test

Demo code

Dependency

Install Python

Install Python dependency: numpy, lmdb, opencv

Usage:

View the lmdb content

python3 data.py view <image db path>

Export the images to a folder

python3 data.py export <image db path> --out_dir <output directory>

Example:

Export all the images in valuation sets in the current folder to a "data" subfolder.

python3 data.py export *_val_lmdb --out_dir data

Submission

We expect one category prediction for each image in the testing set. The name of each image is the key value in the LMDB database. Each category has an index as listed in index list. The submitted results on the testing set will be stored in a text file with one line per image. In each line, there are two fields separated by a whitespace. The first is the image key and the second is the predicted category index. For example:

0001c44e5f5175a7e6358d207660f971d90abaf4 0
000319b73404935eec40ac49d1865ce197b3a553 1
00038e8b13a97577ada8a884702d607220ce6d15 2
00039ba1bf659c30e50b757280efd5eba6fc2fe1 3
...

The score for the submission is the percentage of correctly predicted labels. In our evaluation, we will double check our ground truth labels for the testing images and we may remove some images with controversial labels in the final evaluation.

LSUN Dataset Documentation and Demo Code

Related tags

Overview

LSUN

Data Release

Citing LSUN

Download data

Demo code

Dependency

Usage:

Example:

Submission

Owner

Fisher Yu

Unofficial Implement PU-Transformer

This is a beginner-friendly repo to make a collection of some unique and awesome projects. Everyone in the community can benefit & get inspired by the amazing projects present over here.

Split your patch similarly to `git add -p` but supporting multiple buckets

Codes for "Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier"

Implementation for paper "STAR: A Structure-aware Lightweight Transformer for Real-time Image Enhancement" (ICCV 2021).

Pytorch implementation of NEGEV method. Paper: "Negative Evidence Matters in Interpretable Histology Image Classification".

Molecular Sets (MOSES): A benchmarking platform for molecular generation models

An Unsupervised Graph-based Toolbox for Fraud Detection

A FAIR dataset of TCV experimental results for validating edge/divertor turbulence models.

This is the official implementation of our proposed SwinMR

PyTorch code of "SLAPS: Self-Supervision Improves Structure Learning for Graph Neural Networks"

《Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis》(2021)

Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models (published in ICLR2018)

A new GCN model for Point Cloud Analyse

ICRA 2021 "Towards Precise and Efficient Image Guided Depth Completion"

This tool uses Deep Learning to help you draw and write with your hand and webcam.

Confidence Propagation Cluster aims to replace NMS-based methods as a better box fusion framework in 2D/3D Object detection

Picasso: A CUDA-based Library for Deep Learning over 3D Meshes

Generating synthetic mobility data for a realistic population with RNNs to improve utility and privacy

A generalist algorithm for cell and nucleus segmentation.