A set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

Last update: Nov 01, 2022

Related tags

Deep Learning imagenet-tools

Overview

This is a set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

Make TFRecords

To run the script setup a virtualenv with the following libraries installed.

tensorflow: Install with pip install tensorflow

Once you have all the above libraries setup, you should register on the Imagenet website and download the ImageNet .tar files. It should be extracted and provided in the format:

Training images: train/n03062245/n03062245_4620.JPEG
Validation Images: validation/ILSVRC2012_val_00000001.JPEG

To run the script to preprocess the raw dataset as TFRecords, run the following command:

python3 make_tfrecords.py \
  --raw_data_dir="path/to/imagenet" \
  --local_scratch_dir="path/to/output"

Note that the label is from 1 to 1000.

Make index files

To run the script setup a virtualenv with the following libraries installed.

nvidia.dali: See documentation

python3 make_idx.py --tfrecord_root="path/to/tfrecords"

Build subset of Imagenet-1K

This can help you build a subset of Imagenet-1K (TFRecord format):

python3 build_subset.py "path/to/tfrecords" "output_dir" \
  --train_num_shards=128 \
  --valid_num_shards=16 \
  --num_classes=100

Classes are selected randomly.

DALI dataloader

We also provide a DALI dataloader which can read the processed dataset. The dataloader is equipped with Mixup.

Here is an simple example to construct it:

import glob
import os


def build_dali_train(root):
    train_pat = os.path.join(root, 'train/*')
    train_idx_pat = os.path.join(root, 'idx_files/train/*')
    return DaliDataloader(
        sorted(glob.glob(train_pat)),
        sorted(glob.glob(train_idx_pat)),
        batch_size=BATCH_SIZE,
        shard_id=SHARD_ID,
        num_shards=NUM_SHARDS,
        training=True,
        gpu_aug=True,
        cuda=True,
        mixup_alpha=0.0,
        num_threads=16,
    )

A set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

Related tags

Overview

Overview

Make TFRecords

Make index files

Build subset of Imagenet-1K

DALI dataloader

Owner

This repository contains the code for the paper in EMNLP 2021: "HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression".

Simple-Image-Classification - Simple Image Classification Code (PyTorch)

Styled Augmented Translation

Roach: End-to-End Urban Driving by Imitating a Reinforcement Learning Coach

Modeling Category-Selective Cortical Regions with Topographic Variational Autoencoders

Kinetics-Data-Preprocessing

Tracking code for the winner of track 1 in the MMP-Tracking Challenge at ICCV 2021 Workshop.

HNN: Human (Hollywood) Neural Network

Learning to Initialize Neural Networks for Stable and Efficient Training

A clear, concise, simple yet powerful and efficient API for deep learning.

Anomaly Localization in Model Gradients Under Backdoor Attacks Against Federated Learning

Example repository for custom C++/CUDA operators for TorchScript

Contrastively Disentangled Sequential Variational Audoencoder

This is our ARTS test set, an enriched test set to probe Aspect Robustness of ABSA.

Rethinking Transformer-based Set Prediction for Object Detection

Simple reimplemetation experiments about FcaNet

INSPIRED: A Transparent Dialogue Dataset for Interactive Semantic Parsing

Springer Link Download Module for Python

A spatial genome aligner for analyzing multiplexed DNA-FISH imaging data.

QA-GNN: Question Answering using Language Models and Knowledge Graphs