Active Learning at the ImageNet Scale

This repo contains code for the paper Active Learning at the ImageNet Scale by Zeyad Emam*, Hong-Min Chu*, Ping-Yeh Chiang*, Wojtek Czaja, Richard Leapman, Micah Goldblum, and Tom Goldstein.

Requirements

pip install -r requirements.txt

Comet and Logging

This project uses Comet ML to log all experiments, you must install comet_ml (included in requirements.txt), however, the code does not require the user to have a Comet ML account or to enable comet logging at all. If you choose to use comet ML, then you should include your API key in your home directory ~/.comet.config (more on this in the Comet ML documentation). To use comet make sure the use the flag --enable_comet.

Logs and network weights are stored according to the command line arguments --log_dir and --ckpt_path.

Loading SSP checkpoints

Self-supervised pretrained checkpoints must be obtained separately and specified in ./src/arg_pools for each argpool, under the key "init_pretrained_ckpt_path". To access the checkpoints used in our experiments, please use the following links:

Sample Commands to Reproduce the Results in the Paper

Each Imagenet experiment was conducted on a cluster node with a single V100-SXM2 GPU (32GB VRAM), 64gb of RAM, and 16 2.3 GHz Intel Gold 6140 cpus. If more than one gpu are available on the node, the code will automatically distribute batches across all gpus using DistributedDataParallel training.

Below is a sample command for running an experiment. The full list of command line arguments can be found in src/utils/parser.py.

python main_al.py --dataset_dir 
   
     --exp_name RandomSampler_arg_ssp_linear_evaluation_imagenet_b10000 --dataset imagenet --arg_pool ssp_linear_evaluation --model SSLResNet50 --strategy RandomSampler --rounds 8 --round_budget 10000 --init_pool_size 30000 --subset_labeled 50000 --subset_unlabeled 80000 --freeze_feature --partitions 10 --init_pool_type random

The full list of commands to reproduce all plots in the paper can be obtained by running python src/gen_jobs.py.

Code for Active Learning at The ImageNet Scale.

Related tags

Overview

Active Learning at the ImageNet Scale

Requirements

Comet and Logging

Loading SSP checkpoints

Sample Commands to Reproduce the Results in the Paper

Owner

Zeyad Emam

TFOD-MASKRCNN - Tensorflow MaskRCNN With Python

TCPNet - Temporal-attentive-Covariance-Pooling-Networks-for-Video-Recognition

Weight initialization schemes for PyTorch nn.Modules

The "breathing k-means" algorithm with datasets and example notebooks

We will see a basic program that is basically a hint to brute force attack to crack passwords. In other words, we will make a program to Crack Any Password Using Python. Show some ❤️ by starring this repository!

This is a virtual picture dragging application. Users may virtually slide photos across the screen. The distance between the index and middle fingers determines the movement. Smaller distances indicate click and motion, whereas bigger distances indicate only hand movement.

Pyramid Pooling Transformer for Scene Understanding

Structure-Preserving Deraining with Residue Channel Prior Guidance (ICCV2021)

[ICCV2021] Learning to Track Objects from Unlabeled Videos

Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"

BOOKSUM: A Collection of Datasets for Long-form Narrative Summarization

Basit bir burç modülü.

ParaGen is a PyTorch deep learning framework for parallel sequence generation

salabim - discrete event simulation in Python

[ACL-IJCNLP 2021] Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning

Codes to calculate solar-sensor zenith and azimuth angles directly from hyperspectral images collected by UAV. Works only for UAVs that have high resolution GNSS/IMU unit.

This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".

Find-Lane-Line - Use openCV library and Python to detect the road-lane-line

A package to predict protein inter-residue geometries from sequence data

HyperaPy: An automatic hyperparameter optimization framework ⚡🚀