Large-Scale Unsupervised Object Discovery

Last update: Sep 19, 2022

Related tags

Deep Learning LOD

Overview

Large-Scale Unsupervised Object Discovery

Huy V. Vo, Elena Sizikova, Cordelia Schmid, Patrick Pérez, Jean Ponce [PDF]

We propose a novel ranking-based large-scale unsupervised object discovery algorithm that scales up to 1.7M images.

This repository contains code used in the paper.

Quantitative Results

Installation

Follow INSTALL.md and DATA.md to install LOD and prepare data for running it.

Run LOD on a small toy dataset

Follow GETTING_STARTED_small_dataset.md to run LOD with VGG16 features on a small subset of 60 images of Pascal VOC2007 dataset.

Getting Started

Follow GETTING_STARTED.md to run LOD with VGG16 features and GETTING_STARTED_OBOW.md with VGG16-based OBoW features on C20K dataset.

Citations

@inproceedings{Vo21LOD,
  title     = {Large-Scale Unsupervised Object Discovery},
  author    = {Vo, Huy V. and Sizikova, Elena and Schmid, 
               Cordelia and P{\'e}rez, Patrick and Ponce, Jean},
  booktitle = {Advances in Neural Information Processing Systems 34 (NeurIPS 2021)}
  year      = {2021},
}

Acknowledgments

This work was supported in part by the Inria/NYU collaboration, the Louis Vuitton/ENS chair on artificial intelligence and the French government under management of Agence Nationale de la Recherche as part of the “Investissements d’avenir” program, reference ANR19-P3IA-0001 (PRAIRIE 3IA Institute). Elena Sizikova was supported by the Moore-Sloan Data Science Environment initiative (funded by the Alfred P. Sloan Foundation and the Gordon and Betty Moore Foundation) through the NYU Center for Data Science. Huy V. Vo was supported in part by a Valeo/Prairie CIFRE PhD Fellowship.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Large-Scale Unsupervised Object Discovery

Related tags

Overview

Large-Scale Unsupervised Object Discovery

Quantitative Results

Installation

Run LOD on a small toy dataset

Getting Started

Citations

Acknowledgments

License

Owner

Place holder for HOPE: a human-centric and task-oriented MT evaluation framework using professional post-editing

The PyTorch improved version of TPAMI 2017 paper: Face Alignment in Full Pose Range: A 3D Total Solution.

Code for: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

This is the code of "Multi-view Contrastive Graph Clustering" in NeurlPS 2021.

Compute descriptors for 3D point cloud registration using a multi scale sparse voxel architecture

Evaluation framework for testing segmentation networks in PyTorch

Codes for the compilation and visualization examples to the HIF vegetation dataset

HCQ: Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval

AI Based Smart Exam Proctoring Package

Implementation for the paper SMPLicit: Topology-aware Generative Model for Clothed People (CVPR 2021)

Motion Reconstruction Code and Data for Skills from Videos (SFV)

Complete-IoU (CIoU) Loss and Cluster-NMS for Object Detection and Instance Segmentation (YOLACT)

HybVIO visual-inertial odometry and SLAM system

Pansharpening by convolutional neural networks in the full resolution framework

Implementation of GeoDiff: a Geometric Diffusion Model for Molecular Conformation Generation (ICLR 2022).

The BCNet related data and inference model.

Simple Tensorflow implementation of "Adaptive Convolutions for Structure-Aware Style Transfer" (CVPR 2021)

Multitask Learning Strengthens Adversarial Robustness

catch-22: CAnonical Time-series CHaracteristics