Generalized Category Discovery

This repo is a placeholder for code for our paper: Generalized Category Discovery

Abstract: In this paper, we consider a highly general image recognition setting wherein, given a labelled and unlabelled set of images, the task is to categorize all images in the unlabelled set. Here, the unlabelled images may come from labelled classes or from novel ones. Existing recognition methods are not able to deal with this setting, because they make several restrictive assumptions, such as the unlabelled instances only coming from known --- or unknown --- classes and the number of unknown classes being known a-priori. We address the more unconstrained setting, naming it `Generalized Category Discovery', and challenge all these assumptions. We first establish strong baselines by taking state-of-the-art algorithms from novel category discovery and adapting them for this task. Next, we propose the use of vision transformers with contrastive representation learning for this open world setting. We then introduce a simple yet effective semi-supervised $k$-means method to cluster the unlabelled data into seen and unseen classes automatically, substantially outperforming the baselines. Finally, we also propose a new approach to estimate the number of classes in the unlabelled data. We thoroughly evaluate our approach on public datasets for generic object classification including CIFAR10, CIFAR100 and ImageNet-100, and for fine-grained visual recognition including CUB, Stanford Cars and Herbarium19, benchmarking on this new setting to foster future research.

Code for our paper 'Generalized Category Discovery'

Related tags

Overview

Generalized Category Discovery

Code Coming Soon!

Owner

When are Iterative GPs Numerically Accurate?

Manifold-Mixup implementation for fastai V2

Mind the Trade-off: Debiasing NLU Models without Degrading the In-distribution Performance

CLASP - Contrastive Language-Aminoacid Sequence Pretraining

Public Implementation of ChIRo from "Learning 3D Representations of Molecular Chirality with Invariance to Bond Rotations"

MBPO (paper: When to trust your model: Model-based policy optimization) in offline RL settings

Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.

Annotated notes and summaries of the TensorFlow white paper, along with SVG figures and links to documentation

Repo for "Physion: Evaluating Physical Prediction from Vision in Humans and Machines" submission to NeurIPS 2021 (Datasets & Benchmarks track)

SegNet-Basic with Keras

An open-source, low-cost, image-based weed detection device for fallow scenarios.

Semantic Segmentation in Pytorch

Inverse Rendering for Complex Indoor Scenes: Shape, Spatially-Varying Lighting and SVBRDF From a Single Image

Zero-Cost Proxies for Lightweight NAS

SPT_LSA_ViT - Implementation for Visual Transformer for Small-size Datasets

Sibur challange 2021 competition - 6 place

An auto discord account and token generator. Automatically verifies the phone number. Works without proxy. Bypasses captcha.

PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"

This repository contains code used to audit the stability of personality predictions made by two algorithmic hiring systems

[NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning with Parameterized Action Primitives