Generalized Category Discovery

This repo is a placeholder for code for our paper: Generalized Category Discovery

Abstract: In this paper, we consider a highly general image recognition setting wherein, given a labelled and unlabelled set of images, the task is to categorize all images in the unlabelled set. Here, the unlabelled images may come from labelled classes or from novel ones. Existing recognition methods are not able to deal with this setting, because they make several restrictive assumptions, such as the unlabelled instances only coming from known --- or unknown --- classes and the number of unknown classes being known a-priori. We address the more unconstrained setting, naming it `Generalized Category Discovery', and challenge all these assumptions. We first establish strong baselines by taking state-of-the-art algorithms from novel category discovery and adapting them for this task. Next, we propose the use of vision transformers with contrastive representation learning for this open world setting. We then introduce a simple yet effective semi-supervised $k$-means method to cluster the unlabelled data into seen and unseen classes automatically, substantially outperforming the baselines. Finally, we also propose a new approach to estimate the number of classes in the unlabelled data. We thoroughly evaluate our approach on public datasets for generic object classification including CIFAR10, CIFAR100 and ImageNet-100, and for fine-grained visual recognition including CUB, Stanford Cars and Herbarium19, benchmarking on this new setting to foster future research.

Code for our paper 'Generalized Category Discovery'

Related tags

Overview

Generalized Category Discovery

Code Coming Soon!

Owner

Large scale PTM - PPI relation extraction

DrNAS: Dirichlet Neural Architecture Search

a short visualisation script for pyvideo data

TensorFlow Implementation of Unsupervised Cross-Domain Image Generation

CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation

PyTorch implementation of 1712.06087 "Zero-Shot" Super-Resolution using Deep Internal Learning

RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.

Pre-trained model, code, and materials from the paper "Impact of Adversarial Examples on Deep Learning Models for Biomedical Image Segmentation" (MICCAI 2019).

ShapeGlot: Learning Language for Shape Differentiation

This is an official implementation of the High-Resolution Transformer for Dense Prediction.

PyKaldi GOP-DNN on Epa-DB

(ICCV 2021) ProHMR - Probabilistic Modeling for Human Mesh Recovery

This is a custom made virus code in python, using tkinter module.

A PyTorch implementation for Unsupervised Domain Adaptation by Backpropagation(DANN), support Office-31 and Office-Home dataset

novel deep learning research works with PaddlePaddle

Face Mask Detection system based on computer vision and deep learning using OpenCV and Tensorflow/Keras

Mail classification with tensorflow and MS Exchange Server (ham or spam).

pixelNeRF: Neural Radiance Fields from One or Few Images

Tensorflow 2.x implementation of Panoramic BlitzNet for object detection and semantic segmentation on indoor panoramic images.

Code for the paper "Controllable Video Captioning with an Exemplar Sentence"