[TPAMI 2021] iOD: Incremental Object Detection via Meta-Learning

Last update: Jan 04, 2023

Overview

Incremental Object Detection via Meta-Learning

To appear in an upcoming issue of the IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

arXiv paper: https://arxiv.org/abs/2003.08798

Abstract

In a real-world setting, object instances from new classes can be continuously encountered by object detectors. When existing object detectors are applied to such scenarios, their performance on old classes deteriorates significantly. A few efforts have been reported to address this limitation, all of which apply variants of knowledge distillation to avoid catastrophic forgetting.

We note that although distillation helps to retain previous learning, it obstructs fast adaptability to new tasks, which is a critical requirement for incremental learning. In this pursuit, we propose a meta-learning approach that learns to reshape model gradients, such that information across incremental tasks is optimally shared. This ensures a seamless information transfer via a meta-learned gradient preconditioning that minimizes forgetting and maximizes knowledge transfer. In comparison to existing meta-learning methods, our approach is task-agnostic, allows incremental addition of new-classes and scales to high-capacity models for object detection.

We evaluate our approach on a variety of incremental learning settings defined on PASCAL-VOC and MS COCO datasets, where our approach performs favourably well against state-of-the-art methods.

Installation and setup

Install the Detectron2 library that is packages along with this code base. See INSTALL.md.
Download and extract Pascal VOC 2007 to ./datasets/VOC2007/
Use the starter script: run.sh

Trained Models and Logs

Setting	Reported mAP	Reproduced mAP	Commands	Models and logs
19+1	70.2	70.4	run.sh	Google Drive
15+5	67.8	69.6	run.sh	Google Drive
10+10	66.3	67.3	run.sh	Google Drive

Configurations with which the above results were reproduced:

Python version: 3.6.7
PyTorch version: 1.3.0
CUDA version: 11.0
GPUs: 4 x NVIDIA GTX 1080-ti

Acknowledgement

The code is build on top of Detectron2 library.

Citation

If you find our research useful, please consider citing us:

@article{joseph2021incremental,
  title={Incremental object detection via meta-learning},
  author={Joseph, KJ and Rajasegaran, Jathushan and Khan, Salman and Khan, Fahad Shahbaz and Balasubramanian, Vineeth},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year={2021}
}

[TPAMI 2021] iOD: Incremental Object Detection via Meta-Learning

Related tags

Overview

Incremental Object Detection via Meta-Learning

To appear in an upcoming issue of the IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

Abstract

Installation and setup

Trained Models and Logs

Configurations with which the above results were reproduced:

Acknowledgement

Citation

Owner

Joseph K J

Code for the paper "Query Embedding on Hyper-relational Knowledge Graphs"

Official PyTorch Implementation of Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity

Comp445 project - Data Communications & Computer Networks

Code for the SIGGRAPH 2022 paper "DeltaConv: Anisotropic Operators for Geometric Deep Learning on Point Clouds."

This repository contains the source code of our work on designing efficient CNNs for computer vision

Stacked Generative Adversarial Networks

Code for CVPR2021 paper "Robust Reflection Removal with Reflection-free Flash-only Cues"

3D-aware GANs based on NeRF (arXiv).

Camera ready code repo for the NeuRIPS 2021 paper: "Impression learning: Online representation learning with synaptic plasticity".

This repo is about implementing different approaches of pose estimation and also is a sub-task of the smart hospital bed project :smile:

PyZebrascope - an open-source Python platform for brain-wide neural activity imaging in behaving zebrafish

Official source code to CVPR'20 paper, "When2com: Multi-Agent Perception via Communication Graph Grouping"

This repository contains the scripts for downloading and validating scripts for the documents

A CNN implementation using only numpy. Supports multidimensional images, stride, etc.

Extremely easy multi instancing software for minecraft speedrunning.

A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).

Visualizer using audio and semantic analysis to explore BigGAN (Brock et al., 2018) latent space.

SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data

Fast Differentiable Matrix Sqrt Root

Constructing Neural Network-Based Models for Simulating Dynamical Systems