PartImageNet: A Large, High-Quality Dataset of Parts

We will release our dataset and scripts soon after cleaning and approval.

Introduction

PartImageNet is a large, high-quality dataset with part segmentation annotations. It consists of 158 classes from ImageNet with approximately 24′000 images. The classes are grouped into 11 super-categories and the parts split are designed according to the super-category as shown below. The number in the brackets after the category name indicates the total number of classes of the category.

Category	Annotated Parts
Quadruped (46)	Head, Body, Foot, Tail
Biped (17)	Head, Body, Hand, Foot, Tail
Fish (10)	Head, Body, Fin, Tail
Bird (14)	Head, Body, Wing, Foot, Tail
Snake (15)	Head, Body
Reptile (20)	Head, Body, Foot, Tail
Car (23)	Body, Tier, Side Mirror
Bicycle (6)	Head, Body, Seat, Tier
Boat (4)	Body, Sail
Aeroplane (2)	Head, Body, Wing, Engine, Tail
Bottle (5)	Body, Mouth

The statistics of train/val/test split is shown below.

Split	Number of classes	Number of images
Train	109	16540
Val	19	2957
Test	30	4598
Total	158	24095

For more detailed statistics, please check out our paper.

Possible Usage

PartImageNet has broad potential in and can be benefit to numerious research fields while we simply explore its usage in Part Discovery, Few-shot Learning and Semantic Segmentation in the paper. We hope that with the propose of the PartImageNet, we could attarct more attention to the part-based models and yield more interesting works. We will release our implementation later as well.

PartImageNet is a large, high-quality dataset with part segmentation annotations

Related tags

Overview

PartImageNet: A Large, High-Quality Dataset of Parts

Introduction

Possible Usage

Example Figures

Owner

Ju He

Learning to See by Looking at Noise

This repo contains the source code and a benchmark for predicting user's utilities with Machine Learning techniques for Computational Persuasion

This repository provides a PyTorch implementation and model weights for HCSC (Hierarchical Contrastive Selective Coding)

This project aims to segment 4 common retinal lesions from Fundus Images.

A-SDF: Learning Disentangled Signed Distance Functions for Articulated Shape Representation (ICCV 2021)

A script written in Python that returns a consensus string and profile matrix of a given DNA string(s) in FASTA format.

Performance Analysis of Multi-user NOMA Wireless-Powered mMTC Networks: A Stochastic Geometry Approach

Paddle implementation for "Cross-Lingual Word Embedding Refinement by ℓ1 Norm Optimisation" (NAACL 2021)

Multi Camera Calibration

Head2Toe: Utilizing Intermediate Representations for Better OOD Generalization

A bare-bones Python library for quality diversity optimization.

Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

Lacmus is a cross-platform application that helps to find people who are lost in the forest using computer vision and neural networks.

A web application that provides real time temperature and humidity readings of a house.

This repository contains the code for TABS, a 3D CNN-Transformer hybrid automated brain tissue segmentation algorithm using T1w structural MRI scans

Image processing in Python

[AAAI22] Reliable Propagation-Correction Modulation for Video Object Segmentation

A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Tensorflow-seq2seq-tutorials - Dynamic seq2seq in TensorFlow, step by step