Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Last update: Apr 04, 2022

Related tags

Deep Learning FSAC

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

torch >= 1.0

torchvision >= 0.2.0

Python 3

Environmental settings

This repository is developed using python 3.6.12 on Ubuntu 16.04.5 LTS. The CUDA and pytorch version is 11.2 and 1.7.1. We use one NVIDIA 3090 GPU card for training and testing.

Dataset

PASCAL VOC, Watercolor, Cityscapes, Foggycityscapes -> Please follow the instructions in [Link] to prepare the datasets.

Daytime-Sunny, Dusk-Rainy, and Night-Rainy -> Dataset preparation instruction link [Link].

Code

Faster R-CNN -> Thanks for jwyang [Link]; Fourier Domain Adaptation -> Thanks for Yanchao Yang [Link].

Our Augmentation (Mix+Replace+Extend+Disorder).

Train

To train a faster R-CNN model with vgg16 on pascal_voc:

CUDA_VISIBLE_DEVICES=$GPU_ID python trainval_net.py --dataset pascal_voc --net vgg16 --bs 1 --cuda

And you need to add augmentated data in the loadpath by creating a new dataset_name variable.

Test

To test:

python test_net.py --dataset pascal_voc --net vgg16 --modelpath your modelpath --cuda

Augmentation

Daytime-Sunny -> Dusk-Rainy

Daytime-Sunny -> Night-Rainy

Result

Results on adaptation from Cityscapes to FoggyCityscapes. ‘prsn’, ‘mcycl’, and ‘bcycl’ separately denote ‘person’, ‘motorcycle’, and ‘bicycle’ category.

Results on adaptation from Daytime-sunny to Duskrainy. Here, we directly run the released codes of the compared methods to obtain the results.

Results on Daytime-sunny → Night-rainy.

Results on the compound target domain.

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Related tags

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

Environmental settings

Dataset

Code

Train

Test

Augmentation

Result

Owner

Official implementation of "Implicit Neural Representations with Periodic Activation Functions"

(ICCV 2021 Oral) Re-distributing Biased Pseudo Labels for Semi-supervised Semantic Segmentation: A Baseline Investigation.

A tool for making map images from OpenTTD save games

Portfolio asset allocation strategies: from Markowitz to RNNs

Self-Supervised Monocular DepthEstimation with Internal Feature Fusion(arXiv), BMVC2021

3.8% and 18.3% on CIFAR-10 and CIFAR-100

Lightweight mmm - Lightweight (Bayesian) Media Mix Model

Adversarial Reweighting for Partial Domain Adaptation

A lightweight python AUTOmatic-arRAY library.

A playable implementation of Fully Convolutional Networks with Keras.

The project page of paper: Architecture disentanglement for deep neural networks [ICCV 2021, oral]

[CVPR 2022 Oral] EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

Point Cloud Registration using Representative Overlapping Points.

Generative Adversarial Networks(GANs)

prior-based-losses-for-medical-image-segmentation

Use evolutionary algorithms instead of gridsearch in scikit-learn

AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition

ICRA 2021 - Robust Place Recognition using an Imaging Lidar

ScriptProfilerPy - Module to visualize where your python script is slow