Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Last update: Apr 04, 2022

Related tags

Deep Learning FSAC

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

torch >= 1.0

torchvision >= 0.2.0

Python 3

Environmental settings

This repository is developed using python 3.6.12 on Ubuntu 16.04.5 LTS. The CUDA and pytorch version is 11.2 and 1.7.1. We use one NVIDIA 3090 GPU card for training and testing.

Dataset

PASCAL VOC, Watercolor, Cityscapes, Foggycityscapes -> Please follow the instructions in [Link] to prepare the datasets.

Daytime-Sunny, Dusk-Rainy, and Night-Rainy -> Dataset preparation instruction link [Link].

Code

Faster R-CNN -> Thanks for jwyang [Link]; Fourier Domain Adaptation -> Thanks for Yanchao Yang [Link].

Our Augmentation (Mix+Replace+Extend+Disorder).

Train

To train a faster R-CNN model with vgg16 on pascal_voc:

CUDA_VISIBLE_DEVICES=$GPU_ID python trainval_net.py --dataset pascal_voc --net vgg16 --bs 1 --cuda

And you need to add augmentated data in the loadpath by creating a new dataset_name variable.

Test

To test:

python test_net.py --dataset pascal_voc --net vgg16 --modelpath your modelpath --cuda

Augmentation

Daytime-Sunny -> Dusk-Rainy

Daytime-Sunny -> Night-Rainy

Result

Results on adaptation from Cityscapes to FoggyCityscapes. ‘prsn’, ‘mcycl’, and ‘bcycl’ separately denote ‘person’, ‘motorcycle’, and ‘bicycle’ category.

Results on adaptation from Daytime-sunny to Duskrainy. Here, we directly run the released codes of the compared methods to obtain the results.

Results on Daytime-sunny → Night-rainy.

Results on the compound target domain.

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Related tags

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

Environmental settings

Dataset

Code

Train

Test

Augmentation

Result

Owner

SegNet including indices pooling for Semantic Segmentation with tensorflow and keras

Repo for "Event-Stream Representation for Human Gaits Identification Using Deep Neural Networks"

Instant Real-Time Example-Based Style Transfer to Facial Videos

TensorFlow-LiveLessons - "Deep Learning with TensorFlow" LiveLessons

Code for the upcoming CVPR 2021 paper

ICLR 2021, Fair Mixup: Fairness via Interpolation

Shuwa Gesture Toolkit is a framework that detects and classifies arbitrary gestures in short videos

Robot Servers and Server Manager software for robo-gym

YOLOX Win10 Project

PyStan, a Python interface to Stan, a platform for statistical modeling. Documentation: https://pystan.readthedocs.io

Code for the paper "Multi-task problems are not multi-objective"

Understanding Hyperdimensional Computing for Parallel Single-Pass Learning

pip install python-office

A framework for annotating 3D meshes using the predictions of a 2D semantic segmentation model.

Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper

Network Enhancement implementation in pytorch

Lightweight stereo matching network based on MobileNetV1 and MobileNetV2

A Joint Video and Image Encoder for End-to-End Retrieval

Supervised multi-SNE (S-multi-SNE): Multi-view visualisation and classification

Moer Grounded Image Captioning by Distilling Image-Text Matching Model