[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Last update: Dec 12, 2022

Related tags

Overview

Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Code for Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion. To acquire dataset, please contact [email protected].

Introduction

We proposed a unified network called CorrFusionNet for scene change detection. The proposed CorrFusionNet firstly extracts the features of the bi-temporal inputs with deep convolutional networks. Then the extracted features will be projected into a lower dimension space to computed the instance level canonical correlation. The cross-temporal fusion will be performed based on the computed correlation in the CorrFusion module. The final scene classification and scene change results are obtained with softmax activation layers. In the objective function, we introduced a new formulation for calculating the temporal correlation. The visual results and quantitative assessments both demonstrated that our proposed CorrFusionNet could outperform other scene change detection methods and some state-of-the-art methods for image classification.

CorrFusion Module

The proposed CorrFusion module:

The proposed CorrFusionNet:

Requirements

scipy==1.1.0
matplotlib==3.0.3
h5py==2.8.0
numpy==1.16.3
tensorflow_gpu==1.8.0
Pillow==6.2.1
scikit_learn==0.21.3

Data

Overview of our Wuhan dataset

The images are stored in npz format.

├─trn
│      0-5000.npz
│      10000-15000.npz
│      15000-16488.npz
│      5000-10000.npz
│
├─tst
│      0-4712.npz
│
└─val
       0-2355.npz

Usage

Install the requirements

pip install -r requirements.txt

Run the training code

python train_cnn.py [-h] [-g GPU] [-b BATCH_SIZE] [-e EPOCHES]
                    [-n NUM_CLASSES] [-tb USE_TFBOARD] [-sm SAVE_MODEL]
                    [-log SAVE_LOG] [-trn TRN_DIR] [-tst TST_DIR]
                    [-val VAL_DIR] [-lpath LOG_PATH] [-mpath MODEL_PATH]
                    [-tbpath TB_PATH] [-rpath RESULT_PATH]

(see parser.py)

Evaluate on a trained model:

Download a trained model here.
Evaluation

python evaluate_model.py [-h] [-g GPU] [-m MODEL_DIR] [-tst TST_DIR]
                         [-val VAL_DIR]

optional arguments:
  -h, --help            show this help message and exit
  -g GPU, --gpu GPU     gpu device ID
  -m MODEL_DIR, --model_dir MODEL_DIR
                        model directory
  -tst TST_DIR, --tst_dir TST_DIR
                        testing file dir
  -val VAL_DIR, --val_dir VAL_DIR
                        validation file dir

Results

The results of quantitative assessments:

Predictions on our dataset:

Contact

For any questions, you're welcomed to contact Lixiang Ru.

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Related tags

Overview

Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Introduction

CorrFusion Module

Requirements

Data

Usage

Install the requirements

Run the training code

Evaluate on a trained model:

Results

Contact

Owner

Lixiang Ru

TensorFlow implementation of ENet

Full body anonymization - Realistic Full-Body Anonymization with Surface-Guided GANs

General Vision Benchmark, a project from OpenGVLab

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

📚 Papermill is a tool for parameterizing, executing, and analyzing Jupyter Notebooks.

MGFN: Multi-Graph Fusion Networks for Urban Region Embedding was accepted by IJCAI-2022.

AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)

Data and Code for paper Outlining and Filling: Hierarchical Query Graph Generation for Answering Complex Questions over Knowledge Graph is available for research purposes.

Notebooks for my "Deep Learning with TensorFlow 2 and Keras" course

Combining Diverse Feature Priors

Language-Driven Semantic Segmentation

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

PyTorch implementation of paper "StarEnhancer: Learning Real-Time and Style-Aware Image Enhancement" (ICCV 2021 Oral)

Implementation of Pix2Seq in PyTorch

Latent Network Models to Account for Noisy, Multiply-Reported Social Network Data

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

CS583: Deep Learning

Baseline of DCASE 2020 task 4

STMTrack: Template-free Visual Tracking with Space-time Memory Networks