From this paper "SESNet: A Semantically Enhanced Siamese Network for Remote Sensing Change Detection"

Last update: May 24, 2022

Related tags

Overview

SESNet for remote sensing image change detection

It is the implementation of the paper: "SESNet: A Semantically Enhanced Siamese Network for Remote Sensing Change Detection". Here, we provide the pytorch implementation of this paper.

Prerequisites

windows or Linux
PyTorch-1.4.0
Python 3.6
CPU or NVIDIA GPU

Training

You can run a demo to start training.

python train.py

The network with the highest F1 score in the validation set will be saved in the folder tmp.

testing

You can run a demo to start testing.

python test.py

The F1_score, precision, recall, IoU and OA are displayed in order. Of course, you can slightly modify the code in the test.py file to save the confusion matrix.

Prepare Datasets

download the change detection dataset

SVCD is from the paper CHANGE DETECTION IN REMOTE SENSING IMAGES USING CONDITIONAL ADVERSARIAL NETWORKS, You could download the dataset at https://drive.google.com/file/d/1GX656JqqOyBi_Ef0w65kDGVto-nHrNs9;

LEVIR-CD is from the paper A Spatial-Temporal Attention-Based Method and a New Dataset for Remote Sensing Image Change Detection, You could download the dataset at https://justchenhao.github.io/LEVIR/;

Take SVCD as an example, the path list in the downloaded folder is as follows:

├SVCD:
├  ├─train
├  │  ├─A
├  │  ├─B
├  │  ├─OUT
├  ├─val
├  │  ├─A
├  │  ├─B
├  │  ├─OUT
├  ├─test
├  │  ├─A
├  │  ├─B
├  │  ├─OUT

where A contains images of pre-phase, B contains images of post-phase, and OUT contains label maps.

When using the LEVIR-CD dataset, simply change the folder name from SVCD to LEVIR. The location of the dataset can be set in dataset_dir in the file metadata.json.

cut bitemporal image pairs (LEVIR-CD)

The original image in LEVIR-CD has a size of 1024 * 1024, which will consume too much memory when training. In our paper, we cut the original image into patches of 256 * 256 size without overlapping.

When running our code, please make sure that the file path of the cut image matches ours.

Define hyperparameters

The hyperparameters and dataset paths can be set in the file metadata.json.


"augmentation":  Data Enhancements
"num_gpus":      Number of simultaneous GPUs
"num_workers":   Number of simultaneous processes

"image_chanels": Number of channels of the image (3 for RGB images)
"init_channels": Adjust the overall number of channels in the network, the default is 32
"epochs":        Number of rounds of training
"batch_size":    Number of pictures in the same batch
"learning_rate": Learning Rate
"loss_function": The loss function is specified in the file `./utils/helpers.py`
"bilinear":      Up-sampling method of decoder feature maps, `False` means deconvolution, `True` means bilinear up-sampling

"dataset_dir":   Dataset path, "../SVCD/" means that the dataset `SVCD` is in the same directory as the folder `SESNet`.

From this paper "SESNet: A Semantically Enhanced Siamese Network for Remote Sensing Change Detection"

Related tags

Overview

SESNet for remote sensing image change detection

Prerequisites

Training

testing

Prepare Datasets

download the change detection dataset

cut bitemporal image pairs (LEVIR-CD)

Define hyperparameters

Owner

Testing the Facial Emotion Recognition (FER) algorithm on animations

Implementation of "Semi-supervised Domain Adaptive Structure Learning"

Cmsc11 arcade - Final Project for CMSC11

Neural models of common sense. 🤖

Simple renderer for use with MuJoCo (>=2.1.2) Python Bindings.

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Simple Python application to transform Serial data into OSC messages

Code for "Human Pose Regression with Residual Log-likelihood Estimation", ICCV 2021 Oral

Fast image augmentation library and an easy-to-use wrapper around other libraries

DeepLabv3+：Encoder-Decoder with Atrous Separable Convolution语义分割模型在tensorflow2当中的实现

CoMoGAN: continuous model-guided image-to-image translation. CVPR 2021 oral.

Code for the paper "Reinforced Active Learning for Image Segmentation"

Air Pollution Prediction System using Linear Regression and ANN

Vision Transformer for 3D medical image registration (Pytorch).

Metric learning algorithms in Python

Rethinking Transformer-based Set Prediction for Object Detection

TorchX is a library containing standard DSLs for authoring and running PyTorch related components for an E2E production ML pipeline.

In this project we combine techniques from neural voice cloning and musical instrument synthesis to achieve good results from as little as 16 seconds of target data.

RL Algorithms with examples in Python / Pytorch / Unity ML agents

Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.