Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network"

Last update: Nov 25, 2021

Related tags

Overview

This is a Pytorch Lightning version PSMNet which is based on JiaRenChang/PSMNet.

use python main.py to start training.

PSM-Net

Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network" paper (CVPR 2018) by Jia-Ren Chang and Yong-Sheng Chen.

Official repository: JiaRenChang/PSMNet

Usage

1) Requirements

Python3.5+
Pytorch0.4
Opencv-Python
Matplotlib
TensorboardX
Tensorboard

All dependencies are listed in requirements.txt, you execute below command to install the dependencies.

pip install -r requirements.txt

2) Train

usage: train.py [-h] [--maxdisp MAXDISP] [--logdir LOGDIR] [--datadir DATADIR]
                [--cuda CUDA] [--batch-size BATCH_SIZE]
                [--validate-batch-size VALIDATE_BATCH_SIZE]
                [--log-per-step LOG_PER_STEP]
                [--save-per-epoch SAVE_PER_EPOCH] [--model-dir MODEL_DIR]
                [--lr LR] [--num-epochs NUM_EPOCHS]
                [--num-workers NUM_WORKERS]

PSMNet

optional arguments:
  -h, --help            show this help message and exit
  --maxdisp MAXDISP     max diparity
  --logdir LOGDIR       log directory
  --datadir DATADIR     data directory
  --cuda CUDA           gpu number
  --batch-size BATCH_SIZE
                        batch size
  --validate-batch-size VALIDATE_BATCH_SIZE
                        batch size
  --log-per-step LOG_PER_STEP
                        log per step
  --save-per-epoch SAVE_PER_EPOCH
                        save model per epoch
  --model-dir MODEL_DIR
                        directory where save model checkpoint
  --lr LR               learning rate
  --num-epochs NUM_EPOCHS
                        number of training epochs
  --num-workers NUM_WORKERS
                        num workers in loading data

For example:

python train.py --batch-size 16 \
                --logdir log/exmaple \
                --num-epochs 500

3) Visualize result

This repository uses tensorboardX to visualize training result. Find your log directory and launch tensorboard to look over the result. The default log directory is /log.

tensorboard --logdir <your_log_dir>

Here are some of my training results (have been trained for 1000 epochs on KITTI2015):

4) Inference

usage: inference.py [-h] [--maxdisp MAXDISP] [--left LEFT] [--right RIGHT]
                    [--model-path MODEL_PATH] [--save-path SAVE_PATH]

PSMNet inference

optional arguments:
  -h, --help            show this help message and exit
  --maxdisp MAXDISP     max diparity
  --left LEFT           path to the left image
  --right RIGHT         path to the right image
  --model-path MODEL_PATH
                        path to the model
  --save-path SAVE_PATH
                        path to save the disp image

For example:

python inference.py --left test/left.png \
                    --right test/right.png \
                    --model-path checkpoint/08/best_model.ckpt \
                    --save-path test/disp.png

5) Pretrained model

A model trained for 1000 epochs on KITTI2015 dataset can be download here. (I choose the best model among the 1000 epochs)

state {
    'epoch': 857,
    '3px-error': 3.466
}

Task List

Contact

Email: [email protected]

Welcome for any discussions!

Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network"

Related tags

Overview

PSM-Net

Usage

1) Requirements

2) Train

3) Visualize result

4) Inference

5) Pretrained model

Task List

Contact

Owner

XIAOTIAN LIU

Training vision models with full-batch gradient descent and regularization

Scheme for training and applying a label propagation framework

Official pytorch implementation of the IrwGAN for unaligned image-to-image translation

URIE: Universal Image Enhancementfor Visual Recognition in the Wild

Pytorch implementation of the paper SPICE: Semantic Pseudo-labeling for Image Clustering

The code release of paper 'Domain Generalization for Medical Imaging Classification with Linear-Dependency Regularization' NIPS 2020.

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Image Segmentation Evaluation

Low-code/No-code approach for deep learning inference on devices

以孤立语假设和宽度优先搜索为基础，构建了一种多通道堆叠注意力Transformer结构的斗地主ai

Adaptation through prediction: multisensory active inference torque control

Official code for the CVPR 2021 paper "How Well Do Self-Supervised Models Transfer?"

Random Walk Graph Neural Networks

This repository is for DSA and CP scripts for reference.

Code for the paper "Can Active Learning Preemptively Mitigate Fairness Issues?" presented at RAI 2021.

Code for our paper "Sematic Representation for Dialogue Modeling" in ACL2021

(CVPR 2022 - oral) Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

SeqFormer: a Frustratingly Simple Model for Video Instance Segmentation

Learning hierarchical attention for weakly-supervised chest X-ray abnormality localization and diagnosis

Pytorch Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic