SatelliteSfM - A library for solving the satellite structure from motion problem

Last update: Dec 08, 2022

Related tags

Deep Learning SatelliteSfM

Overview

Satellite Structure from Motion

Maintained by Kai Zhang.

Overview

This is a library dedicated to solving the satellite structure from motion problem.
It's a wrapper of the VisSatSatelliteStereo repo for easier use.
The outputs are png images and OpenCV-compatible pinhole camreas readily deployable to multi-view stereo pipelines targetting ground-level images.

Installation

Assume you are on a Linux machine with at least one GPU, and have conda installed. Then to install this library, simply by:

. ./env.sh

Inputs

We assume the inputs to be a set of .tif images encoding the 3-channel uint8 RGB colors, and the metadata like RPC cameras. This data format is to align with the public satellite benchmark: TRACK 3: MULTI-VIEW SEMANTIC STEREO. Download one example data from this google drive; folder structure look like below:

- examples/inputs
    - images/
        - *.tif
        - *.tif
        - *.tif
        - ...
    - latlonalt_bbx.json

, where latlonalt_bbx.json specifies the bounding box for the site of interest in the global (latitude, longitude, altitude) coordinate system.

If you are not sure what is a reasonably good altitude range, you can put random numbers in the json file, but you have to enable the --use_srtm4 option below.

Run Structure from Motion

python satellite_sfm.py --input_folder examples/inputs --output_folder examples/outputs --run_sfm [--use_srtm4] [--enable_debug]

The --enable_debug option outputs some visualization helpful debugging the structure from motion quality.

Outputs

{output_folder}/images/ folder contains the png images
{output_folder}/cameras_adjusted/ folder contains the bundle-adjusted pinhole cameras; each camera is represented by a pair of 4x4 K, W2C matrices that are OpenCV-compatible.
{output_folder}/enu_bbx_adjusted.json contains the scene bounding box in the local ENU Euclidean coordinate system.
{output_folder}/enu_observer_latlonalt.json contains the observer coordinate for defining the local ENU coordinate; essentially, this observer coordinate is only necessary for coordinate conversion between local ENU and global latitude-longitude-altitude.

If you turn on the --enable_debug option, you might want to dig into the folder {output_folder}/debug_sfm for visuals, etc.

Citations

@inproceedings{VisSat-2019,
  title={Leveraging Vision Reconstruction Pipelines for Satellite Imagery},
  author={Zhang, Kai and Sun, Jin and Snavely, Noah},
  booktitle={IEEE International Conference on Computer Vision Workshops},
  year={2019}
}

Example results

input images

sparse point cloud ouput by SfM

homograhpy-warp one view, then average with another by a plane sequence

high-res video

inspect epipolar geometry

python inspect_epipolar_geometry.py

get zero-skew instrincis marix

python skew_correct.py --input_folder ./examples/outputs ./examples/outputs_zeroskew

More handy scripts are coming

Stay tuned :-)

SatelliteSfM - A library for solving the satellite structure from motion problem

Related tags

Overview

Satellite Structure from Motion

Overview

Installation

Inputs

Run Structure from Motion

Outputs

Citations

Example results

input images

sparse point cloud ouput by SfM

homograhpy-warp one view, then average with another by a plane sequence

inspect epipolar geometry

get zero-skew instrincis marix

More handy scripts are coming

Owner

Kai Zhang

Implementation of paper "Self-supervised Learning on Graphs:Deep Insights and New Directions"

Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving

Simple and Effective Few-Shot Named Entity Recognition with Structured Nearest Neighbor Learning

Code and results accompanying our paper titled Mixture Proportion Estimation and PU Learning: A Modern Approach at Neurips 2021 (Spotlight)

Graph Self-Attention Network for Learning Spatial-Temporal Interaction Representation in Autonomous Driving

Recursive Bayesian Networks

This is an implementation of PIFuhd based on Pytorch

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

Official repository for the ISBI 2021 paper Transformer Assisted Convolutional Neural Network for Cell Instance Segmentation

Fibonacci Method Gradient Descent

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Automatic differentiation with weighted finite-state transducers.

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

PyTorch implementation of our ICCV 2021 paper, Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents.

Safe Model-Based Reinforcement Learning using Robust Control Barrier Functions

Keeping it safe - AI Based COVID-19 Tracker using Deep Learning and facial recognition

Code for `BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery`, Neurips 2021

Continual World is a benchmark for continual reinforcement learning

ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Library.

ShuttleNet: Position-aware Fusion of Rally Progress and Player Styles for Stroke Forecasting in Badminton (AAAI'22)