Pathdreamer: A World Model for Indoor Navigation

Last update: Jan 04, 2023

Related tags

Deep Learning pathdreamer

Overview

Pathdreamer: A World Model for Indoor Navigation

This repository hosts the open source code for Pathdreamer, to be presented at ICCV 2021.

Paper | Project Webpage | Colab Demo

Setup instructions

Environment

Set up virtualenv, and install required libraries:

virtualenv venv
source venv/bin/activate
pip install -r requirements.txt

Add the Pathdreamer library to PYTHONPATH:

export PYTHONPATH=$PYTHONPATH:/home/path/to/pathdreamer_root/

Downloading Pretrained Checkpoints

We provide a pretrained checkpoint which can be acquired by running:

wget https://storage.googleapis.com/gresearch/pathdreamer/ckpt.tar -P data/
tar -xf data/ckpt.tar --directory data/

The results will be extracted to the data/ckpt directory. Two checkpoints are provided, one for the Stage 1 model (Structure Generator), and another for the Stage 2 model (Image Generator).

Colab Demo

Pathdreamer_Example_Colab.ipynb [click to launch in Google Colab] shows how to setup and run the pretrained Pathdreamer model for inference. It includes examples on synthesizing image sequences and continuous video sequences for arbitrary navigation trajectories.

Citation

If you find this work useful, please consider citing:

@inproceedings{koh2021pathdreamer,
  title={Pathdreamer: A World Model for Indoor Navigation},
  author={Koh, Jing Yu and Lee, Honglak and Yang, Yinfei and Baldridge, Jason and Anderson, Peter},
  journal={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
  year={2021}
}

License

Pathdreamer is released under the Apache 2.0 license. The Matterport3D dataset is governed by the Matterport3D Terms of Use.

Disclaimer

Not an official Google product.

Pathdreamer: A World Model for Indoor Navigation

Related tags

Overview

Pathdreamer: A World Model for Indoor Navigation

Setup instructions

Environment

Downloading Pretrained Checkpoints

Colab Demo

Citation

License

Disclaimer

Owner

Google Research

Pytorch Implementation for CVPR2018 Paper: Learning to Compare: Relation Network for Few-Shot Learning

This is a demo app to be used in the video streaming applications

This project is the official implementation of our accepted ICLR 2021 paper BiPointNet: Binary Neural Network for Point Clouds.

A CROSS-MODAL FUSION NETWORK BASED ON SELF-ATTENTION AND RESIDUAL STRUCTURE FOR MULTIMODAL EMOTION RECOGNITION

Tightness-aware Evaluation Protocol for Scene Text Detection

MLSpace: Hassle-free machine learning & deep learning development

Implementation of ConvMixer-Patches Are All You Need? in TensorFlow and Keras

Dynamic Slimmable Network (CVPR 2021, Oral)

Multiview 3D object detection on MultiviewC dataset through moft3d.

Unsupervised Representation Learning by Invariance Propagation

PyTorch Implementation of SSTNs for hyperspectral image classifications from the IEEE T-GRS paper "Spectral-Spatial Transformer Network for Hyperspectral Image Classification: A FAS Framework."

Sequence-tagging using deep learning

Collections for the lasted paper about multi-view clustering methods (papers, codes)

Consensus Learning from Heterogeneous Objectives for One-Class Collaborative Filtering

Repository for the Bias Benchmark for QA dataset.

Implementation of the paper Recurrent Glimpse-based Decoder for Detection with Transformer.

This project aims to segment 4 common retinal lesions from Fundus Images.

This is my codes that can visualize the psnr image in testing videos.

AI assistant built in python.the features are it can display time,say weather,open-google,youtube,instagram.

Fibonacci Method Gradient Descent