Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)

Last update: Dec 26, 2022

Related tags

Overview

VIN: Value Iteration Networks

A quick thank you

A few others have released amazing related work which helped inspire and improve my own implementation. It goes without saying that this release would not be nearly as good if it were not for all of the following:

Why another VIN implementation?

The Pytorch VIN model in this repository is, in my opinion, more readable and closer to the original Theano implementation than others I have found (both Tensorflow and Pytorch).
This is not simply an implementation of the VIN model in Pytorch, it is also a full Python implementation of the gridworld environments as used in the original MATLAB implementation.
Provide a more extensible research base for others to build off of without needing to jump through the possible MATLAB paywall.

Installation

This repository requires following packages:

SciPy >= 0.19.0
Python >= 2.7 (if using Python 3.x: python3-tk should be installed)
Numpy >= 1.12.1
Matplotlib >= 2.0.0
PyTorch >= 0.1.11

Use pip to install the necessary dependencies:

pip install -U -r requirements.txt

Note that PyTorch cannot be installed directly from PyPI; refer to http://pytorch.org/ for custom installation instructions specific to your needs.

How to train

8x8 gridworld

python train.py --datafile dataset/gridworld_8x8.npz --imsize 8 --lr 0.005 --epochs 30 --k 10 --batch_size 128

16x16 gridworld

python train.py --datafile dataset/gridworld_16x16.npz --imsize 16 --lr 0.002 --epochs 30 --k 20 --batch_size 128

28x28 gridworld

python train.py --datafile dataset/gridworld_28x28.npz --imsize 28 --lr 0.002 --epochs 30 --k 36 --batch_size 128

Flags:

datafile: The path to the data files.
imsize: The size of input images. One of: [8, 16, 28]
lr: Learning rate with RMSProp optimizer. Recommended: [0.01, 0.005, 0.002, 0.001]
epochs: Number of epochs to train. Default: 30
k: Number of Value Iterations. Recommended: [10 for 8x8, 20 for 16x16, 36 for 28x28]
l_i: Number of channels in input layer. Default: 2, i.e. obstacles image and goal image.
l_h: Number of channels in first convolutional layer. Default: 150, described in paper.
l_q: Number of channels in q layer (~actions) in VI-module. Default: 10, described in paper.
batch_size: Batch size. Default: 128

How to test / visualize paths (requires training first)

8x8 gridworld

python test.py --weights trained/vin_8x8.pth --imsize 8 --k 10

16x16 gridworld

python test.py --weights trained/vin_16x16.pth --imsize 16 --k 20

28x28 gridworld

python test.py --weights trained/vin_28x28.pth --imsize 28 --k 36

To visualize the optimal and predicted paths simply pass:

--plot

Flags:

weights: Path to trained weights.
imsize: The size of input images. One of: [8, 16, 28]
plot: If supplied, the optimal and predicted paths will be plotted
k: Number of Value Iterations. Recommended: [10 for 8x8, 20 for 16x16, 36 for 28x28]
l_i: Number of channels in input layer. Default: 2, i.e. obstacles image and goal image.
l_h: Number of channels in first convolutional layer. Default: 150, described in paper.
l_q: Number of channels in q layer (~actions) in VI-module. Default: 10, described in paper.

Results

Gridworld	Sample One	Sample Two
8x8
16x16
28x28

Datasets

Each data sample consists of an obstacle image and a goal image followed by the (x, y) coordinates of current state in the gridworld.

Dataset size	8x8	16x16	28x28
Train set	81337	456309	1529584
Test set	13846	77203	251755

The datasets (8x8, 16x16, and 28x28) included in this repository can be reproduced using the dataset/make_training_data.py script. Note that this script is not optimized and runs rather slowly (also uses a lot of memory :D)

Performance: Success Rate

This is the success rate from rollouts of the learned policy in the environment (taken over 5000 randomly generated domains).

Success Rate	8x8	16x16	28x28
PyTorch	99.69%	96.99%	91.07%

Performance: Test Accuracy

NOTE: This is the accuracy on test set. It is different from the table in the paper, which indicates the success rate from rollouts of the learned policy in the environment.

Test Accuracy	8x8	16x16	28x28
PyTorch	99.83%	94.84%	88.54%

Comments

testing accuracy fairly low

I just tried to follow the instructions in the repo, and tested models trained but got a fairly low accuracy. I'm using pyTorch 0.1.12_1. Is there anything I should pay attention to?

opened by xinleipan 10
Prebuilt Dataset Generation

Hello,

I was wondering how you generated the prebuilt datasets that are downloaded when running download_weights_and_datasets.sh, i.e. what were the max_obs and max_obs_size parameters?

Did you follow this file in the original repo? https://github.com/avivt/VIN/blob/master/scripts/make_data_gridworld_nips.m

Thanks, Emilio

opened by eparisotto 5
the rollout accuracy in test script is lower than the test accuracy in train script.

Hello!

I have a little doubt.Does the rollout accuracy indicate the success rate? If so, why is it lower than the prediction accuracy? In the Aviv's implementation, the success rate of the 8x8 grid world was as high as 99.6%. Why is the success rate in your experiment relatively low?

Thanks!

opened by albzni 4
RUN ERROR

when I run 'python train.py --datafile dataset/gridworld_8x8.npz --imsize 8 --lr 0.005 --epochs 30 --k 10 --batch_size 128', it's ok,but again 'python train.py --datafile dataset/gridworld_16x16.npz --imsize 16 --lr 0.002 --epochs 30 --k 20 --batch_size 128' was run, an error occurred as follows: [email protected]:~/pytorch-value-iteration-networks$ python train.py --datafile dataset/gridworld_16x16.npz --imsize 16 --lr 0.002 --epochs 10 --k 20 --batch_size 128 Traceback (most recent call last): File "train.py", line 135, in config.datafile, imsize=config.imsize, train=True, transform=transform) File "/home/ni/pytorch-value-iteration-networks/dataset/dataset.py", line 22, in init self._process(file, self.train) File "/home/ni/pytorch-value-iteration-networks/dataset/dataset.py", line 58, in _process images = images.astype(np.float32) MemoryError

opened by N-Kingsley 3
Problem of running the test script

Hello,

I downloaded the data with the .sh downloading script you provided, I also got an nps weights file after training. When I ran the testing command I got the following error: Traceback (most recent call last): File "/home/research/DL/VIN/pytorch-value-iteration-networks/test.py", line 158, in main(config) File "/home/research/DL/VIN/pytorch-value-iteration-networks/test.py", line 85, in main _, predictions = vin(X_in, S1_in, S2_in, config) File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/module.py", line 357, in call result = self.forward(*input, **kwargs) File "/home/research/DL/VIN/pytorch-value-iteration-networks/model.py", line 64, in forward return logits, self.sm(logits) File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/module.py", line 352, in call for hook in self._forward_pre_hooks.values(): File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/module.py", line 398, in getattr type(self).name, name)) AttributeError: 'Softmax' object has no attribute '_forward_pre_hooks'

Thanks for helping!

opened by YantianZha 3
Improved readability of the VIN model, in addition to minor changes

My main modification is in the forward method of the model where you extract the q_out from the q values, and not repeating q = F.conv2d(...) in two places. I also made minor improvements, such as adding argparse in the dataset creation script and changing .cuda() into .to(device) in test.py.

opened by shuishida 2

Inconsistent tensor sizes when starting training

Hey there. I'm trying to run

python train.py --datafile dataset/gridworld_8x8.npz --imsize 8 --lr 0.005 --epochs 30 --k 10 --batch_size 128

But I get the following error

Number of Train Samples: 103926
Number of Test Samples: 17434
     Epoch | Train Loss | Train Error | Epoch Time
Traceback (most recent call last):
  File "train.py", line 147, in <module>
    train(net, trainloader, config, criterion, optimizer, use_GPU)
  File "train.py", line 40, in train
    outputs, predictions = net(X, S1, S2, config)
  File "/home/j1k1000o/anaconda3/lib/python3.6/site-packages/torch/nn/modules/module.py", line 224, in __call__
    result = self.forward(*input, **kwargs)
  File "/media/user_home2/j1k1000o/j1k/VINs/pytorch-value-iteration-networks/model.py", line 44, in forward
    q = F.conv2d(torch.cat([r, v], 1), 
  File "/home/j1k1000o/anaconda3/lib/python3.6/site-packages/torch/autograd/variable.py", line 897, in cat
    return Concat.apply(dim, *iterable)
  File "/home/j1k1000o/anaconda3/lib/python3.6/site-packages/torch/autograd/_functions/tensor.py", line 317, in forward
    return torch.cat(inputs, dim)
RuntimeError: inconsistent tensor sizes at /opt/conda/conda-bld/pytorch_1502009910772/work/torch/lib/THC/generic/THCTensorMath.cu:141

I've executed

./download_weights_and_datasets.sh

as well as

python ./dataset/make_training_data.py

And I'm running it on an Ubuntu 16.04, python 3.6 and with all the requirements installed.

Can you help me out?

opened by juancprzs 2

Don't understand VIN last step

    slice_s1 = S1.long().expand(config.imsize, 1, config.l_q, q.size(0))
    slice_s1 = slice_s1.permute(3, 2, 1, 0)
    q_out = q.gather(2, slice_s1).squeeze(2)

What does this 3 lines do?

opened by QiXuanWang 1

KeyError: 'arr_1 is not a file in the archive'

python3 train.py --datafile dataset/gridworld_8x8.npz --imsize 8 --lr 0.005 --epochs 30 --k 10 --batch_size 128 Traceback (most recent call last): File "train.py", line 135, in config.datafile, imsize=config.imsize, train=True, transform=transform) File "/home/user/pytorch/tutorials/valueiterationnetworks/pytorch-value-iteration-networks/dataset/dataset.py", line 22, in init self._process(file, self.train) File "/home/user/pytorch/tutorials/valueiterationnetworks/pytorch-value-iteration-networks/dataset/dataset.py", line 49, in _process S1 = f['arr_1'] File "/home/user/miniconda3/lib/python3.6/site-packages/numpy/lib/npyio.py", line 255, in getitem raise KeyError("%s is not a file in the archive" % key) KeyError: 'arr_1 is not a file in the archive'

I got this error, could you please

opened by derelearnro 1
Problem of running dataset/make_training_data.py script
Hi

When I tried to run the make_training_data.py script to generate the gridworld.npz file, I got the following error:

FileNotFoundError: [Errno 2] No such file or directory: 'dataset/gridworld_28x28.npz'

And I found that line 101 should be modified as follows:

save_path = "gridworld_{0}x{1}".format(dom_size[0], dom_size[1])
opened by ruqing00 0

Releases(v1.1)

v1.1(Apr 18, 2018)
Patches for newest PyTorch releases.

Fixes #3 #4 #5

Source code(tar.gz)
Source code(zip)
gridworld_16x16.npz(10.21 MB)
gridworld_28x28.npz(253.55 MB)
gridworld_8x8.npz(566.12 KB)
vin_16x16.pth(13.54 KB)
vin_28x28.pth(13.54 KB)
vin_8x8.pth(13.54 KB)
v1.0(Apr 21, 2017)

This release includes pre-built datasets and pre-trained models for the 8x8, 16x16, and 28x28 gridworlds.
Source code(tar.gz)
Source code(zip)
gridworld_16x16.npz(33.58 MB)
gridworld_28x28.npz(253.55 MB)
gridworld_8x8.npz(1.28 MB)
vin_16x16.pth(24.37 KB)
vin_28x28.pth(24.42 KB)
vin_8x8.pth(24.37 KB)

Owner

Kent Sommer

Software Engineer @ Toyota Research Institute (SF Bay Area)

GitHub Repository

RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.

RLMeta rlmeta - a flexible lightweight research framework for Distributed Reinforcement Learning based on PyTorch and moolib Installation To build fro

281 Dec 22, 2022

Educational API for 3D Vision using pose to control carton.

41 Jul 10, 2022

Differentiable Factor Graph Optimization for Learning Smoothers @ IROS 2021

Differentiable Factor Graph Optimization for Learning Smoothers Overview Status Setup Datasets Training Evaluation Acknowledgements Overview Code rele

60 Nov 14, 2022

Extreme Lightwegith Portrait Segmentation

Extreme Lightwegith Portrait Segmentation Please go to this link to download code Requirements python 3 pytorch = 0.4.1 torchvision==0.2.1 opencv-pyt

59 Dec 16, 2022

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

DRL-robot-navigation Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gra

87 Jan 07, 2023

CN24 is a complete semantic segmentation framework using fully convolutional networks

Build status: master (production branch): develop (development branch): Welcome to the CN24 GitHub repository! CN24 is a complete semantic segmentatio

123 Jul 14, 2022

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

Introduction Version: 2.3.8 Authors: Chris Fonnesbeck Anand Patil David Huard John Salvatier Web site: https://github.com/pymc-devs/pymc Documentation

7.2k Jan 07, 2023

Localized representation learning from Vision and Text (LoVT)

Localized Vision-Text Pre-Training Contrastive learning has proven effective for pre- training image models on unlabeled data and achieved great resul

10 Dec 07, 2022

working repo for my xumx-sliCQ submissions to the ISMIR 2021 MDX

Music Demixing Challenge - xumx-sliCQ This repository is the GitHub mirror of my working submission repository for the AICrowd ISMIR 2021 Music Demixi

4 Aug 25, 2021

A system for quickly generating training data with weak supervision

Programmatically Build and Manage Training Data Announcement The Snorkel team is now focusing their efforts on Snorkel Flow, an end-to-end AI applicat

5.4k Jan 02, 2023

The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.

OC-SORT Observation-Centric SORT (OC-SORT) is a pure motion-model-based multi-object tracker. It aims to improve tracking robustness in crowded scenes

325 Jan 05, 2023

This repository contains the code for "SBEVNet: End-to-End Deep Stereo Layout Estimation" paper by Divam Gupta, Wei Pu, Trenton Tabor, Jeff Schneider

SBEVNet: End-to-End Deep Stereo Layout Estimation This repository contains the code for "SBEVNet: End-to-End Deep Stereo Layout Estimation" paper by D

19 Dec 17, 2022

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation This repository is the pytorch implementation of our paper: Hierarchical Cr

43 Nov 21, 2022

Code for visualizing the loss landscape of neural nets

Visualizing the Loss Landscape of Neural Nets This repository contains the PyTorch code for the paper Hao Li, Zheng Xu, Gavin Taylor, Christoph Studer

2.2k Jan 09, 2023

Pretrained Pytorch face detection (MTCNN) and recognition (InceptionResnet) models

Face Recognition Using Pytorch Python 3.7 3.6 3.5 Status This is a repository for Inception Resnet (V1) models in pytorch, pretrained on VGGFace2 and

3.3k Jan 04, 2023

Self-supervised learning on Graph Representation Learning (node-level task)

graph_SSL Self-supervised learning on Graph Representation Learning (node-level task) How to run the code To run GRACE, sh run_GRACE.sh To run GCA, sh

3 Dec 31, 2021

Learn the Deep Learning for Computer Vision in three steps: theory from base to SotA, code in PyTorch, and space-repetition with Anki

DeepCourse: Deep Learning for Computer Vision arthurdouillard.com/deepcourse/ This is a course I'm giving to the French engineering school EPITA each

113 Nov 29, 2022

we propose a novel deep network, named feature aggregation and refinement network (FARNet), for the automatic detection of anatomical landmarks.

Feature Aggregation and Refinement Network for 2D Anatomical Landmark Detection Overview Localization of anatomical landmarks is essential for clinica

0 Aug 28, 2022

Code for "Localization with Sampling-Argmax", NeurIPS 2021

Localization with Sampling-Argmax [Paper] [arXiv] [Project Page] Localization with Sampling-Argmax Jiefeng Li, Tong Chen, Ruiqi Shi, Yujing Lou, Yong-

71 Dec 17, 2022

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

StyleGAR TODO: add arxiv link Implementation of Inverting Generative Adversarial Renderer for Face Reconstruction TODO: for test Currently, some model

155 Oct 27, 2022

Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)

Related tags

Overview

VIN: Value Iteration Networks

A quick thank you

Why another VIN implementation?

Installation

How to train

8x8 gridworld

16x16 gridworld

28x28 gridworld

How to test / visualize paths (requires training first)

8x8 gridworld

16x16 gridworld

28x28 gridworld

Results

Datasets

Performance: Success Rate

Performance: Test Accuracy

Comments

Releases(v1.1)

v1.1(Apr 18, 2018)

v1.0(Apr 21, 2017)

Owner

Kent Sommer

RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.

Educational API for 3D Vision using pose to control carton.

Differentiable Factor Graph Optimization for Learning Smoothers @ IROS 2021

Extreme Lightwegith Portrait Segmentation

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

CN24 is a complete semantic segmentation framework using fully convolutional networks

THIS IS THE **OLD** PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

Localized representation learning from Vision and Text (LoVT)

working repo for my xumx-sliCQ submissions to the ISMIR 2021 MDX

A system for quickly generating training data with weak supervision

The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.

This repository contains the code for "SBEVNet: End-to-End Deep Stereo Layout Estimation" paper by Divam Gupta, Wei Pu, Trenton Tabor, Jeff Schneider

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Code for visualizing the loss landscape of neural nets

Pretrained Pytorch face detection (MTCNN) and recognition (InceptionResnet) models

Self-supervised learning on Graph Representation Learning (node-level task)

Learn the Deep Learning for Computer Vision in three steps: theory from base to SotA, code in PyTorch, and space-repetition with Anki

we propose a novel deep network, named feature aggregation and refinement network (FARNet), for the automatic detection of anatomical landmarks.

Code for "Localization with Sampling-Argmax", NeurIPS 2021

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD: