Reference implementation for Structured Prediction with Deep Value Networks

Last update: Feb 02, 2022

Related tags

Overview

Deep Value Network (DVN)

This code is a python reference implementation of DVNs introduced in

Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs. Michael Gygli, Mohammad Norouzi, Anelia Angelova. ICML 2017. PDF

Note: This code implements the multi-layer perceptron version used for the multi-label classification experiments only (Section 5.1). The segmentation code was written while inside Google and thus not available.

Requirements

To run this code you need to have tensorflow, numpy, liac-arff, scikit-learn and torchfile installed. Install with

pip install -r requirements.txt

Playing around with a pre-trained Value Net

The pre-trained model for the Bibtex dataset is included in this repository. This allows you do play around with it and it's predictions, using our jupyter notebook.

Replicating the experiments in the paper

Bibtex

To replicate the numbers for bibtex provided in the paper, run:

import reproduce_results
# Reproduce results on the bibtex dataset
reproduce_results.run_bibtex()

By default, the model weights and logs are stored to ./bibtex_dvn. You can monitor the process using tensorboard with

tensorboard --logdir ./bibtex_dvn/

In order to understand the training process two quantities are important:

loss: The loss in estimating the true value of an output hypothesis
gt_f1_scores: The true f1 scores of the generated output hypothesis.

As training progresses, the generated output hypothesis should get better and better. As such, the validation performance reported here closely matches the performance of the test set. The curve should look something like this:

Bookmarks

For Bookmarks the splits are not provided on http://mulan.sourceforge.net/datasets-mlc.html. Thus, we use the splits provided by SPEN. To get the data, run:

cd mlc_datasets
wget http://www.cics.umass.edu/~belanger/icml_mlc_data.tar.gz
tar -xvf icml_mlc_data.tar.gz
cd ..

Then, you can reproduce the results with

import reproduce_results
# Reproduce results on the bookmarks dataset
reproduce_results.run_bookmarks()

The model weights and logs are stored to ./bookmarks_dvn/.

Contributors

Michael Gygli, Mohammad Norouzi, Anelia Angelova

Code by Michael Gygli

Reference implementation for Structured Prediction with Deep Value Networks

Related tags

Overview

Deep Value Network (DVN)

Requirements

Playing around with a pre-trained Value Net

Replicating the experiments in the paper

Contributors

Owner

Michael Gygli

A deep learning network built with TensorFlow and Keras to classify gender and estimate age.

Degree-Quant: Quantization-Aware Training for Graph Neural Networks.

paper: Hyperspectral Remote Sensing Image Classification Using Deep Convolutional Capsule Network

The code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

Implementation of the HMAX model of vision in PyTorch

TilinGNN: Learning to Tile with Self-Supervised Graph Neural Network (SIGGRAPH 2020)

A Demo server serving Bert through ONNX with GPU written in Rust with <3

LSUN Dataset Documentation and Demo Code

Pseudo lidar - (CVPR 2019) Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

Delving into Localization Errors for Monocular 3D Object Detection, CVPR'2021

Yolact-keras实例分割模型在keras当中的实现

A graph neural network (GNN) model to predict protein-protein interactions (PPI) with no sample features

Official pytorch implementation of the AAAI 2021 paper Semantic Grouping Network for Video Captioning

Learning to trade under the reinforcement learning framework

Aerial Imagery dataset for fire detection: classification and segmentation (Unmanned Aerial Vehicle (UAV))

This repository contains the re-implementation of our paper deSpeckNet: Generalizing Deep Learning Based SAR Image Despeckling

C3D is a modified version of BVLC caffe to support 3D ConvNets.

Unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"

catch-22: CAnonical Time-series CHaracteristics

Very Deep Convolutional Networks for Large-Scale Image Recognition