Caffe implementation for Hu et al. Segmentation for Natural Language Expressions

Last update: Jul 27, 2021

Related tags

Overview

Segmentation from Natural Language Expressions

This repository contains the Caffe reimplementation of the following paper:

R. Hu, M. Rohrbach, T. Darrell, Segmentation from Natural Language Expressions. in arXiv:1603.06180, 2016. (PDF)

@article{hu2016segmentation,
  title={Segmentation from Natural Language Expressions},
  author={Hu, Ronghang and Rohrbach, Marcus and Darrell, Trevor},
  journal={arXiv preprint arXiv:1603.06180},
  year={2016}
}

Project Page: http://ronghanghu.com/text_objseg

Installation

Install Caffe following the instructions here.
Download this repository or clone with Git, and then cd into the root directory of the repository.

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Download ReferIt dataset:

./referit/referit-dataset/download_referit_dataset.sh

Download the caffemodel for VGG-16 network parameters trained on ImageNET 1000 classes.

Training

You may need to add the repository root directory to Python's module path:

export PYTHONPATH=/path/to/text_objseg_caffe/:$PYTHONPATH

Build training batches for bounding boxes:

python referit/build_training_batches_det.py

Build training batches for segmentation:

python referit/build_training_batches_seg.py

Configure the config.py file in the directory det_model and train the language-based bounding box localization model:

python det_model/train_det_model.py

Configure the config.py file in the directory seg_low_res_model and train the low resolution language-based segmentation model (from the previous bounding box localization model):

python seg_low_res_model/train_low_res_model.py

Configure the config.py file in the directory seg_model and train the high resolution language-based segmentation model (from the previous low resolution segmentation model):

python seg_model/train_seg_model.py

Evaluation

You may need to add the repository root directory to Python's module path:

export PYTHONPATH=path/to/text_objseg_caffe:$PYTHONPATH

Configure the test_config.py file in the directory seg_model and run evaluation for the high resolution language-based segmentation model:

python seg_model/test_seg_model.py

This should reproduce the results in the paper. You may also evaluate the language-based bounding box localization model:

python det_model/test_det_model.py

The results can be compared to this paper.

Demo

There is a demo that you can try! Run the demo in ./demo/text_objseg_demo.ipynb with Jupyter Notebook (IPython Notebook).

Caffe implementation for Hu et al. Segmentation for Natural Language Expressions

Related tags

Overview

Segmentation from Natural Language Expressions

Installation

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Training

Evaluation

Demo

Owner

Target Propagation via Regularized Inversion

🔥 Cannlytics-powered artificial intelligence 🤖

OverFeat is a Convolutional Network-based image classifier and feature extractor.

A simple program for training and testing vit

Tensorflow-seq2seq-tutorials - Dynamic seq2seq in TensorFlow, step by step

Focal and Global Knowledge Distillation for Detectors

Code for the Active Speakers in Context Paper (CVPR2020)

Code for "On Memorization in Probabilistic Deep Generative Models"

A python comtrade load library accelerated by go

Random Walk Graph Neural Networks

The Body Part Regression (BPR) model translates the anatomy in a radiologic volume into a machine-interpretable form.

Face Recognition & AI Based Smart Attendance Monitoring System.

This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is accepted to ICCV2021.

Official code for our ICCV paper: "From Continuity to Editability: Inverting GANs with Consecutive Images"

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Explanatory Learning: Beyond Empiricism in Neural Networks

Net2net - Network-to-Network Translation with Conditional Invertible Neural Networks

A simple python program that can be used to implement user authentication tokens into your program...

source code for 'Finding Valid Adjustments under Non-ignorability with Minimal DAG Knowledge' by A. Shah, K. Shanmugam, K. Ahuja

Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations