Caffe implementation for Hu et al. Segmentation for Natural Language Expressions

Last update: Jul 27, 2021

Related tags

Overview

Segmentation from Natural Language Expressions

This repository contains the Caffe reimplementation of the following paper:

R. Hu, M. Rohrbach, T. Darrell, Segmentation from Natural Language Expressions. in arXiv:1603.06180, 2016. (PDF)

@article{hu2016segmentation,
  title={Segmentation from Natural Language Expressions},
  author={Hu, Ronghang and Rohrbach, Marcus and Darrell, Trevor},
  journal={arXiv preprint arXiv:1603.06180},
  year={2016}
}

Project Page: http://ronghanghu.com/text_objseg

Installation

Install Caffe following the instructions here.
Download this repository or clone with Git, and then cd into the root directory of the repository.

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Download ReferIt dataset:

./referit/referit-dataset/download_referit_dataset.sh

Download the caffemodel for VGG-16 network parameters trained on ImageNET 1000 classes.

Training

You may need to add the repository root directory to Python's module path:

export PYTHONPATH=/path/to/text_objseg_caffe/:$PYTHONPATH

Build training batches for bounding boxes:

python referit/build_training_batches_det.py

Build training batches for segmentation:

python referit/build_training_batches_seg.py

Configure the config.py file in the directory det_model and train the language-based bounding box localization model:

python det_model/train_det_model.py

Configure the config.py file in the directory seg_low_res_model and train the low resolution language-based segmentation model (from the previous bounding box localization model):

python seg_low_res_model/train_low_res_model.py

Configure the config.py file in the directory seg_model and train the high resolution language-based segmentation model (from the previous low resolution segmentation model):

python seg_model/train_seg_model.py

Evaluation

You may need to add the repository root directory to Python's module path:

export PYTHONPATH=path/to/text_objseg_caffe:$PYTHONPATH

Configure the test_config.py file in the directory seg_model and run evaluation for the high resolution language-based segmentation model:

python seg_model/test_seg_model.py

This should reproduce the results in the paper. You may also evaluate the language-based bounding box localization model:

python det_model/test_det_model.py

The results can be compared to this paper.

Demo

There is a demo that you can try! Run the demo in ./demo/text_objseg_demo.ipynb with Jupyter Notebook (IPython Notebook).

Caffe implementation for Hu et al. Segmentation for Natural Language Expressions

Related tags

Overview

Segmentation from Natural Language Expressions

Installation

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Training

Evaluation

Demo

Owner

Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer"

The codes and related files to reproduce the results for Image Similarity Challenge Track 2.

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

Train DeepLab for Semantic Image Segmentation

[SIGMETRICS 2022] One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search

Predicting Semantic Map Representations from Images with Pyramid Occupancy Networks

A lightweight library designed to accelerate the process of training PyTorch models by providing a minimal

A simple pygame dino game which can also be trained and played by a NEAT KI

BEGAN in PyTorch

Trained on Simulated Data, Tested in the Real World

Self-Supervised CNN-GCN Autoencoder

This is the winning solution of the Endocv-2021 grand challange.

Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration

SwinIR: Image Restoration Using Swin Transformer

A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization

phylotorch-bito is a package providing an interface to BITO for phylotorch

Not Suitable for Work (NSFW) classification using deep neural network Caffe models.

Empowering journalists and whistleblowers

Repo for CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning

An improvement of FasterGICP: Acceptance-rejection Sampling based 3D Lidar Odometry