Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016

Last update: May 24, 2022

Related tags

Overview

Segmentation from Natural Language Expressions

This repository contains the code for the following paper:

R. Hu, M. Rohrbach, T. Darrell, Segmentation from Natural Language Expressions. in ECCV, 2016. (PDF)

@article{hu2016segmentation,
  title={Segmentation from Natural Language Expressions},
  author={Hu, Ronghang and Rohrbach, Marcus and Darrell, Trevor},
  journal={Proceedings of the European Conference on Computer Vision (ECCV)},
  year={2016}
}

Project Page: http://ronghanghu.com/text_objseg

Installation

Install Google TensorFlow (v1.0.0 or higher) following the instructions here.
Download this repository or clone with Git, and then cd into the root directory of the repository.

Demo

Download the trained models:
exp-referit/tfmodel/download_trained_models.sh.
Run the language-based segmentation model demo in ./demo/text_objseg_demo.ipynb with Jupyter Notebook (IPython Notebook).

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Download ReferIt dataset:
exp-referit/referit-dataset/download_referit_dataset.sh.
Download VGG-16 network parameters trained on ImageNET 1000 classes:
models/convert_caffemodel/params/download_vgg_params.sh.

Training

You may need to add the repository root directory to Python's module path: export PYTHONPATH=.:$PYTHONPATH.
Build training batches for bounding boxes:
python exp-referit/build_training_batches_det.py.
Build training batches for segmentation:
python exp-referit/build_training_batches_seg.py.
Select the GPU you want to use during training:
export GPU_ID=<gpu id>. Use 0 for <gpu id> if you only have one GPU on your machine.
Train the language-based bounding box localization model:
python exp-referit/exp_train_referit_det.py $GPU_ID.
Train the low resolution language-based segmentation model (from the previous bounding box localization model):
python exp-referit/init_referit_seg_lowres_from_det.py && python exp-referit/exp_train_referit_seg_lowres.py $GPU_ID.
Train the high resolution language-based segmentation model (from the previous low resolution segmentation model):
python exp-referit/init_referit_seg_highres_from_lowres.py && python exp-referit/exp_train_referit_seg_highres.py $GPU_ID.

Alternatively, you may skip the training procedure and download the trained models directly:
exp-referit/tfmodel/download_trained_models.sh.

Evaluation

Select the GPU you want to use during testing: export GPU_ID=<gpu id>. Use 0 for <gpu id> if you only have one GPU on your machine. Also, you may need to add the repository root directory to Python's module path: export PYTHONPATH=.:$PYTHONPATH.
Run evaluation for the high resolution language-based segmentation model:
python exp-referit/exp_test_referit_seg.py $GPU_ID
This should reproduce the results in the paper.
You may also evaluate the language-based bounding box localization model:
python exp-referit/exp_test_referit_det.py $GPU_ID
The results can be compared to this paper.

Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016

Related tags

Overview

Segmentation from Natural Language Expressions

Installation

Demo

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Training

Evaluation

Owner

Ronghang Hu

Repo for parser tensorflow(.pb) and tflite(.tflite)

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Here is the diagnostic tool for BMVC 2021 paper Diagnosing Errors in Video Relation Detectors.

Ivy is a templated deep learning framework which maximizes the portability of deep learning codebases.

SwinIR: Image Restoration Using Swin Transformer

Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices

MarcoPolo is a clustering-free approach to the exploration of bimodally expressed genes along with group information in single-cell RNA-seq data

Advancing mathematics by guiding human intuition with AI

Fast image augmentation library and an easy-to-use wrapper around other libraries

Official repository of "DeepMIH: Deep Invertible Network for Multiple Image Hiding", TPAMI 2022.

Pytorch implemenation of Stochastic Multi-Label Image-to-image Translation (SMIT)

face property detection pytorch

TorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision

ConvMAE: Masked Convolution Meets Masked Autoencoders

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Rewrite ultralytics/yolov5 v6.0 opencv inference code based on numpy, no need to rely on pytorch

[NeurIPS 2020] Official repository for the project "Listening to Sound of Silence for Speech Denoising"

An open-source outlier detection package by Getcontact Data Team

Use unsupervised and supervised learning to predict stocks

An open source Python package for plasma science that is under development