Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016

Last update: May 24, 2022

Related tags

Overview

Segmentation from Natural Language Expressions

This repository contains the code for the following paper:

R. Hu, M. Rohrbach, T. Darrell, Segmentation from Natural Language Expressions. in ECCV, 2016. (PDF)

@article{hu2016segmentation,
  title={Segmentation from Natural Language Expressions},
  author={Hu, Ronghang and Rohrbach, Marcus and Darrell, Trevor},
  journal={Proceedings of the European Conference on Computer Vision (ECCV)},
  year={2016}
}

Project Page: http://ronghanghu.com/text_objseg

Installation

Install Google TensorFlow (v1.0.0 or higher) following the instructions here.
Download this repository or clone with Git, and then cd into the root directory of the repository.

Demo

Download the trained models:
exp-referit/tfmodel/download_trained_models.sh.
Run the language-based segmentation model demo in ./demo/text_objseg_demo.ipynb with Jupyter Notebook (IPython Notebook).

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Download ReferIt dataset:
exp-referit/referit-dataset/download_referit_dataset.sh.
Download VGG-16 network parameters trained on ImageNET 1000 classes:
models/convert_caffemodel/params/download_vgg_params.sh.

Training

You may need to add the repository root directory to Python's module path: export PYTHONPATH=.:$PYTHONPATH.
Build training batches for bounding boxes:
python exp-referit/build_training_batches_det.py.
Build training batches for segmentation:
python exp-referit/build_training_batches_seg.py.
Select the GPU you want to use during training:
export GPU_ID=<gpu id>. Use 0 for <gpu id> if you only have one GPU on your machine.
Train the language-based bounding box localization model:
python exp-referit/exp_train_referit_det.py $GPU_ID.
Train the low resolution language-based segmentation model (from the previous bounding box localization model):
python exp-referit/init_referit_seg_lowres_from_det.py && python exp-referit/exp_train_referit_seg_lowres.py $GPU_ID.
Train the high resolution language-based segmentation model (from the previous low resolution segmentation model):
python exp-referit/init_referit_seg_highres_from_lowres.py && python exp-referit/exp_train_referit_seg_highres.py $GPU_ID.

Alternatively, you may skip the training procedure and download the trained models directly:
exp-referit/tfmodel/download_trained_models.sh.

Evaluation

Select the GPU you want to use during testing: export GPU_ID=<gpu id>. Use 0 for <gpu id> if you only have one GPU on your machine. Also, you may need to add the repository root directory to Python's module path: export PYTHONPATH=.:$PYTHONPATH.
Run evaluation for the high resolution language-based segmentation model:
python exp-referit/exp_test_referit_seg.py $GPU_ID
This should reproduce the results in the paper.
You may also evaluate the language-based bounding box localization model:
python exp-referit/exp_test_referit_det.py $GPU_ID
The results can be compared to this paper.

Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016

Related tags

Overview

Segmentation from Natural Language Expressions

Installation

Demo

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Training

Evaluation

Owner

Ronghang Hu

Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting

Detail-Preserving Transformer for Light Field Image Super-Resolution

A medical imaging framework for Pytorch

WaveFake: A Data Set to Facilitate Audio DeepFake Detection

Scalable training for dense retrieval models.

내가 보려고 정리한 <프로그래밍 기초 Ⅰ> / organized for me

Distributed DataLoader For Pytorch Based On Ray

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale

Augmented Traffic Control: A tool to simulate network conditions

Steerable discovery of neural audio effects

TiP-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling

Pythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab

CLIP2Video: Mastering Video-Text Retrieval via Image CLIP

Audio2Face - Audio To Face With Python

DCA - Official Python implementation of Delaunay Component Analysis algorithm

PyTorch-centric library for evaluating and enhancing the robustness of AI technologies

Potato Disease Classification - Training, Rest APIs, and Frontend to test.

A collection of resources, problems, explanations and concepts that are/were important during my Data Science journey

Repository for GNSS-based position estimation using a Deep Neural Network