Generic Foreground Segmentation in Images

Last update: Nov 21, 2022

Overview

Pixel Objectness

The following repository contains pretrained model for pixel objectness.

Please visit our project page for the paper and visual results.

If you use this in your research, please cite the following paper:

@article{pixelobjectness,
  Author = {Jain, Suyog and Xiong, Bo and Grauman, Kristen},
  Journal = {arXiv preprint arXiv:1701.05349},
  Title = {Pixel Objectness},
  Year = {2017}
}

These models are freely available for research and academic purposes. However it's patent pending, so please contact us for any commercial use.

Using the pretrained models:

This model is trained using Deeplab-v1 caffe library. Please cite [1] and [2] if you use the code.

Setup: Download and install Deeplab-v1 from here
Refer to demo.py for step-by-step instruction on how to run the code.
Store the images that you want to process in the images folder.
Update the caffe binary path and image extension variable in demo.py
Running demo.py will produce three files 1) image_list.txt : contains list of of input images, 2) output_list.txt: contains names to be used to store the output of pixel objectness 3) test.protoxt: prototxt file required for loading the pretrained model.
Please resize your images so that the maximum side is < 513, otherwise update the crop_size value in test_template.prototxt. Bigger crop sizes require larger gpu memory.

Visualizing the results:

After execution demo.py will store pixel objectness results as matlab files.

Please refer to show_results.m to see how to visualize and extract foreground masks.

Please cite these too if you use the code:

[1] Caffe:

@article{jia2014caffe,
Author = {Jia, Yangqing and Shelhamer, Evan and Donahue, Jeff and Karayev, Sergey and Long, Jonathan and Girshick, Ross and Guadarrama, Sergio and Darrell, Trevor},
Journal = {arXiv preprint arXiv:1408.5093},
Title = {Caffe: Convolutional Architecture for Fast Feature Embedding},
Year = {2014}
}

[2] Deeplab-v1:

@inproceedings{chen14semantic,
title={Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs},
author={Liang-Chieh Chen and George Papandreou and Iasonas Kokkinos and Kevin Murphy and Alan L Yuille},
booktitle={ICLR},
url={http://arxiv.org/abs/1412.7062},
year={2015}
}

Generic Foreground Segmentation in Images

Related tags

Overview

Pixel Objectness

Using the pretrained models:

Visualizing the results:

Please cite these too if you use the code:

Owner

Suyog Jain

Listing arxiv - Personalized list of today's articles from ArXiv

Official Pytorch implementation of "CLIPstyler:Image Style Transfer with a Single Text Condition"

Leveraging OpenAI's Codex to solve cornerstone problems in Music

Contains code for Deep Kernelized Dense Geometric Matching

Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks

Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.

Tensorflow implementation of Swin Transformer model.

DeLiGAN - This project is an implementation of the Generative Adversarial Network

Implementation of the method proposed in the paper "Neural Descriptor Fields: SE(3)-Equivariant Object Representations for Manipulation"

Faune proche - Retrieval of Faune-France data near a google maps location

LF-YOLO (Lighter and Faster YOLO) is used to detect defect of X-ray weld image.

Differentiable Surface Triangulation

A collection of Google research projects related to Federated Learning and Federated Analytics.

Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging

TensorFlow implementation of "Variational Inference with Normalizing Flows"

Official repository for "On Improving Adversarial Transferability of Vision Transformers" (2021)

Original Pytorch Implementation of FLAME: Facial Landmark Heatmap Activated Multimodal Gaze Estimation

PyTorch implementation of NIPS 2017 paper Dynamic Routing Between Capsules

PointCloud Annotation Tools, support to label object bound box, ground, lane and kerb