The code is an implementation of Feedback Convolutional Neural Network for Visual Localization and Segmentation.

Last update: Dec 04, 2022

Related tags

Overview

Feedback Convolutional Neural Network for Visual Localization and Segmentation

The code is an implementation of Feedback Convolutional Neural Network for Visual Localization and Segmentation. The code is written in PyTorch, very simple to understand.

There is also a Caffe implementation, please check it if you use Caffe and Matlab.

Requirement:

Python 3
Pytorch 0.4.0

How to run:

open the ipython notebooks with jupyter notebook

then open vgg_fr.ipynb or vgg_fsp.ipynb, these are the two main files for demonstrate feedback idea.

How it looks:

If you run vgg_fsp.ipynb without modification of code, you are supposed to see below visualization:

Input image:

Image gradient with respect to the target label:

Image gradient with respect to the target label after 4 iterations of feedback selective pruning (FSP):

Files explanation:

vgg_fr.ipynb: the main file that defines the vgg feedback network with the feedback recovering mechanism and run a feedback visualization on examplar images.
vgg_fsp.ipynb: the main file that defines the vgg feedback network with the feedback selective pruning mechanism and run a feedback visualization on examplar images.
images: storing exmaplar images
imagenet1000_clsid_to_human.txt: storing image net 1000 class names, for visualization and understanding purpose
test/simple_test.ipynb: unit test for a simple feedback network, using a simple fully connected structure
test/vgg_test.ipynb: unit test for the loading of a pretrained vgg network, then check the weights copying from pretrained network to a new defined network interface

Citation

Please consider citing in your publications if it helps your research:

@inproceedings{cao2015look,
  title={Look and think twice: Capturing top-down visual attention with feedback convolutional neural networks},
  author={Cao, Chunshui and Liu, Xianming and Yang, Yi and Yu, Yinan and Wang, Jiang and Wang, Zilei and Huang, Yongzhen and Wang, Liang and Huang, Chang and Xu, Wei and others},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  pages={2956--2964},
  year={2015}
}

The code is an implementation of Feedback Convolutional Neural Network for Visual Localization and Segmentation.

Related tags

Overview

Feedback Convolutional Neural Network for Visual Localization and Segmentation

Requirement:

How to run:

How it looks:

Files explanation:

Citation

Owner

Collections for the lasted paper about multi-view clustering methods (papers, codes)

Code for "Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation" ICCV'21

Pytorch implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Tool cek opsi checkpoint facebook!

This repository contains the DendroMap implementation for scalable and interactive exploration of image datasets in machine learning.

Pose Transformers: Human Motion Prediction with Non-Autoregressive Transformers

Guided Internet-delivered Cognitive Behavioral Therapy Adherence Forecasting

[NeurIPS-2021] Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data

FS2KToolbox FS2K Dataset Towards the translation between Face

Robot Servers and Server Manager software for robo-gym

Multimodal Co-Attention Transformer (MCAT) for Survival Prediction in Gigapixel Whole Slide Images

CRNN With PyTorch

Fog Simulation on Real LiDAR Point Clouds for 3D Object Detection in Adverse Weather

Semi-Supervised Semantic Segmentation with Pixel-Level Contrastive Learning from a Class-wise Memory Bank

A collection of metrics for evaluating timbre dissimilarity using the TorchMetrics API

U-Net Implementation: Convolutional Networks for Biomedical Image Segmentation" using the Carvana Image Masking Dataset in PyTorch

MaskTrackRCNN for video instance segmentation based on mmdetection

Turi Create simplifies the development of custom machine learning models.

Calling Julia from Python - an experiment on data loading

RCT-ART is an NLP pipeline built with spaCy for converting clinical trial result sentences into tables through jointly extracting intervention, outcome and outcome measure entities and their relations.