Unifying Global-Local Representations in Salient Object Detection with Transformer

Last update: Aug 24, 2022

Related tags

Overview

GLSTR (Global-Local Saliency Transformer)

This is the official implementation of paper "Unifying Global-Local Representations in Salient Object Detection with Transformer" by Sucheng Ren, Qiang Wen, Nanxuan Zhao, Guoqiang Han, Shengfeng He

Prerequisites

The whole training process can be done on eight RTX2080Ti or four RTX3090.

Pytorch 1.6

Datasets

Training Set

We use the training set of DUTS (DUTS-TR) to train our model.

/path/to/DUTS-TR/
   img/
      img1.jpg
   label/
      label1.png

Testing Set

We test our model on the testing set of DUTS, ECSSD, HKU-IS, PASCAL-S, DUT-OMRON, and SOD to test our model.

Training

Download the pretrained transformer backbone on ImageNet.

# input the path to training data and pretrained backbone in train.sh
bash train.sh

Testing

Download the pretrained model from Baidu pan(code: uo0a), Google drive, and put it int ./ckpt/

python test.py

Evaluation

The precomputed saliency maps (DUTS-TE, ECSSD, HKU-IS, PASCAL-S, DUT-OMRON, and SOD) can be found at Baidu pan(code: uo0a), Google drive.

After paper submission, we retrain the model, and the performance is improved. Feel free to use the results of our paper or the precomputed saliency maps.

Contact

If you have any questions, feel free to email Sucheng Ren :) ([email protected])

Citation

Please cite our paper if you think the code and paper are helpful.

@article{ren2021unifying,
  title={Unifying Global-Local Representations in Salient Object Detection with Transformer},
  author={Ren, Sucheng and Wen, Qiang and Zhao, Nanxuan and Han, Guoqiang and He, Shengfeng},
  journal={arXiv preprint arXiv:2108.02759},
  year={2021}
}

Unifying Global-Local Representations in Salient Object Detection with Transformer

Related tags

Overview

GLSTR (Global-Local Saliency Transformer)

Prerequisites

Datasets

Training Set

Testing Set

Training

Testing

Evaluation

Contact

Citation

Owner

Official PyTorch implementation of N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras (ICCV 2021)

TensorFlow-LiveLessons - "Deep Learning with TensorFlow" LiveLessons

[AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way

Open source repository for the code accompanying the paper 'PatchNets: Patch-Based Generalizable Deep Implicit 3D Shape Representations'.

Using deep actor-critic model to learn best strategies in pair trading

[CVPR 2022] Back To Reality: Weak-supervised 3D Object Detection with Shape-guided Label Enhancement

Fast methods to work with hydro- and topography data in pure Python.

商品推荐系统

A Pytorch implementation of "Manifold Matching via Deep Metric Learning for Generative Modeling" (ICCV 2021)

MVP Benchmark for Multi-View Partial Point Cloud Completion and Registration

Video Instance Segmentation using Inter-Frame Communication Transformers (NeurIPS 2021)

Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering

An algorithm that handles large-scale aerial photo co-registration, based on SURF, RANSAC and PyTorch autograd.

KSAI Lite is a deep learning inference framework of kingsoft, based on tensorflow lite

Multimodal commodity image retrieval 多模态商品图像检索

Code release for BlockGAN: Learning 3D Object-aware Scene Representations from Unlabelled Images

Object Detection using YOLO from PyImageSearch

The Multi-Mission Maximum Likelihood framework (3ML)

Sparse-dense operators implementation for Paddle