This project provides the code and datasets for 'CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection', CVPR 2019.

Last update: Aug 19, 2022

Overview

Code-and-Dataset-for-CapSal

This project provides the code and datasets for 'CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection', CVPR 2019. Paper link

Our code is implemented based on the Mask RCNN in Tensorflow and Keras. You can first install the maskrcnn according to the instruction or INSTALL.md.

COCO-CapSal Dataset

The COCO-CapSal dataset provides the saliency ground truth as well as the image captions for each image. It contains 5265 images for training and 1459 ones for validation. The annotations can be downloaded at BaiduYun or GoogleDrive. The folder 'capsal' contains the images, ground truth maps as well as the caprions (json file) of both training and validation sets.

Evaluation

For testing the CapSal model, first download the trained model at BaiduYun or Google ) and put it under the ./model. Run test_capsal.py to obtain the saliency maps of different datasets. The saliency map is avaliable at Google or BaiduYun.

Train

Run 'train.py'.

Citation

    @InProceedings{Zhang_2019_CVPR,
            author = {Zhang, Lu and Zhang, Jianming and Lin, Zhe and Lu, Huchuan and He, You},
            title = {CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection},
            booktitle = CVPR,
            year = {2019}}

This project provides the code and datasets for 'CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection', CVPR 2019.

Related tags

Overview

Code-and-Dataset-for-CapSal

COCO-CapSal Dataset

Evaluation

Train

Citation

Owner

lu zhang

Medical Insurance Cost Prediction using Machine earning

In this project, two programs can help you take full agvantage of time on the model training with a remote server

[ICCV 2021] Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation

YOLOv5 Series Multi-backbone, Pruning and quantization Compression Tool Box.

Create Data & AI apps in 20 lines of code with Shimoku

Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed

A Python training and inference implementation of Yolov5 helmet detection in Jetson Xavier nx and Jetson nano

Pytorch implementation of Zero-DCE++

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation(mCOLT/mRASP2), ACL2021

Run containerized, rootless applications with podman

2021-AIAC-QQ-Browser-Hyperparameter-Optimization-Rank6

Code for the paper "Benchmarking and Analyzing Point Cloud Classification under Corruptions"

Structure Information is the Key: Self-Attention RoI Feature Extractor in 3D Object Detection

HINet: Half Instance Normalization Network for Image Restoration

This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”

Explaining Hyperparameter Optimization via PDPs

Multiple custom object count and detection using YOLOv3-Tiny method

The repository offers the official implementation of our paper in PyTorch.

[CVPR 2022] Official code for the paper: "A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network Calibration"

Official code implementation for "Personalized Federated Learning using Hypernetworks"