Visualizing Yolov5's layers using GradCam

Last update: Jan 01, 2023

Overview

YOLO-V5 GRADCAM

I constantly desired to know to which part of an object the object-detection models pay more attention. So I searched for it, but I didn't find any for Yolov5. Here is my implementation of Grad-cam for YOLO-v5. To load the model I used the yolov5's main codes, and for computing GradCam I used the codes from the gradcam_plus_plus-pytorch repository. Please follow my GitHub account and star ⭐ the project if this functionality benefits your research or projects.

Installation

pip install -r requirements.txt

Infer

python main.py --model-path yolov5s.pt --img-path images/cat-dog.jpg --output-dir outputs

NOTE: If you don't have any weights and just want to test, don't change the model-path argument. The yolov5s model will be automatically downloaded thanks to the download function from yolov5.

NOTE: For more input arguments, check out the main.py or run the following command:

python main.py -h

Examples

Note

I checked the code, but I couldn't find an explanation for why the truck's heatmap does not show anything. Please inform me or create a pull request if you find the reason.

TO Do

Add GradCam++
Add ScoreCam
Add the functionality to the deep_utils library

References

Citation

Please cite yolov5-gradcam if it helps your research. You can use the following BibTeX entry:

@misc{deep_utils,
	title = {yolov5-gradcam},
	author = {Mohammadi Kazaj, Pooya},
	howpublished = {\url{github.com/pooya-mohammadi/yolov5-gradcam}},
	year = {2021}
}

Visualizing Yolov5's layers using GradCam

Related tags

Overview

YOLO-V5 GRADCAM

Installation

Infer

Examples

Note

TO Do

References

Citation

Owner

Pooya Mohammadi Kazaj

Implements Stacked-RNN in numpy and torch with manual forward and backward functions

Moiré Attack (MA): A New Potential Risk of Screen Photos [NeurIPS 2021]

DARTS-: Robustly Stepping out of Performance Collapse Without Indicators

A Python implementation of the Locality Preserving Matching (LPM) method for pruning outliers in image matching.

Data and Code for paper Outlining and Filling: Hierarchical Query Graph Generation for Answering Complex Questions over Knowledge Graph is available for research purposes.

Losslandscapetaxonomy - Taxonomizing local versus global structure in neural network loss landscapes

RAMA: Rapid algorithm for multicut problem

Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion (CVPR 2021)

A lightweight python AUTOmatic-arRAY library.

On Evaluation Metrics for Graph Generative Models

A CROSS-MODAL FUSION NETWORK BASED ON SELF-ATTENTION AND RESIDUAL STRUCTURE FOR MULTIMODAL EMOTION RECOGNITION

An implementation for `Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event Extraction`

Trainable Bilateral Filter Layer (PyTorch)

Editing a Conditional Radiance Field

FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

Code for STFT Transformer used in BirdCLEF 2021 competition.

This project is for a Twitter bot that monitors a bird feeder in my backyard. Any detected birds are identified and posted to Twitter.

Metadata-Extractor - Metadata Extractor Script can be used to read in exif metadata

Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"