Visualizing Yolov5's layers using GradCam

Last update: Jan 01, 2023

Overview

YOLO-V5 GRADCAM

I constantly desired to know to which part of an object the object-detection models pay more attention. So I searched for it, but I didn't find any for Yolov5. Here is my implementation of Grad-cam for YOLO-v5. To load the model I used the yolov5's main codes, and for computing GradCam I used the codes from the gradcam_plus_plus-pytorch repository. Please follow my GitHub account and star ⭐ the project if this functionality benefits your research or projects.

Installation

pip install -r requirements.txt

Infer

python main.py --model-path yolov5s.pt --img-path images/cat-dog.jpg --output-dir outputs

NOTE: If you don't have any weights and just want to test, don't change the model-path argument. The yolov5s model will be automatically downloaded thanks to the download function from yolov5.

NOTE: For more input arguments, check out the main.py or run the following command:

python main.py -h

Examples

Note

I checked the code, but I couldn't find an explanation for why the truck's heatmap does not show anything. Please inform me or create a pull request if you find the reason.

TO Do

Add GradCam++
Add ScoreCam
Add the functionality to the deep_utils library

References

Citation

Please cite yolov5-gradcam if it helps your research. You can use the following BibTeX entry:

@misc{deep_utils,
	title = {yolov5-gradcam},
	author = {Mohammadi Kazaj, Pooya},
	howpublished = {\url{github.com/pooya-mohammadi/yolov5-gradcam}},
	year = {2021}
}

Visualizing Yolov5's layers using GradCam

Related tags

Overview

YOLO-V5 GRADCAM

Installation

Infer

Examples

Note

TO Do

References

Citation

Owner

Pooya Mohammadi Kazaj

PaSST: Efficient Training of Audio Transformers with Patchout

Official implementation of the paper Momentum Capsule Networks (MoCapsNet)

Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes

PyTorch implementation of SimSiam: Exploring Simple Siamese Representation Learning

An Open-Source Package for Information Retrieval.

Part-aware Measurement for Robust Multi-View Multi-Human 3D Pose Estimation and Tracking

[EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Project Aquarium is a SUSE-sponsored open source project aiming at becoming an easy to use, rock solid storage appliance based on Ceph.

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

PyTorch implementation for 3D human pose estimation

Deep generative models of 3D grids for structure-based drug discovery

Deep High-Resolution Representation Learning for Human Pose Estimation

RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth, in ICCV 2021 (oral)

Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.

The official implementation of Variable-Length Piano Infilling (VLI).

Mscp jamf - Build compliance in jamf

NVIDIA Deep Learning Examples for Tensor Cores

Using Clinical Drug Representations for Improving Mortality and Length of Stay Predictions

A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.