EPSANet：An Efficient Pyramid Split Attention Block on Convolutional Neural Network

Last update: Jan 07, 2023

Overview

EPSANet：An Efficient Pyramid Split Attention Block on Convolutional Neural Network

This repo contains the official Pytorch implementaion code and configuration files of EPSANet：An Efficient Pyramid Split Attention Block on Convolutional Neural Network. created by Hu Zhang.

Installation

Requirements

Python 3.6+
PyTorch 1.0+

Our environments

OS: Ubuntu 18.04
CUDA: 10.0
Toolkit: PyTorch 1.0
GPU: Titan RTX

Data preparation

Download and extract ImageNet train and val images from http://image-net.org/. The directory structure is the standard layout for the torchvision datasets.ImageFolder, and the training and validation data is expected to be in the train/ folder and val folder respectively:

/path/to/imagenet/
  train/
    class1/
      img1.jpeg
    class2/
      img2.jpeg
  val/
    class1/
      img3.jpeg
    class/2
      img4.jpeg

Usage

First, clone the repository locally:

git clone https://github.com/murufeng/EPSANet.git
cd EPSANet

Create a conda virtual environment and activate it:

conda create -n epsanet python=3.6 
conda activate epsanet

Install CUDA==10.0 with cudnn7 following the official installation instructions
Install PyTorch==1.0.1 and torchvision==0.2.0 with CUDA==10.0:

conda install -c pytorch pytorch torchvision

Training

To train models on ImageNet with 8 gpus run:

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python main.py -a epsanet50 --data /path/to/imagenet

Model Zoo

Models are trained with 8 GPUs on both ImageNet and MS-COCO 2017 dataset.

Image Classification on ImageNet

Model	Params(M)	FLOPs(G)	Top-1 (%)	Top-5 (%)
EPSANet-50(Small)	22.56	3.62	77.49	93.54
EPSANet-50(Large)	27.90	4.72	78.64	94.18
EPSANet-101(Small)	38.90	6.82	78.43	94.11
EPSANet-101(Large)	49.59	8.97	79.38	94.58

Object Detection on MS-COCO 2017

Faster R-CNN

model	Style	Lr schd	Params(M)	FLOPs(G)	box AP	AP_50	AP_75
EPSANet-50(small)	pytorch	1x	38.56	197.07	39.2	60.3	42.3
EPSANet-50(large)	pytorch	1x	43.85	219.64	40.9	62.1	44.6

Mask R-CNN

model	Style	Lr schd	Params(M)	FLOPs(G)	box AP	AP_50	AP_75
EPSANet-50(small)	pytorch	1x	41.20	248.53	40.0	60.9	43.3
EPSANet-50(large)	pytorch	1x	46.50	271.10	41.4	62.3	45.3

RetinaNet

model	Style	Lr schd	Params(M)	FLOPs(G)	box AP	AP_50	AP_75
EPSANet-50(small)	pytorch	1x	34.78	229.32	38.2	58.1	40.6
EPSANet-50(large)	pytorch	1x	40.07	251.89	39.6	59.4	42.3

Instance segmentation with Mask R-CNN on MS-COCO 2017

model	Params(M)	FLOPs(G)	AP	AP_50	AP_75
EPSANet-50(small)	41.20	248.53	35.9	57.7	38.1
EPSANet-50(Large)	46.50	271.10	37.1	59.0	39.5

Citing EPSANet

You can cite the paper as:

@article{hu2021epsanet,
  title={EPSANet：An Efficient Pyramid Split Attention Block on Convolutional Neural Network},
  author={Hu Zhang and Keke Zu and Jian Lu and Yuru Zou and Deyu Meng},
  journal={arXiv preprint arXiv:2105.14447},
  year={2021}
}

EPSANet：An Efficient Pyramid Split Attention Block on Convolutional Neural Network

Related tags

Overview

EPSANet：An Efficient Pyramid Split Attention Block on Convolutional Neural Network

Installation

Requirements

Our environments

Data preparation

Usage

Training

Model Zoo

Image Classification on ImageNet

Object Detection on MS-COCO 2017

Faster R-CNN

Mask R-CNN

RetinaNet

Instance segmentation with Mask R-CNN on MS-COCO 2017

Citing EPSANet

Owner

Hu Zhang

automatic color-grading

PyTorch implementation of the paper: Label Noise Transition Matrix Estimation for Tasks with Lower-Quality Features

ComPhy: Compositional Physical Reasoning ofObjects and Events from Videos

Sequence-tagging using deep learning

[ICLR 2022] Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics

ML-Decoder: Scalable and Versatile Classification Head

a generic C++ library for image analysis

基于Paddle框架的arcface复现

Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation

codebase for "A Theory of the Inductive Bias and Generalization of Kernel Regression and Wide Neural Networks"

An implementation of quantum convolutional neural network with MindQuantum. Huawei, classifying MNIST dataset

PyTorch code for SENTRY: Selective Entropy Optimization via Committee Consistency for Unsupervised DA

Object recognition using Azure Custom Vision AI and Azure Functions

A package, and script, to perform imaging transcriptomics on a neuroimaging scan.

PyTorch(Geometric) implementation of G^2GNN in "Imbalanced Graph Classification via Graph-of-Graph Neural Networks"

Planner_backend - Academic planner application designed for students and counselors.

This is an official implementation for "PlaneRecNet".

joint detection and semantic segmentation, based on ultralytics/yolov5,

Additional environments compatible with OpenAI gym

Cross-platform CLI tool to generate your Github profile's stats and summary.