CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

Last update: Dec 29, 2022

Related tags

Overview

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

性能情况

训练数据集	权值文件名称	测试数据集	输入图片大小	mAP 0.5:0.95	mAP 0.5
VOC07+12	centernet_resnet50_voc.pth	VOC-Test07	512x512	-	77.1
COCO-Train2017	centernet_hourglass_coco.pth	COCO-Val2017	512x512	38.4	56.8

所需环境

torch==1.2.0

注意事项

代码中的centernet_resnet50_voc.pth是使用voc数据集训练的。
代码中的centernet_hourglass_coco.pth是使用coco数据集训练的。
注意不要使用中文标签，文件夹中不要有空格！
在训练前需要务必在model_data下新建一个txt文档，文档中输入需要分的类，在train.py中将classes_path指向该文件。

文件下载

训练所需的centernet_resnet50_voc.pth、centernet_hourglass_coco.pth可在百度网盘中下载。
链接: https://pan.baidu.com/s/1QBBgRb_TH8kJdSCQGgcXmQ 提取码: phnc

centernet_resnet50_voc.pth是voc数据集的权重。
centernet_hourglass_coco.pth是coco数据集的权重。

预测步骤

a、使用预训练权重

下载完库后解压，在百度网盘下载centernet_resnet50_voc.pth或者centernet_hourglass_coco.pth，放入model_data，运行predict.py，输入

img/street.jpg

利用video.py可进行摄像头检测。

b、使用自己训练的权重

按照训练步骤训练。
在yolo.py文件里面，在如下部分修改model_path和classes_path使其对应训练好的文件；model_path对应logs文件夹下面的权值文件，classes_path是model_path对应分的类。

_defaults = {
    "model_path"        : 'model_data/centernet_resnet50_voc.pth',
    "classes_path"      : 'model_data/voc_classes.txt',
    # "model_path"        : 'model_data/centernet_hourglass_coco.h5',
    # "classes_path"      : 'model_data/coco_classes.txt',
    "backbone"          : "resnet50",
    "image_size"        : [512,512,3],
    "confidence"        : 0.3,
    # backbone为resnet50时建议设置为True
    # backbone为hourglass时建议设置为False
    # 也可以根据检测效果自行选择
    "nms"               : True,
    "nms_threhold"      : 0.3,
    "cuda"              : True
}

运行predict.py，输入

img/street.jpg

利用video.py可进行摄像头检测。

训练步骤

本文使用VOC格式进行训练。
训练前将标签文件放在VOCdevkit文件夹下的VOC2007文件夹下的Annotation中。
训练前将图片文件放在VOCdevkit文件夹下的VOC2007文件夹下的JPEGImages中。
在训练前利用voc2centernet.py文件生成对应的txt。
再运行根目录下的voc_annotation.py，运行前需要将classes改成你自己的classes。注意不要使用中文标签，文件夹中不要有空格！

classes = ["aeroplane", "bicycle", "bird", "boat", "bottle", "bus", "car", "cat", "chair", "cow", "diningtable", "dog", "horse", "motorbike", "person", "pottedplant", "sheep", "sofa", "train", "tvmonitor"]

此时会生成对应的2007_train.txt，每一行对应其图片位置及其真实框的位置。
在训练前需要务必在model_data下新建一个txt文档，文档中输入需要分的类，在train.py中将classes_path指向该文件，示例如下：

classes_path = 'model_data/new_classes.txt'

model_data/new_classes.txt文件内容为：

cat
dog
...

运行train.py即可开始训练。

mAP目标检测精度计算更新

更新了get_gt_txt.py、get_dr_txt.py和get_map.py文件。
get_map文件克隆自https://github.com/Cartucho/mAP
具体mAP计算过程可参考：https://www.bilibili.com/video/BV1zE411u7Vw

Reference

https://github.com/xuannianz/keras-CenterNet
https://github.com/see--/keras-centernet
https://github.com/xingyizhou/CenterNet

Comments

map指标

B导，我在使用get_map.py的时候，您的初始设置confidence为0.02，我正常得到map结果，但是我像其他网络一样把confidence修改成为0.001以后就得不到map结果了，这是为什么呢？还有就是想问一下，在计算voc的map时，confidence都应该设置为很低，所以是不是0.02和0.001的效果相似？谢谢b导

opened by ChristmasLee 2
训练没有归一化，预测却有归一化，是不是有问题？

训练时候加载数据是dataloader.py 222行，是没有对图片做mean和std归一化的，但预测时predict.py -> centernet.py -> util/util.py -> preprocess_input里却对图片做了mean、std归一化，这应该有问题吧？

opened by seven-linglx 2
显示no mudule named 'past'

Traceback (most recent call last): File "train.py", line 15, in from utils.callbacks import LossHistory File "/root/centernet/centernet-pytorch-main/utils/callbacks.py", line 9, in from torch.utils.tensorboard import SummaryWriter File "/root/.local/lib/python3.7/site-packages/torch/utils/tensorboard/init.py", line 6, in from .writer import FileWriter, SummaryWriter # noqa F401 File "/root/.local/lib/python3.7/site-packages/torch/utils/tensorboard/writer.py", line 18, in from ._convert_np import make_np File "/root/.local/lib/python3.7/site-packages/torch/utils/tensorboard/_convert_np.py", line 12, in from caffe2.python import workspace File "/root/.local/lib/python3.7/site-packages/caffe2/python/workspace.py", line 15, in from past.builtins import basestring

opened by buloseshi 1
请问我改mobilenetv3的时候运行到第7批次就自动停止了是怎么回事呢

Finish Validation 0%| | 0/119 [00:00<?, ?it/s]Get map. 0%| | 0/119 [00:00<?, ?it/s] Traceback (most recent call last): File "/home/linux/data2/sun/centernet-pytorch-main/train.py", line 491, in epoch_step, epoch_step_val, gen, gen_val, UnFreeze_Epoch, Cuda, fp16, scaler, backbone, save_period, save_dir, local_rank) File "/home/linux/data2/sun/centernet-pytorch-main/utils/utils_fit.py", line 161, in fit_one_epoch eval_callback.on_epoch_end(epoch + 1, model_train) File "/home/linux/data2/sun/centernet-pytorch-main/utils/callbacks.py", line 211, in on_epoch_end self.get_map_txt(image_id, image, self.class_names, self.map_out_path) File "/home/linux/data2/sun/centernet-pytorch-main/utils/callbacks.py", line 145, in get_map_txt outputs = decode_bbox(outputs[0], outputs[1], outputs[2], self.confidence, self.cuda) IndexError: list index out of range

opened by sunsn1997 2
第一次尝试的新手提问

按照readme文档中的步骤 1 已解压VOC数据集至项目根目录，pth文件至model_data目录 2 已修改voc_annotation.py 中的annotation_mode为2 3 运行train.py

环境 pytorch1.2 + cuda10.0 +python3.6 ，Ubuntu 刚开始是使用的高版本torch和python，然后也尝试了python3.6+ torch1.2的环境,出现一样的问题

opened by Xie-Muxi 1

Releases(v3.0)

v3.0(Apr 22, 2022)
重要更新

支持step、cos学习率下降法。

支持adam、sgd优化器选择。

支持学习率根据batch_size自适应调整。

支持不同预测模式的选择，单张图片预测、文件夹预测、视频预测、图片裁剪、heatmap、各个种类目标数量计算。

更新summary.py文件，用于观看网络结构。

增加了多GPU训练。

Source code(tar.gz)
Source code(zip)
v2.0(Mar 4, 2022)
重要更新

更新train.py文件，增加了大量的注释，增加多个可调整参数。

更新predict.py文件，增加了大量的注释，增加fps、视频预测、批量预测等功能。

更新centernet.py文件，增加了大量的注释，增加先验框选择、置信度、非极大抑制等参数。

合并get_dr_txt.py、get_gt_txt.py和get_map.py文件，通过一个文件来实现数据集的评估。

更新voc_annotation.py文件，增加多个可调整参数。

更新summary.py文件，用于观看网络结构。

Source code(tar.gz)
Source code(zip)
v1.0(Dec 17, 2020)

Source code(tar.gz)
Source code(zip)
centernet_hourglass_coco.pth(730.32 MB)
centernet_resnet50_voc.pth(124.87 MB)

Owner

Bubbliiiing

GitHub Repository

Code for Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights

Piggyback: https://arxiv.org/abs/1801.06519 Pretrained masks and backbones are available here: https://uofi.box.com/s/c5kixsvtrghu9yj51yb1oe853ltdfz4q

165 Nov 22, 2022

StyleGAN2 Webtoon / Anime Style Toonify

StyleGAN2 Webtoon / Anime Style Toonify Korea Webtoon or Japanese Anime Character Stylegan2 base high Quality 1024x1024 / 512x512 Generate and Transfe

121 Dec 21, 2022

Existing Literature about Machine Unlearning

Machine Unlearning Papers 2021 Brophy and Lowd. Machine Unlearning for Random Forests. In ICML 2021. Bourtoule et al. Machine Unlearning. In IEEE Symp

213 Jan 08, 2023

This repository contains demos I made with the Transformers library by HuggingFace.

Transformers-Tutorials Hi there! This repository contains demos I made with the Transformers library by 🤗 HuggingFace. Currently, all of them are imp

3.5k Jan 01, 2023

The Submission for SIMMC 2.0 Challenge 2021

The Submission for SIMMC 2.0 Challenge 2021 challenge website Requirements python 3.8.8 pytorch 1.8.1 transformers 4.8.2 apex for multi-gpu nltk Prepr

5 Jul 26, 2022

利用yolov5和TensorRT从0到1实现目标检测的模型训练到模型部署全过程

写在前面利用TensorRT加速推理速度是以时间换取精度的做法，意味着在推理速度上升的同时将会有精度的下降，不过不用太担心，精度下降微乎其微。此外，要有NVIDIA显卡，经测试，CUDA10.2可以支持20系列显卡及以下，30系列显卡需要CUDA11.x的支持，并且目前有bug。默认你已经完成了

6 Jul 28, 2022

[ICCV 2021] HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration

HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration Introduction The repository contains the source code and pre-tr

55 Dec 14, 2022

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation(mCOLT/mRASP2), ACL2021

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation(mCOLT/mRASP2), ACL2021 The code for training mCOLT/mRASP2, a multilingua

104 Jan 01, 2023

Python script for Linear, Non-Linear Convection, Burger’s & Poisson Equation in 1D & 2D, 1D Diffusion Equation using Standard Wall Function, 2D Heat Conduction Convection equation with Dirichlet & Neumann BC, full Navier-Stokes Equation coupled with Poisson equation for Cavity and Channel flow in 2D using Finite Difference Method & Finite Volume Method.

Navier-Stokes-numerical-solution-using-Python- Python script for Linear, Non-Linear Convection, Burger’s & Poisson Equation in 1D & 2D, 1D D

89 Jan 04, 2023

PyTorch implementation of CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition

PyTorch implementation of CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition The unofficial code of CDistNet. Now, we ha

25 Jul 20, 2022

LiDAR R-CNN: An Efficient and Universal 3D Object Detector

LiDAR R-CNN: An Efficient and Universal 3D Object Detector Introduction This is the official code of LiDAR R-CNN: An Efficient and Universal 3D Object

295 Jan 05, 2023

a practicable framework used in Deep Learning. So far UDL only provide DCFNet implementation for the ICCV paper (Dynamic Cross Feature Fusion for Remote Sensing Pansharpening)

UDL UDL is a practicable framework used in Deep Learning (computer vision). Benchmark codes, results and models are available in UDL, please contact @

11 Sep 30, 2022

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

Related tags

Overview

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

目录

性能情况

所需环境

注意事项

文件下载

预测步骤

a、使用预训练权重

b、使用自己训练的权重

训练步骤

mAP目标检测精度计算更新

Reference

Comments

map指标

训练没有归一化，预测却有归一化，是不是有问题？

显示no mudule named 'past'

请问我改mobilenetv3的时候运行到第7批次就自动停止了是怎么回事呢

第一次尝试的新手提问

Releases(v3.0)

v3.0(Apr 22, 2022)

重要更新

v2.0(Mar 4, 2022)

重要更新

v1.0(Dec 17, 2020)

Owner

Bubbliiiing

Code for Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights

StyleGAN2 Webtoon / Anime Style Toonify

Existing Literature about Machine Unlearning

This repository contains demos I made with the Transformers library by HuggingFace.

The Submission for SIMMC 2.0 Challenge 2021

利用yolov5和TensorRT从0到1实现目标检测的模型训练到模型部署全过程

[ICCV 2021] HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation(mCOLT/mRASP2), ACL2021

PyTorch implementation of CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition

LiDAR R-CNN: An Efficient and Universal 3D Object Detector

a practicable framework used in Deep Learning. So far UDL only provide DCFNet implementation for the ICCV paper (Dynamic Cross Feature Fusion for Remote Sensing Pansharpening)

A PyTorch re-implementation of Neural Radiance Fields

Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"

Code for Environment Inference for Invariant Learning (ICML 2020 UDL Workshop Paper)

Self-supervised learning (SSL) is a method of machine learning

LogDeep is an open source deeplearning-based log analysis toolkit for automated anomaly detection.

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

Hypersearch weight debugging and losses tutorial

Catbird is an open source paraphrase generation toolkit based on PyTorch.