PyTorch version repo for CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes

Last update: Mar 01, 2022

Related tags

Overview

Study-CSRNet-pytorch

This is the PyTorch version repo for CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes in CVPR 2018, which delivered a state-of-the-art, straightforward and end-to-end architecture for crowd counting tasks.

数据集下载

ShanghaiTech Dataset: Google Drive

设备要求

Python: 3.7.1

PyTorch: 1.9.0

CUDA: cuda10.2

获取真值

1.先运行make_dataset.py生成A，B部分的.h文件(在数据集的ground_turth里面，所以要先下载数据集);

训练过程

控制台执行python train.py part_A_train.json 0 0 训练A数据集控制台执行python train.py part_B_train.json 0 0 训练B数据集训练前记得修改.json下的路径（我是通过记事本一键替换改的数据集所在路径）

测试模型

运行val.py可以看到测试模型的正确率（具体需要修改图片路径和训练好的模型路径）

查看单个图片或测试模型正确率

运行test_single-image.py，修改路径后可以测试自己想测试的图片。

pth模型转ONNX模型

运行pth转onnx.py 可以实现模型转换，方便移植到其他深度学习框架。

Results

ShanghaiA MAE: 66.4 Google Drive ShanghaiB MAE: 10.6 Google Drive

引用作者

@inproceedings{li2018csrnet, title={CSRNet: Dilated convolutional neural networks for understanding the highly congested scenes}, author={Li, Yuhong and Zhang, Xiaofan and Chen, Deming}, booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition}, pages={1091--1100}, year={2018}

@inproceedings{zhang2016single, title={Single-image crowd counting via multi-column convolutional neural network}, author={Zhang, Yingying and Zhou, Desen and Chen, Siqin and Gao, Shenghua and Ma, Yi}, booktitle={Proceedings of the IEEE conference on computer vision and pattern recognition}, pages={589--597}, year={2016}

PyTorch version repo for CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes

Related tags

Overview

Study-CSRNet-pytorch

数据集下载

设备要求

获取真值

训练过程

测试模型

查看单个图片或测试模型正确率

pth模型转ONNX模型

Results

引用作者

Owner

Global Filter Networks for Image Classification

This is the source code for generating the ASL-Skeleton3D and ASL-Phono datasets. Check out the README.md for more details.

Steerable discovery of neural audio effects

"SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image", Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang

MPI Interest Group on Algorithms on 1st semester 2021

Chinese Advertisement Board Identification(Pytorch)

[NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation

PyTorch implementation of MulMON

Ground truth data for the Optical Character Recognition of Historical Classical Commentaries.

Code for Multinomial Diffusion

A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)

Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"

Photo2cartoon - 人像卡通化探索项目 (photo-to-cartoon translation project)

FindFunc is an IDA PRO plugin to find code functions that contain a certain assembly or byte pattern, reference a certain name or string, or conform to various other constraints.

Repository for code and dataset for our EMNLP 2021 paper - “So You Think You’re Funny?”: Rating the Humour Quotient in Standup Comedy.

Offcial repository for the IEEE ICRA 2021 paper Auto-Tuned Sim-to-Real Transfer.

Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling

Train Yolov4 using NBX-Jobs

Source code of CIKM2021 Long Paper "PSSL: Self-supervised Learning for Personalized Search with Contrastive Sampling".

Official codebase used to develop Vision Transformer, MLP-Mixer, LiT and more.