A PyTorch Image-Classification With AlexNet And ResNet50.

Last update: Feb 22, 2022

Overview

PyTorch 图像分类

依赖库的下载与安装

在终端中执行 pip install -r -requirements.txt 完成项目依赖库的安装

使用方式

数据集的准备

STL10 数据集
- 下载：STL-10 Dataset
- 存储位置：将下载后的数据集中 train_X.bin,train_y.bin,test_X.bin,test_y.bin 四个文件存入项目根目录下的 dataset\STL10 子目录内
自制数据集
- 重新设置 config.py 中训练集与测试集图像与标签的读取路径与标签类别的列表
- 重新设置 data_load.py 中的 Dataset 类中的数据读取方式

训练模型

训练模型或进行模型预测时，设置 config.py 中的变量 CONTINUE_TRAIN 为 False ，若需要进行断点续训，设置该变量为 True

模型可以选择使用 ResNet50 与 AlexNet 两种网络之一进行训练，在 train.py 中设置训练模型的参数变量 model 来选择想要训练的模型

模型的训练重要超参数存储在 config.py 中，可根据实际需要进行修改

模型训练完成后参数的读取

模型训练完毕后，在项目文件根目录的 model_data 子目录下会生成两个文件，其中 last_model_state_dict.pth 存储了最后一次模型训练的学习率与模型参数信息，用于断点续训；另一个文件为 best_model_state_dict.pth 存储了模型训练过程中验证集的最高准确率所对应的模型参数信息，可以用来预测

测试模型

运行 test.py ,得到测试集预测准确率与混淆矩阵可视化图像

图片预测

将要预测的图片存储在项目根目录 imgs 文件夹下，运行 predict.py 中的 image_classification 函数，将图像名作为参数传递，即可得到预测结果

A PyTorch Image-Classification With AlexNet And ResNet50.

Related tags

Overview

PyTorch 图像分类

依赖库的下载与安装

使用方式

数据集的准备

训练模型

模型训练完成后参数的读取

测试模型

图片预测

相关链接

Owner

FYH

Implementation supporting the ICCV 2017 paper "GANs for Biological Image Synthesis"

Neural Style and MSG-Net

Source Code For Template-Based Named Entity Recognition Using BART

PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet

[CVPR 2020] Interpreting the Latent Space of GANs for Semantic Face Editing

Website for D2C paper

[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

Implementation based on Paper - Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling

A TikTok-like recommender system for GitHub repositories based on Gorse

Source codes of CenterTrack++ in 2021 ICME Workshop on Big Surveillance Data Processing and Analysis

Hashformers is a framework for hashtag segmentation with transformers.

Python Interview Questions

Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning"

Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"

FCN (Fully Convolutional Network) is deep fully convolutional neural network architecture for semantic pixel-wise segmentation

PyTorch Live is an easy to use library of tools for creating on-device ML demos on Android and iOS.

pytorch implementation of GPV-Pose

A PyTorch implementation for PyramidNets (Deep Pyramidal Residual Networks)

Tf alloc - Simplication of GPU allocation for Tensorflow2

A general, feasible, and extensible framework for classification tasks.