Source code of article "Towards Toxic and Narcotic Medication Detection with Rotated Object Detector"

Last update: Oct 29, 2022

Related tags

Overview

Towards Toxic and Narcotic Medication Detection with Rotated Object Detector

Introduction

This is the source code of article: Towards Toxic and Narcotic Medication Detection with Rotated Object Detector
The orgnization of this repo looks like this:

.
├── configs 
│   ├── cfg_ro.yml # main config file for rotated yolo-v5
│   ├── cfg.yml    # main config file for yolo-v5
│   ├── model_pt   # model config files
│   │   ├── yolov5s_ro.yml
│   │   └── yolov5s.yml
│   ├── nms        # config file for nms
│   │   └── extra_filter.json
│   └── pipeline   # config file for data augmentation
│       └── aug_cfg.yml
├── pipeline       # Analogy to Dataset in Pytorch
│   ├── augment.py 
│   └── dataset.py
├── pt             # Pytorch specific implementation
│   ├── common.py  # DL basic modules
│   ├── loss.py    # loss function ralated for yolo-v5
│   ├── loss_ro.py # loss function ralated for rotated yolo-v5
│   ├── metric.py  # Evaluation ralated
│   ├── server.py  # Main classes for training validation and inference
│   ├── utils.py   # Pytorch specific utilities
│   ├── yolo.py    # Model classes of yolo-v5
│   ├── yolo_ro.py # Model classes of rotated yolo-v5
│   └── log        
│       └── ...    # Where do we save the trained parameters (.pt)
├── tools          # Helper functions
│   ├── colormap.py
│   ├── compress.py
│   ├── const.py
│   ├── plot.py
│   └── utils.py   # Framework independent utilities
├── plot4latex.ipynb # How do we get the figures in the article
├── train.py       # Command for training 
└── infer.py       # Set up an inference http server

How to Get Started

Prerequisite

Class id in .txt label file has already been transfered to the index number we finally use in training and inference. There should be a yolo_label_id2name.json file saving the mapping.
All the tunable arguments are listed in configs/cfg_ro.yml for rotated yolo-v5 and configs/cfg.yml for yolo-v5. It's almost self-explainable, feel free to play with it please.

Training

For rotated yolo-v5:
python train.py --cfg=configs/cfg_ro.yml

For yolo-v5:
python train.py --cfg=configs/cfg.yml

Inference

For rotated yolo-v5:
python infer.py --cfg=configs/cfg_ro.yml

For yolo-v5:
python infer.py --cfg=configs/cfg.yml

This would start up an inference http server with the best-shot trained parameters.

Development Environment

RTX 3060 (12GB GPU Memory) CUDA 11.2 Python 3.8 python packages: requirements.txt

Acknowledgment

This work refers a lot to ultralytics/yolov5 and BossZard/rotation-yolov5. We deeply appreciate their contributions to the community.

Citation

Bibtex

@article{adam,
  title={Towards Toxic and Narcotic Medication Detection with Rotated Object Detector},
  author={Peng, Jiao and Wang, Feifan and Fu, Zhongqiang and Hu, Yiying and Chen, Zichen and Zhou, Xinghan and Wang, Lijun},
  journal={arXiv preprint arXiv:2110.09777},
  year={2021},
  url={https://arxiv.org/abs/2110.09777}
}

Source code of article "Towards Toxic and Narcotic Medication Detection with Rotated Object Detector"

Related tags

Overview

Towards Toxic and Narcotic Medication Detection with Rotated Object Detector

Introduction

How to Get Started

Prerequisite

Training

Inference

Development Environment

Acknowledgment

Citation

Owner

Woody. Wang

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, CVPR 2019 (Oral)

ktrain is a Python library that makes deep learning and AI more accessible and easier to apply

Code for the paper "Attention Approximates Sparse Distributed Memory"

Using modified BiSeNet for face parsing in PyTorch

This project implements "virtual speed" from heart rate monito

Source code for The Power of Many: A Physarum Swarm Steiner Tree Algorithm

Code for paper "Multi-level Disentanglement Graph Neural Network"

Self-supervised learning algorithms provide a way to train Deep Neural Networks in an unsupervised way using contrastive losses

For visualizing the dair-v2x-i dataset

Learning Facial Representations from the Cycle-consistency of Face (ICCV 2021)

State-Relabeling Adversarial Active Learning

Lightweight, Python library for fast and reproducible experimentation :microscope:

Pytorch implementation of Generative Models as Distributions of Functions 🌿

HandFoldingNet ✌️ : A 3D Hand Pose Estimation Network Using Multiscale-Feature Guided Folding of a 2D Hand Skeleton

Garbage Detection system which will detect objects based on whether it is plastic waste or plastics or just garbage.

Breast Cancer Detection 🔬 ITI "AI_Pro" Graduation Project

基于Pytorch实现优秀的自然图像分割框架！(包括FCN、U-Net和Deeplab)

An open source library for face detection in images. The face detection speed can reach 1000FPS.

Self-Regulated Learning for Egocentric Video Activity Anticipation

Computer Vision is an elective course of MSAI, SCSE, NTU, Singapore