A three-stage detection and recognition pipeline of complex meters in wild

This is the first released system towards detection and recognition of complex meters in wild. The system can be divided into three moduels. Fisrtly, a yolo-based detector is applied to get pure meter region. Secondly, a spatial transformer module is eatablished to rectify the position of meter. Lastly, an end-to-end network is to read meter values, which is implemented by pointer/dail predcition and key number learning.

Visulization results

Left row is the original image, middle row is the process of meter rectification, right row is the result of meter value reading.

ToDo List

Installation

Requirements:

Python3 (Python3.7 is recommended)
PyTorch >= 1.0
torchvision from master
numpy
skimage
OpenCV==3.0.x
CUDA >= 9.0 (10.0 is recommended)

Models

Download Trained model

Please put distro_net.pt into meter_distro/weight.
put textgraph_vgg_450.pth into model/meter_data.

Demo

You can run a demo script for a single image inference by two steps.

python get_meter_area.py. and the detected meter will be stored in scene_image_data/deteced_meter

python predict.py to get distored meter and final result.

This is the first released system towards complex meters` detection and recognition, which is implemented by computer vision techniques.

Related tags

Overview

A three-stage detection and recognition pipeline of complex meters in wild

Visulization results

ToDo List

Installation

Requirements:

Models

Demo

Owner

Yan Shu

Exploring Image Deblurring via Blur Kernel Space (CVPR'21)

Python-experiments - A Repository which contains python scripts to automate things and make your life easier with python

Deep Learning Pipelines for Apache Spark

Certified Patch Robustness via Smoothed Vision Transformers

Self Governing Neural Networks (SGNN): the Projection Layer

TensorFlow for Raspberry Pi

ChebLieNet, a spectral graph neural network turned equivariant by Riemannian geometry on Lie groups.

Predicting Price of house by considering ,house age, Distance from public transport

This is the pytorch code for the paper Curious Representation Learning for Embodied Intelligence.

Neural network chess engine trained on Gary Kasparov's games.

Towards Rolling Shutter Correction and Deblurring in Dynamic Scenes (CVPR2021)

Colab notebook and additional materials for Python-driven analysis of redlining data in Philadelphia

A collection of SOTA Image Classification Models in PyTorch

上海交通大学全自动抢课脚本，支持准点开抢与抢课后持续捡漏两种模式。2021/06/08更新。

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Learning nonlinear operators via DeepONet

This repository introduces a short project about Transfer Learning for Classification of MRI Images.

SiamMOT is a region-based Siamese Multi-Object Tracking network that detects and associates object instances simultaneously.

PyTorch code for our ECCV 2020 paper "Single Image Super-Resolution via a Holistic Attention Network"