“袋鼯麻麻——智能购物平台”能够精准地定位识别每一个商品

Last update: Jan 05, 2023

Overview

袋鼯麻麻——智能购物平台

项目背景

目前在零售行业的实际运营过程中，会产生巨大的人力成本，例如导购、保洁、结算等，而其中，尤其需要花费大量的人力成本和时间成本在识别商品并对其进行价格结算的过程中，并且在此过程中，顾客也因此而需要排队等待。这样一来零售行业人力成本较大、工作效率极低，二来也使得顾客的购物体验下降。

随着计算机视觉技术的发展，以及无人化、自动化超市运营理念的提出，利用图像识别技术及目标检测技术实现产品的自动识别及自动化结算的需求呼之欲出，及自动结账系统（Automatic checkout, ACO）。基于计算机视觉的自动结账系统能有效降低零售行业的运营成本，提高顾客结账效率，从而进一步提升用户在购物过程中的体验感与幸福感。

实现功能

本项目具体实现在零售过程中对用户购买商品的自动结算。即：利用计算机视觉领域中的图像识别及目标检测技术，精准地定位顾客购买的商品，并进行智能化、自动化的价格结算。当顾客将自己选购的商品放置在制定区域的时候，“袋鼯麻麻——智能购物平台”能够精准地定位识别每一个商品，并且能够返回完整地购物清单及顾客应付的实际商品总价格，极大地降低零售行业实际运营过程中巨大的人力成本，提升零售行业无人化、自动化、智能化水平。

整体架构

技术路线

袋鼯麻麻——智能购物平台 主要基于PaddleClas作为主要的功能开发套件，利用其开源的图像识别技术，并通过PaddleInference将其部署于Jetson Nano，并基于QPT打包.exe打通Windows系统，开发一套符合实际应用需求的工业级智能零售购物平台。

图像识别介绍

整个图像识别系统分为三步：
（1）通过一个目标检测模型，检测图像物体候选区域；
（2）对每个候选区域进行特征提取；
（3）与检索库中图像进行特征匹配，提取识别结果。

对于新的未知类别，无需重新训练模型，只需要在检索库补入该类别图像，重新建立检索库，就可以识别该类别。

数据集介绍

【The first one】:Products-10K Large Scale Product Recognition Dataset

【The second one】:RP2K: A Large-Scale Retail Product Dataset for Fine-Grained Image Classification

袋鼯麻麻——智能购物平台基于上述两个数据集，并对此两种数据集进行适应性处理。

目前处理后的数据集已在AIStudio开源。

部署方式

本项目已打通Jetson Nano、Windows、linux系统

使用QPT打包的百度网盘链接：https://pan.baidu.com/s/1pVr4zSZB6qV10VIPvgWCsA 提取码：mpq2

解压后运行启动程序.exe即可
服务器部署

安装python依赖库：pip install -r requestment.txt；

执行python manage.py makemigrations;

执行python manage.py migrate;

执行python manage.py runserver # 默认运行在8000端口
微信小程序打开开发者工具，导入系统文件夹下wx_mini_app文件夹并运行，即可运行小程序端；

“袋鼯麻麻——智能购物平台”能够精准地定位识别每一个商品

Related tags

Overview

袋鼯麻麻——智能购物平台

项目背景

实现功能

整体架构

技术路线

图像识别介绍

数据集介绍

部署方式

bilibili效果演示

Owner

thomas-yanxin

Bayesian Meta-Learning Through Variational Gaussian Processes

Code and training data for our ECCV 2016 paper on Unsupervised Learning

Discriminative Region Suppression for Weakly-Supervised Semantic Segmentation

Exploit ILP to learn symmetry breaking constraints of ASP programs.

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

ML for NLP and Computer Vision.

[NeurIPS'21] "AugMax: Adversarial Composition of Random Augmentations for Robust Training" by Haotao Wang, Chaowei Xiao, Jean Kossaifi, Zhiding Yu, Animashree Anandkumar, and Zhangyang Wang.

Updated for TTS(CE) = Also Known as TTN V3. The code requires the first server to be 'ttn' protocol.

Code for CVPR2021 paper "Robust Reflection Removal with Reflection-free Flash-only Cues"

Tensorflow 2.x implementation of Vision-Transformer model

Vis2Mesh: Efficient Mesh Reconstruction from Unstructured Point Clouds of Large Scenes with Learned Virtual View Visibility ICCV2021

Your interactive network visualizing dashboard

PyTorch implementations of algorithms for density estimation

Differential rendering based motion capture blender project.

Physics-Aware Training (PAT) is a method to train real physical systems with backpropagation.

This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)

Deep learning models for change detection of remote sensing images

Code for Contrastive-Geometry Networks for Generalized 3D Pose Transfer

CRISCE: Automatically Generating Critical Driving Scenarios From Car Accident Sketches

PyMove is a Python library to simplify queries and visualization of trajectories and other spatial-temporal data