Working demo of the Multi-class and Anomaly classification model using the CLIP feature space

Last update: Jun 05, 2022

Related tags

Overview

👁️ Hindsight AI: Crime Classification With Clip

About

For Educational Purposes Only This is a recursive neural net trained to classify specific crime classes based on the UCF-Crime dataset UCF-CRIME or to perform general anomaly detection. The model uses images that have been encoded into the CLIP image embedding space.

Introducing CLIP

The model we are utilizing in our application, CLIP (developed by OpenAI), is a generalized image classification model which can take any image and produce word embeddings for the purpose of matching raw text strings to the contents of the image. The design and training of the model allows for high zero-shot performance in classifying images (i.e. image classification problems outside of the training set). The following image provides a summary of the model (taken from A. Radford et al.):

While typical image classification models train an image feature extractor and a linear classifier to predict a label, CLIP trains an image encoder and text encoder to predict the correct pairings of a batch of (image, text) training examples. At test time the learned text encoder synthesizes a zero-shot linear classifier by embedding the names or descriptions of the target dataset’s classes.

Installation

Clone the repo and the required packages can be found in the required.txt file. Running classifier.py will start an interactive application that will attempt to perform anomaly detection or multi-class classification on videos found in the 'Videos' directory.

The scripts that were used to create the image sequence database from the video files of the UCF-Crime dataset as well as the training scripts and models can be found in the src directory.

Working demo of the Multi-class and Anomaly classification model using the CLIP feature space

Related tags

Overview

👁️ Hindsight AI: Crime Classification With Clip

About

Introducing CLIP

Installation

Owner

Miles Tweed

CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability

A distributed, plug-n-play algorithm for multi-robot applications with a priori non-computable objective functions

Confidence Propagation Cluster aims to replace NMS-based methods as a better box fusion framework in 2D/3D Object detection

Malware Analysis Neural Network project.

code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"

Official PyTorch implementation of the ICRA 2021 paper: Adversarial Differentiable Data Augmentation for Autonomous Systems.

Hand tracking demo for DIY Smart Glasses with a remote computer doing the work

Interactive Visualization to empower domain experts to align ML model behaviors with their knowledge.

ML powered analytics engine for outlier detection and root cause analysis.

YKKDetector For Python

Crowd-sourced Annotation of Human Motion.

OpenAi's gym environment wrapper to vectorize them with Ray

Optimizers-visualized - Visualization of different optimizers on local minimas and saddle points.

这是一个yolox-keras的源码，可以用于训练自己的模型。

Code for the paper "Curriculum Dropout", ICCV 2017

Code accompanying the paper Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs (Chen et al., CVPR 2020, Oral).

DeepProbLog is an extension of ProbLog that integrates Probabilistic Logic Programming with deep learning by introducing the neural predicate.

Implementation for Panoptic-PolarNet (CVPR 2021)

Merlion: A Machine Learning Framework for Time Series Intelligence

[ACM MM 2021] Yes, "Attention is All You Need", for Exemplar based Colorization