Working demo of the Multi-class and Anomaly classification model using the CLIP feature space

Last update: Jun 05, 2022

Related tags

Overview

👁️ Hindsight AI: Crime Classification With Clip

About

For Educational Purposes Only This is a recursive neural net trained to classify specific crime classes based on the UCF-Crime dataset UCF-CRIME or to perform general anomaly detection. The model uses images that have been encoded into the CLIP image embedding space.

Introducing CLIP

The model we are utilizing in our application, CLIP (developed by OpenAI), is a generalized image classification model which can take any image and produce word embeddings for the purpose of matching raw text strings to the contents of the image. The design and training of the model allows for high zero-shot performance in classifying images (i.e. image classification problems outside of the training set). The following image provides a summary of the model (taken from A. Radford et al.):

While typical image classification models train an image feature extractor and a linear classifier to predict a label, CLIP trains an image encoder and text encoder to predict the correct pairings of a batch of (image, text) training examples. At test time the learned text encoder synthesizes a zero-shot linear classifier by embedding the names or descriptions of the target dataset’s classes.

Installation

Clone the repo and the required packages can be found in the required.txt file. Running classifier.py will start an interactive application that will attempt to perform anomaly detection or multi-class classification on videos found in the 'Videos' directory.

The scripts that were used to create the image sequence database from the video files of the UCF-Crime dataset as well as the training scripts and models can be found in the src directory.

Working demo of the Multi-class and Anomaly classification model using the CLIP feature space

Related tags

Overview

👁️ Hindsight AI: Crime Classification With Clip

About

Introducing CLIP

Installation

Owner

Miles Tweed

A sequence of Jupyter notebooks featuring the 12 Steps to Navier-Stokes

Code for reproducing our analysis in the paper titled: Image Cropping on Twitter: Fairness Metrics, their Limitations, and the Importance of Representation, Design, and Agency

This is the reference implementation for "Coresets via Bilevel Optimization for Continual Learning and Streaming"

PyTorch Implementation of CycleGAN and SSGAN for Domain Transfer (Minimal)

Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

Repo for paper "Dynamic Placement of Rapidly Deployable Mobile Sensor Robots Using Machine Learning and Expected Value of Information"

Gradient-free global optimization algorithm for multidimensional functions based on the low rank tensor train format

This code is an implementation for Singing TTS.

Unofficial pytorch implementation of 'Image Inpainting for Irregular Holes Using Partial Convolutions'

TResNet: High Performance GPU-Dedicated Architecture

Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking

The Most Efficient Temporal Difference Learning Framework for 2048

This is a Python Module For Encryption, Hashing And Other stuff

Autonomous Ground Vehicle Navigation and Control Simulation Examples in Python

Invertible conditional GANs for image editing

Implementation of "DeepOrder: Deep Learning for Test Case Prioritization in Continuous Integration Testing".

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

💊 A 3D Generative Model for Structure-Based Drug Design (NeurIPS 2021)

Official implementation of "Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform", ICCV 2021

Group Activity Recognition with Clustered Spatial Temporal Transformer