A implemetation of the LRCN in mxnet

Last update: Aug 25, 2022

Overview

A implemetation of the LRCN in mxnet

##Abstract LRCN is a combination of CNN and RNN

##Installation

Download UCF101 dataset
./avi2jpg.sh to split the video of UCF101
Download the pretrained model of vgg16
python cnn_predict.py to get the pretrained data
python train_ucf101.py

##Reference Donahue, J., Hendricks, L. A., Guadarrama, S., Rohrbach, M., Venugopalan, S., Saenko, K., & Darrell, T. (2014). Long-term recurrent convolutional networks for visual recognition and description. arXiv preprint arXiv:1411.4389.Chicago

Owner

GitHub Repository

Implementation of SSMF: Shifting Seasonal Matrix Factorization

SSMF Implementation of SSMF: Shifting Seasonal Matrix Factorization, Koki Kawabata, Siddharth Bhatia, Rui Liu, Mohit Wadhwa, Bryan Hooi. NeurIPS, 2021

9 Jun 10, 2022

Code for the Active Speakers in Context Paper (CVPR2020)

Active Speakers in Context This repo contains the official code and models for the "Active Speakers in Context" CVPR 2020 paper. Before Training The c

43 Oct 14, 2022

Spatio-Temporal Entropy Model (STEM) for end-to-end leaned video compression.

Spatio-Temporal Entropy Model A Pytorch Reproduction of Spatio-Temporal Entropy Model (STEM) for end-to-end leaned video compression. More details can

16 Nov 28, 2022

A simple Rock-Paper-Scissors game using CV in python

ML18_Rock-Paper-Scissors-using-CV A simple Rock-Paper-Scissors game using CV in python For IITISOC-21 Rules and procedure to play the interactive game

3 Aug 08, 2021

PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English

PASTRIE Official release of the corpus described in the paper: Michael Kranzlein, Emma Manning, Siyao Peng, Shira Wein, Aryaman Arora, and Nathan Schn

4 Dec 02, 2021

A Python 3 package for state-of-the-art statistical dimension reduction methods

direpack: a Python 3 library for state-of-the-art statistical dimension reduction techniques This package delivers a scikit-learn compatible Python 3

32 Dec 14, 2022

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania

546 Final Project: Masked Autoencoder Haoran Tang, Qirui Wu 1. Training To train the network, please run mae_pretraining.py. Please modify folder path

0 Apr 22, 2022

Chainer implementation of recent GAN variants

Chainer-GAN-lib This repository collects chainer implementation of state-of-the-art GAN algorithms. These codes are evaluated with the inception score

399 Oct 23, 2022

Official implementation of the paper Label-Efficient Semantic Segmentation with Diffusion Models

Label-Efficient Semantic Segmentation with Diffusion Models Official implementation of the paper Label-Efficient Semantic Segmentation with Diffusion

355 Jan 06, 2023

Towards Interpretable Deep Metric Learning with Structural Matching

DIML Created by Wenliang Zhao*, Yongming Rao*, Ziyi Wang, Jiwen Lu, Jie Zhou This repository contains PyTorch implementation for paper Towards Interpr

75 Nov 11, 2022

ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.

ESRGAN (Enhanced SRGAN) [ 🚀 BasicSR] [Real-ESRGAN] ✨ New Updates. We have extended ESRGAN to Real-ESRGAN, which is a more practical algorithm for rea

4.7k Jan 02, 2023

A implemetation of the LRCN in mxnet

Related tags

Overview

A implemetation of the LRCN in mxnet

Owner

Implementation of SSMF: Shifting Seasonal Matrix Factorization

Code for the Active Speakers in Context Paper (CVPR2020)

Spatio-Temporal Entropy Model (STEM) for end-to-end leaned video compression.

A simple Rock-Paper-Scissors game using CV in python

PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English

A Python 3 package for state-of-the-art statistical dimension reduction methods

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania

Chainer implementation of recent GAN variants

Official implementation of the paper Label-Efficient Semantic Segmentation with Diffusion Models

Towards Interpretable Deep Metric Learning with Structural Matching

ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.

Code for our paper "Sematic Representation for Dialogue Modeling" in ACL2021

Tutorial on active learning with the Nvidia Transfer Learning Toolkit (TLT).

python debugger and anti-vm that checks if you're in a virtual machine or if someones trying to debug your file

Computer Vision is an elective course of MSAI, SCSE, NTU, Singapore

Matplotlib Image labeller for classifying images

Time should be taken seer-iously

BboxToolkit is a tiny library of special bounding boxes.

Code release for BlockGAN: Learning 3D Object-aware Scene Representations from Unlabelled Images

Compact Bidirectional Transformer for Image Captioning