STRIVE: Scene Text Replacement In Videos

Dataset Types:

RoboText
SynthText
RealWorld videos

RoboText : Videos of texts collected using navigation robot in indoor environment. The overall duration of these videos is 10hrs+ Each text's background can be extracted from the bottom rectangle of its text rectangle. The orginial unprocessed data is stored as RoboText-OriginalZip.7z. Around 200 preprocessed videos are stored as RoboTextZip1.7z

SynthText : Using unity, we have created paired videos from synthetic scenes. These videos are stored with similar naming convention in drive. File name : SynthText7Zip.7z

Note: Unity bbox are recorded as mirror values, hence the bbox extraction process will be different than other two video types.

Real World videos: We have collected videos using high resolution mobile camera to capture texts in different lighting conditions and motion blur. File name: RealWorld.7z

Preparing data

We have extracted text bounding box from RoboText and Real world videos using AWS Rekognition API. The code available as runAWS.py file. Synthetic videos bbox is recorded in unity environment

Data Preprocessing

Refer to the preprocessing python file for each dataset type to get crop images of text.

Data download

Data can be downloaded from here

Please contact Jeyasri Subramanian( [email protected] ) for any data queries

STRIVE: Scene Text Replacement In Videos

Related tags

Overview

STRIVE: Scene Text Replacement In Videos

Dataset Types:

Preparing data

Data Preprocessing

Data download

Owner

The codes and related files to reproduce the results for Image Similarity Challenge Track 2.

Complete U-net Implementation with keras

Causal estimators for use with WhyNot

Code for the Active Speakers in Context Paper (CVPR2020)

Internship Assessment Task for BaggageAI.

Task-based end-to-end model learning in stochastic optimization

AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages

General neural ODE and DAE modules for power system dynamic modeling.

Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥

Unsupervised Feature Loss (UFLoss) for High Fidelity Deep learning (DL)-based reconstruction

Pyeventbus: a publish/subscribe event bus

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)

Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation

This repository provides an efficient PyTorch-based library for training deep models.

Leibniz is a python package which provide facilities to express learnable partial differential equations with PyTorch

A framework for analyzing computer vision models with simulated data

A framework for multi-step probabilistic time-series/demand forecasting models

QRec: A Python Framework for quick implementation of recommender systems (TensorFlow Based)

XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale

(Python, R, C/C++) Isolation Forest and variations such as SCiForest and EIF, with some additions (outlier detection + similarity + NA imputation)