STRIVE: Scene Text Replacement In Videos

Dataset Types:

RoboText
SynthText
RealWorld videos

RoboText : Videos of texts collected using navigation robot in indoor environment. The overall duration of these videos is 10hrs+ Each text's background can be extracted from the bottom rectangle of its text rectangle. The orginial unprocessed data is stored as RoboText-OriginalZip.7z. Around 200 preprocessed videos are stored as RoboTextZip1.7z

SynthText : Using unity, we have created paired videos from synthetic scenes. These videos are stored with similar naming convention in drive. File name : SynthText7Zip.7z

Note: Unity bbox are recorded as mirror values, hence the bbox extraction process will be different than other two video types.

Real World videos: We have collected videos using high resolution mobile camera to capture texts in different lighting conditions and motion blur. File name: RealWorld.7z

Preparing data

We have extracted text bounding box from RoboText and Real world videos using AWS Rekognition API. The code available as runAWS.py file. Synthetic videos bbox is recorded in unity environment

Data Preprocessing

Refer to the preprocessing python file for each dataset type to get crop images of text.

Data download

Data can be downloaded from here

Please contact Jeyasri Subramanian( [email protected] ) for any data queries

STRIVE: Scene Text Replacement In Videos

Related tags

Overview

STRIVE: Scene Text Replacement In Videos

Dataset Types:

Preparing data

Data Preprocessing

Data download

Owner

This is a simple plugin for Vim that allows you to use OpenAI Codex.

The modify PyTorch version of Siam-trackers which are speed-up by TensorRT.

Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation" by Shizhe Diao et al.

Nodule Generation Algorithm Baseline and template code for node21 generation track

A Learning-based Camera Calibration Toolbox

Implementation of the Swin Transformer in PyTorch.

A gesture recognition system powered by OpenPose, k-nearest neighbours, and local outlier factor.

Digan - Official PyTorch implementation of Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

Generating Anime Images by Implementing Deep Convolutional Generative Adversarial Networks paper

mmdetection version of TinyBenchmark.

This repo contains the code for the paper "Efficient hierarchical Bayesian inference for spatio-temporal regression models in neuroimaging" that has been accepted to NeurIPS 2021.

Riemannian Geometry for Molecular Surface Approximation (RGMolSA)

Official implementation of NLOS-OT: Passive Non-Line-of-Sight Imaging Using Optimal Transport (IEEE TIP, accepted)

pytorch bert intent classification and slot filling

Cmsc11 arcade - Final Project for CMSC11

Text-Based Ideal Points

Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS 2021 Spotlight

CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking