Character Grounding and Re-Identification in Story of Videos and Text Descriptions

Last update: Dec 09, 2022

Related tags

Overview

Character in Story Identification Network (CiSIN)

This project hosts the code for our paper.

Youngjae Yu, Jongseok Kim, Heeseung Yun, Jiwan Chung and Gunhee Kim. Character Grounding and Re-Identification inStory of Videos and Text Descriptions. In ECCV (spotlight), 2020.

This project is an Winning Solution in LSMDC 19 "Fill-in the Characters" task. For more information about the LSMDC visit the Large Scale Movie Description Challenge (LSMDC) 2019

Reference

If you use this code as part of any published research, please refer following paper,

@inproceedings{yu:2020:ECCV,
    title="{Character Grounding and Re-Identification inStory of Videos and Text Descriptions}",
    author={Yu, Youngjae and Kim, Jongseok and Yun, Heeseung and Chung Jiwan and Kim, Gunhee},
    booktitle={ECCV},
    year=2020
}

System Requirements

The following dependencies should be installed:

Python 3.6
Pytorch 1.4.0
torchvision 0.5.0
CUDA 10.0 supported GPU with at least 12GB memory
see requirements.txt for more details

Data Setup

Coming soon,

CiSIN

To train our model,

python train.py

Acknowledgement

We thank SNUVL lab members for helpful comments. This research was supported by Seoul National University, Brain Research Program by National Research Foundation of Korea (NRF) (2017M3C7A1047860), and AIR Lab (AI Research Lab) in Hyundai Motor Company through HMC-SNU AI Consortium Fund.

License

LICENSE.md.

Character Grounding and Re-Identification in Story of Videos and Text Descriptions

Related tags

Overview

Character in Story Identification Network (CiSIN)

Reference

System Requirements

Data Setup

CiSIN

Acknowledgement

License

Owner

The official implementation of the IEEE S&P`22 paper "SoK: How Robust is Deep Neural Network Image Classification Watermarking".

Various operations like path tracking, counting, etc by using yolov5

A testcase generation tool for Persistent Memory Programs.

Utilizes Pose Estimation to offer sprinters cues based on an image of their running form.

This repository contains the official code of the paper Equivariant Subgraph Aggregation Networks (ICLR 2022)

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

This script runs neural style transfer against the provided content image.

SubOmiEmbed: Self-supervised Representation Learning of Multi-omics Data for Cancer Type Classification

A toolkit for document-level event extraction, containing some SOTA model implementations

Code for paper 'Hand-Object Contact Consistency Reasoning for Human Grasps Generation' at ICCV 2021

This is the second place solution for : UmojaHack Africa 2022: African Snake Antivenom Binding Challenge

Deep Networks with Recurrent Layer Aggregation

MATLAB codes of the book "Digital Image Processing Fourth Edition" converted to Python

Calibrate your listeners! Robust communication-based training for pragmatic speakers. Findings of EMNLP 2021.

This repository provides code for "On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness".

Code for Blind Image Decomposition (BID) and Blind Image Decomposition network (BIDeN).

style mixing for animation face

TAug :: Time Series Data Augmentation using Deep Generative Models

Object tracking implemented with YOLOv4, DeepSort, and TensorFlow.

Tom-the-AI - A compound artificial intelligence software for Linux systems.