Predicting Event Memorability from Contextual Visual Semantics

Last update: Oct 06, 2021

Overview

Predicting-Event-Memorability-from-Contextual-Visual-Semantics

This repository contains pytorch implementation of five configurations in our paper "Predicting Event Memorability from Contextual Visual Semantics".

Raw images are to be put in '../datasets/r3/images/'
Train and validation (val) splits for different configurations are under '../datasets/r3/splits/'; the set of train_1.txt, val_1.txt, etc. contains image names and memorability scores for the respective split.
Configurations of ablation study are with individual folders, e.g., './no_face', './no_activity', etc. './full_set' is for full configuration without removing features.
Complete extrinsic features and the memory test outcome is available in 'R3_data.csv' file. Description of the features is given in 'R3_data_notes.txt'. Both can be downloaded together with the original image cues @ https://drive.google.com/drive/folders/1Bx_ePv7ui6DCIXkESCpoyuvd0H3B9o6d?usp=sharing
The AMNet implementation is adpated from https://github.com/ok1zjf/AMNet

########################################################################################

To train AMNet and CEMNet_wt_AMNet:

python3 main.py --train-batch-size 128 --test-batch-size 128 --cnn ResNet50FC --dataset lamem --train-split train_1 --val-split val_1

To predict:

python3 main.py --cnn ResNet50FC --model-weights /path/to/model/weights_xx.pkl --eval-images /path/to/evl_images --csv-out memorabilities.txt

To train other models (ICNet, MLP, CEMNet_wt_ICNet):

[Go the the respective folder, e.g., '../ICNet']

python main.py

To predict (please select corresponding splits and model in predict.py):

python predict.py

[Where necessary, change Dataset.py to the corresponding directory of split]

########################################################################################

System configuration:

platform: UBUNTU 16.04

GPU: GeForce GTX 1080

CUDA:9.0

########################################################################################

Python packages:

python 3.5.6

pytorch 0.2.0

Torchvison 0.1.9

Numpy 1.15.2

Opencv 3.1.0

PIL 6.1.0

########################################################################################

To cite the paper: Xu Q., Fang F., del Molino A.G, Subbaraju V., Lim J.H., Predicting Event Memorability from Contextual Visual Semantics, NeurIPS 2021.

If you have any questions, please feel free to contact Dr Xu Qianli: [email protected]

Predicting Event Memorability from Contextual Visual Semantics

Related tags

Overview

Predicting-Event-Memorability-from-Contextual-Visual-Semantics

Owner

This repository builds a basic vision transformer from scratch so that one beginner can understand the theory of vision transformer.

DABO: Data Augmentation with Bilevel Optimization

PyTorch implementation of the REMIND method from our ECCV-2020 paper "REMIND Your Neural Network to Prevent Catastrophic Forgetting"

Code for paper Decoupled Dynamic Spatial-Temporal Graph Neural Network for Traffic Forecasting

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

U^2-Net - Portrait matting This repository explores possibilities of using the original u^2-net model for portrait matting.

Repository for reproducing `Model-Based Robust Deep Learning`

Code for Talk-to-Edit (ICCV2021). Paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

Thermal Control of Laser Powder Bed Fusion using Deep Reinforcement Learning

Data Consistency for Magnetic Resonance Imaging

Towards Open-World Feature Extrapolation: An Inductive Graph Learning Approach

Cross-platform CLI tool to generate your Github profile's stats and summary.

constructing maps of intellectual influence from publication data

[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

Open source code for Paper "A Co-Interactive Transformer for Joint Slot Filling and Intent Detection"

Repository aimed at compiling code, papers, demos etc.. related to my PhD on 3D vision and machine learning for fruit detection and shape estimation at the university of Lincoln

End-to-End Speech Processing Toolkit

Focal and Global Knowledge Distillation for Detectors

Official code for our CVPR '22 paper "Dataset Distillation by Matching Training Trajectories"

😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc