Predicting Event Memorability from Contextual Visual Semantics

Last update: Oct 06, 2021

Overview

Predicting-Event-Memorability-from-Contextual-Visual-Semantics

This repository contains pytorch implementation of five configurations in our paper "Predicting Event Memorability from Contextual Visual Semantics".

Raw images are to be put in '../datasets/r3/images/'
Train and validation (val) splits for different configurations are under '../datasets/r3/splits/'; the set of train_1.txt, val_1.txt, etc. contains image names and memorability scores for the respective split.
Configurations of ablation study are with individual folders, e.g., './no_face', './no_activity', etc. './full_set' is for full configuration without removing features.
Complete extrinsic features and the memory test outcome is available in 'R3_data.csv' file. Description of the features is given in 'R3_data_notes.txt'. Both can be downloaded together with the original image cues @ https://drive.google.com/drive/folders/1Bx_ePv7ui6DCIXkESCpoyuvd0H3B9o6d?usp=sharing
The AMNet implementation is adpated from https://github.com/ok1zjf/AMNet

########################################################################################

To train AMNet and CEMNet_wt_AMNet:

python3 main.py --train-batch-size 128 --test-batch-size 128 --cnn ResNet50FC --dataset lamem --train-split train_1 --val-split val_1

To predict:

python3 main.py --cnn ResNet50FC --model-weights /path/to/model/weights_xx.pkl --eval-images /path/to/evl_images --csv-out memorabilities.txt

To train other models (ICNet, MLP, CEMNet_wt_ICNet):

[Go the the respective folder, e.g., '../ICNet']

python main.py

To predict (please select corresponding splits and model in predict.py):

python predict.py

[Where necessary, change Dataset.py to the corresponding directory of split]

########################################################################################

System configuration:

platform: UBUNTU 16.04

GPU: GeForce GTX 1080

CUDA:9.0

########################################################################################

Python packages:

python 3.5.6

pytorch 0.2.0

Torchvison 0.1.9

Numpy 1.15.2

Opencv 3.1.0

PIL 6.1.0

########################################################################################

To cite the paper: Xu Q., Fang F., del Molino A.G, Subbaraju V., Lim J.H., Predicting Event Memorability from Contextual Visual Semantics, NeurIPS 2021.

If you have any questions, please feel free to contact Dr Xu Qianli: [email protected]

Predicting Event Memorability from Contextual Visual Semantics

Related tags

Overview

Predicting-Event-Memorability-from-Contextual-Visual-Semantics

Owner

Multi-task head pose estimation in-the-wild

A Tensorflow implementation of CapsNet based on Geoffrey Hinton's paper Dynamic Routing Between Capsules

This project is used for the paper Differentiable Programming of Isometric Tensor Network

Rule Based Classification Project

This repository contains the official implementation code of the paper Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis

A collection of models for image<->text generation in ACM MM 2021.

The challenge for Quantum Coalition Hackathon 2021

Docker containers of baseline agents for the Crafter environment

Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval (NeurIPS'21)

Deep Inertial Prediction (DIPr)

A method to perform unsupervised cross-region adaptation of crop classifiers trained with satellite image time series.

PINN(s): Physics-Informed Neural Network(s) for von Karman vortex street

End-to-end Temporal Action Detection with Transformer. [Under review]

Source code of our TTH paper: Targeted Trojan-Horse Attacks on Language-based Image Retrieval.

DeepOBS: A Deep Learning Optimizer Benchmark Suite

This is the code for HOI Transformer

Explanatory Learning: Beyond Empiricism in Neural Networks

This repository contains a re-implementation of the code for the CVPR 2021 paper "Omnimatte: Associating Objects and Their Effects in Video."

A collection of pre-trained StyleGAN2 models trained on different datasets at different resolution.

Improving Contrastive Learning by Visualizing Feature Transformation, ICCV 2021 Oral