This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

Last update: Dec 26, 2022

Overview

MultiModal-InfoMax

🔥 If you would be interested in other multimodal works in our DeCLaRe Lab, welcome to visit the clustered repository

Introduction

Multimodal-informax (MMIM) synthesizes fusion results from multi-modality input through a two-level mutual information (MI) maximization. We use BA (Barber-Agakov) lower bound and contrastive predictive coding as the target function to be maximized. To facilitate the computation, we design an entropy estimation module with associated history data memory to facilitate the computation of BA lower bound and the training process.

Usage

Download the CMU-MOSI and CMU-MOSEI dataset from Google Drive or Baidu Disk (extraction code: g3m2). Place them under the folder Multimodal-Infomax/datasets
Set up the environment (need conda prerequisite)

conda env create -f environment.yml
conda activate MMIM

Start training

python main.py --dataset mosi --contrast

Citation

Please cite our paper if you find our work useful for your research:

@article{han2021improving,
  title={Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis},
  author={Han, Wei and Chen, Hui and Poria, Soujanya},
  journal={Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
  year={2021}
}

Contact

Should you have any question, feel free to contact me through [email protected]

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

Related tags

Overview

MultiModal-InfoMax

Introduction

Usage

Citation

Contact

Owner

Deep Cognition and Language Research (DeCLaRe) Lab

Multi-Modal Machine Learning toolkit based on PyTorch.

PyTorch implementation of our ICCV 2019 paper: Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis

Open-Set Recognition: A Good Closed-Set Classifier is All You Need

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification"

Differentiable Optimizers with Perturbations in Pytorch

ICCV2021 - A New Journey from SDRTV to HDRTV.

Photo2cartoon - 人像卡通化探索项目 (photo-to-cartoon translation project)

Losslandscapetaxonomy - Taxonomizing local versus global structure in neural network loss landscapes

This is a Python Module For Encryption, Hashing And Other stuff

Structured Data Gradient Pruning (SDGP)

Underwater industrial application yolov5m6

Fast SHAP value computation for interpreting tree-based models

Self-Supervised Generative Style Transfer for One-Shot Medical Image Segmentation

Public implementation of the Convolutional Motif Kernel Network (CMKN) architecture

Implementation of Neural Style Transfer in Pytorch

Angora is a mutation-based fuzzer. The main goal of Angora is to increase branch coverage by solving path constraints without symbolic execution.

PrimitiveNet: Primitive Instance Segmentation with Local Primitive Embedding under Adversarial Metric (ICCV 2021)

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

Predicting Price of house by considering ,house age, Distance from public transport