PyTorch implementation for "Mining Latent Structures with Contrastive Modality Fusion for Multimedia Recommendation"

Last update: Dec 08, 2022

Related tags

Overview

MIRCO

PyTorch implementation for paper: Latent Structures Mining with Contrastive Modality Fusion for Multimedia Recommendation

Dependencies

Python 3.6
torch==1.5.0
scikit-learn==0.24.2
torch-scatter==2.0.8

Dataset Preparation

Download 5-core reviews data, meta data, and image features from Amazon product dataset. Put data into the directory data/meta-data/.

Install sentence-transformers and download pretrained models to extract textual features. Unzip pretrained model into the directory sentence-transformers/:

├─ data/: 
    ├── sports/
    	├── meta-data/
    		├── image_features_Sports_and_Outdoors.b
    		├── meta-Sports_and_Outdoors.json.gz
    		├── reviews_Sports_and_Outdoors_5.json.gz
    ├── sentence-transformers/
        	├── stsb-roberta-large

Run python build_data.py to preprocess data.
Run python cold_start.py to build cold-start data.
We provide processed data Baidu Yun (access code: m37q), Google Drive.

Usage

Start training and inference as:

cd codes
python main.py --dataset {DATASET}

For cold-start settings:

python main.py --dataset {DATASET} --core 0 --verbose 1 --lr 1e-5

Citation

If you want to use our codes in your research, please cite:

@article{MICRO21,
  title     = {Latent Structures Mining with Contrastive Modality Fusion for Multimedia Recommendation},
  author    = {Zhang, Jinghao and 
               Zhu, Yanqiao and 
               Liu, Qiang and
               Zhang, Mengqi and
               Wu, Shu and 
               Wang, Liang},
  journal = {arXiv.org},
  year={2021},
  eprint={2111.00678},
  archivePrefix={arXiv},
  primaryClass={cs.IR}
}

Acknowledgement

The structure of this code is largely based on LightGCN. Thank for their work.

PyTorch implementation for "Mining Latent Structures with Contrastive Modality Fusion for Multimedia Recommendation"

Related tags

Overview

MIRCO

Dependencies

Dataset Preparation

Usage

Citation

Acknowledgement

Owner

Big Data and Multi-modal Computing Group, CRIPAC

A simple AI that will give you si ple task and this is made with python

Object tracking implemented with YOLOv4, DeepSort, and TensorFlow.

"Domain Adaptive Semantic Segmentation without Source Data" (ACM MM 2021)

Joint Versus Independent Multiview Hashing for Cross-View Retrieval[J] (IEEE TCYB 2021, PyTorch Code)

codes for Image Inpainting with External-internal Learning and Monochromic Bottleneck

The second project in Python course on FCC

ComPhy: Compositional Physical Reasoning ofObjects and Events from Videos

A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).

Boostcamp CV Serving For Python

Built a deep neural network (DNN) that functions as an end-to-end machine translation pipeline

Transformers based fully on MLPs

How to use TensorLayer

Code for Multimodal Neural SLAM for Interactive Instruction Following

This is a Image aid classification software based on python TK library development

Omnidirectional camera calibration in python

Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt

Stable Neural ODE with Lyapunov-Stable Equilibrium Points for Defending Against Adversarial Attacks

a basic code repository for basic task in CV(classification,detection,segmentation)

Prediction of MBA refinance Index (Mortgage prepayment)

KDD CUP 2020 Automatic Graph Representation Learning: 1st Place Solution