The source code and dataset for the RecGURU paper (WSDM 2022)

Last update: Jan 07, 2023

Overview

RecGURU

About The Project

Source code and baselines for the RecGURU paper "RecGURU: Adversarial Learning of Generalized User Representations for Cross-Domain Recommendation (WSDM 2022)"

Code Structure

RecGURU  
├── README.md                                 Read me file 
├── data_process                              Data processing methods
│   ├── __init__.py                           Package initialization file     
│   └── amazon_csv.py                         Code for processing the amazon data (in .csv format)
│   └── business_process.py                   Code for processing the collected data
│   └── item_frequency.py                     Calculate item frequency in each domain
│   └── run.sh                                Shell script to perform data processing  
├── GURU                                      Scripts for modeling, training, and testing 
│   ├── data                                  Dataloader package      
│     ├── __init__.py                         Package initialization file 
│     ├── data_loader.py                      Customized dataloaders 
│   └── tools                                 Tools such as loss function, evaluation metrics, etc.
│     ├── __init__.py                         Package initialization file
│     ├── lossfunction.py                     Customized loss functions
│     ├── metrics.py                          Evaluation metrics
│     ├── plot.py                             Plot function
│     ├── utils.py                            Other tools
│  ├── Transformer                            Transformer package
│     ├── __init__.py                         Package initialization 
│     ├── transformer.py                      transformer module
│  ├── AutoEnc4Rec.py                         Autoencoder based sequential recommender
│  ├── AutoEnc4Rec_cross.py                   Cross-domain recommender modules
│  ├── config_auto4rec.py                     Model configuration file
│  ├── gan_training.py                        Training methods of the GAN framework
│  ├── train_auto.py                          Main function for training and testing single-domain sequential recommender
│  ├── train_gan.py                           Main function for training and testing cross-domain sequential recommender
└── .gitignore                                gitignore file

Dataset

The public datasets: Amazon view dataset at: https://nijianmo.github.io/amazon/index.html
Collected datasets: https://drive.google.com/file/d/1NbP48emGPr80nL49oeDtPDR3R8YEfn4J/view
Data processing:

Amazon dataset:

```shell
cd ../data_process
python amazon_csv.py   
```

Collected dataset

```shell
cd ../data_process
python business_process.py --rate 0.1  # portion of overlapping user = 0.1   
```

After data process, for each cross-domain scenario we have a dataset folder:

."a_domain"-"b_domain"
├── a_only.pickle         # users in domain a only
├── b_only.pickle         # users in domain b only
├── a.pickle              # all users in domain a
├── b.pickle              # all users in domain b
├── a_b.pickle            # overlapped users of domain a and b

Note: see the code for processing details and make modifications accordingly.

Run

Single-domain Methods:

# SAS
python train_auto.py --sas "True"
# AutoRec (ours)
python train_auto.py

Cross-Domain Methods:

# RecGURU
python train_gan.py --cross "True"

The source code and dataset for the RecGURU paper (WSDM 2022)

Related tags

Overview

RecGURU

About The Project

Code Structure

Dataset

Amazon dataset:

Collected dataset

Run

Owner

Chenglin Li

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

Spam your friends and famly and when you do your famly will disown you and you will have no friends.

More than a hundred strange attractors

Pytorch implementation of Compressive Transformers, from Deepmind

Deep Illuminator is a data augmentation tool designed for image relighting. It can be used to easily and efficiently generate a wide range of illumination variants of a single image.

Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning

Official implementation of "Accelerating Reinforcement Learning with Learned Skill Priors", Pertsch et al., CoRL 2020

GeoTransformer - Geometric Transformer for Fast and Robust Point Cloud Registration

Image Data Augmentation in Keras

Proof of concept GnuCash Webinterface

Sky Computing: Accelerating Geo-distributed Computing in Federated Learning

Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing

PRIN/SPRIN: On Extracting Point-wise Rotation Invariant Features

Deep learning toolbox based on PyTorch for hyperspectral data classification.

Full-featured Decision Trees and Random Forests learner.

University of Rochester 2021 Summer REU focusing on music sentiment transfer using CycleGAN

A face dataset generator with out-of-focus blur detection and dynamic interval adjustment.

Testbed of AI Systems Quality Management

Code for the paper "M2m: Imbalanced Classification via Major-to-minor Translation" (CVPR 2020)