A Lighting Pytorch Framework for Recommendation System, Easy-to-use and Easy-to-extend.

Last update: Jan 04, 2023

Overview

Torch-RecHub

A Lighting Pytorch Framework for Recommendation Models, Easy-to-use and Easy-to-extend.

安装

pip install torch-rechub

主要特性

scikit-learn风格易用的API（fit、predict），即插即用
训练过程与模型定义解耦，易拓展，可针对不同类型的模型设置不同的训练机制
使用Pytorch原生Dataset、DataLoader，易修改，自定义数据
高度模块化，支持常见Layer（MLP、FM、FFM、target-attention、self-attention、transformer等），容易调用组装成新模型
支持常见排序模型（WideDeep、DeepFM、DIN、DCN、xDeepFM等）
支持常见召回模型（DSSM、YoutubeDNN、MIND、SARSRec等）
丰富的多任务学习支持
- SharedBottom、ESMM、MMOE、PLE、AITM等模型
- GradNorm、UWL等动态loss加权机制
聚焦更生态化的推荐场景
- 冷启动
- 延迟反馈
- 去偏
支持丰富的训练机制（对比学习、蒸馏学习等）
第三方高性能开源Trainer支持（Pytorch Lighting等）
更多模型正在开发中

快速使用

from torch_rechub.rmodels.ranking import WideDeep, DeepFM, DIN
from torch_rechub.trainers import CTRTrainer
from torch_rechub.basic.utils import DataGenerator

dg = DataGenerator(x, y)
train_dataloader, val_dataloader, test_dataloader = dg.generate_dataloader()

model = DeepFM(deep_features=deep_features, fm_features=fm_features, mlp_params={"dims": [256, 128], "dropout": 0.2, "activation": "relu"})

ctr_trainer = CTRTrainer(model)
ctr_trainer.fit(train_dataloader, val_dataloader)
auc = ctr_trainer.evaluate(ctr_trainer.model, test_dataloader)

多任务学习

from torch_rechub.models.multi_task import SharedBottom, ESMM, MMOE, PLE, AITM
from torch_rechub.trainers import MTLTrainer

model = MMOE(features, task_types, n_expert=3, expert_params={"dims": [64,32,16]}, tower_params_list=[{"dims": [8]}, {"dims": [8]}])

ctr_trainer = MTLTrainer(model)
ctr_trainer.fit(train_dataloader, val_dataloader)
auc = ctr_trainer.evaluate(ctr_trainer.model, test_dataloader)

Note:

所有模型均在大多数论文提及的多个知名公开数据集中测试，达到或者接近论文性能。

使用案例：Examples

每个数据集将会提供

一个使用脚本，包含样本生成、模型训练与测试，并提供一套测评所用参数。

一个预处理脚本，将原始数据进行预处理，转化成csv。

数据格式参考文件（100条）。

全量数据，统一的csv文件，提供高速网盘下载链接和原始数据链接。

初步规划TODO清单

A Lighting Pytorch Framework for Recommendation System, Easy-to-use and Easy-to-extend.

Related tags

Overview

Torch-RecHub

安装

主要特性

快速使用

Owner

Mincai Lai

Rainbow DQN implementation that outperforms the paper's results on 40% of games using 20x less data 🌈

Paper Title: Heterogeneous Knowledge Distillation for Simultaneous Infrared-Visible Image Fusion and Super-Resolution

Train DeepLab for Semantic Image Segmentation

PatchMatch-RL: Deep MVS with Pixelwise Depth, Normal, and Visibility

PyTorch code for the ICCV'21 paper: "Always Be Dreaming: A New Approach for Class-Incremental Learning"

SCAAML is a deep learning framwork dedicated to side-channel attacks run on top of TensorFlow 2.x.

Dados coletados e programas desenvolvidos no processo de iniciação científica

Implementation for the paper SMPLicit: Topology-aware Generative Model for Clothed People (CVPR 2021)

Revealing and Protecting Labels in Distributed Training

Application of K-means algorithm on a music dataset after a dimensionality reduction with PCA

ZeroGen: Efficient Zero-shot Learning via Dataset Generation

Learning Features with Parameter-Free Layers (ICLR 2022)

UCSD Oasis platform

Implementation of Segnet, FCN, UNet , PSPNet and other models in Keras.

Implementation for Panoptic-PolarNet (CVPR 2021)

计算机视觉中用到的注意力模块和其他即插即用模块PyTorch Implementation Collection of Attention Module and Plug&Play Module

🕺Full body detection and tracking

[EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

3ds-Ghidra-Scripts - Ghidra scripts to help with 3ds reverse engineering

A list of all named GANs!