CPM

CPM is an open-source program on large-scale pre-trained models, which is conducted by Beijing Academy of Artificial Intelligence and Tsinghua University, with the goal of building large-scale Chinese-centered pre-trained models. The open-source models can be widely used in Chinese natural language understanding, generative tasks, and all of them are free and open for download for research use.

CPM-1

CPM: A Large-scale Generative Chinese Pre-trained Language Model. [paper]

Codes:

Models: [Download]

CPM-2

CPM-2: Large-scale Cost-effective Pre-trained Language Models. [paper]

Codes:

Models: [Download]

PLM Survey

Pre-Trained Models: Past, Present and Future. [paper]

Useful Links

Article Generation with CPM-1. [link]

Cite

@article{cpm-v1,
  title={CPM: A Large-scale Generative Chinese Pre-trained Language Model},
  author={Zhang, Zhengyan and Han, Xu, and Zhou, Hao, and Ke, Pei, and Gu, Yuxian and Ye, Deming and Qin, Yujia and Su, Yusheng and Ji, Haozhe and Guan, Jian and Qi, Fanchao and Wang, Xiaozhi and Zheng, Yanan and Zeng, Guoyang and Cao, Huanqi and Chen, Shengqi and Li, Daixuan and Sun, Zhenbo and Liu, Zhiyuan and Huang, Minlie and Han, Wentao and Tang, Jie and Li, Juanzi and Sun, Maosong},
  year={2020}
}

@article{cpm-v2,
  title={CPM-2: Large-scale Cost-efficient Pre-trained Language Models},
  author={Zhang, Zhengyan and Gu, Yuxian and Han, Xu and Chen, Shengqi and Xiao, Chaojun and Sun, Zhenbo and Yao, Yuan and Qi, Fanchao and Guan, Jian and Ke, Pei and Cai, Yanzheng and Zeng, Guoyang and Tan, Zhixing and Liu, Zhiyuan and Huang, Minlie and Han, Wentao and Liu, Yang and Zhu, Xiaoyan and Sun, Maosong},
  year={2021}
}

@article{han2021pretrained,
      title={Pre-Trained Models: Past, Present and Future}, 
      author={Xu Han and Zhengyan Zhang and Ning Ding and Yuxian Gu and Xiao Liu and Yuqi Huo and Jiezhong Qiu and Liang Zhang and Wentao Han and Minlie Huang and Qin Jin and Yanyan Lan and Yang Liu and Zhiyuan Liu and Zhiwu Lu and Xipeng Qiu and Ruihua Song and Jie Tang and Ji-Rong Wen and Jinhui Yuan and Wayne Xin Zhao and Jun Zhu},
      year={2021}
}

Introduction to CPM

Related tags

Overview

CPM

CPM-1

CPM-2

PLM Survey

Useful Links

Cite

Owner

Tsinghua AI

[WACV21] Code for our paper: Samuel, Atzmon and Chechik, "From Generalized zero-shot learning to long-tail with class descriptors"

Unsupervised Pre-training for Person Re-identification (LUPerson)

An Efficient Implementation of Analytic Mesh Algorithm for 3D Iso-surface Extraction from Neural Networks

Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implicit Bayesian Inference"

A curated list of programmatic weak supervision papers and resources

codebase for "A Theory of the Inductive Bias and Generalization of Kernel Regression and Wide Neural Networks"

PyMatting: A Python Library for Alpha Matting

OSLO: Open Source framework for Large-scale transformer Optimization

Pytorch implementation of the paper: "SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive Image Deraining"

This repository is maintained for the scientific paper tittled " Study of keyword extraction techniques for Electric Double Layer Capacitor domain using text similarity indexes: An experimental analysis "

Keras like implementation of Deep Learning architectures from scratch using numpy.

Code for C2-Matching (CVPR2021). Paper: Robust Reference-based Super-Resolution via C2-Matching.

Solver for Large-Scale Rank-One Semidefinite Relaxations

Deep learning PyTorch library for time series forecasting, classification, and anomaly detection

Training Cifar-10 Classifier Using VGG16

Codes for CyGen, the novel generative modeling framework proposed in "On the Generative Utility of Cyclic Conditionals" (NeurIPS-21)

This is a pytorch implementation for the BST model from Alibaba https://arxiv.org/pdf/1905.06874.pdf

Spatial Sparse Convolution Library

Losslandscapetaxonomy - Taxonomizing local versus global structure in neural network loss landscapes

Python library for science observations from the James Webb Space Telescope