Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."

Last update: Dec 19, 2022

Related tags

Deep Learning DialogLM

Overview

DialogLM

Code for AAAI 2022 paper: DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization.

Pre-trained Models

We release two versions of pre-trained models.

DialogLM is based on UniLMv2. According to whether sparse attention is introduced, it can be divided into two different versions to process dialogs of different lengths.
DialogLED builds on Longformer-Encoder-Decoder (LED) architecture and uses window-based denoising as the pre-training task on a large amount of long dialogue data for further training. You can use its base version and large version directly through HuggingFace.

Datasets

Please download the five datasets we used in our paper here (AMI, ICSI, QMSum, ForeverDreaming, TVMegaSite).

Finetuning for Downstream Tasks

Please go to specific folders to apply them to downstream tasks related to long dialogues.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."

Related tags

Overview

DialogLM

Pre-trained Models

Datasets

Finetuning for Downstream Tasks

Contributing

Trademarks

Owner

Microsoft

Joint Versus Independent Multiview Hashing for Cross-View Retrieval[J] (IEEE TCYB 2021, PyTorch Code)

A general and strong 3D object detection codebase that supports more methods, datasets and tools (debugging, recording and analysis).

Permute Me Softly: Learning Soft Permutations for Graph Representations

This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".

Learning cell communication from spatial graphs of cells

A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.

This demo showcase the use of onnxruntime-rs with a GPU on CUDA 11 to run Bert in a data pipeline with Rust.

An efficient toolkit for Face Stylization based on the paper "AgileGAN: Stylizing Portraits by Inversion-Consistent Transfer Learning"

Application of K-means algorithm on a music dataset after a dimensionality reduction with PCA

Pca-on-genotypes - Mini bioinformatics project - PCA on genotypes

Pytorch implementation of paper "Learning Co-segmentation by Segment Swapping for Retrieval and Discovery"

Multi-Horizon-Forecasting-for-Limit-Order-Books

Machine Learning Models were applied to predict the mass of the brain based on gender, age ranges, and head size.

Intel® Nervana™ reference deep learning framework committed to best performance on all hardware

Real-time Neural Representation Fusion for Robust Volumetric Mapping

A PyTorch implementation of EfficientNet and EfficientNetV2 (coming soon!)

Genetic Algorithm, Particle Swarm Optimization, Simulated Annealing, Ant Colony Optimization Algorithm,Immune Algorithm, Artificial Fish Swarm Algorithm, Differential Evolution and TSP(Traveling salesman)

Neural Motion Learner With Python

"NAS-Bench-301 and the Case for Surrogate Benchmarks for Neural Architecture Search".

Code release for Local Light Field Fusion at SIGGRAPH 2019