code for modular summarization work published in ACL2021 by Krishna et al

Last update: Nov 24, 2022

Related tags

Overview

This repository contains the code for running modular summarization pipelines as described in the publication
Krishna K, Khosla K, Bigham J, Lipton ZC. Generating SOAP Notes from Doctor-Patient Conversations." ACL 2021.

Instructions

Although we can not release models trained on the confidential medical data, we have released models trained on the publicly available AMI dataset.
To reproduce the results on the AMI dataset, you need to follow the steps listed below. For convenience, we have also created a Google Colab notebook here that runs these steps on Google's servers (free-of-cost as of June 2021) and produces the summaries and their rouge scores.

Step1: Set up the environment by installing the required packages mentioned in requirements.txt using pip.

Step2: Download the ami_models folder from this link and put it at the root of the repository:

Step3: Run the following 3 commands to prepare data, run summary generation pipelines, and show the achieved rouge scores.

# command1: downloads and preprocesses AMI dataset  
./prepare_data.sh  
  
 # command2: runs the summarization pipelines on the data and computes rouge scores  
 # (before running this command, you need to download the models as shown above)  
./predict_ami.sh  
  
# command3: print the results  
python show_results.py

code for modular summarization work published in ACL2021 by Krishna et al

Related tags

Overview

Instructions

Owner

Approximately Correct Machine Intelligence (ACMI) Lab

Chinese Named Entity Recognization (BiLSTM with PyTorch)

STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser)

Contains analysis of trends from Fitbit Dataset (source: Kaggle) to see how the trends can be applied to Bellabeat customers and Bellabeat products

Text to speech for Vietnamese, ez to use, ez to update

pyupbit 라이브러리를 활용하여 upbit에서 비트코인을 자동매매하는 코드입니다. 조코딩 유튜브 채널에서 자세한 강의 영상을 보실 수 있습니다.

Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"

Generate vector graphics from a textual caption

NSFW A chatbot based on GPT2-chitchat

Refactored version of FastSpeech2

Nmt - TensorFlow Neural Machine Translation Tutorial

A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approaches for achieving this in this repo.

Practical Machine Learning with Python

Implementation of paper Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa.

Mycroft Core, the Mycroft Artificial Intelligence platform.

JaQuAD: Japanese Question Answering Dataset

The training code for the 4th place model at MDX 2021 leaderboard A.

Code for Discovering Topics in Long-tailed Corpora with Causal Intervention.

Crie tokens de autenticação íntegros e seguros com UToken.

code for modular summarization work published in ACL2021 by Krishna et al

Related tags

Overview

Instructions

Owner

Approximately Correct Machine Intelligence (ACMI) Lab

Chinese Named Entity Recognization (BiLSTM with PyTorch)

STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser)

Contains analysis of trends from Fitbit Dataset (source: Kaggle) to see how the trends can be applied to Bellabeat customers and Bellabeat products

Text to speech for Vietnamese, ez to use, ez to update

pyupbit 라이브러리를 활용하여 upbit에서 비트코인을 자동매매하는 코드입니다. 조코딩 유튜브 채널에서 자세한 강의 영상을 보실 수 있습니다.

Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"

Generate vector graphics from a textual caption

**NSFW** A chatbot based on GPT2-chitchat

Refactored version of FastSpeech2

Nmt - TensorFlow Neural Machine Translation Tutorial

A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approaches for achieving this in this repo.

Practical Machine Learning with Python

Implementation of paper Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa.

Mycroft Core, the Mycroft Artificial Intelligence platform.

JaQuAD: Japanese Question Answering Dataset

The training code for the 4th place model at MDX 2021 leaderboard A.

Code for Discovering Topics in Long-tailed Corpora with Causal Intervention.

Crie tokens de autenticação íntegros e seguros com UToken.

NSFW A chatbot based on GPT2-chitchat