AMUSE

AMUSE - financial summarization

Unzip data.zip

Train new model:

python FinAnalyze.py --task train --start 0 --count --modelpath data/models/new_model.h5 --train data/train --gold data/gold

data/train = dir where the text files are data/gold = dir where the gold summaries are

Trains new AMUSE prediction model for given files and stores it in an .h5 file

Generate summaries with existing model:

python FinAnalyze.py --task generate-summaries --start 0 --count --modelpath data/models/new_model.h5 --test data/test/ --summarydir data/summaries

Also stored:

a model trained on 3000 files named model.training.muse.3000.all.h5

If you use this code, please cite:

Litvak M, Vanetik N. Summarization of financial reports with AMUSE. In Proceedings of the 3rd Financial Narrative Processing Workshop 2021 (pp. 31-36).

@inproceedings{litvak2021summarization, title={Summarization of financial reports with AMUSE}, author={Litvak, Marina and Vanetik, Natalia}, booktitle={Proceedings of the 3rd Financial Narrative Processing Workshop}, pages={31--36}, year={2021} }

AMUSE - financial summarization

Related tags

Overview

AMUSE

Owner

SentimentArcs: a large ensemble of dozens of sentiment analysis models to analyze emotion in text over time

A python wrapper around the ZPar parser for English.

A 10000+ hours dataset for Chinese speech recognition

Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.

Understand Text Summarization and create your own summarizer in python

Pipelines de datos, 2021.

Curso práctico: NLP de cero a cien 🤗

DLO8012: Natural Language Processing & CSL804: Computational Lab - II

Open-World Entity Segmentation

Deduplication is the task to combine different representations of the same real world entity.

Mycroft Core, the Mycroft Artificial Intelligence platform.

Journey is a NLP-Powered Developer assistant

Contains links to publicly available datasets for modeling health outcomes using speech and language.

Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks

Transformers and related deep network architectures are summarized and implemented here.

DeepAmandine is an artificial intelligence that allows you to talk to it for hours, you won't know the difference.

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.

👄 The most accurate natural language detection library for Python, suitable for long and short text alike

문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.