Ecco is a python library for exploring and explaining Natural Language Processing models using interactive visualizations.

Last update: Dec 25, 2022

Overview

Ecco is a python library for exploring and explaining Natural Language Processing models using interactive visualizations.

Ecco provides multiple interfaces to aid the explanation and intuition of Transformer-based language models. Read: Interfaces for Explaining Transformer Language Models.

Ecco runs inside Jupyter notebooks. It is built on top of pytorch and transformers.

Ecco is not concerned with training or fine-tuning models. Only exploring and understanding existing pre-trained models. The library is currently an alpha release of a research project. You're welcome to contribute to make it better!

Documentation: ecco.readthedocs.io

Features

Support for a wide variety of language models (GPT2, BERT, RoBERTA, T5, T0, and others).
Ability to add your own local models (if they're based on Hugging Face pytorch models).
Feature attribution (IntegratedGradients, Saliency, InputXGradient, DeepLift, DeepLiftShap, GuidedBackprop, GuidedGradCam, Deconvolution, and LRP via Captum)
Capture neuron activations in the FFNN layer in the Transformer block
Identify and visualize neuron activation patterns (via Non-negative Matrix Factorization)
Examine neuron activations via comparisons of activations spaces using SVCCA, PWCCA, and CKA
Visualizations for:
- Evolution of processing a token through the layers of the model (Logit lens)
- Candidate output tokens and their probabilities (at each layer in the model)

Examples:

What is the sentiment of this film review?

Use a large language model (T5 in this case) to detect text sentiment. In addition to the sentiment, see the tokens the model broke the text into (which can help debug some edge cases).

Which words in this review lead the model to classify its sentiment as "negative"?

Feature attribution using Integrated Gradients helps you explore model decisions. In this case, switching "weakness" to "inclination" allows the model to correctly switch the prediction to positive.

Explore the world knowledge of GPT models by posing fill-in-the blank questions.

Does GPT2 know where Heathrow Airport is? Yes. It does.

What other cities/words did the model consider in addition to London?

Visualize the candidate output tokens and their probability scores.

Which input words lead it to think of London?

At which layers did the model gather confidence that London is the right answer?

The model chose London by making the highest probability token (ranking it #1) after the last layer in the model. How much did each layer contribute to increasing the ranking of London? This is a logit lens visualizations that helps explore the activity of different model layers.

What are the patterns in BERT neuron activation when it processes a piece of text?

A group of neurons in BERT tend to fire in response to commas and other punctuation. Other groups of neurons tend to fire in response to pronouns. Use this visualization to factorize neuron activity in individual FFNN layers or in the entire model.

Read the paper:

Ecco: An Open Source Library for the Explainability of Transformer Language Models Association for Computational Linguistics (ACL) System Demonstrations, 2021

Tutorials

Video: Take A Look Inside Language Models With Ecco. [Colab Notebook]

How-to Guides

API Reference

The API reference and the architecture page explain Ecco's components and how they work together.

Gallery & Examples

Predicted Tokens: View the model's prediction for the next token (with probability scores). See how the predictions evolved through the model's layers. [Notebook] [Colab]

Rankings across layers: After the model picks an output token, Look back at how each layer ranked that token. [Notebook] [Colab]

Layer Predictions:Compare the rankings of multiple tokens as candidates for a certain position in the sequence. [Notebook] [Colab]

Primary Attributions: How much did each input token contribute to producing the output token? [Notebook] [Colab]

Detailed Primary Attributions: See more precise input attributions values using the detailed view. [Notebook] [Colab]

Neuron Activation Analysis: Examine underlying patterns in neuron activations using non-negative matrix factorization. [Notebook] [Colab]

Getting Help

Having trouble?

The Discussion board might have some relevant information. If not, you can post your questions there.
Report bugs at Ecco's issue tracker

Bibtex for citations:

@inproceedings{alammar-2021-ecco,
    title = "Ecco: An Open Source Library for the Explainability of Transformer Language Models",
    author = "Alammar, J",
    booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations",
    year = "2021",
    publisher = "Association for Computational Linguistics",
}

Comments

Support for T5-like Seq2SeqLM

Hello, I was wondering if there are any plans for explicit encoder-decoder models like T5. Although T5 was not pre-trained with auto-regressive LM objective it is a pretty good candidate for ecco's generate method. I tried running t5 as it was listed in model-config.yaml but soon ran into issues because the current implementation is very much suited to gpt like models.

I made some changes on a fork to get attribution working, but not sure if I did it correctly https://colab.research.google.com/drive/1zahIWgOCySoQXQkAaEAORZ5DID11qpkH?usp=sharing https://github.com/chiragjn/ecco/tree/t5_exp

I would love to contribute to add support with some help, especially on the overall implementation design

opened by chiragjn 8
Adds a model config field use_causal_lm and config entries for gpt-neo

Adding gpt-neo models to model-config.yaml failed because the model needs to be loaded using AutoModelForCausalLM, but init identified such models by looking for gpt2 in the name. A TODO comment in init mentioned using config instead. I refactored config loading slightly to enable this - not sure if that is the direction you intended or not.

opened by stprior 8
Add a `conda` install option for `ecco`
A conda install option for ecco could be helpful for two reasons:

Easy installation with version management with conda.

For other libraries, which if depend on ecco, if you want them on conda-forge channel as well, ecco must be available on conda-forge.

:bulb: I have already have started work on this. PR: https://github.com/conda-forge/staged-recipes/pull/17388

Once, the PR gets merged, you will be able to install ecco as:

conda install -c conda-forge ecco

I will send a PR to update your documentation, once the PR gets merged.
opened by sugatoray 7
Add support for PEGASUS model

I would like to add the support of PEGASUS in model-config.yaml.

PEGASUS model is an encoder-decoder type and the implementation is completely inherited from BartForConditionalGeneration. So the config is similar to the BART model.

Notes: This is my first time making a pull request on an open-source project, but hope this helps!

opened by thomas-chong 6
Add support for Integrated Gradients explainability method
In this PR, me and @SSamDav add support for the IG algorithm and make use of the same visualization plots used for input saliency. Besides, we also fix a saliency visualization bug for enc-dec models that was not addressed in the previous PR.

Notes:

The generate method became even slower with the IG method. We added an option to choose which attribution method to calculate, but it can be further improved. Maybe the visualization could be coupled with the generation itself.

The IG score has a convergence delta error that could be shown in the plot or, for example, be used to change the IG default parameters when a minimum error is not met.
opened by JoaoLages 5
attention head

Hi @jalammar, I tested some examples with Ecco, and I wanted to know if it is possible to change the head to view the activations for each one and for each layer?

opened by afcarvallo 5
Add support for more attribution methods

Hi, Currently, the project seems to be relying on grad-norm and grad-x-input to obtain the attributions. However, there are other arguably better (as discussed in recent work) methods to obtain saliency maps. Integrating them in this project would also provide a good way to compare them on the same input examples.

Some of these methods from the top of my head are- integrated gradients, gradient shapley, and LIME. Perhaps support for visualizing the attention map from the model being interpreted itself could also be added. Methods based on feature ablation are also possible but they might need more work to integrate.

There is support for these aforementioned methods on Captum, but it takes effort to get them working for NLP tasks, especially those based on language modeling. Thus, I feel this would be a useful addition here.
enhancement help wanted

opened by RachitBansal 5
token prefix in roberta model?

Trying to use a custom trained Roberta model by loading the config file but getting the error the token prefix is not present in the config. Any idea how to fix it?

opened by sarthusarth 4

output.saliency() displays nothing

I am trying to visualize saliency maps from a custom GPT model. Since I am concerned only about saliency maps, I just do the following:

out = OutputSeq(token_ids = input_ids, n_input_tokens = n_input_tokens, tokens = tokens, attribution = attr)
out.saliency()

I get no errors and nothing is displayed in the jupyter notebook, but when I open Chrome's Javascript console, I see the following thing.


(unknown) Ecco initialize.

  | l | @ | storage.googleapis.c…ust=1610606118793:1
-- | -- | -- | --
  | (anonymous) | @ | storage.googleapis.c…ust=1610606118793:1
  | autoTextColor | @ | storage.googleapis.c…ust=1610606118793:1
  | (anonymous) | @ | storage.googleapis.c…ust=1610606118793:1
  | (anonymous) | @ | d3js.org/d3.v5.min.j…ust=1610606118793:2
  | each | @ | d3js.org/d3.v5.min.j…ust=1610606118793:2
  | style | @ | d3js.org/d3.v5.min.j…ust=1610606118793:2
  | enter | @ | storage.googleapis.c…ust=1610606118793:1
  | (anonymous) | @ | storage.googleapis.c…ust=1610606118793:1
  | join | @ | d3js.org/d3.v5.min.j…ust=1610606118793:2
  | setupTokenBoxes | @ | storage.googleapis.c…ust=1610606118793:1
  | init | @ | storage.googleapis.c…ust=1610606118793:1
  | eval
  | execCb | @ | require.js:1693
  | check | @ | require.js:881
  | enable | @ | require.js:1173
  | init | @ | require.js:786
  | (anonymous) | @ | require.js:1457

DevTools failed to load SourceMap: Could not load content for http://localhost:8888/static/notebook/js/main.min.js.map: HTTP error: status code 404, net::ERR_HTTP_RESPONSE_CODE_FAILURE
DevTools failed to load SourceMap: Could not load content for https://storage.googleapis.com/wandb-cdn/production/d4e2434e6/raven.min.js.map: HTTP error: status code 404, net::ERR_HTTP_RESPONSE_CODE_FAILURE

How do I resolve this issue? Btw, I am running this notebook by sshing into my institute's remote machine.

opened by VirajBagal 4

Tell pip to install from setup.py

Forces pip install -r requirements.txt to install the same package versions specified in setup.py.

For details, see this comment.

Confirmed that tests pass locally after merging this and #13 . (Since #13 fixes tests, they won't pass until it is merged.)

opened by nostalgebraist 4
Memory management and tweaks
Hello Jay, thanks for all your work on GPT interpretation!

This PR contains changes I made in a personal fork while attempting to use ecco with a 1.5B-size GPT-2 model. There are 3 kinds of changes:

Attempts to plug memory leaks / otherwise reduce memory footprint

Bug fixes

Usability tweaks and new features

In retrospect, I wish I had made distinct branches for these 3 types of change, as together they now make up a pretty large PR. I can still go back and do that, if (say) you want to merge the bug fixes without the other ones.

Context: I am using ecco on a 1.5B-size GPT-2 model, using a Tesla T4 GPU (~15GB memory) on Colab.

I am using version 3.4.0 of transformers, which is the max version consistent with ecco's setup.py and hence the one I got on installation.

1. Memory management

Running lm.generate with this large model, I ran out of GPU memory. This surprised me, because memory has not been an issue for me using the same model in tensorflow.

After looking into it, I found a few places where use of GPU memory could be lowered:

past, which we don't use here, was still being computed on each step.

More importantly, python garbage collection was not (as far as I could tell) freeing the values of past produced on previous steps, so generating N tokens required enough memory to store the N pasts emitted from steps 1, 2, ..., N.

Mitigation: pass use_cache=False to the model's forward pass, so it doesn't return pasts

Saliency calculations all used retain_graph=True, so the backward graphs were never cleared.

Mitigation: when we do several gradient calculations per step, pass retain_graph=False to the last one

hidden_states were stored on the GPU during generation.

They don't need to be on the GPU at that time (because they aren't used in generation).

And, since we have a low CPU memory footprint otherwise, we have plenty of CPU memory to store them in.

Mitigation: call .cpu() on hidden states emitted from each step. If we want to calculate with them later on, move them back to self.device.

(Minor) Memory allocated for logit matrices from each step was not freed after sampling

Mitigation: output['logits']=None after rolling a sample

With these changes, I can run lm.generate for many 100s of steps, where previously I could only manage a small number, maybe ~10.

2. Bug fixes

activations_dict_to_array would fail in the edge case where we only have a single token in the prompt.

Issue: np.squeeze would wrongly eliminate the position axis (because its size was 1).

Mitigation: use np.concatenate, which doesn't add an unwanted singleton dimension, so we don't have to squeeze

top-p sampling did not work

Issue: top_k_top_p_filtering apparently expects a position axis in its input, even if that axis only has length 1

Mitigation: replace [-1, :] with [-1: ,:] and then squeeze after rolling a sample

3. Usability tweaks and new features

Added an option to not track hidden_states. This feels consistent with the way you can choose whether or not to track other things (activations, attn).

To help this work properly, switched from position-based indexing into the CausalLMOutputWithPast objects to key lookup, so we're robust to changes in the length/order of these objects.

Added the option to only track hidden states for a user-defined subset of layers, through the new kwarg collect_activations_layer_nums.

This is valuable with a large model where you may be only interested in a specific layer, and storing activations from all layers has high memory cost.

NMF now takes this kwarg and (if not None) uses it to map between row indices in activations and actual layer numbers. For example, if we are tracking layers 7 and 23, we will have an activation matrix with 2 rows. If passed from_layer=7, to_layer=8, we should retrieve the row slice [:1, :], not [7:8, :].

I realize this PR is unwieldy -- I just wanted to get my changes up in some form, since at least some of them seemed unambiguously helpful (bug fixes).

Let me know if you want me to break it down into smaller pieces, or if it needs other work, or if it is generally unhelpful for your goals, or whatever.

Did not run tox tests because I could not get them to run properly on my machine, even after downloading the tox.ini from one of the CI-related branches.
opened by nostalgebraist 4
AttributeError: 'OutputSeq' object has no attribute 'saliency'
captum 0.5.0 torch 1.13.0+cu117

Language_Models_and_Ecco_PyData_Khobar.ipynb

text= "The countries of the European Union are:\n1. Austria\n2. Belgium\n3. Bulgaria\n4." output_3 = lm.generate(text, generate=20, do_sample=True) output_3.saliency()

AttributeError Traceback (most recent call last) Cell In [13], line 1 ----> 1 output_3.saliency()

AttributeError: 'OutputSeq' object has no attribute 'saliency'
opened by Claus1 1
Rankings_watch displaying wrong sequence

Hello, I have a problem with the rankings_watch() function. I used a predefined GPT2 model and gave it the input "Today, the weather is". However, in the visualization, only the first token is shown although the model creates the output correctly:

Thank you for your help :D
bug

opened by MiriUll 1
Running Eccomap for Pre Trained BertForMaskedLM

Hi, I was trying to run my pretrained model for which i had used BERTForMaskedLM model class from hugging face but its giving me this error. Plese help me in resolving this error. Thanks in advance.

opened by iamakshay1 1
Remove `tokenizer_config` usage from the library

This config parameter was made to easily package config to send to the Javascript components. Ecco now handles all tokenization on the Python side to separate the concerns between the python and JS components. Subsequently, this needs to be removed.

opened by jalammar 0

Tokenizer has partial token suffix instead of prefix

Following your guide for identifying model configuration

MODEL_ID = "vinai/bertweet-base"

from transformers import AutoModelForSequenceClassification, AutoTokenizer
model = AutoModelForSequenceClassification.from_pretrained(MODEL_ID)
tokenizer = AutoTokenizer.from_pretrained(MODEL_ID, normalization=True, use_fast=False)

ids= tokenizer('tokenization')
ids

returns:

{'input_ids': [0, 969, 6186, 6680, 2], 'token_type_ids': [0, 0, 0, 0, 0], 'attention_mask': [1, 1, 1, 1, 1]}

Then

tokenizer.convert_ids_to_tokens(ids['input_ids'])

returns:

['<s>', 'to@@', 'ken@@', 'ization', '</s>']

Here I noticed that the tokenizer adds a partial token suffix instead of partial token prefix. Having a suffix instead of prefix is not configurable in the config.

opened by guustfranssensEY 1

Releases(v0.1.2)

v0.1.2(Jan 9, 2022)

Hotfix for T5 on later versions of transformers.
Source code(tar.gz)
Source code(zip)
ecco-0.1.2-py2.py3-none-any.whl(69.06 KB)
ecco-0.1.2.tar.gz(64.03 KB)
v0.1.1(Jan 4, 2022)

Hotfixes. Closes #57 and #56.
Source code(tar.gz)
Source code(zip)
ecco-0.1.1-py2.py3-none-any.whl(69.07 KB)
ecco-0.1.1.tar.gz(64.04 KB)
v0.1.0(Dec 29, 2021)
Big update to Ecco! Massive contributions from @JoaoLages and @SSamDav.

Added support of encoder-decoder models like T5

Using Captum for feature attribution adds support to these new methods: (IntegratedGradients, Saliency, InputXGradient, DeepLift, DeepLiftShap, GuidedBackprop, GuidedGradCam, Deconvolution, LRP). This replaces the previous implementation within Ecco for Saliency and InputXGradients.

Added support for Beam Search generation

Added support for importing local models. Very useful for analyzing finetuned models.

Added better support for various tokenizers in visualizations. Some more work needed on this front, still.

Source code(tar.gz)
Source code(zip)
ecco-0.1.0-py2.py3-none-any.whl(68.94 KB)
ecco-0.1.0.tar.gz(63.90 KB)
v0.0.15(Aug 2, 2021)
Added support for similarity score of activation matrices through CKA (centered kernel alignment), and Canonical Correlation Analysis methods: CCA, SVCCA, and PWCCA.

Better loading of configs and model support (thanks @stprior)

Source code(tar.gz)
Source code(zip)
v0.0.14(Feb 25, 2021)
Adds a documentation portal

LM now has a call() function so MLMs like BERT can be supported (without requiring text generation). Closes #18 call() and other functions now all support a batch dimension. The exception is "generate" which works on a single input sequence and not a batch. Closes #19

Set up ground-work towards #6. BERT is now supported for activation collection and an earlier version of NMF factorization. EccoJS needs to clean up partial token characters like (##). Or, better yet, eccojs should remain dumb and we give it the tokens cleaned up and it only worries about displaying them.

Part of the groundwork to support additional models is the model-config.yml file which should lay out how to connect ecco.LM with the underlying language model

Source code(tar.gz)
Source code(zip)
ecco-0.0.14-py2.py3-none-any.whl(75.03 KB)
ecco-0.0.14.tar.gz(54.24 KB)
v0.0.13(Feb 8, 2021)
Added Hugging Face Transformers v4 support. Closing #30.

Source code(tar.gz)
Source code(zip)
v0.0.12(Jan 5, 2021)
Larger GPT2 models can now work with long sequences in GPU without running out of memory.

Neuron activations: ability to specify capturing activations from certain layers

Thanks to contributor @nostalgebraist
Source code(tar.gz)
Source code(zip)
v0.0.10(Dec 16, 2020)

Alpha version
Source code(tar.gz)
Source code(zip)
v0.0.9RC(Dec 1, 2020)
Started working on docs

More tests. Github actions CI/CD

Source code(tar.gz)
Source code(zip)
v0.0.8(Nov 20, 2020)

Source code(tar.gz)
Source code(zip)

Owner

Jay Alammar

ML Research Engineer. Focused on NLP language models and visualization. @cohere-ai. Ex ML content dev @ Udacity.

GitHub Repository https://ecco.readthedocs.io

It analyze the sentiment of the user, whether it is postive or negative.

Sentiment-Analyzer-Tool It analyze the sentiment of the user, whether it is postive or negative. It uses streamlit library for creating this sentiment

18 Dec 17, 2022

Correctly generate plurals, ordinals, indefinite articles; convert numbers to words

NAME inflect.py - Correctly generate plurals, singular nouns, ordinals, indefinite articles; convert numbers to words. SYNOPSIS import inflect p = in

762 Dec 29, 2022

A Fast Sequence Transducer Implementation with PyTorch Bindings

transducer A Fast Sequence Transducer Implementation with PyTorch Bindings. The corresponding publication is Sequence Transduction with Recurrent Neur

184 Dec 18, 2022

What are the best Systems? New Perspectives on NLP Benchmarking

What are the best Systems? New Perspectives on NLP Benchmarking In Machine Learning, a benchmark refers to an ensemble of datasets associated with one

12 Nov 03, 2022

Korean extractive summarization. 2021 AI 텍스트 요약 온라인 해커톤 화성갈끄니까팀 코드

korean extractive summarization 2021 AI 텍스트 요약 온라인 해커톤 화성갈끄니까팀 코드 Leaderboard Notice Text Summarization with Pretrained Encoders에 나오는 bertsumext모델(ext

3 Aug 10, 2022

本插件是pcrjjc插件的重置版，可以独立于后端api运行

pcrjjc2 本插件是pcrjjc重置版，不需要使用其他后端api，但是需要自行配置客户端本项目基于AGPL v3协议开源，由于项目特殊性，禁止基于本项目的任何商业行为配置方法环境需求：.net framework 4.5及以上 jre8 别忘了装jre8 别忘了装jre8 别忘了装jre8

132 Dec 26, 2022

Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation

Non-Autoregressive Translation with Layer-Wise Prediction and Deep Supervision Training Efficiency We show the training efficiency of our DSLP model b

37 Jan 04, 2023

Jarvis is a simple Chatbot with a GUI capable of chatting and retrieving information and daily news from the internet for it's user.

J.A.R.V.I.S Kindly consider starring this repository if you like the program :-) What/Who is J.A.R.V.I.S? J.A.R.V.I.S is an chatbot written that is bu

50 Dec 31, 2022

An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.

Extracting OpenAI CLIP (Global/Grid) Features from Image and Text This repo aims at providing an easy to use and efficient code for extracting image &

13 Jan 06, 2023

Lumped-element impedance calculator and frequency-domain plotter.

fastZ: Lumped-Element Impedance Calculator fastZ is a small tool for calculating and visualizing electrical impedance in Python. Features include: Sup

47 Nov 18, 2022

News-Articles-and-Essays - NLP (Topic Modeling and Clustering)

NLP T5 Project proposal Topic Modeling and Clustering of News-Articles-and-Essays Students: Nasser Alshehri Abdullah Bushnag Abdulrhman Alqurashi OVER

2 Jan 18, 2022

Basic Utilities for PyTorch Natural Language Processing (NLP)

Basic Utilities for PyTorch Natural Language Processing (NLP) PyTorch-NLP, or torchnlp for short, is a library of basic utilities for PyTorch NLP. tor

2.1k Jan 01, 2023

Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!

Auto-Research A no-code utility to generate a detailed well-cited survey with topic clustered sections (draft paper format) and other interesting arti

20 Dec 14, 2022

PyTorch original implementation of Cross-lingual Language Model Pretraining.

XLM NEW: Added XLM-R model. PyTorch original implementation of Cross-lingual Language Model Pretraining. Includes: Monolingual language model pretrain

2.7k Dec 27, 2022

A Python script that compares files in directories

compare-files A Python script that compares files in different directories, this is similar to the command filecmp.cmp(f1, f2). I made this script in

1 Oct 15, 2021

Anomaly Detection 이상치 탐지 전처리 모듈

Anomaly Detection 시계열 데이터에 대한 이상치 탐지 1. Kernel Density Estimation을 활용한 이상치 탐지 train_data_path와 test_data_path에 존재하는 시점 정보를 포함하고 있는 csv 형태의 train data와

43 Nov 28, 2022

official ( API ) for the zAmericanEnglish app in [ Google play ] and [ App store ]

3 Jan 12, 2022

This repository details the steps in creating a Part of Speech tagger using Trigram Hidden Markov Models and the Viterbi Algorithm without using external libraries.

POS-Tagger This repository details the creation of a Part-of-Speech tagger using Trigram Hidden Markov Models to predict word tags in a word sequence.

1 Dec 09, 2021

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

textgenrnn Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code, or quickly tr

4.8k Dec 30, 2022

Pretrained Japanese BERT models

Pretrained Japanese BERT models This is a repository of pretrained Japanese BERT models. The models are available in Transformers by Hugging Face. Mod

387 Dec 30, 2022

Ecco is a python library for exploring and explaining Natural Language Processing models using interactive visualizations.

Related tags

Overview

Features

Examples:

What is the sentiment of this film review?

Which words in this review lead the model to classify its sentiment as "negative"?

Explore the world knowledge of GPT models by posing fill-in-the blank questions.

What other cities/words did the model consider in addition to London?

Which input words lead it to think of London?

At which layers did the model gather confidence that London is the right answer?

What are the patterns in BERT neuron activation when it processes a piece of text?

Tutorials

How-to Guides

API Reference

Gallery & Examples

Getting Help

Comments

Releases(v0.1.2)

v0.1.2(Jan 9, 2022)

v0.1.1(Jan 4, 2022)

v0.1.0(Dec 29, 2021)

v0.0.15(Aug 2, 2021)

v0.0.14(Feb 25, 2021)

v0.0.13(Feb 8, 2021)

v0.0.12(Jan 5, 2021)

v0.0.10(Dec 16, 2020)

v0.0.9RC(Dec 1, 2020)

v0.0.8(Nov 20, 2020)

Owner

Jay Alammar

It analyze the sentiment of the user, whether it is postive or negative.

Correctly generate plurals, ordinals, indefinite articles; convert numbers to words

A Fast Sequence Transducer Implementation with PyTorch Bindings

What are the best Systems? New Perspectives on NLP Benchmarking

Korean extractive summarization. 2021 AI 텍스트 요약 온라인 해커톤 화성갈끄니까팀 코드

本插件是pcrjjc插件的重置版，可以独立于后端api运行

Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation

Jarvis is a simple Chatbot with a GUI capable of chatting and retrieving information and daily news from the internet for it's user.

An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.

Lumped-element impedance calculator and frequency-domain plotter.

News-Articles-and-Essays - NLP (Topic Modeling and Clustering)

Basic Utilities for PyTorch Natural Language Processing (NLP)

Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!

PyTorch original implementation of Cross-lingual Language Model Pretraining.

A Python script that compares files in directories

Anomaly Detection 이상치 탐지 전처리 모듈

official ( API ) for the zAmericanEnglish app in [ Google play ] and [ App store ]

This repository details the steps in creating a Part of Speech tagger using Trigram Hidden Markov Models and the Viterbi Algorithm without using external libraries.

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

Pretrained Japanese BERT models