NLTK Source

Last update: Jan 04, 2023

Overview

Natural Language Toolkit (NLTK)

NLTK -- the Natural Language Toolkit -- is a suite of open source Python modules, data sets, and tutorials supporting research and development in Natural Language Processing. NLTK requires Python version 3.5, 3.6, 3.7, or 3.8.

For documentation, please visit nltk.org.

Contributing

Do you want to contribute to NLTK development? Great! Please read CONTRIBUTING.md for more details.

Donate

Have you found the toolkit helpful? Please support NLTK development by donating to the project via PayPal, using the link on the NLTK homepage.

Citing

If you publish work that uses NLTK, please cite the NLTK book, as follows:

Bird, Steven, Edward Loper and Ewan Klein (2009).
Natural Language Processing with Python.  O'Reilly Media Inc.

Copyright

For license information, see LICENSE.txt.

AUTHORS.md contains a list of everyone who has contributed to NLTK.

Redistributing

NLTK source code is distributed under the Apache 2.0 License.
NLTK documentation is distributed under the Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States license.
NLTK corpora are provided under the terms given in the README file for each corpus; all are redistributable and available for non-commercial use.
NLTK may be freely redistributed, subject to the provisions of these licenses.

NLTK Source

Related tags

Overview

Natural Language Toolkit (NLTK)

Contributing

Donate

Citing

Copyright

Redistributing

Owner

Natural Language Toolkit

Tools, wrappers, etc... for data science with a concentration on text processing

Train and use generative text models in a few lines of code.

A paper list of pre-trained language models (PLMs).

RecipeReduce: Simplified Recipe Processing for Lazy Programmers

Black for Python docstrings and reStructuredText (rst).

小布助手对话短文本语义匹配的一个baseline

Search with BERT vectors in Solr and Elasticsearch

Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.

Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS)

Japanese synonym library

Modeling cumulative cases of Covid-19 in the US during the Covid 19 Delta wave using Bayesian methods.

Official Pytorch implementation of Test-Agnostic Long-Tailed Recognition by Test-Time Aggregating Diverse Experts with Self-Supervision.

OceanScript is an Esoteric language used to encode and decode text into a formulation of characters

NLP techniques such as named entity recognition, sentiment analysis, topic modeling, text classification with Python to predict sentiment and rating of drug from user reviews.

Repository for the paper: VoiceMe: Personalized voice generation in TTS

Binary LSTM model for text classification

Officile code repository for "A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning"

EasyTransfer is designed to make the development of transfer learning in NLP applications easier.

A programming language with logic of Python, and syntax of all languages.

中文問句產生器；使用台達電閱讀理解資料集(DRCD)