Natural Language Processing Specialization

Last update: Oct 06, 2022

Overview

Natural Language Processing Specialization

In this folder, Natural Language Processing Specialization projects and notes can be found.

WHAT I LEARNED

Use logistic regression, naïve Bayes, and word vectors to implement sentiment analysis, complete analogies & translate words.
Use dynamic programming, hidden Markov models, and word embeddings to implement autocorrect, autocomplete & identify part-of-speech tags for words.
Use recurrent neural networks, LSTMs, GRUs & Siamese networks in Trax for sentiment analysis, text generation & named entity recognition.
Use encoder-decoder, causal, & self-attention to machine translate complete sentences, summarize text, build chatbots & question-answering.

There are 4 Courses in this Specialization

Course 1 - Natural Language Processing with Classification and Vector Spaces

In the first course of the Natural Language Processing Specialization
I performed sentiment analysis of tweets using logistic regression and then naïve Bayes,
I used vector space models to discover relationships between words and used PCA to reduce the dimensionality of the vector space and visualize those relationships, and
I wrote a simple English to French translation algorithm using pre-computed word embeddings and locality-sensitive hashing to relate words via approximate k-nearest neighbor search.

Projects

Course 2 - Natural Language Processing with Probabilistic Models

In the second course of the Natural Language Processing Specialization
I wrote a simple auto-correct algorithm using minimum edit distance and dynamic programming,
I applied the Viterbi Algorithm for part-of-speech (POS) tagging, which is vital for computational linguistics,
I wrote a better auto-complete algorithm using an N-gram language model, and
I wrote my own Word2Vec model that uses a neural network to compute word embeddings using a continuous bag-of-words model.

Projects

Course 3 - Natural Language Processing with Sequence Models

In the third course of the Natural Language Processing Specialization
I trained a neural network with GLoVe word embeddings to perform sentiment analysis of tweets,
I generated synthetic Shakespeare text using a Gated Recurrent Unit (GRU) language model,
I trained a recurrent neural network to perform named entity recognition (NER) using LSTMs with linear layers, and
I used so-called ‘Siamese’ LSTM models to compare questions in a corpus and identify those that are worded differently but have the same meaning.

Projects

Course 4 - Natural Language Processing with Attention Models

In the fourth course of the Natural Language Processing Specialization
I translated complete English sentences into German using an encoder-decoder attention model,
I built a Transformer model to summarize text,
I used T5 and BERT models to perform question-answering, and
I built a chatbot using a Reformer model.

Projects

Disclaimer

DeepLearning.AI makes course notes available for educational purposes.
Project solutions are just for educational purposes. I highly recommend trying and solving project/program assignments on your own.

All the best 🤘

Natural Language Processing Specialization

Related tags

Overview

Natural Language Processing Specialization

WHAT I LEARNED

There are 4 Courses in this Specialization

Course 1 - Natural Language Processing with Classification and Vector Spaces

Projects

Course 2 - Natural Language Processing with Probabilistic Models

Projects

Course 3 - Natural Language Processing with Sequence Models

Projects

Course 4 - Natural Language Processing with Attention Models

Projects

Disclaimer

Owner

Kaan BOKE

This is Assignment1 code for the Web Data Processing System.

Conditional Transformer Language Model for Controllable Generation

Generate vector graphics from a textual caption

ADCS - Automatic Defect Classification System (ADCS) for SSMC

CoSENT 比Sentence-BERT更有效的句向量方案

LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation

BERTopic is a topic modeling technique that leverages 🤗 transformers and c-TF-IDF to create dense clusters allowing for easily interpretable topics whilst keeping important words in the topic descriptions

Python implementation of TextRank for phrase extraction and summarization of text documents

The Internet Archive Research Assistant - Daily search Internet Archive for new items matching your keywords

Binaural Speech Synthesis

Maha is a text processing library specially developed to deal with Arabic text.

Türkçe küfürlü içerikleri bulan bir yapay zeka kütüphanesi / An ML library for profanity detection in Turkish sentences

Training RNNs as Fast as CNNs

Diaformer: Automatic Diagnosis via Symptoms Sequence Generation

QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries

An attempt to map the areas with active conflict in Ukraine using open source twitter data.

precise iris segmentation

In this repository we have tested 3 VQA models on the ImageCLEF-2019 dataset.

KoBERTopic은 BERTopic을 한국어 데이터에 적용할 수 있도록 토크나이저와 BERT를 수정한 코드입니다.

Data manipulation and transformation for audio signal processing, powered by PyTorch