Natural Language Processing

Here you will find the teaching materials for the "Natural Language Processing" course at EDHEC Business School, 2022

What is the course about?

The course is designed as an introduction to the basics of natural language processing for analyzing unstructured, user-generated content. It is for beginners to the topic (and NLP in general), but it will be helpful to have basic knowledge of Python and a familarity with data science techniques.

Topics covered include:

text preprocessing in Python,
collecting your own data from Twitter and Reddit,
content analysis,
text embeddings, and
supervised learning with text data.

What materials are available here?

The sildes will be posted on the course BlackBoard page. They mostly serve as a high-level introduction to the examples and exercies (in Colab notebooks), which are linked to from the slides themselves. Copies of the Colab notebooks can also be found in the folder called /colab in this repository.

Can I work through the material on my own?

If you didn't attend the class, you can certainly work through the materials on your own (the Colab notebooks are designed to be readable and doable for individuals working at their own pace). The slides posted on BlackBoard will guide you through the content. The notebooks are intendend to be worked through in order. Each one will have examples to view and 1 or 2 practice exercises to complete.

Aknowledgements

I would like to aknowledge Steve Wilson at Oakland University for making his DS3 workshop materials publically available with an MIT license.

Natural Language Processing at EDHEC, 2022

Related tags

Overview

Natural Language Processing

What is the course about?

What materials are available here?

Can I work through the material on my own?

Aknowledgements

Owner

使用pytorch+transformers复现了SimCSE论文中的有监督训练和无监督训练方法

Beyond the Imitation Game collaborative benchmark for enormous language models

Natural Language Processing Specialization

2021 AI CUP Competition on Traditional Chinese Scene Text Recognition - Intermediate Contest

Lattice methods in TensorFlow

End-to-end image captioning with EfficientNet-b3 + LSTM with Attention

Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

Open source annotation tool for machine learning practitioners.

OCR을 이용하여 인원수를 인식 후 줌을 Kill 해줍니다

Official PyTorch Implementation of paper "NeLF: Neural Light-transport Field for Single Portrait View Synthesis and Relighting", EGSR 2021.

Python Implementation of ``Modeling the Influence of Verb Aspect on the Activation of Typical Event Locations with BERT'' (Findings of ACL: ACL 2021)

PUA Programming Language written in Python.

The proliferation of disinformation across social media has led the application of deep learning techniques to detect fake news.

Repository for the paper "Optimal Subarchitecture Extraction for BERT"

Code to reprudece NeurIPS paper: Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

Smart discord chatbot integrated with Dialogflow to manage different classrooms and assist in teaching!

Python library to make development of portfolio analysis faster and easier

CodeBERT: A Pre-Trained Model for Programming and Natural Languages.

In this workshop we will be exploring NLP state of the art transformers, with SOTA models like T5 and BERT, then build a model using HugginFace transformers framework.