Text Classification in Turkish Texts with Bert

Last update: Dec 31, 2022

Overview

You can watch the details of the project on my youtube channel

Project Interface

Project Second Interface

Goal= Correctly guessing the classification of texts and audios

BERT_Text_Classification

It is a text classification task implementation transformers (by HuggingFace) with BERT. It contains several parts:

--Data pre-processing

--BERT tokenization and input formating

--Train with BERT

--Evaluation

--Save and load saved model

Text-classification-transformers

Text classification tasks are most easily encountered in the area of natural language processing and can be used in various ways.

However, the given data needs to be preprocessed and the model's data pipeline must be created according to the preprocessing.

The purpose of this Repository is to allow text classification to be easily performed with Transformers (BERT)-like models if text classification data has been preprocessed into a specific structure.

Implemented based on Huggingfcae transformers for quick and convenient implementation.

Text Classification in Turkish Texts with Bert

Related tags

Overview

You can watch the details of the project on my youtube channel

Project Interface

Project Second Interface

BERT_Text_Classification

Text-classification-transformers

📝 read_dataset

Unique Categories

☄️ Available models

🏴‍☠️ Model Performance

Predictions Vs Actuals

🃏 predictor

97.22 📈

Owner

DeepPavlov Tutorials

Unsupervised Language Modeling at scale for robust sentiment classification

Grover is a model for Neural Fake News -- both generation and detectio

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

NewsMTSC: (Multi-)Target-dependent Sentiment Classification in News Articles

Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT)

iBOT: Image BERT Pre-Training with Online Tokenizer

Pipeline for fast building text classification TF-IDF + LogReg baselines.

SpikeX - SpaCy Pipes for Knowledge Extraction

pyupbit 라이브러리를 활용하여 upbit에서 비트코인을 자동매매하는 코드입니다. 조코딩 유튜브 채널에서 자세한 강의 영상을 보실 수 있습니다.

PhoNLP: A BERT-based multi-task learning toolkit for part-of-speech tagging, named entity recognition and dependency parsing

Ukrainian TTS (text-to-speech) using Coqui TTS

A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion

JaQuAD: Japanese Question Answering Dataset

Script to generate VAD dataset used in Asteroid recipe

Code for Text Prior Guided Scene Text Image Super-Resolution

Reproduction process of BERT on SST2 dataset

Research code for the paper "Fine-tuning wav2vec2 for speaker recognition"

Code for Findings at EMNLP 2021 paper: "Learn Continually, Generalize Rapidly: Lifelong Knowledge Accumulation for Few-shot Learning"

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.