PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Last update: Nov 12, 2021

Overview

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Abstract

NLP applications for code-mixed (CM) or mix-lingual text have gained a significant momentum recently, the main reason being the prevalence of language mixing in social media communications in multi-lingual societies like India, Mexico, Europe, parts of USA etc. Word embeddings are basic building blocks of any NLP system today, yet, word embedding for CM languages is an unexplored territory. The major bottleneck for CM word embeddings is switching points, where the language switches. These locations lack in contextually and statistical systems fail to model this phenomena due to high variance in the seen examples. In this paper we present our initial observations on applying switching point based positional encoding techniques for CM language, specifically Hinglish (Hindi - English). Results are only marginally better than SOTA, but it is evident that positional encoding could be an effective way to train position sensitive language models for CM text.

PESTO Architecture

Switch Point Attention

If you find this useful, please cite our paper below:

@inproceedings{ali-etal-relative,
title = {PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages},
author = {Mohsin Ali and Kandukuri Sai Teja and Sumanth Manduru and Parth Patwa and Amitava Das}
booktitle =  {Proceedings of the AAAI Conference on Artificial Intelligence},
year = {2022},}

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Related tags

Overview

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Abstract

PESTO Architecture

Switch Point Attention

If you find this useful, please cite our paper below:

Owner

Mohsin Ali, Mohammed

Social Distancing Detector

This is a TensorFlow implementation for C2-Rec

Official code for the CVPR 2022 (oral) paper "Extracting Triangular 3D Models, Materials, and Lighting From Images".

The Illinois repository for Climatehack (https://climatehack.ai/). We won 1st place!

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020

Malmo Collaborative AI Challenge - Team Pig Catcher

This is the repository for CVPR2021 Dynamic Metric Learning: Towards a Scalable Metric Space to Accommodate Multiple Semantic Scales

Official PyTorch implemention of our paper "Learning to Rectify for Robust Learning with Noisy Labels".

VideoGPT: Video Generation using VQ-VAE and Transformers

A system used to detect whether a person is wearing a medical mask or not.

LieTransformer: Equivariant Self-Attention for Lie Groups

FS-Mol: A Few-Shot Learning Dataset of Molecules

PAIRED in PyTorch 🔥

Interactive Image Generation via Generative Adversarial Networks

TraSw for FairMOT - A Single-Target Attack example (Attack ID: 19; Screener ID: 24):

[NeurIPS2021] Exploring Architectural Ingredients of Adversarially Robust Deep Neural Networks

Unofficial implementation of PatchCore anomaly detection

Magisk module to enable hidden features on Android 12 Developer Preview 1.

Prompt-BERT: Prompt makes BERT Better at Sentence Embeddings

FedCV: A Federated Learning Framework for Diverse Computer Vision Tasks