PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Last update: Nov 12, 2021

Overview

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Abstract

NLP applications for code-mixed (CM) or mix-lingual text have gained a significant momentum recently, the main reason being the prevalence of language mixing in social media communications in multi-lingual societies like India, Mexico, Europe, parts of USA etc. Word embeddings are basic building blocks of any NLP system today, yet, word embedding for CM languages is an unexplored territory. The major bottleneck for CM word embeddings is switching points, where the language switches. These locations lack in contextually and statistical systems fail to model this phenomena due to high variance in the seen examples. In this paper we present our initial observations on applying switching point based positional encoding techniques for CM language, specifically Hinglish (Hindi - English). Results are only marginally better than SOTA, but it is evident that positional encoding could be an effective way to train position sensitive language models for CM text.

PESTO Architecture

Switch Point Attention

If you find this useful, please cite our paper below:

@inproceedings{ali-etal-relative,
title = {PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages},
author = {Mohsin Ali and Kandukuri Sai Teja and Sumanth Manduru and Parth Patwa and Amitava Das}
booktitle =  {Proceedings of the AAAI Conference on Artificial Intelligence},
year = {2022},}

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Related tags

Overview

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Abstract

PESTO Architecture

Switch Point Attention

If you find this useful, please cite our paper below:

Owner

Mohsin Ali, Mohammed

PyTorch code for training MM-DistillNet for multimodal knowledge distillation

ServiceX Transformer that converts flat ROOT ntuples into columnwise data

Density-aware Single Image De-raining using a Multi-stream Dense Network (CVPR 2018)

https://sites.google.com/cornell.edu/recsys2021tutorial

Educational 2D SLAM implementation based on ICP and Pose Graph

Code for Blind Image Decomposition (BID) and Blind Image Decomposition network (BIDeN).

MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

This is an example of a reproducible modelling project

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

Python版OpenCVのTracking APIのサンプルです。DaSiamRPNアルゴリズムまで対応しています。

68 keypoint annotations for COFW test data

Scientific Computation Methods in C and Python (Open for Hacktoberfest 2021)

Context Decoupling Augmentation for Weakly Supervised Semantic Segmentation

YOLO-v5 기반 단안 카메라의 영상을 활용해 차간 거리를 일정하게 유지하며 주행하는 Adaptive Cruise Control 기능 구현

Experiments for Fake News explainability project

This is a Keras implementation of a CNN for estimating age, gender and mask from a camera.

PyTorch and GPyTorch implementation of the paper "Conditioning Sparse Variational Gaussian Processes for Online Decision-making."

This is the PyTorch implementation of GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation

PyTorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision.

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"