PyTorch Seq2Seq Intent Parsing

Reframing intent parsing as a human - machine translation task. Work in progress successor to torch-seq2seq-intent-parsing

The command language

This is a simple command language developed for the "home assistant" Maia living in my apartment. She's designed as a collection of microservices with services for lights (Hue), switches (WeMo), and info such as weather and market prices.

A command consists of a "service", a "method", and some number of arguments.

lights setState office_light on
switches getState teapot
weather getWeather "San Francisco"
price getPrice TSLA

These can be represented with variable placeholders:

lights setState $device $state
switches getState $device
weather getWeather $location
price getPrice $symbol

We can imagine a bunch of human sentences that would map to a single command:

"Turn the office light on."
"Please turn on the light in the office."
"Maia could you set the office light on, thank you."

Which could similarly be represented with placeholders.

TODO: Specific vs. freeform variables

A shortcoming of the approach so far is that the model has to learn translations of specific values, for example mapping all of the device names to their equivalent device_name. If we added a "basement light" the model would have no basement_light in the output vocabulary unless it was re-trained.

The bigger the potential input space, the more obvious the problem - consider the getWeather command, where the model would need to be trained with every possible location we might ask about. Worse yet, consider a playMusic command that could take any song or artist name...

This can be solved with a technique which I have implemented in Torch here. The training pairs have "variable placeholders" in the output translation, which the model generates during an intial pass. Then the network fills in the values of these placeholders with an additional pass over the input.

Intent parsing and slot filling in PyTorch with seq2seq + attention

Related tags

Overview

PyTorch Seq2Seq Intent Parsing

The command language

TODO: Specific vs. freeform variables

Owner

Sean Robertson

Test-Time Personalization with a Transformer for Human Pose Estimation, NeurIPS 2021

MVGCN: a novel multi-view graph convolutional network (MVGCN) framework for link prediction in biomedical bipartite networks.

Attentive Implicit Representation Networks (AIR-Nets)

Romanian Automatic Speech Recognition from the ROBIN project

PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"

code for our ECCV 2020 paper "A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation"

Learning with Noisy Labels via Sparse Regularization, ICCV2021

pytorch implementation of GPV-Pose

Numerai tournament example scripts using NN and optuna

[CVPR 2022] Structured Sparse R-CNN for Direct Scene Graph Generation

High-Fidelity Pluralistic Image Completion with Transformers (ICCV 2021)

FedML: A Research Library and Benchmark for Federated Machine Learning

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

This project deploys a yolo fastest model in the form of tflite on raspberry 3b+. The model is from another repository of mine called -Trash-Classification-Car

Unofficial PyTorch reimplementation of the paper Swin Transformer V2: Scaling Up Capacity and Resolution

Fog Simulation on Real LiDAR Point Clouds for 3D Object Detection in Adverse Weather

PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and reinforcement learning

A program to recognize fruits on pictures or videos using yolov5

An executor that loads ONNX models and embeds documents using the ONNX runtime.

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]