Deep learning based state estimation: incorporating Transformer and LSTM to Kalman Filter with EM algorithm

Overview

Kalman Filter requires the true parameters of the model and solves optimal state estimation recursively. Expectation Maximization (EM) algorithm is applicable for estimating the parameters of the model that are not available before Kalman filtering, which is EM-KF algorithm.
To improve the preciseness of EM-KF algorithm, the author presents a state estimation method by combining the Long-Short Term Memory network (LSTM), Transformer and EM-KF algorithm in the framework of Encoder-Decoder in Sequence to Sequence (seq2seq).
Simulation on a linear mobile robot model demonstrates that the new method is more accurate.
Please read our paper on arXiv: Incorporating Transformer and LSTM to Kalman Filter with EM algorithm for state estimation, for understanding the details w.r.t. theoretical analysis and experiment in our method.

Usage

python main.py

Requirements

The code has been tested running under Python3, with package PyTorch, NumPy, Matplotlib, PyKalman and their dependencies installed.

Methodology

We proposed encoder-decoder framework in seq2seq for state estimation, that state estimation is equivalent to encode and decode observation.

Previous works incorporating LSTM to KF, are adopting LSTM encoder and KF decoder. We proposed LSTM-KF adopting LSTM encoder and EM-KF decoder.
Before EM-KF decoder, replace LSTM encoder by Transformer encoder, we call this Transformer-KF.
Integrating Transformer and LSTM, we call this TL-KF.

Integrating Transformer and LSTM to encode observation before filtering, makes it easier for EM algorithm to estimate parameters.

Conclusions

Combining Transformer and LSTM as an encoder-decoder framework for observation, can depict state more effectively, attenuate noise interference, and weaken the assumption of Markov property of states, and conditional independence of observations. This can enhance the preciseness and robustness of state estimation.
Transformer, based on multi-head self attention and residual connection, can capture long-term dependency, while LSTM-encoder can model time-series. TL-KF, a combination of Transformer, LSTM and EM-KF, is precise for state estimation in systems with unknown parameters.
Kalman smoother can ameliorate Kalman filter, but in TL-KF, filtering is precise enough. Therefore, after offline training for parameter estimation, KF for online estimation can be adopted.

Citation

@article{shi2021kalman,
    author={Zhuangwei Shi},
    title={Incorporating Transformer and LSTM to Kalman Filter with EM algorithm for state estimation},
    journal={arXiv preprint arXiv:2105.00250},
    year={2021},
}

Incorporating Transformer and LSTM to Kalman Filter with EM algorithm

Related tags

Overview

Deep learning based state estimation: incorporating Transformer and LSTM to Kalman Filter with EM algorithm

Overview

Usage

Requirements

Methodology

Conclusions

Citation

Owner

zshicode

Official repository for the ISBI 2021 paper Transformer Assisted Convolutional Neural Network for Cell Instance Segmentation

SuperSonic, a new open-source framework to allow compiler developers to integrate RL into compilers easily, regardless of their RL expertise

SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches

Scalable implementation of Lee / Mykland (2012) and Ait-Sahalia / Jacod (2012) Jump tests for noisy high frequency data

Official implementation of the paper Label-Efficient Semantic Segmentation with Diffusion Models

1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection

A toolkit for developing and comparing reinforcement learning algorithms.

The code of “Similarity Reasoning and Filtration for Image-Text Matching” [AAAI2021]

A tight inclusion function for continuous collision detection

Stochastic Normalizing Flows

OrienMask: Real-time Instance Segmentation with Discriminative Orientation Maps

OneShot Learning-based hotword detection.

Mini-hmc-jax - A simple implementation of Hamiltonian Monte Carlo in JAX

Code repository for Semantic Terrain Classification for Off-Road Autonomous Driving

GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning

CSD: Consistency-based Semi-supervised learning for object Detection

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Is RobustBench/AutoAttack a suitable Benchmark for Adversarial Robustness?

CryptoFrog - My First Strategy for freqtrade

Implementation of the famous Image Manipulation\Forgery Detector "ManTraNet" in Pytorch