Winning solution of the Indoor Location & Navigation Kaggle competition

Last update: Dec 28, 2022

Overview

This repository contains the code to generate the winning solution of the Kaggle competition on indoor location and navigation organized by Microsoft Research.

Our team name: "Track me if you can".

Authors:

Are Haartveit
Dmitry Gordeev
Tom Van de Wiele

References

Steps to obtain the approximate winning submission

Clone the repository, it doesn't matter where you clone it to since the source code and data are disentangled.
Create a project folder on a disk with at least 150GB of free space. Create a "Data" subfolder in your project folder. This will be referred to as "your data folder" in what follows.
Download the raw text data from here and extract it into your data folder.
Download the cleaned raw data from here and extract it into the "reference_preprocessed" subfolder of your data folder.
Add your data folder to line 19 in src/utils.py.
Run main.py.

If all goes well, the pipeline should create a "final_submissions" subfolder in your data folder with two final submissions. Note that these are likely slightly different from our actual submissions due to inherent training stochasticity. When you make a late submit of these submissions to the leaderboard, you should obtain a private score around 1.5, which can be further reduced to about 1.3 after fixing the private test floor predictions (not part of this repository).

Main script parameters

Mode ("-m" or "--mode"). Default: 'test'. Select from ('valid', 'test').
Suppress multipricessing ("-s"). Default: no suppression of multiprocessing.
Fast (and bad) sensor models ("-f"). Default: no fast sensor models. Mostly useful for verifying that all dependencies are in place. Ignored when copying sensor models (next parameter).
Copy sensor predictions ("-c"). Default: no copying of pretrained sensor predictions. Useful if you want to speed up the pipeline since training sensor models is the slowest part.

Hardware requirements

Due to the size of the data set, you need at least 32 GB RAM to be able to run the pipeline successfully.

Known issues

If you run out of memory, try running the pipeline again. It should continue where it left it in the previous run.

Winning solution of the Indoor Location & Navigation Kaggle competition

Related tags

Overview

References

Steps to obtain the approximate winning submission

Main script parameters

Hardware requirements

Known issues

Owner

Tom Van de Wiele

Reading list for research topics in Masked Image Modeling

Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)

PyTorch inference for "Progressive Growing of GANs" with CelebA snapshot

DANet for Tabular data classification/ regression.

Code accompanying the paper "How Tight Can PAC-Bayes be in the Small Data Regime?"

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data.

A copy of Ares that costs 30 fucking dollars.

Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization

Vision-and-Language Navigation in Continuous Environments using Habitat

Implementation of ConvMixer in TensorFlow and Keras

make ASCII Art by Deep Learning

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners

Temporal-Relational CrossTransformers

Turi Create simplifies the development of custom machine learning models.

PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks

A high-level Python library for Quantum Natural Language Processing

Lepard: Learning Partial point cloud matching in Rigid and Deformable scenes

The reference baseline of final exam for XMU machine learning course

Forecasting directional movements of stock prices for intraday trading using LSTM and random forest