Winning solution of the Indoor Location & Navigation Kaggle competition

Last update: Dec 28, 2022

Overview

This repository contains the code to generate the winning solution of the Kaggle competition on indoor location and navigation organized by Microsoft Research.

Our team name: "Track me if you can".

Authors:

Are Haartveit
Dmitry Gordeev
Tom Van de Wiele

References

Steps to obtain the approximate winning submission

Clone the repository, it doesn't matter where you clone it to since the source code and data are disentangled.
Create a project folder on a disk with at least 150GB of free space. Create a "Data" subfolder in your project folder. This will be referred to as "your data folder" in what follows.
Download the raw text data from here and extract it into your data folder.
Download the cleaned raw data from here and extract it into the "reference_preprocessed" subfolder of your data folder.
Add your data folder to line 19 in src/utils.py.
Run main.py.

If all goes well, the pipeline should create a "final_submissions" subfolder in your data folder with two final submissions. Note that these are likely slightly different from our actual submissions due to inherent training stochasticity. When you make a late submit of these submissions to the leaderboard, you should obtain a private score around 1.5, which can be further reduced to about 1.3 after fixing the private test floor predictions (not part of this repository).

Main script parameters

Mode ("-m" or "--mode"). Default: 'test'. Select from ('valid', 'test').
Suppress multipricessing ("-s"). Default: no suppression of multiprocessing.
Fast (and bad) sensor models ("-f"). Default: no fast sensor models. Mostly useful for verifying that all dependencies are in place. Ignored when copying sensor models (next parameter).
Copy sensor predictions ("-c"). Default: no copying of pretrained sensor predictions. Useful if you want to speed up the pipeline since training sensor models is the slowest part.

Hardware requirements

Due to the size of the data set, you need at least 32 GB RAM to be able to run the pipeline successfully.

Known issues

If you run out of memory, try running the pipeline again. It should continue where it left it in the previous run.

Winning solution of the Indoor Location & Navigation Kaggle competition

Related tags

Overview

References

Steps to obtain the approximate winning submission

Main script parameters

Hardware requirements

Known issues

Owner

Tom Van de Wiele

A transformer which can randomly augment VOC format dataset (both image and bbox) online.

Photo2cartoon - 人像卡通化探索项目 (photo-to-cartoon translation project)

Autonomous Perception: 3D Object Detection with Complex-YOLO

Group-Free 3D Object Detection via Transformers

Create images and texts with the First Order Generative Adversarial Networks

A sketch extractor for anime/illustration.

Data Augmentation Using Keras and Python

Official repository for the ISBI 2021 paper Transformer Assisted Convolutional Neural Network for Cell Instance Segmentation

A Partition Filter Network for Joint Entity and Relation Extraction EMNLP 2021

GraphGT: Machine Learning Datasets for Graph Generation and Transformation

PyTorch Implement for Path Attention Graph Network

The first dataset on shadow generation for the foreground object in real-world scenes.

Using Machine Learning to Create High-Res Fine Art

Python library to receive live stream events like comments and gifts in realtime from TikTok LIVE.

Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.

DeepLearning Anomalies Detection with Bluetooth Sensor Data

Decorators for maximizing memory utilization with PyTorch & CUDA

KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch

Official implementation for ICDAR 2021 paper "Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer"

Optimizaciones incrementales al problema N-Body con el fin de evaluar y comparar las prestaciones de los traductores de Python en el ámbito de HPC.