Flight Delay Prediction

Our objective is to predict arrival delays of commercial flights. According to the US Department of Transportation, about 21% of commercial flights scheduled between June 2003 and October 2021 have experienced some form of delay. It is critical for airlines to estimate flight delays as accurately as possible in order to improve customer satisfaction and optimize the income of airline agencies. This project will be evaluated on the basis of arrival delay prediction accuracy for flights

Contributors

Jordan Silke
Jonas Bacareza

Understanding the problem

In an effort to understand some common causes of commercial flight delays, a number of sources were consulted including government agencies and flight-focused blog posts. A brief overview of findings can be found in the Research directory. These common causes will inform feature selection and engineering decisions.

Data description

Data was sourced from a LHL PostgreSQL database and descriptions were provided for each table. We used a custom script to extract the feature names from these description files and the raw data can be found here. The rationale behind missing value processing can be reviewed and reproduced by reading and executing the data_overview notebook. The data from the flights table included in this repository is a randomly sampled subset of the source table.

Recommended exploration

Task	Status
Test the hypothesis that the arrival delay is from Normal distribution and that mean of the delay is 0. Be careful about the outliers.	✅
Is average/median monthly delay different during the year? If so, which months have the biggest delays and what could be the reason?	✅
Does the weather affect the delay?	🧰
How are taxi times changing during the day? Does higher traffic lead to longer taxi times?	✅
What is the average percentage of delays that exist prior to departure (i.e. are arrival delays caused by departure delays)? Are airlines able to lower the delay during the flights?	✅
How many states cover 50% of US air traffic?	✅
Test the hypothesis that planes fly faster when there is a departure delay.	✅
When (which hour) do most 'LONG', 'SHORT', 'MEDIUM' haul flights take off?	🔳
Find the top 10 the bussiest airports. Does the greatest number of flights mean that the majority of passengers went through a given airport? How much traffic do these 10 airports cover?	🔳
Do bigger delays lead to bigger fuel consumption per passenger?	🔳

🔳 - To do.
✅ - Core task 'complete' (at least a first pass).
🧰 - Work in progress.

Exploration task results can be found here

Predicting the duration of arrival delays for commercial flights.

Related tags

Overview

Flight Delay Prediction

Contributors

Understanding the problem

Data description

Recommended exploration

Owner

Jordan Silke

public repo for ESTER dataset and modeling (EMNLP'21)

A Real-World Benchmark for Reinforcement Learning based Recommender System

A distributed, plug-n-play algorithm for multi-robot applications with a priori non-computable objective functions

Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.

TiP-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling

Point-NeRF: Point-based Neural Radiance Fields

Lucid Sonic Dreams syncs GAN-generated visuals to music.

Robotic Process Automation in Windows and Linux by using Driagrams.net BPMN diagrams.

A library of scripts that interact with the PythonTurtle module to create games, drawings, and more

PyTorch implementation for Graph Contrastive Learning with Augmentations

Continual World is a benchmark for continual reinforcement learning

U-Net: Convolutional Networks for Biomedical Image Segmentation

Keras + Hyperopt: A very simple wrapper for convenient hyperparameter optimization

Readings for "A Unified View of Relational Deep Learning for Polypharmacy Side Effect, Combination Therapy, and Drug-Drug Interaction Prediction."

KeypointDeformer: Unsupervised 3D Keypoint Discovery for Shape Control

Detect roadway lanes using Python OpenCV for project during the 5th semester at DHBW Stuttgart for lecture in digital image processing.

fcn by tensorflow

A program that can analyze videos according to the weights you select

Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm

[NeurIPS'21] "AugMax: Adversarial Composition of Random Augmentations for Robust Training" by Haotao Wang, Chaowei Xiao, Jean Kossaifi, Zhiding Yu, Animashree Anandkumar, and Zhangyang Wang.