Geospatial data-science analysis on reasons behind delay in Grab ride-share services

Last update: Jun 07, 2022

Overview

Grab x Pulis

Detailed analysis done to investigate possible reasons for delay in Grab services for NUS Data Analytics Competition 2022, to be found in here and here.

Our main tech-stack:

Vahalla, a C++ implementation for map matching.
ipyleaflet, for very interactive visualizations of geospatial data analysis
geopandas
Dask
matplotlib & seaborn

We've shortlisted the reasons to be:

Traffic bottlenecks at popular shopping malls due to narrow infrastructures of pickup points. We comparatively found out that pickup speeds at Changi Airport with optimized pick-up and drop-off pioints are much faster at the initial and end-timings of each trip, compared to popular shopping malls with narrow queues at their pick-up and drop-off locations.
Drivers picking inefficient routes, as we compare the actual driver routes taken with popular Google Maps and Open Street Map routes which we pulled using Google Maps API and osmnx. We found out that drivers's supposed "shortcuts" are more often slower, albeit, there were in-fact expert-curated routes which were actually even faster than Google Maps and Open Street Maps. These insights could be used to augment Grab-Nav!

Team:

Keng Hwee Lead @kenghweeng
Russell Saerang @RussellDash332
Sean Gee Zhing @pikasean
Terry Lim @terrylimxc
Jonathan Chen @cysjonathan

Geospatial data-science analysis on reasons behind delay in Grab ride-share services

Related tags

Overview

Grab x Pulis

Owner

Keng Hwee

Mortgage-loan-prediction - Show how to perform advanced Analytics and Machine Learning in Python using a full complement of PyData utilities

Clean and reusable data-sciency notebooks.

PyPDC is a Python package for calculating asymptotic Partial Directed Coherence estimations for brain connectivity analysis.

Analyzing Earth Observation (EO) data is complex and solutions often require custom tailored algorithms.

[CVPR2022] This repository contains code for the paper "Nested Collaborative Learning for Long-Tailed Visual Recognition", published at CVPR 2022

Python utility to extract differences between two pandas dataframes.

Open-Domain Question-Answering for COVID-19 and Other Emergent Domains

Common bioinformatics database construction

Deep universal probabilistic programming with Python and PyTorch

API>local_db>AWS_RDS - Disclaimer! All data used is for educational purposes only.

A Python adaption of Augur to prioritize cell types in perturbation analysis.

An implementation of the largeVis algorithm for visualizing large, high-dimensional datasets, for R

First and foremost, we want dbt documentation to retain a DRY principle. Every time we repeat ourselves, we waste our time. Second, we want to understand column level lineage and automate impact analysis.

Fancy data functions that will make your life as a data scientist easier.

A tax calculator for stocks and dividends activities.

Cold Brew: Distilling Graph Node Representations with Incomplete or Missing Neighborhoods

A multi-platform GUI for bit-based analysis, processing, and visualization

The Dash Enterprise App Gallery "Oil & Gas Wells" example

Example Of Splunk Search Query With Python And Splunk Python SDK

My first Python project is a simple Mad Libs program.