Learn machine learning the fun way, with Oracle and RedBull Racing

Last update: Oct 24, 2022

Related tags

Overview

Red Bull Racing Analytics Hands-On Labs

Introduction

Are you interested in learning machine learning (ML)? How about doing this in the context of the exciting world of F1 racing?! Get your ML skills bootstrapped here with Oracle and Red Bull Racing!

This tutorial teaches ML analytics with a series of hands-on labs (HOLs) using the Data Science service in Oracle Cloud Infrastructure.

You'll learn how to get data from some public data sources, then how to analyze this data using some of the latest ML techniques. In the process you'll build ML models and test them out in a predictor app.

Getting Started

There is some infrastructure that must be deployed before you can enjoy this tutorial. See the Terraform documentation for more information.

After the OCI infrastructure is deployed, proceed with the beginner's tutorial to start through the ML labs.

Prerequisites

You must have an OCI account. Click here to create a new cloud account.

This solution is designed to work with several OCI services, allowing you to quickly be up-and-running:

There are required OCI resources (see the Terraform documentation for more information) that are needed for this tutorial.

Notes/Issues

None at this time.

URLs

Oracle and Red Bull partnership announcement

Contributing

This project is open source. Please submit your contributions by forking this repository and submitting a pull request! Oracle appreciates any contributions that are made by the open source community.

License

Licensed under the Universal Permissive License (UPL), Version 1.0.

See LICENSE for more details.

Comments

Refactored Terraform code
Compatible with ORM, Cloud Shell and Terraform CLI

Updated README to include instructions for all three methods

Refactored, removing unnecessary resources (Vault, public Subnet, etc.).

Added a nerd knob so that it could use an existing Group (rather than create a new one)

Fixed ORM RegEx filters to allow dashes (-) and underscores (_), for the names
opened by timclegg 2
Issue with hands on lab guide - launchapp.sh missing

https://github.com/oracle-devrel/redbull-analytics-hol/tree/main/beginners#beginners-hands-on-lab

In Starting The Web Application it reads:

cd /home/opc/redbull-analytics-hol/beginners/web ./launchapp.sh start

However is launchapp.sh is missing, for example

(redbullenv) cd /home/opc/redbull-analytics-hol/beginners/web (redbullenv) ./launchapp.sh start bash: ./launchapp.sh: No such file or directory

opened by raekins 1
fix: Updating schema.yaml syntax

Making the variable notation follow what the doc syntax shows (https://docs.oracle.com/en-us/iaas/Content/ResourceManager/Concepts/terraformconfigresourcemanager_topic-schema.htm)

opened by timclegg 1
Exploratory Data Analysis Merge Issue

Hello I have been encountering an issue while running the lab. The Jupyter notebook 03.f1_analysis_EDA.ipynb has the following issue on cell number 5:

ValueError Traceback (most recent call last) in ----> 1 df1 = pd.merge(races,results,how='inner',on=['raceId']) 2 df2 = pd.merge(df1,quali,how='inner',on=['raceId','driverId','constructorId']) 3 df3 = pd.merge(df2,drivers,how='inner',on=['driverId']) 4 df4 = pd.merge(df3,constructors,how='inner',on=['constructorId']) 5 df5 = pd.merge(df4,circuit,how='inner',on=['circuitId'])

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in merge(left, right, how, on, left_on, right_on, left_index, right_index, sort, suffixes, copy, indicator, validate) 85 copy=copy, 86 indicator=indicator, ---> 87 validate=validate, 88 ) 89 return op.get_result()

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in init(self, left, right, how, on, left_on, right_on, axis, left_index, right_index, sort, suffixes, copy, indicator, validate) 654 # validate the merge keys dtypes. We may need to coerce 655 # to avoid incompatible dtypes --> 656 self._maybe_coerce_merge_keys() 657 658 # If argument passed to validate,

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in _maybe_coerce_merge_keys(self) 1163 inferred_right in string_types and inferred_left not in string_types 1164 ): -> 1165 raise ValueError(msg) 1166 1167 # datetimelikes must match exactly

ValueError: You are trying to merge on object and int64 columns. If you wish to proceed you should use pd.concat

I’m using an oracle automatic deployment provided by oracle as part of their environment. I do not have a lot of experience with Python but one possible ible solution is to read the numeric values form the csv file as integer or float but I’m almost certain the solution might be a little more elaborated than that 😉. Anyway thanks for your time. I’m really excited to test your solution and finish the lab. Thanks again.

opened by yankodavila 2
Has the PAR for the stack deploy image expired.

Cannot deploy stack as getting PAR expired message.

2021/11/07 10:50:11[TERRAFORM_CONSOLE] [INFO] Error Message: work request did not succeed, workId: ocid1.coreservicesworkrequest.oc1.eu-amsterdam-1.abqw2ljrwz2n7qqj7ghdwtnlrqol355oumc7a6coushvgdrebskspaewh7ea, entity: image, action: CREATED. Message: Import image not found: PAR is invalid (maybe is expired or deleted), please check.

PAR in stack file is https://objectstorage.eu-frankfurt-1.oraclecloud.com/p/khhPjc_IMuyBOMfZUcJajIzCpoZ5aC-D7VMCU__GVZRlIQueXLIIcaaqLOZIuT1a/n/emeasespainsandbox/b/publichol/o/redbullhol-20210809-1523

opened by Mel-A-M 1

Releases(v0.1.8)

v0.1.8(Feb 18, 2022)

Optimized the models generation for Quickstarts Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.7...v0.1.8
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(20.78 KB)
v0.1.7(Feb 17, 2022)

add quickstart configuration by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/43

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.6...v0.1.7
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(17.20 KB)
v0.1.6(Feb 17, 2022)
What's Changed

add quickstart configuration by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/43

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.5...v0.1.6
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(17.20 KB)
v0.1.5(Feb 16, 2022)
What's Changed

Livelabs02162022 by @jasperan in https://github.com/oracle-devrel/redbull-analytics-hol/pull/41

fix: updated Alyssa Cotton's changes by @jasperan in https://github.com/oracle-devrel/redbull-analytics-hol/pull/42

New Contributors

@jasperan made their first contribution in https://github.com/oracle-devrel/redbull-analytics-hol/pull/41

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.4...v0.1.5
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.33 KB)
v0.1.4(Jan 25, 2022)
What's Changed

Update Port for Jupyter Lab. Changed with last Stack script by @operard in https://github.com/oracle-devrel/redbull-analytics-hol/pull/38

automatically set the latest Oracle Linux 7.9 image build number as default OS image by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/40

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.3...v0.1.4
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.33 KB)
v0.1.3(Nov 10, 2021)
What's Changed

fix: ORM zip file not being generated properly

Fixed it so that ORM can be used to deploy the lab.

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.2...v0.1.3
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.21 KB)
v0.1.0(Nov 9, 2021)
The lab has been refactored to not use a custom compute image, but rather to build out the compute instance.

What's Changed

feat: removing custom image usage by @timclegg in https://github.com/oracle-devrel/redbull-analytics-hol/pull/34

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.0.12...v0.1.0
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.62 KB)
v0.0.12(Sep 6, 2021)

Redbull HOL Beginner Extension Period to access Image
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(9.01 KB)
v0.0.11(Aug 10, 2021)

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.06 KB)
v0.0.10(Aug 10, 2021)

The SSH public key is optional, but present in the ORM dialog. Happy deploying!
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.06 KB)
v0.0.9(Aug 9, 2021)

The SSH key isn't directly needed for the hands-on lab, so making this optional. Also some doc updates.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.83 KB)
v0.0.8(Aug 9, 2021)

Updated docs and a bug in the deployment.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.83 KB)
v0.0.7(Aug 6, 2021)

This release has a refactored "one-click" (or really close to it!) hands-on lab.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.82 KB)
v0.0.6(Aug 4, 2021)

This repo now can build its own ZIP files for ORM deployments. These are automatically built and stored in the release (as it's made).
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.19 KB)
v0.0.5(Jul 28, 2021)

Fixing situations where the group name and/or dynamic group name creation would fail, if it already existed. This might occur in situations where the HoL would be deployed more than once in the same tenancy. This eliminates the potential for collision with the same group names being used.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.40 KB)
v0.0.4(Jul 23, 2021)

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(10.23 KB)
v0.0.3(Jul 15, 2021)

Fixed home region detection.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.26 KB)
v0.2(Jul 14, 2021)
This release makes it easier to deploy the infrastructure, whether using ORM, Cloud Shell or Terraform CLI.

Added DevRel defined tags (and ignored the default tags)

Compatible with ORM, Cloud Shell and Terraform CLI

Updated README to include instructions for all three methods

Refactored, removing unnecessary resources (Vault, public Subnet, etc.).

Added a nerd knob so that it could use an existing Group (rather than create a new one)

Fixed ORM RegEx filters to allow dashes (-) and underscores (_), for the names

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.19 KB)
v0.1(Jun 21, 2021)

This release includes the beginner series of tutorials, along with the Terraform stack to create the required OCI resources.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(9.24 KB)

Owner

Oracle DevRel

GitHub Repository

BigDL - Evaluate the performance of BigDL (Distributed Deep Learning on Apache Spark) in big data analysis problems

Evaluate the performance of BigDL (Distributed Deep Learning on Apache Spark) in big data analysis problems.

1 Jan 06, 2022

yt is an open-source, permissively-licensed Python library for analyzing and visualizing volumetric data.

The yt Project yt is an open-source, permissively-licensed Python library for analyzing and visualizing volumetric data. yt supports structured, varia

367 Dec 25, 2022

Hidden Markov Models in Python, with scikit-learn like API

hmmlearn hmmlearn is a set of algorithms for unsupervised learning and inference of Hidden Markov Models. For supervised learning learning of HMMs and

2.7k Jan 03, 2023

A probabilistic programming library for Bayesian deep learning, generative models, based on Tensorflow

ZhuSuan is a Python probabilistic programming library for Bayesian deep learning, which conjoins the complimentary advantages of Bayesian methods and

2.2k Dec 28, 2022

Full ELT process on GCP environment.

Rent Houses Germany - GCP Pipeline Project: The goal of the project is to extract data about house rentals in Germany, store, process and analyze it u

2 Jan 20, 2022

An ETL framework + Monitoring UI/API (experimental project for learning purposes)

Fastlane An ETL framework for building pipelines, and Flask based web API/UI for monitoring pipelines. Project structure fastlane |- fastlane: (ETL fr

2 Jan 06, 2022

vartests is a Python library to perform some statistic tests to evaluate Value at Risk (VaR) Models

gg I wasn't satisfied with any of the other available Gemini clients, so I wrote my own. Requires Python 3.9 (maybe older, I haven't checked) and opti

5 Jan 03, 2023

An Aspiring Drop-In Replacement for NumPy at Scale

Legate NumPy is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the NumPy API on top of the Legion runtime. Using Legate NumPy you do things like run the f

502 Jan 03, 2023

Analyzing Covid-19 Outbreaks in Ontario

My group and I took Covid-19 outbreak statistics from ontario, and analyzed them to find different patterns and future predictions for the virus

0 Jan 20, 2022

COVID-19 deaths statistics around the world

COVID-19-Deaths-Dataset COVID-19 deaths statistics around the world This is a daily updated dataset of COVID-19 deaths around the world. The dataset c

4 Jul 10, 2022

.npy, .npz, .mtx converter.

npy-converter Matrix Data Converter. Expand matrix for multi-thread, multi-process Divid matrix for multi-thread, multi-process Support: .mtx, .npy, .

1 Feb 07, 2022

Convert monolithic Jupyter notebooks into Ploomber pipelines.

65 Dec 16, 2022

A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.

Disclaimer This project is stable and being incubated for long-term support. It may contain new experimental code, for which APIs are subject to chang

1.6k Dec 29, 2022

Statistical & Probabilistic Analysis of Store Sales, University Survey, & Manufacturing data

Statistical_Modelling Statistical & Probabilistic Analysis of Store Sales, University Survey, & Manufacturing data Statistical Methods for Decision Ma

1 Jan 27, 2022

Python reader for Linked Data in HDF5 files

Linked Data are becoming more popular for user-created metadata in HDF5 files.

8 May 17, 2022

Very basic but functional Kakuro solver written in Python.

kakuro.py Very basic but functional Kakuro solver written in Python. It uses a reduction to exact set cover and Ali Assaf's elegant implementation of

4 Jan 15, 2022

Generates a simple report about the current Covid-19 cases and deaths in Malaysia

Generates a simple report about the current Covid-19 cases and deaths in Malaysia. Results are delay one day, data provided by the Ministry of Health Malaysia Covid-19 public data.

7 Dec 15, 2022

Elementary is an open-source data reliability framework for modern data teams. The first module of the framework is data lineage.

Data lineage made simple, reliable, and automated. Effortlessly track the flow of data, understand dependencies and analyze impact. Features Visualiza

898 Jan 09, 2023

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis. You write a high level configuration file specifying your in

917 Jan 03, 2023

Python utility to extract differences between two pandas dataframes.

8 Jan 07, 2023

Learn machine learning the fun way, with Oracle and RedBull Racing

Related tags

Overview

Red Bull Racing Analytics Hands-On Labs

Introduction

Getting Started

Prerequisites

Notes/Issues

URLs

Contributing

License

Comments

Refactored Terraform code

Issue with hands on lab guide - launchapp.sh missing

fix: Updating schema.yaml syntax

Exploratory Data Analysis Merge Issue

Has the PAR for the stack deploy image expired.

Releases(v0.1.8)

v0.1.8(Feb 18, 2022)

v0.1.7(Feb 17, 2022)

v0.1.6(Feb 17, 2022)

What's Changed

v0.1.5(Feb 16, 2022)

What's Changed

New Contributors

v0.1.4(Jan 25, 2022)

What's Changed

v0.1.3(Nov 10, 2021)

What's Changed

v0.1.0(Nov 9, 2021)

What's Changed

v0.0.12(Sep 6, 2021)

v0.0.11(Aug 10, 2021)

v0.0.10(Aug 10, 2021)

v0.0.9(Aug 9, 2021)

v0.0.8(Aug 9, 2021)

v0.0.7(Aug 6, 2021)

v0.0.6(Aug 4, 2021)

v0.0.5(Jul 28, 2021)

v0.0.4(Jul 23, 2021)

v0.0.3(Jul 15, 2021)

v0.2(Jul 14, 2021)

v0.1(Jun 21, 2021)

Owner

Oracle DevRel

BigDL - Evaluate the performance of BigDL (Distributed Deep Learning on Apache Spark) in big data analysis problems

yt is an open-source, permissively-licensed Python library for analyzing and visualizing volumetric data.

Hidden Markov Models in Python, with scikit-learn like API

A probabilistic programming library for Bayesian deep learning, generative models, based on Tensorflow

Full ELT process on GCP environment.

An ETL framework + Monitoring UI/API (experimental project for learning purposes)

vartests is a Python library to perform some statistic tests to evaluate Value at Risk (VaR) Models

An Aspiring Drop-In Replacement for NumPy at Scale

Analyzing Covid-19 Outbreaks in Ontario

COVID-19 deaths statistics around the world

.npy, .npz, .mtx converter.

Convert monolithic Jupyter notebooks into Ploomber pipelines.

A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.

Statistical & Probabilistic Analysis of Store Sales, University Survey, & Manufacturing data

Python reader for Linked Data in HDF5 files

Very basic but functional Kakuro solver written in Python.

Generates a simple report about the current Covid-19 cases and deaths in Malaysia

Elementary is an open-source data reliability framework for modern data teams. The first module of the framework is data lineage.

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

Python utility to extract differences between two pandas dataframes.