Learn machine learning the fun way, with Oracle and RedBull Racing

Last update: Oct 24, 2022

Related tags

Overview

Red Bull Racing Analytics Hands-On Labs

Introduction

Are you interested in learning machine learning (ML)? How about doing this in the context of the exciting world of F1 racing?! Get your ML skills bootstrapped here with Oracle and Red Bull Racing!

This tutorial teaches ML analytics with a series of hands-on labs (HOLs) using the Data Science service in Oracle Cloud Infrastructure.

You'll learn how to get data from some public data sources, then how to analyze this data using some of the latest ML techniques. In the process you'll build ML models and test them out in a predictor app.

Getting Started

There is some infrastructure that must be deployed before you can enjoy this tutorial. See the Terraform documentation for more information.

After the OCI infrastructure is deployed, proceed with the beginner's tutorial to start through the ML labs.

Prerequisites

You must have an OCI account. Click here to create a new cloud account.

This solution is designed to work with several OCI services, allowing you to quickly be up-and-running:

There are required OCI resources (see the Terraform documentation for more information) that are needed for this tutorial.

Notes/Issues

None at this time.

URLs

Oracle and Red Bull partnership announcement

Contributing

This project is open source. Please submit your contributions by forking this repository and submitting a pull request! Oracle appreciates any contributions that are made by the open source community.

License

Licensed under the Universal Permissive License (UPL), Version 1.0.

See LICENSE for more details.

Comments

Refactored Terraform code
Compatible with ORM, Cloud Shell and Terraform CLI

Updated README to include instructions for all three methods

Refactored, removing unnecessary resources (Vault, public Subnet, etc.).

Added a nerd knob so that it could use an existing Group (rather than create a new one)

Fixed ORM RegEx filters to allow dashes (-) and underscores (_), for the names
opened by timclegg 2
Issue with hands on lab guide - launchapp.sh missing

https://github.com/oracle-devrel/redbull-analytics-hol/tree/main/beginners#beginners-hands-on-lab

In Starting The Web Application it reads:

cd /home/opc/redbull-analytics-hol/beginners/web ./launchapp.sh start

However is launchapp.sh is missing, for example

(redbullenv) cd /home/opc/redbull-analytics-hol/beginners/web (redbullenv) ./launchapp.sh start bash: ./launchapp.sh: No such file or directory

opened by raekins 1
fix: Updating schema.yaml syntax

Making the variable notation follow what the doc syntax shows (https://docs.oracle.com/en-us/iaas/Content/ResourceManager/Concepts/terraformconfigresourcemanager_topic-schema.htm)

opened by timclegg 1
Exploratory Data Analysis Merge Issue

Hello I have been encountering an issue while running the lab. The Jupyter notebook 03.f1_analysis_EDA.ipynb has the following issue on cell number 5:

ValueError Traceback (most recent call last) in ----> 1 df1 = pd.merge(races,results,how='inner',on=['raceId']) 2 df2 = pd.merge(df1,quali,how='inner',on=['raceId','driverId','constructorId']) 3 df3 = pd.merge(df2,drivers,how='inner',on=['driverId']) 4 df4 = pd.merge(df3,constructors,how='inner',on=['constructorId']) 5 df5 = pd.merge(df4,circuit,how='inner',on=['circuitId'])

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in merge(left, right, how, on, left_on, right_on, left_index, right_index, sort, suffixes, copy, indicator, validate) 85 copy=copy, 86 indicator=indicator, ---> 87 validate=validate, 88 ) 89 return op.get_result()

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in init(self, left, right, how, on, left_on, right_on, axis, left_index, right_index, sort, suffixes, copy, indicator, validate) 654 # validate the merge keys dtypes. We may need to coerce 655 # to avoid incompatible dtypes --> 656 self._maybe_coerce_merge_keys() 657 658 # If argument passed to validate,

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in _maybe_coerce_merge_keys(self) 1163 inferred_right in string_types and inferred_left not in string_types 1164 ): -> 1165 raise ValueError(msg) 1166 1167 # datetimelikes must match exactly

ValueError: You are trying to merge on object and int64 columns. If you wish to proceed you should use pd.concat

I’m using an oracle automatic deployment provided by oracle as part of their environment. I do not have a lot of experience with Python but one possible ible solution is to read the numeric values form the csv file as integer or float but I’m almost certain the solution might be a little more elaborated than that 😉. Anyway thanks for your time. I’m really excited to test your solution and finish the lab. Thanks again.

opened by yankodavila 2
Has the PAR for the stack deploy image expired.

Cannot deploy stack as getting PAR expired message.

2021/11/07 10:50:11[TERRAFORM_CONSOLE] [INFO] Error Message: work request did not succeed, workId: ocid1.coreservicesworkrequest.oc1.eu-amsterdam-1.abqw2ljrwz2n7qqj7ghdwtnlrqol355oumc7a6coushvgdrebskspaewh7ea, entity: image, action: CREATED. Message: Import image not found: PAR is invalid (maybe is expired or deleted), please check.

PAR in stack file is https://objectstorage.eu-frankfurt-1.oraclecloud.com/p/khhPjc_IMuyBOMfZUcJajIzCpoZ5aC-D7VMCU__GVZRlIQueXLIIcaaqLOZIuT1a/n/emeasespainsandbox/b/publichol/o/redbullhol-20210809-1523

opened by Mel-A-M 1

Releases(v0.1.8)

v0.1.8(Feb 18, 2022)

Optimized the models generation for Quickstarts Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.7...v0.1.8
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(20.78 KB)
v0.1.7(Feb 17, 2022)

add quickstart configuration by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/43

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.6...v0.1.7
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(17.20 KB)
v0.1.6(Feb 17, 2022)
What's Changed

add quickstart configuration by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/43

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.5...v0.1.6
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(17.20 KB)
v0.1.5(Feb 16, 2022)
What's Changed

Livelabs02162022 by @jasperan in https://github.com/oracle-devrel/redbull-analytics-hol/pull/41

fix: updated Alyssa Cotton's changes by @jasperan in https://github.com/oracle-devrel/redbull-analytics-hol/pull/42

New Contributors

@jasperan made their first contribution in https://github.com/oracle-devrel/redbull-analytics-hol/pull/41

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.4...v0.1.5
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.33 KB)
v0.1.4(Jan 25, 2022)
What's Changed

Update Port for Jupyter Lab. Changed with last Stack script by @operard in https://github.com/oracle-devrel/redbull-analytics-hol/pull/38

automatically set the latest Oracle Linux 7.9 image build number as default OS image by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/40

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.3...v0.1.4
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.33 KB)
v0.1.3(Nov 10, 2021)
What's Changed

fix: ORM zip file not being generated properly

Fixed it so that ORM can be used to deploy the lab.

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.2...v0.1.3
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.21 KB)
v0.1.0(Nov 9, 2021)
The lab has been refactored to not use a custom compute image, but rather to build out the compute instance.

What's Changed

feat: removing custom image usage by @timclegg in https://github.com/oracle-devrel/redbull-analytics-hol/pull/34

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.0.12...v0.1.0
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.62 KB)
v0.0.12(Sep 6, 2021)

Redbull HOL Beginner Extension Period to access Image
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(9.01 KB)
v0.0.11(Aug 10, 2021)

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.06 KB)
v0.0.10(Aug 10, 2021)

The SSH public key is optional, but present in the ORM dialog. Happy deploying!
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.06 KB)
v0.0.9(Aug 9, 2021)

The SSH key isn't directly needed for the hands-on lab, so making this optional. Also some doc updates.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.83 KB)
v0.0.8(Aug 9, 2021)

Updated docs and a bug in the deployment.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.83 KB)
v0.0.7(Aug 6, 2021)

This release has a refactored "one-click" (or really close to it!) hands-on lab.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.82 KB)
v0.0.6(Aug 4, 2021)

This repo now can build its own ZIP files for ORM deployments. These are automatically built and stored in the release (as it's made).
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.19 KB)
v0.0.5(Jul 28, 2021)

Fixing situations where the group name and/or dynamic group name creation would fail, if it already existed. This might occur in situations where the HoL would be deployed more than once in the same tenancy. This eliminates the potential for collision with the same group names being used.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.40 KB)
v0.0.4(Jul 23, 2021)

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(10.23 KB)
v0.0.3(Jul 15, 2021)

Fixed home region detection.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.26 KB)
v0.2(Jul 14, 2021)
This release makes it easier to deploy the infrastructure, whether using ORM, Cloud Shell or Terraform CLI.

Added DevRel defined tags (and ignored the default tags)

Compatible with ORM, Cloud Shell and Terraform CLI

Updated README to include instructions for all three methods

Refactored, removing unnecessary resources (Vault, public Subnet, etc.).

Added a nerd knob so that it could use an existing Group (rather than create a new one)

Fixed ORM RegEx filters to allow dashes (-) and underscores (_), for the names

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.19 KB)
v0.1(Jun 21, 2021)

This release includes the beginner series of tutorials, along with the Terraform stack to create the required OCI resources.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(9.24 KB)

Owner

Oracle DevRel

GitHub Repository

Learn machine learning the fun way, with Oracle and RedBull Racing

Related tags

Overview

Red Bull Racing Analytics Hands-On Labs

Introduction

Getting Started

Prerequisites

Notes/Issues

URLs

Contributing

License

Comments

Refactored Terraform code

Issue with hands on lab guide - launchapp.sh missing

fix: Updating schema.yaml syntax

Exploratory Data Analysis Merge Issue

Has the PAR for the stack deploy image expired.

Releases(v0.1.8)

v0.1.8(Feb 18, 2022)

v0.1.7(Feb 17, 2022)

v0.1.6(Feb 17, 2022)

What's Changed

v0.1.5(Feb 16, 2022)

What's Changed

New Contributors

v0.1.4(Jan 25, 2022)

What's Changed

v0.1.3(Nov 10, 2021)

What's Changed

v0.1.0(Nov 9, 2021)

What's Changed

v0.0.12(Sep 6, 2021)

v0.0.11(Aug 10, 2021)

v0.0.10(Aug 10, 2021)

v0.0.9(Aug 9, 2021)

v0.0.8(Aug 9, 2021)

v0.0.7(Aug 6, 2021)

v0.0.6(Aug 4, 2021)

v0.0.5(Jul 28, 2021)

v0.0.4(Jul 23, 2021)

v0.0.3(Jul 15, 2021)

v0.2(Jul 14, 2021)

v0.1(Jun 21, 2021)

Owner

Oracle DevRel

:truck: Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark

Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

AWS Glue ETL Code Samples

Shot notebooks resuming the main functions of GeoPandas

Maximum Covariance Analysis in Python

Data processing with Pandas.

A lightweight, hub-and-spoke dashboard for multi-account Data Science projects

A Numba-based two-point correlation function calculator using a grid decomposition

A columnar data container that can be compressed.

NFCDS Workshop Beginners Guide Bioinformatics Data Analysis

Multiple Pairwise Comparisons (Post Hoc) Tests in Python

DenseClus is a Python module for clustering mixed type data using UMAP and HDBSCAN

Data Analytics on Genomes and Genetics

Catalogue data - A Python Scripts to prepare catalogue data

Investigating EV charging data

A Big Data ETL project in PySpark on the historical NYC Taxi Rides data

A stock analysis app with streamlit

Produces a summary CSV report of an Amber Electric customer's energy consumption and cost data.

This is an analysis and prediction project for house prices in King County, USA based on certain features of the house

Hidden Markov Models in Python, with scikit-learn like API