✨ Real-life Data Analysis and Model Training Workshop by Global AI Hub.

Overview

🎓 Data Analysis and Model Training Course by Global AI Hub

Syllabus:

Day 1

  • What is Data?

  • Multimedia

  • Structured and Unstructured Data

  • Data Types

  • Data Visualization

    • What is Visualization?
    • Tufte's 6 Principle
    • Visualization Types
      • Line Plot
      • Scatter Plot
      • Bar Plot
      • Histogram
      • Pie Charts
      • Heatmap
      • Box Plot
      • Kartil Nedir? Nasıl Hesaplanır?
      • Joint Plot
      • KDE(Kernel Density Estimate)
  • Statistics

    • Descriptive Statistics Concepts
    • The Concept of Skewness
    • Correlation and Correlation Matrix
    • The Simpsons Paradox
    • Anscombe Quartet
    • Data Distribution and Hypothesis Testing
  • Data Distribution

    • Data and Distribution
    • Gaussian(Normal) Distribution
    • t-Distribution
    • Degrees of Freedom
    • Bernoulli's Distribution
    • Exponential Distribution
  • Application

    • Pandas Revision
    • Introduction to Data Preprocessing with Pandas

Day 2

  • Hypothesis Tests

    • Basic Hypothesis testing
    • P value
    • T test
    • Z test
    • Chi-square (Chi-Square) Test
    • Errors in Hypothesis Testing
  • Data Cleaning

    • The 68-95-99.7 Rule and 3 Sigma
    • Outlier, Missing and Duplicate Data and their Detection
    • Z-Score
    • Handling missing values
    • Null vs NaN
    • Pandas Functions for missing values
    • Dimensionality Reduction
    • PCA (Principal Component Analysis)
    • Collinearity (Multiple Linear Connection
  • Data Transformation

    • Data Conversion Techniques
      • round
      • Scaling
      • Label Encoding
      • One Hot Encoding
      • Stack
      • melt
      • Shorts
      • Feature Engineering
  • Data Augmentation

    • Aggregation Functions
  • Application

    • Data Visualization with Seaborn
    • Data Preprocessing with Pandas

Day 3

  • ML Review

    • What is Machine Learning?
    • Supervised Learning
    • Unsupervised Learning
    • Errors That May Be Encountered in Model Training
    • Tools Used in Data Analysis and Machine Learning
    • End-to-End Machine Learning Project Steps
  • Application

    • Training An End-to-End ML Model with a Real Dataset

Certification

The course completion is certified.

Owner
Global AI Hub
Where AI Talent Meets Opportunity
Global AI Hub
Main repository for the Sphinx documentation builder

Sphinx Sphinx is a tool that makes it easy to create intelligent and beautiful documentation for Python projects (or other documents consisting of mul

5.1k Jan 02, 2023
💡 Catatan Materi Bahasa Pemrogramman Python

Repository catatan kuliah Andika Tulus Pangestu selama belajar Dasar Pemrograman dengan Python.

0 Oct 10, 2021
Course materials for: Geospatial Data Science

Course materials for: Geospatial Data Science These course materials cover the lectures for the course held for the first time in spring 2022 at IT Un

Michael Szell 266 Jan 02, 2023
Python solutions to solve practical business problems.

Python Business Analytics Also instead of "watching" you can join the link-letter, it's already being sent out to about 90 people and you are free to

Derek Snow 357 Dec 26, 2022
The sarge package provides a wrapper for subprocess which provides command pipeline functionality.

Overview The sarge package provides a wrapper for subprocess which provides command pipeline functionality. This package leverages subprocess to provi

Vinay Sajip 14 Dec 18, 2022
Dynamic Resume Generator

Dynamic Resume Generator

Quinten Lisowe 15 May 19, 2022
Credit EDA Case Study Using Python

This case study aims to identify patterns which indicate if a client has difficulty paying their installments which may be used for taking actions such as denying the loan, reducing the amount of loa

Purvi Padliya 1 Jan 14, 2022
Pyoccur - Python package to operate on occurrences (duplicates) of elements in lists

pyoccur Python Occurrence Operations on Lists About Package A simple python package with 3 functions has_dup() get_dup() remove_dup() Currently the du

Ahamed Musthafa 6 Jan 07, 2023
freeCodeCamp Scientific Computing with Python Project for Certification.

Polygon_Area_Calculator freeCodeCamp Python Project freeCodeCamp Scientific Computing with Python Project for Certification. In this project you will

Rajdeep Mondal 1 Dec 23, 2021
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, A

Donne Martin 24.5k Jan 09, 2023
Canonical source repository for PyYAML

PyYAML - The next generation YAML parser and emitter for Python. To install, type 'python setup.py install'. By default, the setup.py script checks

The YAML Project 2k Jan 01, 2023
Netbox Dns is a netbox plugin for managing zone, nameserver and record inventory.

Netbox DNS Netbox Dns is a netbox plugin for managing zone, nameserver and record inventory. Features Manage zones (domains) you have. Manage nameserv

Aurora Research Lab 155 Jan 06, 2023
A PyTorch implementation of Deep SAD, a deep Semi-supervised Anomaly Detection method.

Deep SAD: A Method for Deep Semi-Supervised Anomaly Detection This repository provides a PyTorch implementation of the Deep SAD method presented in ou

Lukas Ruff 276 Jan 04, 2023
JMESPath is a query language for JSON.

JMESPath JMESPath (pronounced "james path") allows you to declaratively specify how to extract elements from a JSON document. For example, given this

1.7k Dec 31, 2022
Zero configuration Airflow plugin that let you manage your DAG files.

simple-dag-editor SimpleDagEditor is a zero configuration plugin for Apache Airflow. It provides a file managing interface that points to your dag_fol

30 Jul 20, 2022
Types that make coding in Python quick and safe.

Type[T] Types that make coding in Python quick and safe. Type[T] works best with Python 3.6 or later. Prior to 3.6, object types must use comment type

Contains 17 Aug 01, 2022
Yu-Gi-Oh! Master Duel translation script

Yu-Gi-Oh! Master Duel translation script

715 Jan 08, 2023
Sms Bomber, Tool Encryptor

ɴᴏʙɪᴛᴀシ︎ ғᴏʀ ᴀɴʏ ʜᴇʟᴘシ︎ Install pkg install git -y pkg install python -y pip install requests git clone https://github.com/AK27HVAU/akash Run cd Akash

ɴᴏʙɪᴛᴀシ︎ 4 May 23, 2022
A module filled with many useful functions and modules in various subjects.

Usefulpy Check out the Usefulpy site Usefulpy site is not always up to date Download and Import download and install with with pip download usefulpyth

Austin Garcia 1 Dec 28, 2021
This repo contains everything you'll ever need to learn/revise python basics

Python Notes/cheat sheet Simplified notes to get your Python basics right Just compare code and output side by side and feel the rush of enlightenment

Hem 5 Oct 06, 2022