Automated machine learning: Review of the state-of-the-art and opportunities for healthcare

Overview

AutoML in Healthcare Review

Automated machine learning: Review of the state-of-the-art and opportunities for healthcare

Selected highlights from the 2020 AutoML Review [https://doi.org/10.1016/j.artmed.2020.101822] that reviewed over 2,160 works related to the field of automated machine learning.

The curated list of automated feature engineering tools for Automated Machine Learning

Full details in https://www.sciencedirect.com/science/article/pii/S0933365719310437?via%3Dihub#tbl0005

Method Work Feature Engineering Technique Used by how many works
Deep Feature Synthesis LINK Expand-Reduce 151
Explore Kit LINK Expand-Reduce 53
One Button Machine LINK Expand-Reduce 32
AutoLearn LINK Expand-Reduce 16
GP Feature Construction LINK Genetic Programming 68
Cognito LINK Hierarchical Greedy Search 38
RLFE LINK Reinforcement Learning 21
LFE LINK Meta-Learning 34

Automated machine learning pipeline optimizers

Full details in https://www.sciencedirect.com/science/article/pii/S0933365719310437?via%3Dihub#tbl0010

Method Work Optimization Algorithm Data Pre-Processing Feature Engineering Model Selection Hyperparameter Optimization Ensemble Learning Meta-Learning Used by how many works
Auto-Weka LINK Bayesian Optimization (SMAC) ✔️ ✔️ ✔️ 703
Auto-Sklearn LINK Joint Bayesian Optimization and Bandit Search (BOHB) ✔️ ✔️ ✔️ ✔️ ✔️ 542
TPOT LINK Evolutionary Algorithm ✔️ ✔️ ✔️ ✔️ 84
TuPAQ LINK Bandit Search ✔️ ✔️ 94
ATM LINK Joint Bayesian Optimization and Bandit Search ✔️ ✔️ ✔️ 29
Automatic Frankensteining LINK Bayesian Optimization ✔️ ✔️ ✔️ 12
ML-Plan LINK Hierarchical Task Networks (HTN) ✔️ ✔️ ✔️ 24
Autostacker LINK Evolutionary Algorithm ✔️ ✔️ ✔️ 18
AlphaD3M LINK Reinforcement Learning/Monte Carlo Tree Search ✔️ ✔️ ✔️ 8
Collaborative Filtering LINK Probabilistic Matrix Factorization ✔️ ✔️ ✔️ ✔️ 29

Neural Architecture Search algorithms, based on performance on the CIFAR-10 dataset

Full details in https://www.sciencedirect.com/science/article/pii/S0933365719310437?via%3Dihub#tbl0015

NAS Algorithm Work Search Space Search Strategy Performance Estimation Strategy Number of Parameters Search Time (GPU-days) Test Error (%)
Large-scale Evolution LINK Feed-Forward Networks Evolutionary Algorithm Naive Training and Validation 5.4M 2600 5.4
EAS LINK Feed-Forward Networks Reinforcement Learning and Network Morphism Short Training and Validation 23.4M 10 4.23
Hierarchical Evolution LINK Cell Motifs Evolutionary Algorithm Training and Validation on proposed CNN Cell 15.7M 300 3.75
NAS v3 LINK Multi-branched Networks Reinforcement Learning Naive Training and Validation 37.4M 22400 3.65
PNAS LINK Cell Motifs Sequential Model-Based Optimization (SMBO) Performance Prediction 3.2M 225 3.41
ENAS LINK Cell Motifs Reinforcement Learning One Shot 4.6M 0.45 2.89
ResNet + Regularization LINK HUMAN BASELINE HUMAN BASELINE HUMAN BASELINE 26.2M - 2.86
DARTS LINK Cell Motifs Gradient-Based Optimization Training and Validation on proposed CNN Cell 3.4M 4 2.83
NASNet-A LINK Cell Motifs Reinforcement Learning Naive Training and Validation 3.3M 2000 2.65
EENA LINK Cell Motifs Evolutionary Algorithm Performance Prediction 8.5M 0.65 2.56
Path-Level EAS LINK Cell Motifs Reinforcement Learning Short Training and Validation 14.3M 200 2.30
NAO LINK Cell Motifs Gradient-Based Optimization Performance Prediction 128M 200 2.11
Probabilistic time series modeling in Python

GluonTS - Probabilistic Time Series Modeling in Python GluonTS is a Python toolkit for probabilistic time series modeling, built around Apache MXNet (

Amazon Web Services - Labs 3.3k Jan 03, 2023
ArviZ is a Python package for exploratory analysis of Bayesian models

ArviZ (pronounced "AR-vees") is a Python package for exploratory analysis of Bayesian models. Includes functions for posterior analysis, data storage, model checking, comparison and diagnostics

ArviZ 1.3k Jan 05, 2023
Implementation of K-Nearest Neighbors Algorithm Using PySpark

KNN With Spark Implementation of KNN using PySpark. The KNN was used on two separate datasets (https://archive.ics.uci.edu/ml/datasets/iris and https:

Zachary Petroff 4 Dec 30, 2022
Python package for machine learning for healthcare using a OMOP common data model

This library was developed in order to facilitate rapid prototyping in Python of predictive machine-learning models using longitudinal medical data from an OMOP CDM-standard database.

Sontag Lab 75 Jan 03, 2023
mlpack: a scalable C++ machine learning library --

a fast, flexible machine learning library Home | Documentation | Doxygen | Community | Help | IRC Chat Download: current stable version (3.4.2) mlpack

mlpack 4.2k Jan 01, 2023
虚拟货币(BTC、ETH)炒币量化系统项目。在一版本的基础上加入了趋势判断

🎉 第二版本 🎉 (现货趋势网格) 介绍 在第一版本的基础上 趋势判断,不在固定点位开单,选择更优的开仓点位 优势: 🎉 简单易上手 安全(不用将api_secret告诉他人) 如何启动 修改app目录下的authorization文件

幸福村的码农 250 Jan 07, 2023
Skforecast is a python library that eases using scikit-learn regressors as multi-step forecasters

Skforecast is a python library that eases using scikit-learn regressors as multi-step forecasters. It also works with any regressor compatible with the scikit-learn API (pipelines, CatBoost, LightGBM

Joaquín Amat Rodrigo 297 Jan 09, 2023
A repository to index and organize the latest machine learning courses found on YouTube.

📺 ML YouTube Courses At DAIR.AI we ❤️ open education. We are excited to share some of the best and most recent machine learning courses available on

DAIR.AI 9.6k Jan 01, 2023
LiuAlgoTrader is a scalable, multi-process ML-ready framework for effective algorithmic trading

LiuAlgoTrader is a scalable, multi-process ML-ready framework for effective algorithmic trading. The framework simplify development, testing, deployment, analysis and training algo trading strategies

Amichay Oren 458 Dec 24, 2022
MLReef is an open source ML-Ops platform that helps you collaborate, reproduce and share your Machine Learning work with thousands of other users.

The collaboration platform for Machine Learning MLReef is an open source ML-Ops platform that helps you collaborate, reproduce and share your Machine

MLReef 1.4k Dec 27, 2022
Python implementation of Weng-Lin Bayesian ranking, a better, license-free alternative to TrueSkill

Python implementation of Weng-Lin Bayesian ranking, a better, license-free alternative to TrueSkill This is a port of the amazing openskill.js package

Open Debates Project 156 Dec 14, 2022
A high-performance topological machine learning toolbox in Python

giotto-tda is a high-performance topological machine learning toolbox in Python built on top of scikit-learn and is distributed under the G

giotto.ai 632 Dec 29, 2022
Model Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to production and monitoring them after deployment to production.

Model Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to production and monitoring them after deployment to production.

FINRA 25 Dec 28, 2022
JMP is a Mixed Precision library for JAX.

Mixed precision training [0] is a technique that mixes the use of full and half precision floating point numbers during training to reduce the memory bandwidth requirements and improve the computatio

DeepMind 108 Dec 31, 2022
Decision Weights in Prospect Theory

Decision Weights in Prospect Theory It's clear that humans are irrational, but how irrational are they? After some research into behavourial economics

Cameron Davidson-Pilon 32 Nov 08, 2021
machine learning model deployment project of Iris classification model in a minimal UI using flask web framework and deployed it in Azure cloud using Azure app service

This is a machine learning model deployment project of Iris classification model in a minimal UI using flask web framework and deployed it in Azure cloud using Azure app service. We initially made th

Krishna Priyatham Potluri 73 Dec 01, 2022
Reproducibility and Replicability of Web Measurement Studies

Reproducibility and Replicability of Web Measurement Studies This repository holds additional material to the paper "Reproducibility and Replicability

6 Dec 31, 2022
A GitHub action that suggests type annotations for Python using machine learning.

Typilus: Suggest Python Type Annotations A GitHub action that suggests type annotations for Python using machine learning. This action makes suggestio

40 Sep 18, 2022
Interactive Web App with Streamlit and Scikit-learn that applies different Classification algorithms to popular datasets

Interactive Web App with Streamlit and Scikit-learn that applies different Classification algorithms to popular datasets Datasets Used: Iris dataset,

Samrat Mitra 2 Nov 18, 2021