ELI5 is a Python package which helps to debug machine learning classifiers and explain their predictions

Last update: Dec 17, 2022

Related tags

Machine Learning eli5

Overview

ELI5

ELI5 is a Python package which helps to debug machine learning classifiers and explain their predictions.

It provides support for the following machine learning frameworks and packages:

scikit-learn. Currently ELI5 allows to explain weights and predictions of scikit-learn linear classifiers and regressors, print decision trees as text or as SVG, show feature importances and explain predictions of decision trees and tree-based ensembles. ELI5 understands text processing utilities from scikit-learn and can highlight text data accordingly. Pipeline and FeatureUnion are supported. It also allows to debug scikit-learn pipelines which contain HashingVectorizer, by undoing hashing.
Keras - explain predictions of image classifiers via Grad-CAM visualizations.
xgboost - show feature importances and explain predictions of XGBClassifier, XGBRegressor and xgboost.Booster.
LightGBM - show feature importances and explain predictions of LGBMClassifier, LGBMRegressor and lightgbm.Booster.
CatBoost - show feature importances of CatBoostClassifier, CatBoostRegressor and catboost.CatBoost.
lightning - explain weights and predictions of lightning classifiers and regressors.
sklearn-crfsuite. ELI5 allows to check weights of sklearn_crfsuite.CRF models.

ELI5 also implements several algorithms for inspecting black-box models (see Inspecting Black-Box Estimators):

TextExplainer allows to explain predictions of any text classifier using LIME algorithm (Ribeiro et al., 2016). There are utilities for using LIME with non-text data and arbitrary black-box classifiers as well, but this feature is currently experimental.
Permutation importance method can be used to compute feature importances for black box estimators.

Explanation and formatting are separated; you can get text-based explanation to display in console, HTML version embeddable in an IPython notebook or web dashboards, a pandas.DataFrame object if you want to process results further, or JSON version which allows to implement custom rendering and formatting on a client.

License is MIT.

Check docs for more.

Note

This is the same project as https://github.com/TeamHG-Memex/eli5/, but due to temporary github access issues, 0.11 release is prepared in https://github.com/eli5-org/eli5 (this repo).

ELI5 is a Python package which helps to debug machine learning classifiers and explain their predictions

Related tags

Overview

ELI5

Owner

Flask app to predict daily radiation from the time series of Solcast from Islamabad, Pakistan

This handbook accompanies the course: Machine Learning with Hung-Yi Lee

AutoOED: Automated Optimal Experiment Design Platform

PyHarmonize: Adding harmony lines to recorded melodies in Python

Implementation of deep learning models for time series in PyTorch.

BioPy is a collection (in-progress) of biologically-inspired algorithms written in Python

Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in the form of Jupyter Notebooks.

To-Be is a machine learning challenge on CodaLab Platform about Mortality Prediction

🎛 Distributed machine learning made simple.

InfiniteBoost: building infinite ensembles with gradient descent

ZenML 🙏: MLOps framework to create reproducible ML pipelines for production machine learning.

A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.

Deep Survival Machines - Fully Parametric Survival Regression

Dive into Machine Learning

Ml based project which uses regression technique to predict the price.

LibTraffic is a unified, flexible and comprehensive traffic prediction library based on PyTorch

Lightning ⚡️ fast forecasting with statistical and econometric models.

MCML is a toolkit for semi-supervised dimensionality reduction and quantitative analysis of Multi-Class, Multi-Label data

Add built-in support for quaternions to numpy

决策树分类与回归模型的实现和可视化