liquid_scikit_learn

Scikit learn library models to account for data and concept drift.

This python library focuses on solving data drift and concept drift in the industry to minimize retraining of the models regularly. After inspired about the capabilities of neurons in octopus tentacles, which they interact and adapt directly with the environment without their central nervous system. I designed the weights for these models in the similar way where they train on input and experience. Instead of calculating weights based on minimizing the loss function, derivatives of weights are calculated. ( Hasani Chen). This library also provides model expiration details at a feature level. This could help in finding the features that model has hard time adjusting.

This library adapts concepts from Nueral ODE for scikit-learn. The models in this librabry calculate the derivatives of weights instead of weights as in standard scikit-learn librabry.

There are two training phases, the first one is a standard scikit learn model that provides predictions and weights for each feature. Typically, in standard ML models, training data is sent in batches and inferences can be done real time and in batch. In this scenario for the second training phase, input data is sent in semi batches and model adapts with changing data drift and concept drift with time. The second training phase along with changing weights it provides decay rate for each weight, contribution from data drift and concept drift and model failure parameters.

For example, suppose we train three months of data in the first training phase for the model to understand patterns with its provided inputs and outputs. In the second phase of training, we send weekly batches of inputs and outputs to make the model to adapt to changes in data and output that typically changes with customer behavior. I will make efforts to extend this library for unsupervised learning also. Currently liquid logistic regression is available with limited parameter optimization.

To use this librabry for now, git clone the librarby and give path to the librarby.

To use standard logistic regression

from liquid_scikit_learn.liquid_logistic_regression import logistic_regression

To use liquid logistic regression

from liquid_scikit_learn.liquid_logistic_regression import liquid_logistic_regression

To get model expiration details at a feature level

from liquid_scikit_learn.liquid_logistic_regression import model_failure

Scikit learn library models to account for data and concept drift.

Related tags

Overview

liquid_scikit_learn

Owner

Lightning ⚡️ fast forecasting with statistical and econometric models.

Probabilistic programming framework that facilitates objective model selection for time-varying parameter models.

CrayLabs and user contibuted examples of using SmartSim for various simulation and machine learning applications.

Implementation of deep learning models for time series in PyTorch.

Optimal Randomized Canonical Correlation Analysis

Real-time domain adaptation for semantic segmentation

Implementation of different ML Algorithms from scratch, written in Python 3.x

Made in collaboration with Chris George for Art + ML Spring 2019.

Software Engineer Salary Prediction

Free MLOps course from DataTalks.Club

Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray

STUMPY is a powerful and scalable Python library for computing a Matrix Profile, which can be used for a variety of time series data mining tasks

A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.

TensorFlow implementation of an arbitrary order Factorization Machine

Course files for "Ocean/Atmosphere Time Series Analysis"

50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster

2021 Machine Learning Security Evasion Competition

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

Module is created to build a spam filter using Python and the multinomial Naive Bayes algorithm.

LibRerank is a toolkit for re-ranking algorithms. There are a number of re-ranking algorithms, such as PRM, DLCM, GSF, miDNN, SetRank, EGRerank, Seq2Slate.