Relevance Vector Machine implementation using the scikit-learn API.

Last update: Nov 18, 2022

Related tags

Overview

scikit-rvm

scikit-rvm is a Python module implementing the Relevance Vector Machine (RVM) machine learning technique using the scikit-learn API.

Quickstart

With NumPy, SciPy and scikit-learn available in your environment, install with:

pip install https://github.com/JamesRitchie/scikit-rvm/archive/master.zip

Regression is done with the RVR class:

>>> from skrvm import RVR
>>> X = [[0, 0], [2, 2]]
>>> y = [0.5, 2.5 ]
>>> clf = RVR(kernel='linear')
>>> clf.fit(X, y)
RVR(alpha=1e-06, beta=1e-06, beta_fixed=False, bias_used=True, coef0=0.0,
coef1=None, degree=3, kernel='linear', n_iter=3000,
threshold_alpha=1000000000.0, tol=0.001, verbose=False)
>>> clf.predict([[1, 1]])
array([ 1.49995187])

Classification is done with the RVC class:

>>> from skrvm import RVC
>>> from sklearn.datasets import load_iris
>>> clf = RVC()
>>> clf.fit(iris.data, iris.target)
RVC(alpha=1e-06, beta=1e-06, beta_fixed=False, bias_used=True, coef0=0.0,
coef1=None, degree=3, kernel='rbf', n_iter=3000, n_iter_posterior=50,
threshold_alpha=1000000000.0, tol=0.001, verbose=False)
>>> clf.score(iris.data, iris.target)
0.97999999999999998

Theory

The RVM is a sparse Bayesian analogue to the Support Vector Machine, with a number of advantages:

It provides probabilistic estimates, as opposed to the SVM's point estimates.
Typically provides a sparser solution than the SVM, which tends to have the number of support vectors grow linearly with the size of the training set.
Does not need a complexity parameter to be selected in order to avoid overfitting.

However it is more expensive to train than the SVM, although prediction is faster and no cross-validation runs are required.

The RVM's original creator Mike Tipping provides a selection of papers offering detailed insight into the formulation of the RVM (and sparse Bayesian learning in general) on a dedicated page, along with a Matlab implementation.

Most of this implementation was written working from Section 7.2 of Christopher M. Bishops's Pattern Recognition and Machine Learning.

Contributors

Future Improvements

Implement the fast Sequential Sparse Bayesian Learning Algorithm outlined in Section 7.2.3 of Pattern Recognition and Machine Learning
Handle ill-conditioning errors more gracefully.
Implement more kernel choices.
Create more detailed examples with IPython notebooks.

Relevance Vector Machine implementation using the scikit-learn API.

Related tags

Overview

scikit-rvm

Quickstart

Theory

Contributors

Future Improvements

Owner

James Ritchie

Code Repository for Machine Learning with PyTorch and Scikit-Learn

A Python library for choreographing your machine learning research.

Required for a machine learning pipeline data preprocessing and variable engineering script needs to be prepared

AutoX是一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。它的特点包括: 效果出色、简单易用、通用、自动化、灵活。

ThunderSVM: A Fast SVM Library on GPUs and CPUs

A toolbox to iNNvestigate neural networks' predictions!

learn python in 100 days, a simple step could be follow from beginner to master of every aspect of python programming and project also include side project which you can use as demo project for your personal portfolio

A simple machine learning package to cluster keywords in higher-level groups.

Made in collaboration with Chris George for Art + ML Spring 2019.

Relevance Vector Machine implementation using the scikit-learn API.

YouTube Spam Detection with python

dirty_cat is a Python module for machine-learning on dirty categorical variables.

A Python Module That Uses ANN To Predict A Stocks Price And Also Provides Accurate Technical Analysis With Many High Potential Implementations!

Simple structured learning framework for python

ParaMonte is a serial/parallel library of Monte Carlo routines for sampling mathematical objective functions of arbitrary-dimensions

Stock Price Prediction Bank Jago Using Facebook Prophet Machine Learning & Python

ClearML - Auto-Magical Suite of tools to streamline your ML workflow. Experiment Manager, MLOps and Data-Management

[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark

XGBoost + Optuna

Painless Machine Learning for python based on scikit-learn

Relevance Vector Machine implementation using the scikit-learn API.

Related tags

Overview

scikit-rvm

Quickstart

Theory

Contributors

Future Improvements

Owner

James Ritchie

Code Repository for Machine Learning with PyTorch and Scikit-Learn

A Python library for choreographing your machine learning research.

Required for a machine learning pipeline data preprocessing and variable engineering script needs to be prepared

AutoX是一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。 它的特点包括: 效果出色、简单易用、通用、自动化、灵活。

ThunderSVM: A Fast SVM Library on GPUs and CPUs

A toolbox to iNNvestigate neural networks' predictions!

learn python in 100 days, a simple step could be follow from beginner to master of every aspect of python programming and project also include side project which you can use as demo project for your personal portfolio

A simple machine learning package to cluster keywords in higher-level groups.

Made in collaboration with Chris George for Art + ML Spring 2019.

Relevance Vector Machine implementation using the scikit-learn API.

YouTube Spam Detection with python

dirty_cat is a Python module for machine-learning on dirty categorical variables.

A Python Module That Uses ANN To Predict A Stocks Price And Also Provides Accurate Technical Analysis With Many High Potential Implementations!

Simple structured learning framework for python

ParaMonte is a serial/parallel library of Monte Carlo routines for sampling mathematical objective functions of arbitrary-dimensions

Stock Price Prediction Bank Jago Using Facebook Prophet Machine Learning & Python

ClearML - Auto-Magical Suite of tools to streamline your ML workflow. Experiment Manager, MLOps and Data-Management

[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark

XGBoost + Optuna

Painless Machine Learning for python based on scikit-learn

AutoX是一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。它的特点包括: 效果出色、简单易用、通用、自动化、灵活。