Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.

Last update: Dec 14, 2022

Overview

Regularized Greedy Forest

Regularized Greedy Forest (RGF) is a tree ensemble machine learning method described in this paper. RGF can deliver better results than gradient boosted decision trees (GBDT) on a number of datasets and it has been used to win a few Kaggle competitions. Unlike the traditional boosted decision tree approach, RGF works directly with the underlying forest structure. RGF integrates two ideas: one is to include tree-structured regularization into the learning formulation; and the other is to employ the fully-corrective regularized greedy algorithm.

This repository contains the following implementations of the RGF algorithm:

RGF: original implementation from the paper;
FastRGF: multi-core implementation with some simplifications;
rgf_python: wrapper of both RGF and FastRGF implementations for Python;
R package: wrapper of rgf_python for R.

You may want to get interesting information about RGF from the posts collected in Awesome RGF.

Comments

Support wheels

Since rgf_python hasn't any special requirements (for compiler, environment, etc.), I think it good idea to have wheels on PyPI site (and the sources in .tar.gz, of course). I believe providing successfully compiled binaries will prevent many strange errors like recent ones.

We need wheels for two platforms: first for macOS and Linux and second for Windows.

The final result should be similar to this one:

But each wheel for each platform should have 32bit and 64bit version.

Binaries we could get from Travis and Appveyor as artifacts (I can do this). The one problem I see now is that Travis hasn't 32bit machines, but I believe we'll overcome this problem 😃 .

@fukatani When you'll have time, please search how to appropriate name wheels according to target platforms and how to post them at PyPI. Or I can do it more later.
enhancement

opened by StrikerRUS 35
error:Exception: Model learning result is not found in /tmp/rgf. This is rgf_python error.

How to deal with this error:

Ran 0 examples: 0 success, 0 failure, 0 error

None Ran 0 examples: 0 success, 0 failure, 0 error

None Ran 0 examples: 0 success, 0 failure, 0 error

None Traceback (most recent call last): File "/Users/k.den/Desktop/For_Submission/1_source_code/test.py", line 25, in pred = rgf_model.predict_proba(X_eval)[:, 1] File "/usr/local/lib/python3.6/site-packages/rgf/sklearn.py", line 652, in predict_proba class_proba = clf.predict_proba(X) File "/usr/local/lib/python3.6/site-packages/rgf/sklearn.py", line 798, in predict_proba 'This is rgf_python error.'.format(_TEMP_PATH)) Exception: Model learning result is not found in /tmp/rgf. This is rgf_python error.

Process finished with exit code 1

opened by tianke0711 34
ModuleNotFoundError: No module named 'rgf.sklearn'; 'rgf' is not a package

For bugs and unexpected issues, please provide the following information, so that we could reproduce them on our system.

Environment Info

Operating System: MacOS Sierra 10.12 | Ubuntu 16.04.3 LTS

Python version: 3.6.1

rgf_python version: HEAD (pulled from github)

Whether test.py is passed or not: FAILED (errors=24)

Error Message

ModuleNotFoundError: No module named 'rgf.sklearn'; 'rgf' is not a package

Reproducible Example

from rgf.sklearn import RGFClassifier

opened by vsedelnik 30
suggestion to integrate the R wrapper in the repository

This issue is related with a previous one. A month ago I wrapped rgf_python using the reticulate package in R. It can be installed on Linux, and somehow cumbersome on Macintosh and Windows (on Windows currently it works only from the command prompt). I opened the issue as suggested by @fukatani

opened by mlampros 20

$Model learning result is not found in C:\Users\hp\temp\rgf. This is rgf_python error.$

Model learning result is not found in C:\Users\hp\temp\rgf. This is rgf_python error.

Hello,

i have read the previous thread on the same post, but it does not seem to solve my problem, because the previous case had string included in dataset and all i have got are all numbers. Could you please let me know what could be the problem??

Much appreciated !

skf = StratifiedKFold(n_splits = kfold, random_state=1)
for i, (train_index, test_index) in enumerate(skf.split(X, y)):
    X_train, X_eval = X[train_index], X[test_index]
    y_train, y_eval = y[train_index], y[test_index]
   
    rgf_model = RGFClassifier(max_leaf=400,
                    algorithm="RGF_Sib",
                    test_interval=100,
                    verbose=True).fit( X_train, y_train)
    pred = rgf_model.predict_proba(X_eval)[:,1]
    print( "Gini = ", eval_gini(y_eval, pred) )

and

---------------------------------------------------------------------------
Exception                                 Traceback (most recent call last)
<ipython-input-17-b27ba3506d06> in <module>()
     12                     test_interval=100,
     13                     verbose=True).fit( X_train, y_train)
---> 14     pred = rgf_model.predict_proba(X_eval)[:,1]
     15     print( "Gini = ", eval_gini(y_eval, pred) )

C:\Anaconda3\lib\site-packages\rgf\sklearn.py in predict_proba(self, X)
    644                              % (self._n_features, n_features))
    645         if self._n_classes == 2:
--> 646             y = self._estimators[0].predict_proba(X)
    647             y = _sigmoid(y)
    648             y = np.c_[y, 1 - y]

C:\Anaconda3\lib\site-packages\rgf\sklearn.py in predict_proba(self, X)
    796         if not model_files:
    797             raise Exception('Model learning result is not found in {0}. '
--> 798                             'This is rgf_python error.'.format(_TEMP_PATH))
    799         latest_model_loc = sorted(model_files, reverse=True)[0]
    800 

Exception: Model learning result is not found in C:\Users\hp\temp\rgf. This is rgf_python error.

opened by mike-m123 20

migrate from Appveyor to GitHub Actions

Fixed #122. Appveyor suggests only 1 parallel job at free tier, GitHub Actions - 20.

Should be considered as a continuation of #328. Same changes as for *nix OSes: latest R version; stop producing 32bit artifacts.

opened by StrikerRUS 16
New release

I suppose it's time to release a new version with the support of warm start.

@fukatani Please release new Python version, and then @mlampros please upload to CRAN new R version.

opened by StrikerRUS 16
updated wheels building

@fukatani Please attach Linux i686 executable file to GitHub release - I've just tested replacing files into wheels and it works locally, so should work on Travis too! :-)

Refer to https://github.com/fukatani/rgf_python/issues/81#issuecomment-348662123.

opened by StrikerRUS 15
More Travis tests

Hi @fukatani ! Can you add more platforms (Windows, MacOS) to Travis? I don't know how, but it's possible 😄 : [Screenshot from xgboost repo] Maybe it can help: https://github.com/dmlc/xgboost/blob/master/.travis.yml

If there is a limitation to number of tests, maybe it's better to split Python version tests between platforms: Windows + 2.7, Linux + 3.4, MacOS + 3.5 (I think you understand me).

opened by StrikerRUS 15
Cannot import name 'RGFClassifier'
I am having the above error. I have made rgf1.2 and have tested using rgf1.2's own perl test script. This works. I have installed rgf_python and run the python setup as specified. I have changed the two folder locations to rgf1.2..\rgf executable and a temp folder that exist.

In python when I try to import I get the error Cannot import name 'RGFClassifier'. I tried to run the exact code in the test.py script provided in with rgf_python and this same error occurs.

Strangely, I have /usr/local/lib/python3.5/dist-packages/rgf_sklearn-0.0.0-py3.5.egg/rgf in my path when I do run

import sys sys.path

in python. I also in /usr/local/lib/python3.5/dist-packages I only have the rfg-sklearn-0.0.0-py3.5.egg and no rgf-sklearn as I would expect as the following appeared towards the end of the setup.py,

Extracting rgf_sklearn-0.0.0-py3.5.egg to /usr/local/lib/python3.5/dist-packages Adding rgf-sklearn 0.0.0 to easy-install.pth file
opened by JoshuaC3 15
[rgf_python] add warm-start

Fixed #184.

This PR adds the support of warm-start in RGF estimators, save_model() method which is needed to obtain binary model file and for further passing in init_model argument.

Also, this PR adds tests with analysis of exception message (as I promised in https://github.com/RGF-team/rgf/pull/258#issuecomment-439685042).

opened by StrikerRUS 14
Running RGF from R cmd

For bugs and unexpected issues, please provide the following information, so that we could reproduce them on our system.

Environment Info

Operating System: Windows 10

RGF/FastRGF/rgf_python version: 3.5.0-9

Python version (for rgf_python errors): 3.5.0-9

Error Message

Reproducible Example

Error when running RGF from R console as shown in the pic. Installation of RGF should be working fine as shown in the pic. RGF was installed via devtools.
help wanted

opened by similang 2
Python cant find executables
Hi there

I'm trying to install rgf/fastrgf and use the python wrapper to launch the executables.

I've installed using pip install rgf_python

However when i import the rgf module i get a user warning

UserWarning: Cannot find FastRGF executable files. FastRGF estimators will be unavailable for usage. warnings.warn("Cannot find FastRGF executable files. FastRGF estimators will be unavailable for usage.")

To fix this issue i've compiled the rgf and fastrgf binaries* and added them to my $PATH variable (confirmed in bash that they are in the PATH) however i still get the same error. I've looked a bit into the rgf/utils get_paths and is_fastrgf_executable functions however i'm not completely sure why it fails?

*binaries: i was not sure which binaries are needed so i've added the following rgf, forest_predict, forest_train, discretized_trainer, discretized_gendata, auc

System Python: conda 3.6.1 OS: ubuntu 16.04
opened by casperkaae 29
dump RGF and FastRGF to the JSON file

Initial support for dumping the RGF model is already implemented in #161. At present it's possible to print the model to the console. But it's good idea to bring the possibility of dumping the model to the file (e.g. JSON).

@StrikerRUS:

Really like new features introduced in this PR. But please think about "real dump" of a model. I suppose it'll be more useful than just printing to the console.

@fukatani:

For example dump in JSON format like lightGBM. It's convenient and we may support it in the future, but we should do it with another PR.

enhancement

opened by StrikerRUS 6
Support f_ratio?

I found not documented parameter f_ratio in RGF. This corresponding to LightGBM feature_fraction and XGB colsample_bytree.

I tried these parameter with boston regression example. In small max_leaf(300), f_ratio=0.9 improves score to 11.0 from 11.8, but in many max_leaf(5000), f_ratio=0.95 degrared score to 10.34 from 10.19810.

After all, is there no value to use f_ratio < 1.0?

opened by fukatani 10
[FastRGF] FastRGF doesn't work for small sample and need to fix integration test for FastRGF

#Now, sklearn integration tests for FastRGFClassifier and FastRGFClassifier.

FastRGF doesn't work well for small samples, that is reason for test failed. I doubt inside Fast RGF executable inside. I inspect Fast RGF by debugger, discretization boundaries are invalid.

At least we should raise understandable error from RGF python if discretization failed.
bug

opened by fukatani 18

Releases(3.12.0)

3.12.0(Jan 7, 2022)
Support Visual Studio 2022

Support macOS Monterey

Drop support of macOS Mojave

Drop support of Python 3.6

Added a pop-up note during R-package startup with workaround for attempt to apply non-function error

Source code(tar.gz)
Source code(zip)
rgf.exe(785.00 KB)
rgf_python-3.12.0-py3-none-macosx_10_15_x86_64.macosx_11_6_x86_64.macosx_12_0_x86_64.whl(726.19 KB)
rgf_python-3.12.0-py3-none-manylinux1_x86_64.whl(740.02 KB)
rgf_python-3.12.0-py3-none-win_amd64.whl(1.68 MB)
rgf_python-3.12.0.tar.gz(214.97 KB)
3.11.0(Aug 22, 2021)
Support gcc-11

Fixed several CRAN notes for the R-package

Added CITATION info in the R-package

Source code(tar.gz)
Source code(zip)
rgf.exe(785.00 KB)
rgf_python-3.11.0-py3-none-macosx_10_14_x86_64.macosx_10_15_x86_64.macosx_11_0_x86_64.whl(727.56 KB)
rgf_python-3.11.0-py3-none-manylinux1_x86_64.whl(740.01 KB)
rgf_python-3.11.0-py3-none-win_amd64.whl(1.68 MB)
rgf_python-3.11.0.tar.gz(216.27 KB)
3.10.0(Apr 28, 2021)
Drop Python 3.5 support

Drop Python 2.x support

Support macOS Big Sur

Drop support for 32-bit systems

Support R 4.x

Migrate CI infrastructure to GitHub Actions

Support Python 3.9

Fix compatibility with the latest scikit-learn version

Improve loading Python-modules in R-package

Source code(tar.gz)
Source code(zip)
rgf.exe(785.00 KB)
rgf_python-3.10.0-py3-none-macosx_10_14_x86_64.macosx_10_15_x86_64.macosx_11_0_x86_64.whl(727.83 KB)
rgf_python-3.10.0-py3-none-manylinux1_x86_64.whl(739.98 KB)
rgf_python-3.10.0-py3-none-win_amd64.whl(1.68 MB)
rgf_python-3.10.0.tar.gz(216.24 KB)
3.9.0(Aug 10, 2020)

Support gcc 10
Source code(tar.gz)
Source code(zip)
rgf.exe(692.00 KB)
rgf32.exe(575.50 KB)
rgf_python-3.9.0-py2.py3-none-macosx_10_13_x86_64.macosx_10_14_x86_64.macosx_10_15_x86_64.whl(728.54 KB)
rgf_python-3.9.0-py2.py3-none-manylinux1_i686.whl(772.43 KB)
rgf_python-3.9.0-py2.py3-none-manylinux1_x86_64.whl(740.09 KB)
rgf_python-3.9.0-py2.py3-none-win32.whl(1.50 MB)
rgf_python-3.9.0-py2.py3-none-win_amd64.whl(1.64 MB)
rgf_python-3.9.0-py3-none-macosx_10_14_x86_64.macosx_10_15_x86_64.macosx_11_0_x86_64.whl(325.06 KB)
rgf_python-3.9.0-py3-none-manylinux1_x86_64.whl(360.08 KB)
rgf_python-3.9.0.tar.gz(214.90 KB)
3.8.0(Apr 1, 2020)

Support gcc-10
Source code(tar.gz)
Source code(zip)
rgf.exe(692.00 KB)
rgf32.exe(575.50 KB)
rgf_python-3.8.0-py2.py3-none-macosx_10_13_x86_64.macosx_10_14_x86_64.macosx_10_15_x86_64.whl(731.40 KB)
rgf_python-3.8.0-py2.py3-none-manylinux1_i686.whl(772.41 KB)
rgf_python-3.8.0-py2.py3-none-manylinux1_x86_64.whl(778.30 KB)
rgf_python-3.8.0-py2.py3-none-win32.whl(1.50 MB)
rgf_python-3.8.0-py2.py3-none-win_amd64.whl(1.64 MB)
rgf_python-3.8.0.tar.gz(214.82 KB)
3.7.0(Feb 5, 2020)
Support Python 3.8

Install joblib and six separately

Support macOS Catalina

Source code(tar.gz)
Source code(zip)
rgf.exe(692.00 KB)
rgf32.exe(575.50 KB)
rgf_python-3.7.0-py2.py3-none-macosx_10_13_x86_64.macosx_10_14_x86_64.macosx_10_15_x86_64.whl(731.28 KB)
rgf_python-3.7.0-py2.py3-none-manylinux1_i686.whl(772.41 KB)
rgf_python-3.7.0-py2.py3-none-manylinux1_x86_64.whl(778.30 KB)
rgf_python-3.7.0-py2.py3-none-win32.whl(1.50 MB)
rgf_python-3.7.0-py2.py3-none-win_amd64.whl(1.64 MB)
rgf_python-3.7.0.tar.gz(214.86 KB)
3.6.0(Jun 9, 2019)
Drop Python 3.4 support

Support Visual Studio 2019

Remove unused library

Source code(tar.gz)
Source code(zip)
rgf.exe(692.00 KB)
rgf32.exe(575.50 KB)
rgf_python-3.6.0-py2.py3-none-macosx_10_6_x86_64.macosx_10_7_x86_64.macosx_10_8_x86_64.macosx_10_9_x86_64.macosx_10_10_x86_64.macosx_10_11_x86_64.macosx_10_12_x86_64.macosx_10_13_x86_64.macosx_10_14_x86_64.whl(729.09 KB)
rgf_python-3.6.0-py2.py3-none-manylinux1_i686.whl(772.43 KB)
rgf_python-3.6.0-py2.py3-none-manylinux1_x86_64.whl(740.08 KB)
rgf_python-3.6.0-py2.py3-none-win32.whl(1.50 MB)
rgf_python-3.6.0-py2.py3-none-win_amd64.whl(1.64 MB)
rgf_python-3.6.0.tar.gz(210.22 KB)
3.5.0(Jan 15, 2019)

Support warm start and refactoring.
Source code(tar.gz)
Source code(zip)
rgf.exe(692.00 KB)
rgf32.exe(575.50 KB)
rgf_python-3.5.0-py2.py3-none-macosx_10_6_x86_64.macosx_10_7_x86_64.macosx_10_8_x86_64.macosx_10_9_x86_64.macosx_10_10_x86_64.macosx_10_11_x86_64.macosx_10_12_x86_64.macosx_10_13_x86_64.macosx_10_14_x86_64.whl(709.55 KB)
rgf_python-3.5.0-py2.py3-none-manylinux1_i686.whl(772.34 KB)
rgf_python-3.5.0-py2.py3-none-manylinux1_x86_64.whl(740.00 KB)
rgf_python-3.5.0-py2.py3-none-win32.whl(1.50 MB)
rgf_python-3.5.0-py2.py3-none-win_amd64.whl(1.64 MB)
rgf_python-3.5.0.tar.gz(210.10 KB)
3.4.0(Dec 4, 2018)
Changed location for storing temp files of rgf_python

Moved docker file under rgfteam account

Added AWESOME RGF

Moved RGF changelog to CHANGES file

Added R-package

Python 3.7 support in rgf_python

Converted RGF user guide from PDF to the reStructuredText format

Refined RGF examples

Reduced logs of files copying during installation of rgf_python

Fixed sklearn 0.20 and joblib 0.12 rgf_python compatibility

Renamed flag for RGF executable file in the config file for rgf_python

Refined error messages in rgf_python

Source code(tar.gz)
Source code(zip)
rgf.exe(692.00 KB)
rgf32.exe(575.50 KB)
rgf_python-3.4.0-py2.py3-none-macosx_10_6_x86_64.macosx_10_7_x86_64.macosx_10_8_x86_64.macosx_10_9_x86_64.macosx_10_10_x86_64.macosx_10_11_x86_64.macosx_10_12_x86_64.macosx_10_13_x86_64.macosx_10_14_x86_64.whl(708.83 KB)
rgf_python-3.4.0-py2.py3-none-manylinux1_i686.whl(771.61 KB)
rgf_python-3.4.0-py2.py3-none-manylinux1_x86_64.whl(739.28 KB)
rgf_python-3.4.0-py2.py3-none-win32.whl(1.50 MB)
rgf_python-3.4.0-py2.py3-none-win_amd64.whl(1.64 MB)
rgf_python-3.4.0.tar.gz(209.37 KB)
3.3.0(Jul 21, 2018)
Fix feature importances bug

Change repository structure

Change license

Fix doc

Refactoring

Source code(tar.gz)
Source code(zip)
rgf.exe(692.00 KB)
rgf32.exe(575.50 KB)
rgf_python-3.3.0-py2.py3-none-macosx_10_6_x86_64.macosx_10_7_x86_64.macosx_10_8_x86_64.macosx_10_9_x86_64.macosx_10_10_x86_64.macosx_10_11_x86_64.macosx_10_12_x86_64.macosx_10_13_x86_64.whl(710.62 KB)
rgf_python-3.3.0-py2.py3-none-manylinux1_i686.whl(770.16 KB)
rgf_python-3.3.0-py2.py3-none-manylinux1_x86_64.whl(737.82 KB)
rgf_python-3.3.0-py2.py3-none-win32.whl(1.50 MB)
rgf_python-3.3.0-py2.py3-none-win_amd64.whl(1.64 MB)
rgf_python-3.3.0.tar.gz(207.75 KB)
3.2.0(Jun 16, 2018)
Added note about gcc version on macOS

Support feature_importances_ and dump_model() of RGF estimators

Source code(tar.gz)
Source code(zip)
forest_predict(563.41 KB)
forest_train(557.71 KB)
rgf(860.39 KB)
rgf_python-3.2.0-py2.py3-none-macosx_10_6_x86_64.macosx_10_7_x86_64.macosx_10_8_x86_64.macosx_10_9_x86_64.macosx_10_10_x86_64.macosx_10_11_x86_64.macosx_10_12_x86_64.macosx_10_13_x86_64.whl(711.22 KB)
rgf_python-3.2.0-py2.py3-none-manylinux1_i686.whl(771.29 KB)
rgf_python-3.2.0-py2.py3-none-manylinux1_x86_64.whl(738.70 KB)
rgf_python-3.2.0-py2.py3-none-win32.whl(1.50 MB)
rgf_python-3.2.0-py2.py3-none-win_amd64.whl(1.64 MB)
rgf_python-3.2.0.tar.gz(212.81 KB)
3.1.0(Feb 23, 2018)
FastRGF updated and bug about small sample weight was fixed. (Thanks @StrikerRUS!)

RGF supports absolute error loss for regression.

Source code(tar.gz)
Source code(zip)
forest_predict(563.41 KB)
forest_train(557.71 KB)
rgf(864.45 KB)
rgf_python-3.1.0-py2.py3-none-macosx_10_6_x86_64.macosx_10_7_x86_64.macosx_10_8_x86_64.macosx_10_9_x86_64.macosx_10_10_x86_64.macosx_10_11_x86_64.macosx_10_12_x86_64.macosx_10_13_x86_64.whl(710.34 KB)
rgf_python-3.1.0-py2.py3-none-manylinux1_i686.whl(774.50 KB)
rgf_python-3.1.0-py2.py3-none-manylinux1_x86_64.whl(741.91 KB)
rgf_python-3.1.0-py2.py3-none-win32.whl(1.50 MB)
rgf_python-3.1.0-py2.py3-none-win_amd64.whl(1.64 MB)
rgf_python-3.1.0.tar.gz(211.63 KB)
3.0.0(Feb 10, 2018)
From 3.0.0, FastRGF supports officialy, not alpha version

Installation improvement

Fix COO matrix

To 3.0.0, all contribution is done by @StrikerRUS. Many thanks!
Source code(tar.gz)
Source code(zip)
forest_predict(563.41 KB)
forest_train(557.71 KB)
rgf(859.74 KB)
rgf_python-3.0.0-py2.py3-none-macosx_10_6_x86_64.macosx_10_7_x86_64.macosx_10_8_x86_64.macosx_10_9_x86_64.macosx_10_10_x86_64.macosx_10_11_x86_64.macosx_10_12_x86_64.macosx_10_13_x86_64.whl(710.93 KB)
rgf_python-3.0.0-py2.py3-none-manylinux1_i686.whl(777.04 KB)
rgf_python-3.0.0-py2.py3-none-manylinux1_x86_64.whl(743.87 KB)
rgf_python-3.0.0-py2.py3-none-win32.whl(1.51 MB)
rgf_python-3.0.0-py2.py3-none-win_amd64.whl(1.64 MB)
rgf_python-3.0.0.tar.gz(210.86 KB)
2.3.0(Jan 8, 2018)
FastRGF parameter validation (Thank you for @StrikerRUS!).

Change FastRGF parameter name like sklearn model (Thank you for @StrikerRUS!).

FastRGF on the fly compilation.

Add docker image usage.

Fix view in Jupyter notebook (Thank you for @StrikerRUS!).

Improve model input efficiency (Thank you for @StrikerRUS!).

Source code(tar.gz)
Source code(zip)
forest_predict(563.07 KB)
forest_train(557.37 KB)
rgf_python-2.3.0-py2.py3-none-macosx_10_6_x86_64.macosx_10_7_x86_64.macosx_10_8_x86_64.macosx_10_9_x86_64.macosx_10_10_x86_64.macosx_10_11_x86_64.macosx_10_12_x86_64.macosx_10_13_x86_64.whl(705.77 KB)
rgf_python-2.3.0-py2.py3-none-manylinux1_i686.whl(1.13 MB)
rgf_python-2.3.0-py2.py3-none-manylinux1_x86_64.whl(741.63 KB)
rgf_python-2.3.0-py2.py3-none-win32.whl(741.74 KB)
rgf_python-2.3.0-py2.py3-none-win_amd64.whl(806.40 KB)
rgf_python-2.3.0.tar.gz(208.07 KB)
2.2.0(Dec 29, 2017)
Support FastRGF as alpha version Please see https://github.com/fukatani/rgf_python/blob/master/FastRGF.rst

Overall refactoring, Change directory configuration (Thank you for @StrikerRUS! )

Enable to import by import rgf.RGFRegressor or other model. You can also use conventional import method import rgf.sklearn.RGFRegressor (Thank you for @StrikerRUS! )

Source code(tar.gz)
Source code(zip)
rgf_python-2.2.0-py2.py3-none-macosx_10_6_x86_64.macosx_10_7_x86_64.macosx_10_8_x86_64.macosx_10_9_x86_64.macosx_10_10_x86_64.macosx_10_11_x86_64.macosx_10_12_x86_64.macosx_10_13_x86_64.whl(323.44 KB)
rgf_python-2.2.0-py2.py3-none-manylinux1_i686.whl(387.20 KB)
rgf_python-2.2.0-py2.py3-none-manylinux1_x86_64.whl(357.44 KB)
rgf_python-2.2.0-py2.py3-none-win32.whl(291.82 KB)
rgf_python-2.2.0-py2.py3-none-win_amd64.whl(330.54 KB)
rgf_python-2.2.0.tar.gz(168.80 KB)
2.1.2(Dec 4, 2017)

Source code(tar.gz)
Source code(zip)
rgf_python-2.1.2-py2.py3-none-macosx_10_6_x86_64.macosx_10_7_x86_64.macosx_10_8_x86_64.macosx_10_9_x86_64.macosx_10_10_x86_64.macosx_10_11_x86_64.macosx_10_12_x86_64.macosx_10_13_x86_64.whl(318.51 KB)
rgf_python-2.1.2-py2.py3-none-manylinux1_i686.whl(382.27 KB)
rgf_python-2.1.2-py2.py3-none-manylinux1_x86_64.whl(352.50 KB)
rgf_python-2.1.2-py2.py3-none-win32.whl(286.89 KB)
rgf_python-2.1.2-py2.py3-none-win_amd64.whl(325.60 KB)
rgf_python-2.1.2.tar.gz(165.72 KB)
2.1.1(Dec 2, 2017)
Add cleanup API

Source code(tar.gz)
Source code(zip)
rgf(859.80 KB)
rgf_python-2.1.1-py2.py3-none-any.whl(318.51 KB)
rgf_python-2.1.1-py2.py3-none-manylinux1_i686.whl(381.45 KB)
rgf_python-2.1.1-py2.py3-none-manylinux1_x86_64.whl(352.50 KB)
rgf_python-2.1.1-py2.py3-none-win32.whl(286.89 KB)
rgf_python-2.1.1-py2.py3-none-win_amd64.whl(325.61 KB)
rgf_python-2.1.1.tar.gz(165.72 KB)
2.1.0(Nov 10, 2017)
We are extremely pleased to announce the release of rgf_python 2.1.0, many thanks to the contributors and all users!

Support serialization by pickle and joblib.

Changed some error message and improve debuggability.

Source code(tar.gz)
Source code(zip)
2.0.4(Oct 3, 2017)

We fixed #65 Thanks @StrikerRUS and @abczhqiang!
Source code(tar.gz)
Source code(zip)
2.0.2(Aug 21, 2017)

Hot fix.
Source code(tar.gz)
Source code(zip)
2.0.1(Aug 20, 2017)

Hot fix for installation.
Source code(tar.gz)
Source code(zip)
1.3.0(Jun 29, 2017)
Support on-the-fly compile if you use pip or setup.py. You can install rgf_python by 1 command!

Support memory use policy option.

Support parallel fitting when n_classes > 2.

Support display weight optimize information by verpose > 5.

All of contribution done by @StrikerRUS . Thanks a lot!
Source code(tar.gz)
Source code(zip)
1.0.0(Jun 3, 2017)
HyperParameter is added.

HyperParameter type and value validation is supported.

Renamed module.

Documentation fix.

Directed execution file directory by PATH. (Thank you @eyadsibai )

Prediction probability is fixed.

Refactoring.

Most of contribution done by @StrikerRUS . Thanks a lot!
Source code(tar.gz)
Source code(zip)
0.2.0(Aug 21, 2016)

This is a hot-fix release that fixes a linux environment issue ( #4 ).
For linux user, it is recommended to use this version.
Source code(tar.gz)
Source code(zip)
rgf1.2.zip(1.06 MB)

Owner

RGF-team

GitHub Repository

Python library which makes it possible to dynamically mask/anonymize data using JSON string or python dict rules in a PySpark environment.

pyspark-anonymizer Python library which makes it possible to dynamically mask/anonymize data using JSON string or python dict rules in a PySpark envir

6 Jun 30, 2022

A logistic regression model for health insurance purchasing prediction

Logistic_Regression_Model A logistic regression model for health insurance purchasing prediction This code is using these packages, so please make sur

1 Nov 29, 2021

A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.

5.7k Dec 30, 2022

PySpark + Scikit-learn = Sparkit-learn

Sparkit-learn PySpark + Scikit-learn = Sparkit-learn GitHub: https://github.com/lensacom/sparkit-learn About Sparkit-learn aims to provide scikit-lear

1.1k Jan 04, 2023

Distributed scikit-learn meta-estimators in PySpark

sk-dist: Distributed scikit-learn meta-estimators in PySpark What is it? sk-dist is a Python package for machine learning built on top of scikit-learn

282 Dec 09, 2022

Python-based implementations of algorithms for learning on imbalanced data.

ND DIAL: Imbalanced Algorithms Minimalist Python-based implementations of algorithms for imbalanced learning. Includes deep and representational learn

220 Dec 13, 2022

Machine Learning Algorithms ( Desion Tree, XG Boost, Random Forest )

implementation of machine learning Algorithms such as decision tree and random forest and xgboost on darasets then compare results for each and implement ant colony and genetic algorithms on tsp map,

1 Jan 19, 2022

Bayesian Modeling and Computation in Python

Bayesian Modeling and Computation in Python Open access and Code This repository contains the open access version of the text and the code examples in

339 Jan 02, 2023

Dieses Projekt ermöglicht es den Smartmeter der EVN (Netz Niederösterreich) über die Kundenschnittstelle auszulesen.

SmartMeterEVN Dieses Projekt ermöglicht es den Smartmeter der EVN (Netz Niederösterreich) über die Kundenschnittstelle auszulesen. Smart Meter werden

43 Dec 04, 2022

Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Now updated with Dask to handle millions of rows.

Auto_TS: Auto_TimeSeries Automatically build multiple Time Series models using a Single Line of Code. Now updated with Dask. Auto_timeseries is a comp

519 Jan 03, 2023

This project impelemented for midterm of the Machine Learning #Zoomcamp #Alexey Grigorev

MLProject_01 This project impelemented for midterm of the Machine Learning #Zoomcamp #Alexey Grigorev Context Dataset English question data set file F

1 Dec 18, 2021

Tutorials, examples, collections, and everything else that falls into the categories: pattern classification, machine learning, and data mining

**Tutorials, examples, collections, and everything else that falls into the categories: pattern classification, machine learning, and data mining.** S

4k Dec 30, 2022

Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.

Related tags

Overview

Regularized Greedy Forest

Comments

Environment Info

Error Message

Reproducible Example

Environment Info

Error Message

Reproducible Example

Releases(3.12.0)

3.12.0(Jan 7, 2022)

3.11.0(Aug 22, 2021)

3.10.0(Apr 28, 2021)

3.9.0(Aug 10, 2020)

3.8.0(Apr 1, 2020)

3.7.0(Feb 5, 2020)

3.6.0(Jun 9, 2019)

3.5.0(Jan 15, 2019)

3.4.0(Dec 4, 2018)

3.3.0(Jul 21, 2018)

3.2.0(Jun 16, 2018)

3.1.0(Feb 23, 2018)

3.0.0(Feb 10, 2018)

2.3.0(Jan 8, 2018)

2.2.0(Dec 29, 2017)

2.1.2(Dec 4, 2017)

2.1.1(Dec 2, 2017)

2.1.0(Nov 10, 2017)

2.0.4(Oct 3, 2017)

2.0.2(Aug 21, 2017)

2.0.1(Aug 20, 2017)

1.3.0(Jun 29, 2017)

1.0.0(Jun 3, 2017)

0.2.0(Aug 21, 2016)

Owner

RGF-team

Python library which makes it possible to dynamically mask/anonymize data using JSON string or python dict rules in a PySpark environment.

A logistic regression model for health insurance purchasing prediction

A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.

PySpark + Scikit-learn = Sparkit-learn

Distributed scikit-learn meta-estimators in PySpark

Python-based implementations of algorithms for learning on imbalanced data.

Machine Learning Algorithms ( Desion Tree, XG Boost, Random Forest )

Bayesian Modeling and Computation in Python

Dieses Projekt ermöglicht es den Smartmeter der EVN (Netz Niederösterreich) über die Kundenschnittstelle auszulesen.

Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Now updated with Dask to handle millions of rows.

This project impelemented for midterm of the Machine Learning #Zoomcamp #Alexey Grigorev

Tutorials, examples, collections, and everything else that falls into the categories: pattern classification, machine learning, and data mining

Predicting diabetes over a five year period using logistic regression and the Pima First-Nation dataset

Module is created to build a spam filter using Python and the multinomial Naive Bayes algorithm.

This is a curated list of medical data for machine learning

Implemented four supervised learning Machine Learning algorithms

MooGBT is a library for Multi-objective optimization in Gradient Boosted Trees.

dirty_cat is a Python module for machine-learning on dirty categorical variables.

Machine learning template for projects based on sklearn library.

An open-source library of algorithms to analyse time series in GPU and CPU.