Benchmark library for high-dimensional HPO of black-box models based on Weighted Lasso regression

Last update: Mar 15, 2022

Related tags

Deep Learning LassoBench

Overview

LassoBench

LassoBench is a library for high-dimensional hyperparameter optimization benchmarks based on Weighted Lasso regression.

Note: LassoBench is under active construction. Follow for more benchmarks soon.

Install and work with the development version

From a console or terminal clone the repository and install LassoBench:

git clone https://github.com/ksehic/LassoBench.git
cd LassoBench/
pip install -e .

Overview

The objective is to optimize the multi-dimensional hyperparameter that balances the least-squares estimation and the penalty term that promotes the sparsity.

The ambient space bounds are defined between [-1, 1].

LassoBench comes with two classes SyntheticBenchmark and RealBenchmark. While RealBenchmark is based on real-world applications found in medicine and finance, SyntheticBenchmark covers synthetic well-defined conditions. The user can select one of the predefined synthetic benchmarks or create a different bechmark.

Each benchmark comes with .evaluate that is used to evaluate the objective function, .test that provides the post-processing metrics (such as MSE on the test data and the F-score for synt benchs) and the argument mf_opt to define the multi-fidelity framework that is evaluated via .fidelity_evaluate.

The results are compared with the baselines LassoCV (.run_LASSOCV), AdaptiveLassoCV (to be implemented soon) and Sparse-HO (.run_sparseho).

Simple experiments are provided in example.py. In hesbo_example.py and alebo_example.py, we demostrate how to use LassoBench with some well-known HPO algorithms for high-dimensional problems.

Please refer to the reference for more details.

.
├── ...
├── example                    # Examples how to use LassoBench for HDBO algorithms
│   ├── alebo_example.py       # ALEBO applied on synt bench
│   ├── example.py             # Simple cases how to run with synt, real, and multifidelity benchs
│   ├── hesbo_example.py        # HesBO applied on synt and real bench
│   ├── hesbo_lib.pu            # HesBO library
│
└── ...

License

LassoBench is distributed under the MIT license. More information on the license can be found here

Simple synthetic bench code

import numpy as np
import LassoBench
synt_bench = LassoBench.SyntheticBenchmark(pick_bench='synt_simple')
d = synt_bench.n_features
random_config = np.random.uniform(low=-1.0, high=1.0, size=(d,))
loss = synt_bench.evaluate(random_config)

Real-world bench code

import numpy as np
import LassoBench
real_bench = LassoBench.RealBenchmark(pick_data='rcv1')
d = real_bench.n_features
random_config = np.random.uniform(low=-1.0, high=1.0, size=(d,))
loss = real_bench.evaluate(random_config)

Multi-information source bench code

import numpy as np
import LassoBench
real_bench_mf = LassoBench.RealBenchmark(pick_data='rcv1', mf_opt='discrete_fidelity')
d = real_bench_mf.n_features
random_config = np.random.uniform(low=-1.0, high=1.0, size=(d,))
fidelity_pick = 0
loss = real_bench_mf.fidelity_evaluate(random_config, index_fidelity=fidelity_pick)

List of synthetic benchmarks

Name	Dimensionality	Axis-aligned Subspace
synt_simple	60	3
synt_medium	100	5
synt_high	300	15
synt_hard	1000	50

List of real world benchmarks

Name	Dimensionality	Approx. Axis-aligned Subspace
breast_cancer	10	3
diabetes	8	5
leukemia	7 129	22
dna	180	43
rcv1	19 959	75

Cite

If you use this code, please cite:


Šehić Kenan, Gramfort Alexandre, Salmon Joseph and Nardi Luigi. "LassoBench: A High-Dimensional Hyperparameter Optimization Benchmark Suite for Lasso", TBD, 2021.

Benchmark library for high-dimensional HPO of black-box models based on Weighted Lasso regression

Related tags

Overview

LassoBench

Install and work with the development version

Overview

License

Simple synthetic bench code

Real-world bench code

Multi-information source bench code

List of synthetic benchmarks

List of real world benchmarks

Cite

Owner

Kenan Šehić

A whale detector design for the Kaggle whale-detector challenge!

Code of the lileonardo team for the 2021 Emotion and Theme Recognition in Music task of MediaEval 2021

Sub-tomogram-Detection - Deep learning based model for Cyro ET Sub-tomogram-Detection

This is a code repository for the paper "Graph Auto-Encoders for Financial Clustering".

OCR-D wrapper for detectron2 based segmentation models

The codes and models in 'Gaze Estimation using Transformer'.

Official PyTorch implementation of PS-KD

A set of tools to pre-calibrate and calibrate (multi-focus) plenoptic cameras (e.g., a Raytrix R12) based on the libpleno.

FaceAnon - Anonymize people in images and videos using yolov5-crowdhuman

I will implement Fastai in each projects present in this repository.

Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training

Dogs classification with Deep Metric Learning using some popular losses

Local-Global Stratified Transformer for Efficient Video Recognition

Graph neural network message passing reframed as a Transformer with local attention

Cross-Image Region Mining with Region Prototypical Network for Weakly Supervised Segmentation

Framework that uses artificial intelligence applied to mathematical models to make predictions

A list of multi-task learning papers and projects.

Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models Benchmark and Efficient Evaluation

Official repository for the ICCV 2021 paper: UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model.

Extreme Rotation Estimation using Dense Correlation Volumes