A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

Last update: Nov 05, 2022

Related tags

Overview

Feature Forge

This library provides a set of tools that can be useful in many machine learning applications (classification, clustering, regression, etc.), and particularly helpful if you use scikit-learn (although this can work if you have a different algorithm).

Most machine learning problems involve an step of feature definition and preprocessing. Feature Forge helps you with:

Defining and documenting features
Testing your features against specified cases and against randomly generated cases (stress-testing). This helps you making your application more robust against invalid/misformatted input data. This also helps you checking that low-relevance results when doing feature analysis is actually because the feature is bad, and not because there's a slight bug in your feature code.
Evaluating your features on a data set, producing a feature evaluation matrix. The evaluator has a robust mode that allows you some tolerance both for invalid data and buggy features.
Experimentation: running, registering, classifying and reproducing experiments for determining best settings for your problems.

Installation

Just pip install featureforge.

Documentation

Documentation is available at http://feature-forge.readthedocs.org/en/latest/

Contact information

Javier Mansilla <[email protected]> (jmansilla at github)
Daniel Moisset <[email protected]> (dmoisset at github)
Rafael Carrascosa <[email protected]> (rafacarrascosa at github)

Any contributions or suggestions are welcome, the official channel for this is submitting github pull requests or issues.

Changelog

0.1.7:

StatsManager api change (order of arguments swapped)
For experimentation, enabled a way of booking experiments forever.

0.1.6:

Bug fixes related to sparse matrices.
Small documentation improvements.
Reduced default logging verbosity.

0.1.5:

Using sparse numpy matrices by default.

0.1.4:

Discarded the need of using forked version of Schema library.

0.1.3:

Added support for running and generating stats for experiments

0.1.2:

Fixing installer dependencies

0.1.1:

Added support for python 3
Added support for bag-of-words features

0.1:

Initial release

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

Related tags

Overview

Feature Forge

Installation

Documentation

Contact information

Changelog

Owner

Machinalis

Toward Spatially Unbiased Generative Models (ICCV 2021)

Code and data for ImageCoDe, a contextual vison-and-language benchmark

Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices

Official Pytorch implementation of Meta Internal Learning

Code accompanying the paper "Knowledge Base Completion Meets Transfer Learning"

This repository contains the accompanying code for Deep Virtual Markers for Articulated 3D Shapes, ICCV'21

A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision for Visual Scene Graph Generation''

Artificial intelligence technology inferring issues and logically supporting facts from raw text

Open-CyKG: An Open Cyber Threat Intelligence Knowledge Graph

GeoTransformer - Geometric Transformer for Fast and Robust Point Cloud Registration

Official implementation of "UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer"

The repository for our EMNLP 2021 paper "Finnish Dialect Identification: The Effect of Audio and Text"

Pytorch implementation for "Implicit Semantic Response Alignment for Partial Domain Adaptation"

The first machine learning framework that encourages learning ML concepts instead of memorizing class functions.

Synthetic LiDAR sequential point cloud dataset with point-wise annotations

An Implementation of SiameseRPN with Feature Pyramid Networks

A minimal yet resourceful implementation of diffusion models (along with pretrained models + synthetic images for nine datasets)

Fuse radar and camera for detection

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Neural implicit reconstruction experiments for the Vector Neuron paper

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

Related tags

Overview

Feature Forge

Installation

Documentation

Contact information

Changelog

Owner

Machinalis

Toward Spatially Unbiased Generative Models (ICCV 2021)

Code and data for ImageCoDe, a contextual vison-and-language benchmark

Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices

Official Pytorch implementation of Meta Internal Learning

Code accompanying the paper "Knowledge Base Completion Meets Transfer Learning"

This repository contains the accompanying code for Deep Virtual Markers for Articulated 3D Shapes, ICCV'21

A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision for Visual Scene Graph Generation''

Artificial intelligence technology inferring issues and logically supporting facts from raw text

Open-CyKG: An Open Cyber Threat Intelligence Knowledge Graph

GeoTransformer - Geometric Transformer for Fast and Robust Point Cloud Registration

Official implementation of "UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer"

The repository for our EMNLP 2021 paper "Finnish Dialect Identification: The Effect of Audio and Text"

Pytorch implementation for "Implicit Semantic Response Alignment for Partial Domain Adaptation"

The first machine learning framework that encourages learning ML concepts instead of memorizing class functions.

Synthetic LiDAR sequential point cloud dataset with point-wise annotations

An Implementation of SiameseRPN with Feature Pyramid Networks

A minimal yet resourceful implementation of diffusion models (along with pretrained models + synthetic images for nine datasets)

Fuse radar and camera for detection

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Neural implicit reconstruction experiments for the Vector Neuron paper

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.