WithPipe is a simple utility for functional piping in Python.

Last update: Oct 26, 2021

Overview

WithPipe

Introduction

WithPipe is a simple utility for functional piping in Python. The package exposes a context manager (used with with) called PipeContext, that allows you to access any function in any scope as a partial, meaning that it's naturally pipeable. Here's a contrived example from the test suite:

import numpy as np
from with_pipe import PipeContext
from pipetools import pipe

with PipeContext() as _:
    ret = (
        10 > pipe |
        _.np.ones() |
        _.np.reshape(newshape=(5, 2)) |
        _.np.mean() |
        _.int()
    )
    assert ret == 1

As you can see, we were able to call both numpy and built-in functions on the _ object, and it executed the pipeline similarly to say R's magrittr package.

Installation

pip install git+https://github.com/multimeric/WithPipe.git

Usage

Actually WithPipe doesn't provide an actual piping mechanism, but it does add a useful syntax for use with pipes. For the actual piping mechanism, I suggest that you try pipetools, which this package is actually tested against.

WithPipe provides a single class: PipeContext. The way you use PipeContext is by first using it as a context manager:

with PipeContext() as _:

Then, using the return value of the context manager, which we have named _ (but you could call it anything), you access attributes and items (using .attr or ["key"] or [0]) to locate the function you want and then you finally call it (), which will create the partial. You can use positional and keyword arguments at this point if you need

For more usage information, refer to the test suite.

Tests

Note: you will need poetry installed.

git clone https://github.com/multimeric/WithPipe.git
cd WithPipe
poetry install --extras pipetools
poetry run pytest test/

WithPipe is a simple utility for functional piping in Python.

Related tags

Overview

WithPipe

Introduction

Installation

Usage

Tests

Owner

Michael Milton

Business Intelligence (BI) in Python, OLAP

Includes all files needed to satisfy hw02 requirements

Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

Synthetic Data Generation for tabular, relational and time series data.

A data analysis using python and pandas to showcase trends in school performance.

AptaMat is a simple script which aims to measure differences between DNA or RNA secondary structures.

PLStream: A Framework for Fast Polarity Labelling of Massive Data Streams

Port of dplyr and other related R packages in python, using pipda.

Repository created with LinkedIn profile analysis project done

:truck: Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

Processo de ETL (extração, transformação, carregamento) realizado pela equipe no projeto final do curso da Soul Code Academy.

Incubator for useful bioinformatics code, primarily in Python and R

PostQF is a user-friendly Postfix queue data filter which operates on data produced by postqueue -j.

PySpark Structured Streaming ROS Kafka ApacheSpark Cassandra

Manage large and heterogeneous data spaces on the file system.

An ETL Pipeline of a large data set from a fictitious music streaming service named Sparkify.

A Python module for clustering creators of social media content into networks

Larch: Applications and Python Library for Data Analysis of X-ray Absorption Spectroscopy (XAS, XANES, XAFS, EXAFS), X-ray Fluorescence (XRF) Spectroscopy and Imaging

VevestaX is an open source Python package for ML Engineers and Data Scientists.