Important dataframe statistics with a single command

Last update: Dec 19, 2021

Overview

quick_eda

Receiving dataframe statistics with one command

Project description

A python package for Data Scientists, Students, ML Engineers and anyone who wants dataframe meta data without the trouble of having to type in numerous commands.

Installation

Use pip to install quick-eda by typing or copying the following command.

pip install quick-eda

License

This package is licensed under BSD Clause 3.

Example usage

Users of the package can import the individual modules from this package, for example:

import quick_eda.df_eda
import quick_eda.column_eda

This loads the submodules quick_eda.df_eda and quick_eda.column_eda. They must be referenced with their full name.

quick_eda.df_eda.df_eda(<df>)
quick_eda.column_eda.column_eda(<column_name>)

An alternative way of importing the submodules is:

from quick_eda import df_eda
from quick_eda import column_eda

This also loads the submodules quick_eda.df_eda and quick_eda.column_eda, and makes them available without their prefix, so they can be used as follows:

df_eda.df_eda(<df>)
column_eda.column_eda(<column_name>)

Yet another variation is to import the desired functions directly:

from quick_eda.df_eda import df_eda
from quick_eda.column_eda import column_eda

Again, this loads the submodules, but makes them directly available:

df_eda(<df>)
column_eda(<column_name>)

Imagine you have a dataframe called pets with the columns name, age and color. You could then run statistics on both the entire dataframe or e.g. the column age with

df_eda(pets)
column_eda(pets, "age")

Source code & further information

The source code is maintained at https://github.com/sveneschlbeck/quick_eda
There are also further information concerning the BSD license model, contributing guidelines and more...

Important dataframe statistics with a single command

Related tags

Overview

quick_eda

Project description

Installation

License

Example usage

Source code & further information

Owner

Sven Eschlbeck

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

A collection of learning outcomes data analysis using Python and SQL, from DQLab.

Weather Image Recognition - Python weather application using series of data

A set of procedures that can realize covid19 virus detection based on blood.

Python package for analyzing behavioral data for Brain Observatory: Visual Behavior

A notebook to analyze Amazon Recommendation Review Dataset.

A collection of robust and fast processing tools for parsing and analyzing web archive data.

Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano

The micro-framework to create dataframes from functions.

Common bioinformatics database construction

Datashader is a data rasterization pipeline for automating the process of creating meaningful representations of large amounts of data.

Statsmodels: statistical modeling and econometrics in Python

Udacity - Data Analyst Nanodegree - Project 4 - Wrangle and Analyze Data

Cold Brew: Distilling Graph Node Representations with Incomplete or Missing Neighborhoods

Pipeline to convert a haploid assembly into diploid

small package with utility functions for analyzing (fly) calcium imaging data

Python script for transferring data between three drives in two separate stages

songplays datamart provide details about the musical taste of our customers and can help us to improve our recomendation system

ASTR 302: Python for Astronomy (Winter '22)

Exploratory Data Analysis for Employee Retention Dataset