A powerful data analysis package based on mathematical step functions. Strongly aligned with pandas.

Overview

staircase logo

The leading use-case for the staircase package is for the creation and analysis of step functions.

Pretty exciting huh.

But don't hit the close button on the browser just yet. Let us convince you that much of the world around you can be modelled as step functions.

For example, the number of users viewing this page over time can be modelled as a step function. The value of the function increases by 1 every time a user arrives at the page, and decreases by 1 every time a user leaves the page. Let's say we have this data in vector format (i.e. tuple, list, numpy array, pandas series). Specifically, assume arrive and leave are vectors of times, expressed as minutes past midnight, for all page views occuring yesterday. Creating the corresponding step function is simple. To achieve it we use the Stairs class:

>>> import staircase as sc

>>> views = sc.Stairs()
>>> views.layer(arrive,leave)

We can visualise the function with the plot function:

>>> views.plot()

pageviews example

We can find the total time in minutes the page was viewed:

>>> views.clip(0,1440).integral()
9297.94622521079

We can find the average number of viewers:

>>> views.clip(0,1440).mean()
6.4569071008408265

We can find the average number of viewers, per hour of the day, and plot:

>>> views.slice(pd.interval_range(0, periods=24, freq=60)).mean().plot()

mean page views per hour

We can find the maximum concurrent views:

>>> views.clip(0,1440).max()
16

We can create histogram data showing relative frequency of concurrent viewers (and plot it):

>>> views.clip(0,1440).hist().plot.bar()

concurrent viewers histogram

Plotting is based on matplotlib and it requires relatively little effort to take the previous chart and improve the aesthetics:

concurrent viewers histogram (aesthetic)

There is plenty more analysis that could be done. The staircase package provides a rich variety of arithmetic operations, relational operations, logical operations, statistical operations, for use with Stairs, in addition to functions for univariate analysis, aggregations and compatibility with pandas.Timestamp.

Installation

staircase can be installed from PyPI:

python -m pip install staircase

or also with conda:

conda install -c conda-forge staircase

Documentation

The complete guide to using staircase can be found at staircase.dev

Contributing

There are many ways in which contributions can be made - the first and foremost being using staircase and giving feedback.

Bug reports, feature requests and ideas can be submitted via the Github issue tracker.

Additionally, bug fixes. enhancements, and improvements to the code and documentation are also appreciated and can be done via pull requests. Take a look at the current issues and if there is one you would like to work on please leave a comment to that effect.

See this beginner's guide to contributing, or Pandas' guide to contributing, to learn more about the process.

Versioning

We use SemVer for versioning. For the versions available, see the tags on this repository.

License

This project is licensed under the MIT License - see the LICENSE file for details

Acknowledgments

The seeds of staircase began developing at the Hunter Valley Coal Chain Coordinator, where it finds strong application in analysing simulated data. Thanks for the support!

Comments
  • POC _closed value check

    POC _closed value check

    Not complete but seems to do what's desired for the add & subtract operations.

    • [x] closes #96
    • [x] tests added / passed
    • [x] ensure all linting tests pass
    • [x] changelog entry
    opened by amagee 13
  • BUG: error message appearing in debugging mode

    BUG: error message appearing in debugging mode

    • [X] I have checked that this issue has not already been reported.

    • [X] I have confirmed this bug exists on the latest version of staircase.


    Note: Please read this guide detailing how to provide the necessary information for us to reproduce your bug.

    Code Sample, a copy-pastable example

    # This in the code
    import staircase as sc
    import pandas as pd
    
    # this run in debug interactive mode
    pd.DataFrame()
    
    
    

    Problem description

    I'm still trying to make a minimum reproducible example, but for me it seems that somehow the only inclusion of the staircase dependence even without using it, is changing some pandas behaviour and raising an error. The error is non breaking and it appears when debugging code (interactively) that contains pandas dataframes even if staircase is not actually used in any operation and only imported at some point.

    The problem appears with delay and is not breaking, I'm not sure how to find the issue.

    Expected Output

    Nothing

    Dependency Versions``

    staircase 2.4.2 pandas 1.4.1 numpy 1.22.2

    Traceback (most recent call last): File "C:\Program Files\JetBrains\PyCharm Community Edition 2021.1.1\plugins\python-ce\helpers\pydev_pydevd_bundle\pydevd_comm.py", line 1590, in do_it msg = _pydev_completer.generate_completions_as_xml(frame, self.act_tok) File "C:\Program Files\JetBrains\PyCharm Community Edition 2021.1.1\plugins\python-ce\helpers\pydev_pydev_bundle_pydev_completer.py", line 172, in generate_completions_as_xml completions = completer.complete(act_tok) File "C:\Program Files\JetBrains\PyCharm Community Edition 2021.1.1\plugins\python-ce\helpers\pydev_pydev_bundle_pydev_completer.py", line 89, in complete return self.attr_matches(text) File "C:\Program Files\JetBrains\PyCharm Community Edition 2021.1.1\plugins\python-ce\helpers\pydev_pydev_bundle_pydev_completer.py", line 151, in attr_matches words = dir2(obj, filter=filter) File "C:\Program Files\JetBrains\PyCharm Community Edition 2021.1.1\plugins\python-ce\helpers\pydev_pydev_bundle_pydev_imports_tipper.py", line 178, in generate_imports_tip_for_module dir_comps = dir(obj_to_complete) File "C:\Users\PabloRuiz\AppData\Local\pypoetry\Cache\virtualenvs\transformer-_p2xsS-z-py3.8\lib\site-packages\pandas\core\accessor.py", line 40, in dir rv = (rv - self._dir_deletions()) | self._dir_additions() File "C:\Users\PabloRuiz\AppData\Local\pypoetry\Cache\virtualenvs\transformer-_p2xsS-z-py3.8\lib\site-packages\pandas\core\generic.py", line 5632, in _dir_additions additions = super()._dir_additions() File "C:\Users\PabloRuiz\AppData\Local\pypoetry\Cache\virtualenvs\transformer-_p2xsS-z-py3.8\lib\site-packages\pandas\core\accessor.py", line 29, in _dir_additions return {accessor for accessor in self._accessors if hasattr(self, accessor)} File "C:\Users\PabloRuiz\AppData\Local\pypoetry\Cache\virtualenvs\transformer-_p2xsS-z-py3.8\lib\site-packages\pandas\core\accessor.py", line 29, in return {accessor for accessor in self._accessors if hasattr(self, accessor)} File "C:\Users\PabloRuiz\AppData\Local\pypoetry\Cache\virtualenvs\transformer-_p2xsS-z-py3.8\lib\site-packages\pandas\core\accessor.py", line 182, in get accessor_obj = self._accessor(obj) File "C:\Users\PabloRuiz\AppData\Local\pypoetry\Cache\virtualenvs\transformer-_p2xsS-z-py3.8\lib\site-packages\staircase\core\arrays\accessor.py", line 14, in init raise TypeError( TypeError: sc accessor only valid for Series with Stairs dtype. Convert using .astype("Stairs").

    I think the problem is related to the word "sc" in the import and:

    @register_series_accessor("sc") class StairsAccessor: ...

    in accessor.

    opened by PabloRuizCuevas 11
  • Typo Update in 05_comparing.rst

    Typo Update in 05_comparing.rst

    Below is the changes made to the file in

    - To answer the question of "what if the step function is not boolean valued" we appeal to the boolean definition that Python applies to numbers;anything not zero is considered *true*, and consequently only zero is false.
    +To answer the question of "what if the step function is not boolean valued" we appeal to the boolean definition that Python applies to numbers: anything not zero is considered *true*, and consequently only zero is false.
    

    Checklist:

    • [x] I have read freeCodeCamp's contribution guidelines.
    • [x] My pull request has a descriptive title (not a vague title like Update index.md)
    • [x] My pull request targets the main branch of freeCodeCamp.
    • [x] I have tested these changes either locally on my machine, or GitPod.

    Closes #99

    The list of files can be found in below link link

    • [x] closes #99
    opened by Pavithradevadiga 4
  • ENH: raise error if operands of binary operations do not have same

    ENH: raise error if operands of binary operations do not have same "closed" value

    Describe the solution you'd like

    Stairs instances have a "closed" property which can be "left" or "right" indicates if they are left-closed, right-open or right-closed left-open.

    When performing binary operations with two Stairs objects, like addition, &, >= etc it is important that both operands have the same value of closed, and an error should be raised if not.

    It would be great to have this functionality implemented as a function decorator, which knows to compare self and other parameters which appear in these binary operator methods.

    enhancement advanced high priority 
    opened by venaturum 4
  • ENH: Add Stairs.from_values method

    ENH: Add Stairs.from_values method

    Is your feature request related to a problem?

    Allow a user who has timeseries data in pandas, which represents step function values, to create a Stairs instance from this data.

    Describe the solution you'd like

    Similar to https://github.com/staircase-dev/staircase/blob/00da1209eec21d3ea11f0f18f538637d2bae5ecf/staircase/core/stairs.py#L73

    Parameters should include

    • initial_value (float or nan)
    • values (pandas.Series)
    • closed (either "left" or "right")

    values should be indexed by change points of step function. The values of values should be step function values when approaching the change points from the right (this is the internal representation used in "value" column of Stairs._data).

    Additional context

    See https://www.staircase.dev/en/latest/development/guide/internals.html for more context

    enhancement beginner high priority 
    opened by venaturum 4
  • BUG: staircase.make_test_data fails with numpy < 1.17

    BUG: staircase.make_test_data fails with numpy < 1.17

    • [ ] I have checked that this issue has not already been reported.

    • [ ] I have confirmed this bug exists on the latest version of staircase.


    Note: Please read this guide detailing how to provide the necessary information for us to reproduce your bug.

    Code Sample, a copy-pastable example

    >>> import staircase as sc
    >>> sc.make_test_data()
    Traceback (most recent call last):
      File "<stdin>", line 1, in <module>
      File "C:\Users\rclement\Documents\temp11\.venv\lib\site-packages\staircase\test_data.py", line 56, in make_test_data
        return _make_test_data(dates=dates, positive_only=positive_only, seed=seed)
      File "C:\Users\rclement\Documents\temp11\.venv\lib\site-packages\staircase\test_data.py", line 71, in _make_test_data
        rng = np.random.default_rng()
    AttributeError: module 'numpy.random' has no attribute 'default_rng'
    
    

    Problem description

    Expected Output

    Dependency Versions``

    staircase 2.0.0 pandas 1.0.0 numpy 1.16.0

    bug intermediate high priority 
    opened by venaturum 4
  • DOC: Typo in

    DOC: Typo in "How can I compare my step function"

    Location of the documentation

    https://www.staircase.dev/en/latest/getting_started/intro_tutorials/05_comparing.html#logical

    Note: You can check the latest versions of the docs on master here.

    Documentation problem

    Typo below:

    To answer the question of “what if the step function is not boolean valued” we appeal to the boolean definition that Python applies to numbers;anything not zero is considered true, and consequently only zero is false.

    documentation beginner medium priority 
    opened by venaturum 3
  • ENH: Allow Stairs.fillna to take a Stairs parameter

    ENH: Allow Stairs.fillna to take a Stairs parameter

    Is your feature request related to a problem?

    We want the value parameter in Stairs.fillna to be able to take a staircase.Stairs parameter

    current definition https://github.com/staircase-dev/staircase/blob/cb0265d1b182ad3a8452f544c197ce54aef6feea/staircase/core/ops/masking.py#L160

    docstring https://github.com/staircase-dev/staircase/blob/cb0265d1b182ad3a8452f544c197ce54aef6feea/staircase/core/ops/docstrings.py#L459

    Describe the solution you'd like

    See https://www.staircase.dev/en/latest/user_guide/cookbook.html#fill-undefined-intervals-of-one-step-function-with-another for an implementation

    enhancement beginner medium priority 
    opened by venaturum 3
  • Bump mistune from 0.8.4 to 2.0.3

    Bump mistune from 0.8.4 to 2.0.3

    Bumps mistune from 0.8.4 to 2.0.3.

    Release notes

    Sourced from mistune's releases.

    Version 2.0.2

    Fix escape_url via lepture/mistune#295

    Version 2.0.1

    Fix XSS for image link syntax.

    Version 2.0.0

    First release of Mistune v2.

    Version 2.0.0 RC1

    In this release, we have a Security Fix for harmful links.

    Version 2.0.0 Alpha 1

    This is the first release of v2. An alpha version for users to have a preview of the new mistune.

    Changelog

    Sourced from mistune's changelog.

    Changelog

    Here is the full history of mistune v2.

    Version 2.0.4

    
    Released on Jul 15, 2022
    
    • Fix url plugin in &lt;a&gt; tag
    • Fix * formatting

    Version 2.0.3

    Released on Jun 27, 2022

    • Fix table plugin
    • Security fix for CVE-2022-34749

    Version 2.0.2

    
    Released on Jan 14, 2022
    

    Fix escape_url

    Version 2.0.1

    Released on Dec 30, 2021

    XSS fix for image link syntax.

    Version 2.0.0

    
    Released on Dec 5, 2021
    

    This is the first non-alpha release of mistune v2.

    Version 2.0.0rc1

    Released on Feb 16, 2021

    Version 2.0.0a6

    
    </tr></table> 
    

    ... (truncated)

    Commits
    • 3f422f1 Version bump 2.0.3
    • a6d4321 Fix asteris emphasis regex CVE-2022-34749
    • 5638e46 Merge pull request #307 from jieter/patch-1
    • 0eba471 Fix typo in guide.rst
    • 61e9337 Fix table plugin
    • 76dec68 Add documentation for renderer heading when TOC enabled
    • 799cd11 Version bump 2.0.2
    • babb0cf Merge pull request #295 from dairiki/bug.escape_url
    • fc2cd53 Make mistune.util.escape_url less aggressive
    • 3e8d352 Version bump 2.0.1
    • Additional commits viewable in compare view

    Dependabot compatibility score

    Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


    Dependabot commands and options

    You can trigger Dependabot actions by commenting on this PR:

    • @dependabot rebase will rebase this PR
    • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
    • @dependabot merge will merge this PR after your CI passes on it
    • @dependabot squash and merge will squash and merge this PR after your CI passes on it
    • @dependabot cancel merge will cancel a previously requested merge and block automerging
    • @dependabot reopen will reopen this PR if it is closed
    • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
    • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
    • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
    • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
    • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
    • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

    You can disable automated security fix PRs for this repo from the Security Alerts page.

    dependencies 
    opened by dependabot[bot] 2
  • Test logical xor stairs 1

    Test logical xor stairs 1

    Seems to do what was asked for in the ticket, let me know if you want anything more / different.

    • [x] closes #105
    • [x] tests added / passed
    • [x] ensure all linting tests pass
    • [x] changelog entry
    opened by amagee 2
  • Updated black version, and flake 8

    Updated black version, and flake 8

    • [X] closes #165
    • [ ] tests added / passed
    • [x] ensure all linting tests pass
    • [ ] changelog entry

    No relevant changes, I thought we would have some little style change cause the new version of black, but nothing changed, so the PR is not the greatest contribution.

    Anyway, I set in the pypoetry.toml the line-lenght of black to be 88, which was already the the default, so no also not changes there, it is black and pandas default too, so it makes sense for this project. thought I usually like more 120 or so.

    opened by PabloRuizCuevas 1
  • Bump setuptools from 65.5.0 to 65.5.1

    Bump setuptools from 65.5.0 to 65.5.1

    Bumps setuptools from 65.5.0 to 65.5.1.

    Changelog

    Sourced from setuptools's changelog.

    v65.5.1

    Misc ^^^^

    • #3638: Drop a test dependency on the mock package, always use :external+python:py:mod:unittest.mock -- by :user:hroncok
    • #3659: Fixed REDoS vector in package_index.
    Commits

    Dependabot compatibility score

    Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


    Dependabot commands and options

    You can trigger Dependabot actions by commenting on this PR:

    • @dependabot rebase will rebase this PR
    • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
    • @dependabot merge will merge this PR after your CI passes on it
    • @dependabot squash and merge will squash and merge this PR after your CI passes on it
    • @dependabot cancel merge will cancel a previously requested merge and block automerging
    • @dependabot reopen will reopen this PR if it is closed
    • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
    • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
    • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
    • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
    • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
    • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

    You can disable automated security fix PRs for this repo from the Security Alerts page.

    dependencies 
    opened by dependabot[bot] 1
  • Bump certifi from 2022.9.24 to 2022.12.7

    Bump certifi from 2022.9.24 to 2022.12.7

    Bumps certifi from 2022.9.24 to 2022.12.7.

    Commits

    Dependabot compatibility score

    Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


    Dependabot commands and options

    You can trigger Dependabot actions by commenting on this PR:

    • @dependabot rebase will rebase this PR
    • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
    • @dependabot merge will merge this PR after your CI passes on it
    • @dependabot squash and merge will squash and merge this PR after your CI passes on it
    • @dependabot cancel merge will cancel a previously requested merge and block automerging
    • @dependabot reopen will reopen this PR if it is closed
    • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
    • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
    • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
    • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
    • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
    • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

    You can disable automated security fix PRs for this repo from the Security Alerts page.

    dependencies 
    opened by dependabot[bot] 0
  • Bump pillow from 9.2.0 to 9.3.0

    Bump pillow from 9.2.0 to 9.3.0

    Bumps pillow from 9.2.0 to 9.3.0.

    Release notes

    Sourced from pillow's releases.

    9.3.0

    https://pillow.readthedocs.io/en/stable/releasenotes/9.3.0.html

    Changes

    ... (truncated)

    Changelog

    Sourced from pillow's changelog.

    9.3.0 (2022-10-29)

    • Limit SAMPLESPERPIXEL to avoid runtime DOS #6700 [wiredfool]

    • Initialize libtiff buffer when saving #6699 [radarhere]

    • Inline fname2char to fix memory leak #6329 [nulano]

    • Fix memory leaks related to text features #6330 [nulano]

    • Use double quotes for version check on old CPython on Windows #6695 [hugovk]

    • Remove backup implementation of Round for Windows platforms #6693 [cgohlke]

    • Fixed set_variation_by_name offset #6445 [radarhere]

    • Fix malloc in _imagingft.c:font_setvaraxes #6690 [cgohlke]

    • Release Python GIL when converting images using matrix operations #6418 [hmaarrfk]

    • Added ExifTags enums #6630 [radarhere]

    • Do not modify previous frame when calculating delta in PNG #6683 [radarhere]

    • Added support for reading BMP images with RLE4 compression #6674 [npjg, radarhere]

    • Decode JPEG compressed BLP1 data in original mode #6678 [radarhere]

    • Added GPS TIFF tag info #6661 [radarhere]

    • Added conversion between RGB/RGBA/RGBX and LAB #6647 [radarhere]

    • Do not attempt normalization if mode is already normal #6644 [radarhere]

    ... (truncated)

    Commits

    Dependabot compatibility score

    Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


    Dependabot commands and options

    You can trigger Dependabot actions by commenting on this PR:

    • @dependabot rebase will rebase this PR
    • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
    • @dependabot merge will merge this PR after your CI passes on it
    • @dependabot squash and merge will squash and merge this PR after your CI passes on it
    • @dependabot cancel merge will cancel a previously requested merge and block automerging
    • @dependabot reopen will reopen this PR if it is closed
    • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
    • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
    • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
    • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
    • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
    • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

    You can disable automated security fix PRs for this repo from the Security Alerts page.

    dependencies 
    opened by dependabot[bot] 0
  • DOC: upgrade pydata-sphinx-theme

    DOC: upgrade pydata-sphinx-theme

    Current docs based on 0.8.1 and latest is 0.11.0

    Upgrading will result in several breaking changes which will result in build crashes, and unwanted theming.

    CSS and logos may need to be redone.

    opened by venaturum 0
  • DOC: demonstrate

    DOC: demonstrate "string timestamp" functionality in user guide and case studies

    In v2.4.0 the ability to pass string representations of timestamps into Stairs.clip, Stairs.mask, Stairs.where was introduced. This is so users can do this

    my_stepfunction.clip("2020", "2021")

    instead of my_stepfunction.clip(pd.Timestamp("2020"), pd.Timestamp("2021"))

    There are opportunities to utilise this new feature in examples found in the user guide and case studies. Perhaps the best approach is, for each page, use the longer version first, then shorter version in subsequent code, noting that the feature is being used.

    documentation beginner 
    opened by venaturum 0
Releases(v2.5.0)
  • v2.5.0(Oct 19, 2022)

    This is a minor release in the 2.x series and includes a couple of minor bugfixes. There is no added functionality however support for Python 3.6 has been dropped.

    See the changelogfor details.

    Source code(tar.gz)
    Source code(zip)
  • v2.4.2(May 13, 2022)

  • v2.4.1(May 9, 2022)

    This is a patch release in the 2.4.x series and includes a fix for a bug introduced in v2.4.0 where a pandas warning was generated when constructing/layering Stairs objects with NA-like arguments.

    Source code(tar.gz)
    Source code(zip)
  • v2.4.0(May 2, 2022)

  • v2.3.0(Dec 21, 2021)

  • v2.2.0(Nov 27, 2021)

    This is a minor release in the 2.x series and includes new functionality for pandas extensions including Extension Array, Series accessor, and several new methods.

    See the changelog for details.

    Source code(tar.gz)
    Source code(zip)
  • v2.1.1(Nov 2, 2021)

    This is a patch release in the 2.1.x series and includes a single bug fix. The bug is inherited from a known issue in pandas (https://github.com/pandas-dev/pandas/issues/34251) which causes staircase.Stairs.plot to fail when style="hlines" and numexpr package is installed.

    Source code(tar.gz)
    Source code(zip)
  • v2.1.0(Oct 29, 2021)

  • v2.0.4(Oct 12, 2021)

    This is a patch release in the 2.0.x series and includes a single bug fix:

    • bugfix for array aggregation where array contains a constant value step function over non-float domain (#GH119)
    Source code(tar.gz)
    Source code(zip)
  • v2.0.3(Oct 7, 2021)

  • v2.0.2(Oct 3, 2021)

  • v2.0.1(Sep 13, 2021)

  • v2.0.0(Aug 25, 2021)

    A major release with several enhancements added, namely masking, slicing and the implicit handling of dates. The library is now based on pandas and numpy, resulting in significant speedups. While much of the API has remained the same, or similar, there are many backwards incompatible changes and deprecations. See documentation at www.staircase.dev for details.

    Source code(tar.gz)
    Source code(zip)
Tablexplore is an application for data analysis and plotting built in Python using the PySide2/Qt toolkit.

Tablexplore is an application for data analysis and plotting built in Python using the PySide2/Qt toolkit.

Damien Farrell 81 Dec 26, 2022
Code for the DH project "Dhimmis & Muslims – Analysing Multireligious Spaces in the Medieval Muslim World"

Damast This repository contains code developed for the digital humanities project "Dhimmis & Muslims – Analysing Multireligious Spaces in the Medieval

University of Stuttgart Visualization Research Center 2 Jul 01, 2022
NumPy and Pandas interface to Big Data

Blaze translates a subset of modified NumPy and Pandas-like syntax to databases and other computing systems. Blaze allows Python users a familiar inte

Blaze 3.1k Jan 05, 2023
X-news - Pipeline data use scrapy, kafka, spark streaming, spark ML and elasticsearch, Kibana

X-news - Pipeline data use scrapy, kafka, spark streaming, spark ML and elasticsearch, Kibana

Nguyễn Quang Huy 5 Sep 28, 2022
Created covid data pipeline using PySpark and MySQL that collected data stream from API and do some processing and store it into MYSQL database.

Created covid data pipeline using PySpark and MySQL that collected data stream from API and do some processing and store it into MYSQL database.

2 Nov 20, 2021
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

AWS Data Wrangler Pandas on AWS Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretMana

Amazon Web Services - Labs 3.3k Jan 04, 2023
Learn machine learning the fun way, with Oracle and RedBull Racing

Red Bull Racing Analytics Hands-On Labs Introduction Are you interested in learning machine learning (ML)? How about doing this in the context of the

Oracle DevRel 55 Oct 24, 2022
A Big Data ETL project in PySpark on the historical NYC Taxi Rides data

Processing NYC Taxi Data using PySpark ETL pipeline Description This is an project to extract, transform, and load large amount of data from NYC Taxi

Unnikrishnan 2 Dec 12, 2021
Codes for the collection and predictive processing of bitcoin from the API of coinmarketcap

Codes for the collection and predictive processing of bitcoin from the API of coinmarketcap

Teo Calvo 5 Apr 26, 2022
Containerized Demo of Apache Spark MLlib on a Data Lakehouse (2022)

Spark-DeltaLake-Demo Reliable, Scalable Machine Learning (2022) This project was completed in an attempt to become better acquainted with the latest b

8 Mar 21, 2022
ASTR 302: Python for Astronomy (Winter '22)

ASTR 302, Winter 2022, University of Washington: Python for Astronomy Mario Jurić Location When: 2:30-3:50, Monday & Wednesday, Winter quarter 2022 Wh

UW ASTR 302: Python for Astronomy 4 Jan 12, 2022
Program that predicts the NBA mvp based on data from previous years.

NBA MVP Predictor A machine learning model using RandomForest Regression that predicts NBA MVP's using player data. Explore the docs » View Demo · Rep

Muhammad Rabee 1 Jan 21, 2022
Hg002-qc-snakemake - HG002 QC Snakemake

HG002 QC Snakemake To Run Resources and data specified within snakefile (hg002QC

Juniper A. Lake 2 Feb 16, 2022
VHub - An API that permits uploading of vulnerability datasets and return of the serialized data

VHub - An API that permits uploading of vulnerability datasets and return of the serialized data

André Rodrigues 2 Feb 14, 2022
Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine Intro This repo contains the python/stan version of the Statistical Rethinking

Andrés Suárez 3 Nov 08, 2022
Pyspark Spotify ETL

This is my first Data Engineering project, it extracts data from the user's recently played tracks using Spotify's API, transforms data and then loads it into Postgresql using SQLAlchemy engine. Data

16 Jun 09, 2022
Lale is a Python library for semi-automated data science.

Lale is a Python library for semi-automated data science. Lale makes it easy to automatically select algorithms and tune hyperparameters of pipelines that are compatible with scikit-learn, in a type-

International Business Machines 293 Dec 29, 2022
Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video. You can chose the cha

2 Jul 22, 2022
Larch: Applications and Python Library for Data Analysis of X-ray Absorption Spectroscopy (XAS, XANES, XAFS, EXAFS), X-ray Fluorescence (XRF) Spectroscopy and Imaging

Larch: Data Analysis Tools for X-ray Spectroscopy and More Documentation: http://xraypy.github.io/xraylarch Code: http://github.com/xraypy/xraylarch L

xraypy 95 Dec 13, 2022
Fancy data functions that will make your life as a data scientist easier.

WhiteBox Utilities Toolkit: Tools to make your life easier Fancy data functions that will make your life as a data scientist easier. Installing To ins

WhiteBox 3 Oct 03, 2022