Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Last update: Nov 08, 2022

Overview

Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Intro

This repo contains the python/stan version of the Statistical Rethinking course that Professor Richard McElreath taught on the Max Planck Institute for Evolutionary Anthropology in Leipzig during the Winter of 2019/2020. The original repo for the course, from which this repo is forked, can be found here. The course contains 20 lectures structured in 10 weeks with a series of assignments for each week. The course is an excellent introduction to bayesian modelling in general and to the Rethinking Statistics wonderful book written by Professor McElreath.

How to use this repo

There are ten jupyter notebooks, one for each week of the course. At the beginning of each notebook there are links to the youtube videos of the lectures, the slides used and the original homework questions and answers in R.

How I would use this repo is like this:

Go to the notebook of the week.
Watch the two videos for the lectures of that week. Their URL are at the very top of each notebook.
Read the original problems presented to the students and try to solve them on your own.
Follow the exercises solutions of the notebook with my code and explanations by Professor McElreath.

Installing `CmdStanPy`

The stan code is executed thanks to CmdStanPy. CmdStanPy is a lightweight pure-Python interface to CmdStan which provides access to the Stan compiler and all inference algorithms. It provides the function install_cmdstan() which downloads CmdStan from GitHub and builds the CmdStan utilities. It can be can be called from within Python or from the command line.

import cmdstanpy
cmdstanpy.install_cmdstan()

You can found more information about the installation process here.

Other useful resources

There are a lot of very useful resources for bayesian statistical modelling out there. Specifically centered on Professor McElreath work I would mention:

Original repo for the course.
Original rethinking package repo

Copyright

The present work is a derivative work of Statistical Rethinking: A Bayesian Course Using python and pymc3 by Gabriel Bosque Chacon and Statistical Rethinking: A Bayesian Course Using Python and NumPyro by Andrés Suárez. I made the stan code, the plotnine figures and slightly modifications to his comments.

Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Related tags

Overview

Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Intro

How to use this repo

Installing `CmdStanPy`

Other useful resources

Copyright

Owner

Andrés Suárez

Catalogue data - A Python Scripts to prepare catalogue data

Shot notebooks resuming the main functions of GeoPandas

Port of dplyr and other related R packages in python, using pipda.

Stock Analysis dashboard Using Streamlit and Python

This program analyzes a DNA sequence and outputs snippets of DNA that are likely to be protein-coding genes.

A script to "SHUA" H1-2 map of Mercenaries mode of Hearthstone

Streamz helps you build pipelines to manage continuous streams of data

nrgpy is the Python package for processing NRG Data Files

Business Intelligence (BI) in Python, OLAP

A forecasting system dedicated to smart city data

The Spark Challenge Student Check-In/Out Tracking Script

Average time per match by division

Intake is a lightweight package for finding, investigating, loading and disseminating data.

The Dash Enterprise App Gallery "Oil & Gas Wells" example

MoRecon - A tool for reconstructing missing frames in motion capture data.

Numerical Analysis toolkit centred around PDEs, for demonstration and understanding purposes not production

Predictive Modeling & Analytics on Home Equity Line of Credit

Pipetools enables function composition similar to using Unix pipes.

Detecting Underwater Objects (DUO)

A meta plugin for processing timelapse data timepoint by timepoint in napari

Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Related tags

Overview

Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Intro

How to use this repo

Installing CmdStanPy

Other useful resources

Copyright

Owner

Andrés Suárez

Catalogue data - A Python Scripts to prepare catalogue data

Shot notebooks resuming the main functions of GeoPandas

Port of dplyr and other related R packages in python, using pipda.

Stock Analysis dashboard Using Streamlit and Python

This program analyzes a DNA sequence and outputs snippets of DNA that are likely to be protein-coding genes.

A script to "SHUA" H1-2 map of Mercenaries mode of Hearthstone

Streamz helps you build pipelines to manage continuous streams of data

nrgpy is the Python package for processing NRG Data Files

Business Intelligence (BI) in Python, OLAP

A forecasting system dedicated to smart city data

The Spark Challenge Student Check-In/Out Tracking Script

Average time per match by division

Intake is a lightweight package for finding, investigating, loading and disseminating data.

The Dash Enterprise App Gallery "Oil & Gas Wells" example

MoRecon - A tool for reconstructing missing frames in motion capture data.

Numerical Analysis toolkit centred around PDEs, for demonstration and understanding purposes not production

Predictive Modeling & Analytics on Home Equity Line of Credit

Pipetools enables function composition similar to using Unix pipes.

Detecting Underwater Objects (DUO)

A meta plugin for processing timelapse data timepoint by timepoint in napari

Installing `CmdStanPy`