First steps with Python in Life Sciences

Last update: Jan 08, 2023

Overview

First steps with Python in Life Sciences

This course material is part of the "First Steps with Python in Life Science" three-day course of SIB-training and is addressed to beginners wanting to become familiar with the Python syntax, environment, and the most common commands.

This course material provides an introduction to python and jupyter notebooks (a web based notebook system for creating and sharing computational documents) in an interactive manner.

prerequisite installation

You can find tips and instructions to ensure you have installed all the required software before starting the course.

course material organization

The course revolves around a sery of jupyter notebooks which take you on your first steps in you python journey.

Each jupyter notebook interleaves theory and examples of codes. We heartily recommend you execute and play around with these bits of code as you follow along : in programming, perhaps even more than anywhere else, practice makes perfect.

Additionally, each notebook is associated with a number of exercises (often in a separate notebook) of varying difficulty, with associated corrections.

If you are attending this course with a teacher (or if you are just curious), you can take a look at our schedule. In short, lessons 00 to 04 deals with generalistic aspect of the python language, while notebooks 05 or 08 present some of the most common modules used in data analysis and/or life sciences.

The notebooks/ folder contains each lesson:

00_jupyter_setup
01_python_basics
02_python_structures
03_reading_writing_files
04_modules
05_module_pandas : handle tabular data data-frames with pandas
06_module_matplotlib : create nice graphics and plots with matplotlib
07_module_biopython : do all kind of bioinformatics with [biopython]](https://biopython.org/)
08_module_numpy_and_scipy : fast numerical computations with numpy + a bit of statistics with scipy.stats

Exercise notebooks:

The data used in the practicals can be found in the data notebooks/data folder, and solutions codes can be found in the notebooks/solutions/ folder (NB: micro-exercises do not have a correction).

You might also like...

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Comments

Module 2-create your own functions - text columns

Your tutorials are fantastic! minor format issues: the multiple column format in some pages (ex: module 2 in python training) collapse the text and making it unreadable. Hope to see it fixed to complete the tutorial! thank you.

opened by catalicu 1

Releases(October2022)

October2022(Oct 12, 2022)

course material for the October 2022 edition of the SIB course "First Steps with Python in Life Sciences"
Source code(tar.gz)
Source code(zip)
May2022(May 12, 2022)

Release for the May2022 edition of the course in Basel
Source code(tar.gz)
Source code(zip)

First steps with Python in Life Sciences

Related tags

Overview

First steps with Python in Life Sciences

prerequisite installation

course material organization

You might also like...

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano

Statsmodels: statistical modeling and econometrics in Python

A computer algebra system written in pure Python

ForecastGA is a Python tool to forecast Google Analytics data using several popular time series models.

Multiple Pairwise Comparisons (Post Hoc) Tests in Python

Hidden Markov Models in Python, with scikit-learn like API

Deep universal probabilistic programming with Python and PyTorch

Fast, flexible and easy to use probabilistic modelling in Python.

Comments

Module 2-create your own functions - text columns

Releases(October2022)

October2022(Oct 12, 2022)

May2022(May 12, 2022)

Owner

SIB Swiss Institute of Bioinformatics

Handle, manipulate, and convert data with units in Python

Pipeline and Dataset helpers for complex algorithm evaluation.

Office365 (Microsoft365) audit log analysis tool

An orchestration platform for the development, production, and observation of data assets.

Tokyo 2020 Paralympics, Analytics

Titanic data analysis for python

Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

WithPipe is a simple utility for functional piping in Python.

Streamz helps you build pipelines to manage continuous streams of data

Sample code for Harry's Airflow online trainng course

This is a python script to navigate and extract the FSD50K dataset

Statsmodels: statistical modeling and econometrics in Python

A Streamlit web-app for a data-science project that aims to evaluate if the answer to a question is helpful.

A meta plugin for processing timelapse data timepoint by timepoint in napari

Created covid data pipeline using PySpark and MySQL that collected data stream from API and do some processing and store it into MYSQL database.

apricot implements submodular optimization for the purpose of selecting subsets of massive data sets to train machine learning models quickly.

Geospatial data-science analysis on reasons behind delay in Grab ride-share services

Exploring the Top ML and DL GitHub Repositories

Two phase pipeline + StreamlitTwo phase pipeline + Streamlit

MIR Cheatsheet - Survival Guidebook for MIR Researchers in the Lab