Identify and annotate mutations from genome editing assays.

Overview

CRISPR-detector

Here we propose our CRISPR-detector to facilitate the CRISPR-edited amplicon and whole genome sequencing data analysis, with functions that existing tools are not able to provide.

CRISPR-detector brings the following four key innovations :

  1. optimized processing time allowing for hundreds of amplicons or whole genome sequencing data;
  2. integrated structural variation calling;
  3. edited and control sample co-analysis, to remove background variants not induced by gene-editing;
  4. functional and clinical consequences annotation of editing-induced mutations.

System requirements

Sentieon module

Download sentieon toolkit from https://s3.amazonaws.com/sentieon-release/software/sentieon-genomics-202010.03.tar.gz
You may request a license by sending emails to [email protected]

export SENTIEON_LICENSE=PATH_TO_SENTIEON/sentieon-genomics-202010.03/localhost_eval.lic
export PATH=PATH_TO_SENTIEON/sentieon-genomics-202010.03/bin:$PATH

Python packages

pip install biopython
pip install pyfaidx
pip install -U textwrap3
conda install blast
conda install samtools

ANNOVAR

Download ANNOVAR from https://www.openbioinformatics.org/annovar/annovar_download_form.php

perl annotate_variation.pl -downdb -webfrom annovar avdblist humandb/ -buildver hg38
perl annotate_variation.pl -buildver hg38 -downdb -webfrom annovar refGene humandb/
perl annotate_variation.pl -buildver hg38 -downdb -webfrom annovar clinvar_20210501 humandb/
export PATH=PATH_TO_ANNOVAR/annovar:$PATH

Organism Homo sapiens Experiment type sequencing data support variant annotations from refGene & ClinVar, other species may only support refGene annotations

Usage

1. Single amplicon & pooled amplicons sequencing data analysis

python CRISPRdetectorCORE.py
--sample, sample name & output dir
--e1, treated group fq1 path, required = True
--e2, treated group fq2 path, required = False
--c1, control group fq2 path, required = False
--c2, control group fq2 path, required = False
--ref_fasta, single or pooled amplicon(s) sequence(s) path in fasta format, required=True
--threads, number of threads to run sentieon minimap2 & driver module, default=1
--anno, annotate variants with ANNOVAR [1] or not run ANNOVAR [2], required=False
--assembly, path to assembly in fasta format : hg38.fa mm9.fa ... required=False
--species, species : Homo_sapiens,Mus_musculus... required=False
--db, ANNOVAR database path, required=False

2. Whole genome sequencing (WGS) data analysis

python CRISPRdetectorWGS.py
--e1, treated group fq1 path, required = True
--e2, treated group fq2 path, required = False
--c1, control group fq2 path, required = False
--c2, control group fq2 path, required = False
--sample, sample name & output dir
--threads, number of threads to run sentieon minimap2 & driver module, default=1
--bed, bed format file input to call variants of interest region, required=False
--anno, annotate variants with ANNOVAR [1] or not run ANNOVAR [2], required=False
--assembly, path to assembly in fasta format : hg38.fa mm9.fa ... required=False
--species, species : Homo_sapiens,Mus_musculus... required=False
--db, ANNOVAR database path, required=False

Owner
hlcas
hlcas
A beautiful and useful prompt for your shell

A Powerline style prompt for your shell A beautiful and useful prompt generator for Bash, ZSH, Fish, and tcsh: Shows some important details about the

Buck Ryan 6k Jan 08, 2023
VHDL to Discrete Logic on PCB Flow

PCBFlow Highly experimental set of scripts to transform a digital circuit described in a hardware description language (VHDL or Verilog) into a discre

Tim 77 Nov 04, 2022
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!

Streamify A data pipeline with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more! Description Objective The project will stre

Ankur Chavda 206 Dec 30, 2022
WinBoost: Boost your windows system.

Winboost runs a complete checkup of your entire system locating junk files, speed-reducing issues and causes of any system or application glitches or crashes. Through a lot of research and testing, w

Smit Parmar 4 Oct 01, 2021
Python bindings for Basler's VisualApplets TCL script generation

About visualapplets.py The Basler AG company provides a TCL scripting engine to automatize the creation of VisualApplets designs (a former Silicon Sof

Jürgen Hock 2 Dec 07, 2022
This directory gathers the tools developed by the Data Sourcing Working Group

BigScience Data Sourcing Code This directory gathers the tools developed by the Data Sourcing Working Group First Sourcing Sprint: October 2021 The co

BigScience Workshop 27 Nov 04, 2022
Listen Surah, prepare for next and Endless life...

Al-Quran In this repository, I have linked up all Surah with Arabic-Bangla Audio From Youtube. So, you just need to choose and listen. and the ( surah

SpiderX 1 Sep 30, 2022
Utility to play with ADCS, allows to request tickets and collect information about related objects

certi Utility to play with ADCS, allows to request tickets and collect information about related objects. Basically, it's the impacket copy of Certify

Eloy 185 Dec 29, 2022
Small projects for python beginners.

Python Mini Projects For Beginners I recently started doing the #100DaysOfCode Challenge in Python. I've used Python before, but I had switched to JS

Sreekesh Iyer 10 Dec 12, 2022
Installer, package manager, build wrapper and version manager for Piccolo

Piccl Installer, package manager, build wrapper and version manager for Piccolo

1 Dec 19, 2021
Script that creates graphical representations of Julia an Mandelbrot sets.

Julia and Mandelbrot Picture Maker This simple functions create simple plots of the Julia and Mandelbrot sets. The Julia set require the important par

Juan Riera Gomez 1 Jan 10, 2022
Python program to start your zoom meetings

zoomstarter Python programm to start your zoom meetings More about Initially this was a bash script for starting zoom meetings, but as i started devel

Viktor Cvetanovic 2 Nov 24, 2021
Arcpy Tool developed for ArcMap 10.x that checks DVOF points against TDS data and creates an output feature class as well as a check database.

DVOF_check_tool Arcpy Tool developed for ArcMap 10.x that checks DVOF points against TDS data and creates an output feature class as well as a check d

3 Apr 18, 2022
Make after-work Mending More flexible In Python

Mending Make after-work Mending More flexible In Python A Lite Package focuses on making project's after-post mending pythonic and flexible. Certainly

2 Jun 15, 2022
Mnemosyne: efficient learning with powerful digital flash-cards.

Mnemosyne: Optimized Flashcards and Research Project Mnemosyne is: a free, open-source, spaced-repetition flashcard program that helps you learn as ef

359 Dec 24, 2022
A gamey, snakey esoteric programming language

Snak Snak is an esolang based on the classic snake game. Installation You will need python3. To use the visualizer, you will need the curses module. T

David Rutter 3 Oct 10, 2022
End-to-End text sumarization, QAs generation using flask.

Help-Me-Read A web application created with Flask + BootStrap + HuggingFace 🤗 to generate summary and question-answer from given input text. It uses

Ankush Kuwar 12 Nov 13, 2022
Heisenbridge a bouncer-style Matrix IRC bridge

Heisenbridge brings IRC to Matrix by creating an environment where every user connects to each network individually like they would with a traditional IRC bouncer

Toni Spets 152 Dec 28, 2022
Tesla App Update Differences Extractor

Tesla App Update Differences Extractor Python program that finds the differences between two versions of the Tesla App. When Tesla updates the app a l

Adrian 5 Apr 11, 2022
A tool to help calculate how to split conveyors in Satisfactory into specific ratios.

Satisfactory Splitter Calculator A tool to help calculate how to split conveyors in Satisfactory into specific ratios. Dependencies Python 3.9 PyYAML

RobotiCat 5 Dec 22, 2022