This is a tool for speculation of ancestral allel, calculation of sfs and drawing its bar plot.

Last update: Dec 16, 2022

Related tags

Overview

superSFS

This is a tool for speculation of ancestral allel, calculation of sfs and drawing its bar plot. It is easy-to-use and runing fast. What you should prepare is the phased vcf file containg the data of populations you intrested and the outgroup, the outgroup name file, and the annotation file. Enjoy it!!!

It has four models:

0：Using all function, from original vcf data to sfs barplot
1: Only speculate the ancestral allel and output new vcf file using speculated allel as reference
2: Only count the frequency of derived allel in each snp of each population
3: Only draw bar polt of sfs using data generated from the results of calutation of sfs

Example:

Model 0: python superSFS 0 ogdir threshold vcfdir annodir modir coutdir plotdir group
Model 1: python superSFS 1 ogdir threshold vcfdir outdir
Model 2: python superSFS 2 annodir modir coutdir
Model 3: python superSFS 3 coutdir plotdir group

Explation for each parameter:

ogdir: direction of outgroup names file
threshold: a number that if the sum of variant allel in outpgroup greater than it,the variant allel will be counted as ancestral allel
vcfdir: direction of vcf data
vannodir: direction of annotation file with sample names in first column and group name in second colum. This file should has header in first row
vmodir: assign the output direction of generated vcf file using speculated allel as reference
countdir: assign the output direction of calculation of derived allels for each snp in each group
plotdir: assign the output direction of bar plot of sfs
group: the group that you want to analysis

This is a tool for speculation of ancestral allel, calculation of sfs and drawing its bar plot.

Related tags

Overview

superSFS

Owner

A collection of robust and fast processing tools for parsing and analyzing web archive data.

Active Learning demo using two small datasets

The Dash Enterprise App Gallery "Oil & Gas Wells" example

X-news - Pipeline data use scrapy, kafka, spark streaming, spark ML and elasticsearch, Kibana

TheMachineScraper 🐱‍👤 is an Information Grabber built for Machine Analysis

follow-analyzer helps GitHub users analyze their following and followers relationship

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

This mini project showcase how to build and debug Apache Spark application using Python

PyPSA: Python for Power System Analysis

Numerical Analysis toolkit centred around PDEs, for demonstration and understanding purposes not production

Py-price-monitoring - A Python price monitor

A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).

Weather analysis with Python, SQLite, SQLAlchemy, and Flask

Udacity-api-reporting-pipeline - Udacity api reporting pipeline

4CAT: Capture and Analysis Toolkit

MS in Data Science capstone project. Studying attacks on autonomous vehicles.

CubingB is a timer/analyzer for speedsolving Rubik's cubes, with smart cube support

simple way to build the declarative and destributed data pipelines with python

Python implementation of Principal Component Analysis

A project consists in a set of assignements corresponding to a BI process: data integration, construction of an OLAP cube, qurying of a OPLAP cube and reporting.