COVID19_detection

背景

当前，世界正遭受全球COVID19大流行的困扰。数十亿人受到影响，数百万的人员伤亡已经发生。因此，鉴定受SARS-CoV-2病毒感染或已经受其污染的个人至关重要。这种识别有助于公共卫生组织和政府制定行动计划，以减少这种大流行的影响。从这种意义上讲，Hilab是一家远程实验室公司，它执行数十种类型的血液检查，包括针对COVID19的血清学检查，该公司已经在巴西进行了数百万次检查。为了改善对这种病毒的检测，可以使用机器学习方法来帮助实验室专家进行决策。因此，本项目将致力于解决构建用于检测COVID19的具有高置信度和准确性的机器学习模型的难题。

方法

决策树（Decision tree）
随机森林（Random forest）
支持向量机（SVN）
主成分分析（PCA）

数据集

数据集地址：https://drive.google.com/drive/folders/1FfIx5WmEc_C7d3Ai7ONIQE4s-o2xQZz5?usp=sharing

项目结构

/
-dataset/		#数据集存放目录
--test/			#测试集目录
---test.csv		#测试集文件
--train/  		#训练集目录
---train_1.csv	#训练集文件1（此文件与测试集相同，默认不使用）
---train_2.csv	#训练集文件2
.......
---train_7.csv	#训练集文件7

-data_preprocess.py	#数据集提取与预处理
-pca.py				#pca降维的相关实验
-decision_tree.py	#决策树
-random_forest.py	#随机森林
-SVM.py				#SVM
-README.md			#说明文件

A set of procedures that can realize covid19 virus detection based on blood.

Related tags

Overview

COVID19_detection

背景

方法

数据集

项目结构

Owner

Nuyoah-xlh

Port of dplyr and other related R packages in python, using pipda.

Airflow ETL With EKS EFS Sagemaker

In this project, ETL pipeline is build on data warehouse hosted on AWS Redshift.

Weather analysis with Python, SQLite, SQLAlchemy, and Flask

Implementation in Python of the reliability measures such as Omega.

PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)

Anomaly Detection with R

An easy-to-use feature store

Synthetic Data Generation for tabular, relational and time series data.

CPSPEC is an astrophysical data reduction software for timing

Bearsql allows you to query pandas dataframe with sql syntax.

Reading streams of Twitter data, save them to Kafka, then process with Kafka Stream API and Spark Streaming

My first Python project is a simple Mad Libs program.

A library to create multi-page Streamlit applications with ease.

Additional tools for particle accelerator data analysis and machine information

MotorcycleParts DataAnalysis python

A simple and efficient tool to parallelize Pandas operations on all available CPUs

A DSL for data-driven computational pipelines

COVID-19 deaths statistics around the world

Data Intelligence Applications - Online Product Advertising and Pricing with Context Generation