Web Scraping COVID 19 Meta Portal with Python

Last update: Jan 04, 2022

Overview

Web-Scraping-COVID-19-Meta-Portal-with-Python

Requests API and Beautiful Soup to scrape real-time COVID statistics from worldometer website and perform data cleaning and visual analysis in Jupyter notebook.

Data Preparation Notebook

In the first module, web scraping techniques using requests, beautifulsoup packages are utilized to collect and manipulate COVID related data from the worldometer website

The notebook has a total of five code blocks.

The first four code blocks provide the following data:

Summary Data for ALL Global COVID Cases
Summary Data for ACTIVE Global COVID Cases
Summary Data for CLOSED Global COVID Cases
Tabular Data for COVID Cases by Country

The fifth and final code block provides an interactive interface for exporting each of these four tables

Data Analysis Notebook

In the second module, data analysis techniques using pandas, numpy, seaborn and statsmodels packages are utilized to collect effective insights from the data and plot necessary graphs. The raw csv data is the same table we collected in Part A of the project taken from the worldometer website regarding COVID cases tabulated by country.

The notebook has a total of twelve code blocks.

Importing a CSV file, reading it and counting no. of rows and columns
Using the to_numeric method to ensure all numerical columns get passed as numeric
Using the describe function to display and analyze basic statistical data on the numerical columns of the imported data
Working with a smaller set of imported data - Top 20 countries with most cases
Horizontal bar chart to analyze total cases in the top 20 countries
Vertical bar chart to analyze total deaths in the top 20 countries with most cases
Distribution plot to analyze spread of data for Deaths/1M Population of the 20 countries
Using the describe function to display basic statistical data on the numerical columns of the REDUCED dataset
Comparing and analyzing mean and standard deviation between population of the Full dataset and the Reduced dataset
Using regression scatter plot to check for data independence between tests/million people and the size of the population
Finding and analyzing correlations between the variables in the dataset
Applying a statistical model to collect useful information about Total Cases and Total Deaths in the full data set

-- Aarif M Jahan -- May 08, 2021

Web Scraping COVID 19 Meta Portal with Python

Related tags

Overview

Web-Scraping-COVID-19-Meta-Portal-with-Python

Data Preparation Notebook

Data Analysis Notebook

Owner

Aarif Munwar Jahan

A python module to parse the Open Graph Protocol

Crawler do site Fundamentus.com com o uso do framework scrapy, tanto da aba detalhada como a de resumo.

A simple reddit scraper to get memes (only images) from r/ProgrammerHumor.

crypto currency scraping

Scrapy uses Request and Response objects for crawling web sites.

Scrap-mtg-top-8 - A top 8 mtg scraper using python

NASA APOD Discord Bot - Fetches information from NASA APOD site.

This is a module that I had created along with my friend. It's a basic web scraping module

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster

A web crawler for recording posts in "sina weibo"

A high-level distributed crawling framework.

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

河南工业大学完美校园自动校外打卡

Html Content / Article Extractor, web scrapping lib in Python

New World Market Scraper

feapder 是一款简单、快速、轻量级的爬虫框架。以开发快速、抓取快速、使用简单、功能强大为宗旨。支持分布式爬虫、批次爬虫、多模板爬虫，以及完善的爬虫报警机制。

Crawler in Python 3.7, 3.8. 3.9. Pypy3

A way to scrape sports streams for use with Jellyfin.

中国大学生在线四史自动答题刷分(现仅支持英雄篇)

This is a web crawler that works on employ email data by gmane.org and visualizes it in different ways.

Web Scraping COVID 19 Meta Portal with Python

Related tags

Overview

Web-Scraping-COVID-19-Meta-Portal-with-Python

Data Preparation Notebook

Data Analysis Notebook

Owner

Aarif Munwar Jahan

A python module to parse the Open Graph Protocol

Crawler do site Fundamentus.com com o uso do framework scrapy, tanto da aba detalhada como a de resumo.

A simple reddit scraper to get memes (only images) from r/ProgrammerHumor.

crypto currency scraping

Scrapy uses Request and Response objects for crawling web sites.

Scrap-mtg-top-8 - A top 8 mtg scraper using python

NASA APOD Discord Bot - Fetches information from NASA APOD site.

This is a module that I had created along with my friend. It's a basic web scraping module

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster

A web crawler for recording posts in "sina weibo"

A high-level distributed crawling framework.

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

河南工业大学 完美校园 自动校外打卡

Html Content / Article Extractor, web scrapping lib in Python

New World Market Scraper

feapder 是一款简单、快速、轻量级的爬虫框架。以开发快速、抓取快速、使用简单、功能强大为宗旨。支持分布式爬虫、批次爬虫、多模板爬虫，以及完善的爬虫报警机制。

Crawler in Python 3.7, 3.8. 3.9. Pypy3

A way to scrape sports streams for use with Jellyfin.

中国大学生在线 四史自动答题刷分(现仅支持英雄篇)

This is a web crawler that works on employ email data by gmane.org and visualizes it in different ways.

河南工业大学完美校园自动校外打卡

中国大学生在线四史自动答题刷分(现仅支持英雄篇)