Making the DAEN information accessible.

Last update: May 10, 2022

Overview

AccessibleAdverseEventNotification

Making the DAEN information accessible.

The purpose of this repository is to make the information on Australian COVID-19 adverse events accessible. The Therapeutics Goods Administration (TGA) keeps a database of adverse reactions to medications including the COVID-19 vaccines. This Database of Adverse Event Notifications (DAEN) is available to the public via this awful web interface. The most recent two weeks is never available.

The DAEN website doesn't provide information in a format that might be useful for analysis. Instead you have to scrape the information by entering each individual day and collecting the results from two tables which might span multiple pages. I've already done that and the code is here (this code isn't great, but it is good enough to get the job done).

Please be aware that the numbers reported in DAEN are probably significantly less than the actual number of adverse events and deaths. As the DAEN website states:

Adverse event reports from consumers and health professionals to the TGA are voluntary, so there is under-reporting by these groups of adverse events related to therapeutic goods in Australia. This is the same around the world.

The scraped data is found in the data directory. These files are tab separated files which you can easily import in to a spreadsheet program. All of the files are only for COVID-19 vaccines.

DAEN_webscrape_simple.txt This file shows the date (twice for reasons that made sense at the time, but don't necessarily make sense anymore), the number of cases reported that day, the number of cases with a single suspected medicine for that day, and the number of deaths reported that day.
DAEN_webscrape_medsummary.txt This file gives a daily count of each adverse event category. Please note that if one patient had multiple adverse events, then each event would be counted in the appropriate category.
DAEN_webscrape_listofreports.txt This file provides the individual reports and includes sex and age (when recorded).

Figure 1 shows some of the basic information such as number of adverse events and deaths reported each day for the COVID-19 vaccines, myocarditis, pericarditis and the more general term cardiac disorder.

Figure 1.

Figure 2 shows a histogram of reported cases of myocarditis and pericarditis from the COVID-19 vaccine. Please note that the age group 10-19 is somewhat distorted as the age 10-11 should not receive the vaccine (although there are cases of 8 year olds getting the vaccine when that should not have occurred). This age group also has a significantly lower uptake than other age groups.

Figure 2.

Figures 3 and 4 plot the reports of myocarditis by age grouped by sex or manufacturer respectively. Figures 5 and 6 are the same for pericarditis. A '-' is used where an age was not given in the report.

Figure 3.

Figure 4.

Figure 5.

Figure 6.

Figure 7 shows how the histogram for myocarditis has progressed over time.

Figure 7.

Figure 8 shows the death rate of people in Australia who contracted COVID-19. Data taken from health.gov on 1/12/2021. Bottom graph is zoomed in to 1% to see what is happening with those under the age of 60.

Figure 8.

Making the DAEN information accessible.

Related tags

Overview

AccessibleAdverseEventNotification

Owner

Using Data Science with Machine Learning techniques (ETL pipeline and ML pipeline) to classify received messages after disasters.

Bearsql allows you to query pandas dataframe with sql syntax.

Fast, flexible and easy to use probabilistic modelling in Python.

Reading streams of Twitter data, save them to Kafka, then process with Kafka Stream API and Spark Streaming

A pipeline that creates consensus sequences from a Nanopore reads. I

Python Project on Pro Data Analysis Track

Uses MIT/MEDSL, New York Times, and US Census datasources to analyze per-county COVID-19 deaths.

Detecting Underwater Objects (DUO)

Python utility to extract differences between two pandas dataframes.

Aggregating gridded data (xarray) to polygons

Convert monolithic Jupyter notebooks into Ploomber pipelines.

Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.

Pandas and Dask test helper methods with beautiful error messages.

Projeto para realizar o RPA Challenge . Utilizando Python e as bibliotecas Selenium e Pandas.

Pipeline and Dataset helpers for complex algorithm evaluation.

ped-crash-techvol: Texas Ped Crash Tech Volume Pack

Tokyo 2020 Paralympics, Analytics

Data Scientist in Simple Stock Analysis of PT Bukalapak.com Tbk for Long Term Investment

Python implementation of Principal Component Analysis

Data collection, enhancement, and metrics calculation.