Exploratory Data Analysis of the 2019 Indian General Elections using a dataset from Kaggle.

Last update: Oct 10, 2022

Overview

2019-indian-election-eda

Exploratory Data Analysis of the 2019 Indian General Elections using a dataset from Kaggle.

This project is a part of the Course - Data Analysis using Python: Zero to Pandas offered by Jovian.ai.

We perform Exploratory Data Analyis on the 2019 Indian General Elections dataset. Here we use various Python libraries to perform Data Cleaning and Visualization. The Dataset which is used in this project is from Kaggle, authored by the user Prakrut Chauhan.

Link to the Dataset used - https://www.kaggle.com/prakrutchauhan/indian-candidates-for-general-election-2019

The dataset contains information of all the candidates who contested the elections from various Constituencies. Data includes personal information like Assets, Education, Criminal Record, etc. as well as electoral information such as Contesting Constituency, Political Party, Total Votes received, etc.

The Libraries used in the Project are:

Matplotlib (for visualization of data),
Seaborn (used alongside Matplotlib for visualization),
Numpy (used for operations on numeric data),
Pandas (used for utilising DataFrames and organising the data),
Jovian (used for downloading dataset and to run, save and upload the Notebook).

Apart from the above mentioned libraries, we use the opendatasets package to directly download the files from Kaggle and parse the data. Link to the package - https://github.com/JovianML/opendatasets

To view the Jupyter Notebook containing the EDA, click on the .ipynb file to open it. Scroll down to see the analysis. Some contents might not be visible in Dark Theme, so I recommend viewing the notebook in Light Theme.

The Notebook can also be viewed in Google Colab and Binder or can be downloaded and viewed locally.

Link to a Blog Post will be added soon.

Hope you like my work !!!

Exploratory Data Analysis of the 2019 Indian General Elections using a dataset from Kaggle.

Related tags

Overview

2019-indian-election-eda

Owner

Souradeep Banerjee

Flexible HDF5 saving/loading and other data science tools from the University of Chicago

Convert monolithic Jupyter notebooks into Ploomber pipelines.

Using Python to derive insights on particular Pokemon, Types, Generations, and Stats

AptaMat is a simple script which aims to measure differences between DNA or RNA secondary structures.

This repo contains a simple but effective tool made using python which can be used for quality control in statistical approach.

A simplified prototype for an as-built tracking database with API

Package for decomposing EMG signals into motor unit firings, as used in Formento et al 2021.

ELFXtract is an automated analysis tool used for enumerating ELF binaries

General Assembly's 2015 Data Science course in Washington, DC

A Streamlit web-app for a data-science project that aims to evaluate if the answer to a question is helpful.

A collection of robust and fast processing tools for parsing and analyzing web archive data.

A neural-based binary analysis tool

Finding project directories in Python (data science) projects, just like there R rprojroot and here packages

Approximate Nearest Neighbor Search for Sparse Data in Python!

pipeline for migrating lichess data into postgresql

Python reader for Linked Data in HDF5 files

Data pipelines built with polars

Supply a wrapper ``StockDataFrame`` based on the ``pandas.DataFrame`` with inline stock statistics/indicators support.

A variant of LinUCB bandit algorithm with local differential privacy guarantee

Tools for analyzing data collected with a custom unity-based VR for insects.