Mortgage-loan-prediction - Show how to perform advanced Analytics and Machine Learning in Python using a full complement of PyData utilities

Last update: Dec 26, 2021

Overview

MORTGAGE LOAN AQUISITION REQUIREMENT

This entire project encompasses both Data Analysis and Machine Learning. It was carefully structured and compiled for easy understanding.

Installation:

To run this notebook you can either install.

Download anaconda from anaconda site this have almost all dependencies pre-installed. Feel free to use any environment of choice

Dependencies:

Personal project | Mortgage loan elegibility prediction

The Home Mortgage Disclosure Act (HMDA) requires many financial institutions to maintain, report, and publicly disclose information about mortgages. These public data are important because:

- they help show whether lenders are serving the housing needs of their communities.
- help authourities to determine and fish out all predatory act of lending.
- they give public officials information that helps them make decisions and policies.
- They shed light on lending patterns that could be discriminatory. Eg. a reported increase in mortgage borrowing by blacks and Hispanics as of 1993.

On my Kaggle site My Homepage.

Goal for this Notebook:

Show how to perform advanced Analytics and Machine Learning in Python using a full complement of PyData utilities. This is aimed for those looking to get into the field Data Science or those who are already in the field and looking to solve a real world project with python.

This Notebook will teach the following:

Data Handling

Importing Data with Pandas
Cleaning Data
Exploring Data through Visualizations with Matplotlib
Doing predictive Analysis with various Machine Learning Algorithms

Data Analysis/Machine Learning

Supervised Machine learning Techniques: + RandomForestClassifier + StratifiedKfold ( 5 folds) + ETC

Valuation of the Analysis

K-folds cross validation to valuate results locally
Output the results from the IPython Notebook to Kaggle

Results obtained

Was able to derive excerpt insights to give pro recommendation to borrowers
Was able to predict applicant loan approval with 74% accuracy

Mortgage-loan-prediction - Show how to perform advanced Analytics and Machine Learning in Python using a full complement of PyData utilities

Related tags

Overview

MORTGAGE LOAN AQUISITION REQUIREMENT

Installation:

Dependencies:

Personal project | Mortgage loan elegibility prediction

Goal for this Notebook:

This Notebook will teach the following:

Data Handling

Data Analysis/Machine Learning

Valuation of the Analysis

Results obtained

Owner

Joachim

A library to create multi-page Streamlit applications with ease.

Senator Trades Monitor

signac-flow - manage workflows with signac

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Making the DAEN information accessible.

This is a repo documenting the best practices in PySpark.

ForecastGA is a Python tool to forecast Google Analytics data using several popular time series models.

In this tutorial, raster models of soil depth and soil water holding capacity for the United States will be sampled at random geographic coordinates within the state of Colorado.

Useful tool for inserting DataFrames into the Excel sheet.

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Desafio 1 ~ Bantotal

A project consists in a set of assignements corresponding to a BI process: data integration, construction of an OLAP cube, qurying of a OPLAP cube and reporting.

This repo is dedicated to the data extraction and manipulation of the World Bank's database called STEP.

A 2-dimensional physics engine written in Cairo

cLoops2: full stack analysis tool for chromatin interactions

CubingB is a timer/analyzer for speedsolving Rubik's cubes, with smart cube support

Tools for the analysis, simulation, and presentation of Lorentz TEM data.

Pipetools enables function composition similar to using Unix pipes.

Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.

An implementation of the largeVis algorithm for visualizing large, high-dimensional datasets, for R