Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables

Last update: Jan 29, 2022

Overview

Mortgage-Application-Analysis

Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables: age, income level, occupancy type, accepted, and debt-income ratio, Eliminating all the demographic bias except for age We picked 5 attributes from the Mortgage data set provided and created a separate *.csv file to avoid extra data loss from the null values of the attributes which we neglect in our model. We preprocessed the data to drop any null values of the applicants which might skew our datasets using the pandas library For the processing part, we had some classification data with controlled intervals. We used Ordinal encoding to convert those into numeric discrete data for training and testing our model. We also had one, unique string data attribute, which was encoded using One-hot encoding to extract numeric values for processing. With this clean data, we divided the data into two groups, 80% for validation and 20%, and trained our model to establish a correlation between mortgage application acceptance.

Using Matlab plot, we carried out data/representation/ visualization and found out, other than debt-to-income ratio, there isn’t any significant correlation between acceptance and other non-demographic factors After this visualization to establish our hypothesis, we trained our model using the data set we created., and evaluate the model we created we applied 4 types of algorithms to test it out: We used the Logistic Regression model to create a line the best fit for log-odds values to calculate the acceptance rate for the mortgage application. The F1 score, precision score, and recall score for this testing were very high, which suggested that the non-demographic factor which we accounted for didn’t have many roles in the application being accepted or rejected. Similarly, we carried out a random forest model, Decision Tree, and Support Vector machine algorithm and each of those evaluations had really high precision, recall, and F1 score supporting the evidence from data visualization.

Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables

Related tags

Overview

Mortgage-Application-Analysis

Owner

Stand-alone language identification system

This is a modification of the OpenAI-CLIP repository of moein-shariatnia

GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning

Snips Python library to extract meaning from text

This repository implements a brute-force spellchecker utilizing the Damerau-Levenshtein edit distance.

Using BERT-based models for toxic span detection

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

A BERT-based reverse dictionary of Korean proverbs

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

Yet Another Compiler Visualizer

A single model that parses Universal Dependencies across 75 languages.

A demo of chinese asr

100+ Chinese Word Vectors 上百种预训练中文词向量

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Code for producing Japanese GPT-2 provided by rinna Co., Ltd.

Official codebase for Can Wikipedia Help Offline Reinforcement Learning?

Machine learning classifiers to predict American Sign Language .

American Sign Language (ASL) to Text Converter

This project consists of data analysis and data visualization (done using python)of all IPL seasons from 2008 to 2019 and answering the most asked questions about the IPL.

Skipgram Negative Sampling in PyTorch