My project contrasts K-Nearest Neighbors and Random Forrest Regressors on Real World data

Last update: Oct 28, 2021

Related tags

Overview

kNN-vs-RFR

My project contrasts K-Nearest Neighbors and Random Forrest Regressors on Real World data

In many areas, rental bikes have been launched to improve accessibility ease. It is important to have the rented bike ready and open to the public at the appropriate time, as this reduces the amount of time people have to wait. Eventually, ensuring a steady supply of rented bikes for the area becomes a big concern. The most important aspect is predicting the number of rental bikes required at each hour in order to maintain a steady supply. In this project, we discuss the ways in which we can predict the number of bikes needed for the particular day based on the provided data set. These type of prediction systems enable users to borrow a bike from a specific location and return it to a different location. Hence, we use machine learning to predict the number of rental bikes that are needed on a particular day

Background:

In Machine Intelligence, there are many ways in which we can predict the number of bikes that might be needed in a particular day. One of the methods used was to examine the models for predicting hourly rental bike demand and investigate a function filtering method to exclude non-predictive parameters and rate features based on their prediction efficiency. The project was accomplished by using repeated cross validation to train five statistical regression models with their best hyper-parameters, and then evaluating their results. The other method just estimates the cumulative number of rented bikes in the entire bike sharing system. The various data in the data collection were used to manipulate and forecast the final number of rental bikes. Methods such as Ridge Linear Regression, Support Vector Machine for Regression, Random Forest Method for Regression and Gradient Boosted Regression Tree are used for the prediction of rental bikes.

Additional Info:

Feel free to dowload my code which is in main.py. I have also provided a copy of the testing and training data sets used. Lastly, I have also uploaded a copy of the short research paper that I wrote based on this project.

My project contrasts K-Nearest Neighbors and Random Forrest Regressors on Real World data

Related tags

Overview

kNN-vs-RFR

Background:

Additional Info:

Owner

A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.

Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application

Turning images into '9-pan' palettes using KMeans clustering from sklearn.

A benchmark of data-centric tasks from across the machine learning lifecycle.

Self Organising Map (SOM) for clustering of atomistic samples through unsupervised learning.

A game theoretic approach to explain the output of any machine learning model.

A machine learning model for Covid case prediction

Retrieve annotated intron sequences and classify them as minor (U12-type) or major (U2-type)

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks

Xeasy-ml is a packaged machine learning framework.

A framework for building (and incrementally growing) graph-based data structures used in hierarchical or DAG-structured clustering and nearest neighbor search

Required for a machine learning pipeline data preprocessing and variable engineering script needs to be prepared

Probabilistic time series modeling in Python

GAM timeseries modeling with auto-changepoint detection. Inspired by Facebook Prophet and implemented in PyMC3

Little Ball of Fur - A graph sampling extension library for NetworKit and NetworkX (CIKM 2020)

虚拟货币(BTC、ETH)炒币量化系统项目。在一版本的基础上加入了趋势判断

Python ML pipeline that showcases mltrace functionality.

Firebase + Cloudrun + Machine learning

MaD GUI is a basis for graphical annotation and computational analysis of time series data.

Distributed Deep learning with Keras & Spark