Data Scientist in Simple Stock Analysis of PT Bukalapak.com Tbk for Long Term Investment

Overview

Data Scientist in Simple Stock Analysis of PT Bukalapak.com Tbk for Long Term Investment

Brief explanation of PT Bukalapak.com Tbk

Bukalapak was founded on January 10, 2010 by Achmad Zaky, Nugroho Herucahyono, and Fajrin Rasyid in a boarding house while studying at the Bandung Institute of Technology. Bukalapak is one of the e-commerce companies in Indonesia. From the owner of a local shopping brand through an ownership group founded by Achmad Zaky, Nugroho Herucahyono, and Muhamad Fajrin Rasyid in 2010. Bukalapak was originally an online store that allowed Small and Medium Enterprises (SMEs) to venture into cyberspace. The company has now expanded into various other business lines, including helping to increase sales of traditional warungs through the Bukalapak Partner service. In 2017, Bukalapak became one of the unicorn startups from Indonesia. Currently, Bukalapak's valuation has reached 7.6 billion US dollars or around Rp. 110.2 trillion.

Bukalapak conducts Initial Public Offering of Shares

Bukalapak is the largest Initial Public Offering (IPO) company in Indonesia. This is the latest achievement that the Southeast Asian startup community is starting to grow. Quoting CNN.com, Saturday (8/7), Bukalapak reaped fresh funds of US $ 1.5 billion or Rp. 21.4 trillion (exchange rate of Rp. 14,300 per US dollar) from the corporate action. When Bukalapak's shares were traded on the first day, the price immediately jumped almost 25% in the first session. This indicates that many investors are hunting for Bukalapak's shares.

Mandiri Sekuritas said Bukalapak's IPO was oversubscribed by 8.7 times with orders from nearly 100 thousand investors. Bukalapak itself has developed into an e-commerce player in Southeast Asia. Some of Bukalapak's competitors, including Shopee, Lazada, and Tokopedia.

The company is backed by major investors, including Microsoft (MSFT) and Standard Chartered (SCBFF). The company plans to use the funds from the IPO to roll out more features. This will allow Bukalapak to offer more services and add new revenue streams. Indonesia's first unicorn to be listed on the stock exchange has great potential. BUKA is part of a conglomerate that has penetrated into various potential business lines.

As a data analyst, we can do a simple analysis on Bukalapak shares to assess whether the benefits provided are tempting for investors? or is it detrimental? on a long term basis. Through this simple analysis, we will calculate the Return On Investment on the Bukalapak stock

Simple Stock Analysis for Long Term Investment

A. Questions and Goals

  • If we want to invest in long-term intervals, how can we reduce risk while maximizing return on investment?
  • What is the best way to select stocks based on the criteria from the previous question? The purpose of this analysis is to look at Bukalapak's stock in providing a return on investment considering the risks involved. The analysis must provide substantial data to confidently use the stock for future analysis or use.

B. Data Collection

There are many ways to aggregate historical stock prices with their fundamentals. For this particular analysis, Pandas provides a library for retrieving forum data from multiple sources. The library is called Pandas Data Reader. The library is a wrapper for retrieving data like historical stock prices, country GDP, World economic data, etc. This analysis in particular. Yahoo Finance data is used because it is free and has a very large stock database. By avoiding manual, tedious work like downloading CSV files, analysis can be used for as much stock as possible. By doing import pandas_datareader.data as web

Since the final analysis will involve a return on investment, the Closing price is Adjusted. Used This option was taken to simplify the analysis. also, based on the adjusted closing price period, the price should best reflect the BUKA price on a given day.

Plotting Price BUKA Stock

download (3)

The plot above shows the price of BUKA from the beginning of the IPO until November 2021. As expected, the data has shown a clear trend, namely a down trend. For example, let's take the return on investment if someone returns the money at the start of the IPO. Since plots of the same style will be widely used in this analysis, functions can be written to simplify future use.

Result Return On Investment

Screenshot 2021-11-13 134113

By using logarithmic returns using the `NumPy` library, the result is that the return on investment in BUKA.JK shares from the beginning of the IPO until November 2021 is -35.94%. so if we invest from the beginning of the IPO of Rp 10 million, our money now will be Rp 6.5 million. Of course we have suffered losses, but our analysis does not end here, we will conduct a deeper analysis to see how attractive BUKA shares are in the investor's portfolio.

C. Data Analysis

With a quick glance at the data, there is some work to be done before it can be used. One of them is looking for a return on investment from every day. This is a necessary step. In Investing, the stock price on a given day is not very relevant. Price difference 2 different days.

Visualization ROI of BUKA Stock everyday

download

By looking at the graph above, although it's a bit difficult to tell the difference, there are some clues that can be taken. For example, the worst ROI occurred around August and October 2021 where it was less than -6%. However, the best day comes also around November 2021 where ROI exceeds 8% in one day. Furthermore, at the outset, it is stated that the analysis will be used to invest in long-term intervals. This means the daily ROI will be less relevant because the interval is too short. Resampling is a good way to convert data from daily ROI to monthly ROI.

Visualization ROI of BUKA Stock every month

download (1)

Describe

Screenshot 2021-11-13 140503

From the visualization and summary above, the data now looks a bit easier to read. For example, although ROIs fluctuate, they are not far from a certain point. This is called oscillation. The mean of the oscillations is the average of the data, the form of this example is -8.98 This also means, on average, one will earn an estimated ROI of -8.98% every month over a 4 month period. Of course this is not desired by investors, because it will result in losses. Then, their oscillations also have bounded upper and lower bounds, and it is called standard deviation or std. with a slight modification to the code, everything can be visualized as follows.

Visualization Volatility ROI of BUKA Stock every month

download (2)

In stock prices, std is called volatility. This is an important metric because when large amounts of money are involved, less volatile stocks are more profitable. Less volatile stocks mean they are easier to predict because they are also less risky. Moreover, If the data is normally distributed, one is normally distributed. One that fits is the Q-Q plot.

Visualization Q-Q Plot

download (4)

Using the statsmodels library, the Q-Q plot should show if the dataset is normally distributed. If most of the points fall on the red line, then the distribution is normally distributed. Unfortunately, not this data. So in short, collecting stock data over a 4 month period can yield a rough estimate for generating expectations about what ROI and risk are used for. Mean and Standard Deviation.

Conclusion

We can conclude that during the past 4 months, BUKA shares have been less attractive in terms of ROI. However, this BUKA stock is attractive to be included in an investor's portfolio because the price is already relatively cheap. In terms of long-term investment, this stock is attractive to buy considering that Bukalapak is a startup company with a high valuation. It is possible that in the long term there will be an increase. However, to determine whether the risk and ROI is high or low, a comparison with other stocks is needed.

Owner
Najibulloh Asror
`Welcome to my world`
Najibulloh Asror
Incubator for useful bioinformatics code, primarily in Python and R

Collection of useful code related to biological analysis. Much of this is discussed with examples at Blue collar bioinformatics. All code, images and

Brad Chapman 560 Jan 03, 2023
ETL pipeline on movie data using Python and postgreSQL

Movies-ETL ETL pipeline on movie data using Python and postgreSQL Overview This project consisted on a automated Extraction, Transformation and Load p

Juan Nicolas Serrano 0 Jul 07, 2021
Fitting thermodynamic models with pycalphad

ESPEI ESPEI, or Extensible Self-optimizing Phase Equilibria Infrastructure, is a tool for thermodynamic database development within the CALPHAD method

Phases Research Lab 42 Sep 12, 2022
PipeChain is a utility library for creating functional pipelines.

PipeChain Motivation PipeChain is a utility library for creating functional pipelines. Let's start with a motivating example. We have a list of Austra

Michael Milton 2 Aug 07, 2022
Pandas and Spark DataFrame comparison for humans

DataComPy DataComPy is a package to compare two Pandas DataFrames. Originally started to be something of a replacement for SAS's PROC COMPARE for Pand

Capital One 259 Dec 24, 2022
Candlestick Pattern Recognition with Python and TA-Lib

Candlestick-Pattern-Recognition-with-Python-and-TA-Lib Goal Look at the S&P500 to try and get a better understanding of these candlestick patterns and

Ganesh Jainarain 11 Oct 07, 2022
Cleaning and analysing aggregated UK political polling data.

Analysing aggregated UK polling data The tweet collection & storage pipeline used in email-service is used to also collect tweets from @britainelects.

Ajay Pethani 0 Dec 22, 2021
Predictive Modeling & Analytics on Home Equity Line of Credit

Predictive Modeling & Analytics on Home Equity Line of Credit Data (Python) HMEQ Data Set In this assignment we will use Python to examine a data set

Dhaval Patel 1 Jan 09, 2022
Universal data analysis tools for atmospheric sciences

U_analysis Universal data analysis tools for atmospheric sciences Script written in python 3. This file defines multiple functions that can be used fo

Luis Ackermann 1 Oct 10, 2021
Flenser is a simple, minimal, automated exploratory data analysis tool.

Flenser Have you ever been handed a dataset you've never seen before? Flenser is a simple, minimal, automated exploratory data analysis tool. It runs

John McCambridge 79 Sep 20, 2022
Program that predicts the NBA mvp based on data from previous years.

NBA MVP Predictor A machine learning model using RandomForest Regression that predicts NBA MVP's using player data. Explore the docs » View Demo · Rep

Muhammad Rabee 1 Jan 21, 2022
Py-price-monitoring - A Python price monitor

A Python price monitor This project was focused on Brazil, so the monitoring is

Samuel 1 Jan 04, 2022
An implementation of the largeVis algorithm for visualizing large, high-dimensional datasets, for R

largeVis This is an implementation of the largeVis algorithm described in (https://arxiv.org/abs/1602.00370). It also incorporates: A very fast algori

336 May 25, 2022
Sensitivity Analysis Library in Python (Numpy). Contains Sobol, Morris, Fractional Factorial and FAST methods.

Sensitivity Analysis Library (SALib) Python implementations of commonly used sensitivity analysis methods. Useful in systems modeling to calculate the

SALib 663 Jan 05, 2023
MidTerm Project for the Data Analysis FT Bootcamp, Adam Tycner and Florent ZAHOUI

MidTerm Project for the Data Analysis FT Bootcamp, Adam Tycner and Florent ZAHOUI Hallo

Florent Zahoui 1 Feb 07, 2022
Stochastic Gradient Trees implementation in Python

Stochastic Gradient Trees - Python Stochastic Gradient Trees1 by Henry Gouk, Bernhard Pfahringer, and Eibe Frank implementation in Python. Based on th

John Koumentis 2 Nov 18, 2022
Intake is a lightweight package for finding, investigating, loading and disseminating data.

Intake: A general interface for loading data Intake is a lightweight set of tools for loading and sharing data in data science projects. Intake helps

Intake 851 Jan 01, 2023
My solution to the book A Collection of Data Science Take-Home Challenges

DS-Take-Home Solution to the book "A Collection of Data Science Take-Home Challenges". Note: Please don't contact me for the dataset. This repository

Jifu Zhao 1.5k Jan 03, 2023
Building house price data pipelines with Apache Beam and Spark on GCP

This project contains the process from building a web crawler to extract the raw data of house price to create ETL pipelines using Google Could Platform services.

1 Nov 22, 2021
Making the DAEN information accessible.

The purpose of this repository is to make the information on Australian COVID-19 adverse events accessible. The Therapeutics Goods Administration (TGA) keeps a database of adverse reactions to medica

10 May 10, 2022