Deep Reinforcement Learning based Trading Agent for Bitcoin

Last update: Dec 29, 2022

Overview

Deep Trading Agent

Deep Reinforcement Learning based Trading Agent for Bitcoin using DeepSense Network for Q function approximation.

For complete details of the dataset, preprocessing, network architecture and implementation, refer to the Wiki of this repository.

Requirements

Python 2.7
Tensorflow
Pandas (for pre-processing Bitcoin Price Series)
tqdm (for displaying progress of training)

To setup a ubuntu virtual machine with all the dependencies to run the code, refer to assets/vm.

Run with Docker

Pull the prebuilt docker image directly from docker hub and run it as

docker pull samre12/deep-trading-agent:latest
docker run -p 6006:6006 -it samre12/deep-trading-agent:latest

Build the docker image locally by executing the command and the run the image as

docker build -t deep-trading-agent .
docker run -p 6006:6006 -it deep-trading-agent

This will setup the repository for training the agent and

mount the current directory into /deep-trading-agent in the container
during image build, the latest transactions history from the exchange is pulled and sampled to create per-minute scale dataset of Bitcoin prices. This dataset is placed at /deep-trading-agent/data/btc.csv
to initiate training of the agent, specify suitable parameters in a config file (an example config file is provided at /deep-trading-agent/code/config/config.cfg) and run the code using /deep-trading-agent/code/main.py
training supports logging and monitoring through Tensorboard
vim and screen are installed in the container to edit the configuration files and run tensorboard
bind port 6006 of container to 6006 of host machine to monitor training using Tensorboard

Support

Please give a ⭐ to this repository to support the project 😄 .

ToDo

Docker Support

Add Docker support for a fast and easy start with the project

Improve Model performance

Extract highest and lowest prices and the volume of Bitcoin traded within a given time interval in the Preprocessor
Use closing, highest, lowest prices and the volume traded as input channels to the model (remove features calculated just using closing prices)
Normalize the price tensors using the price of the previous time step
For the complete state representation, input the remaining number of trades to the model
Use separate diff price blocks to calculate the unrealized PnL
Use exponentially decayed weighted unrealized PnL as a reward function to incorporate current state of investment and stabilize the learning of the agent

Trading Model

is inspired by Deep Q-Trading where they solve a simplified trading problem for a single asset.
For each trading unit, only one of the three actions: neutral(1), long(2) and short(3) are allowed and a reward is obtained depending upon the current position of agent. Deep Q-Learning agent is trained to maximize the total accumulated rewards.
Current Deep Q-Trading model is modified by using the Deep Sense architecture for Q function approximation.

Dataset

Per minute Bitcoin series is obtained by modifying the procedure mentioned in this repository. Transactions in the Coinbase exchange are sampled to generate the Bitcoin price series.
Refer to assets/dataset to download the dataset.

Preprocessing

Basic Preprocessing
Completely ignore missing values and remove them from the dataset and accumulate blocks of continuous values using the timestamps of the prices.
All the accumulated blocks with number of timestamps lesser than the combined history length of the state and horizon of the agent are then filtered out since they cannot be used for training of the agent.
In the current implementation, past 3 hours (180 minutes) of per minute Bitcoin prices are used to generate the representation of the current state of the agent.
With the existing dataset (at the time of writing), following are the logs generated while preprocessing the dataset:

INFO:root:Number of blocks of continuous prices found are 58863
INFO:root:Number of usable blocks obtained from the dataset are 887
INFO:root:Number of distinct episodes for the current configuration are 558471

Advanced Preprocessing
Process missing values and concatenate smaller blocks to increase the sizes of continuous price blocks.
Standard technique in literature to fill the missing values in a way that does not much affect the performance of the model is using exponential filling with no decay.
(To be implemented)

Implementation

Tensorflow "1.1.0" version is used for the implementation of the Deep Sense network.

Deep Sense

Implementation is adapted from this Github repository with a few simplifications in the network architecture to incorporate learning over a single time series of the Bitcoin data.

Deep Q Trading

Implementation and preprocessing is inspired from this Medium post. The actual implementation of the Deep Q Network is adapted from DQN-tensorflow.

Deep Reinforcement Learning based Trading Agent for Bitcoin

Related tags

Overview

Deep Trading Agent

Requirements

Run with Docker

Support

ToDo

Docker Support

Improve Model performance

Trading Model

Dataset

Preprocessing

Implementation

Deep Sense

Deep Q Trading

Owner

Kartikay Garg

Serve TensorFlow ML models with TF-Serving and then create a Streamlit UI to use them

ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels

Writeups for the challenges from DownUnderCTF 2021

Data-depth-inference - Data depth inference with python

This YoloV5 based model is fit to detect people and different types of land vehicles, and displaying their density on a fitted map, according to their coordinates and detected labels.

Satellite labelling tool for manual labelling of storm top features such as overshooting tops, above-anvil plumes, cold U/Vs, rings etc.

Web service for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation based on OpenFace 2.0

Code for the paper "Regularizing Variational Autoencoder with Diversity and Uncertainty Awareness"

Pose estimation with MoveNet Lightning

Instant-nerf-pytorch - NeRF trained SUPER FAST in pytorch

Multi-Modal Machine Learning toolkit based on PaddlePaddle.

StyleGAN of All Trades: Image Manipulation withOnly Pretrained StyleGAN

Real-Time Social Distance Monitoring tool using Computer Vision

discovering subdomains, hidden paths, extracting unique links

Implementation of the ivis algorithm as described in the paper Structure-preserving visualisation of high dimensional single-cell datasets.

Neural HMMs are all you need (for high-quality attention-free TTS)

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Official PyTorch code for CVPR 2020 paper "Deep Active Learning for Biased Datasets via Fisher Kernel Self-Supervision"

Welcome to The Eigensolver Quantum School, a quantum computing crash course designed by students for students.

Context-Sensitive Misspelling Correction of Clinical Text via Conditional Independence, CHIL 2022