Learning to trade under the reinforcement learning framework

Last update: Nov 28, 2022

Overview

Trading Using Q-Learning

In this project, I will present an adaptive learning model to trade a single stock under the reinforcement learning framework. This area of machine learning consists in training an agent by reward and punishment without needing to specify the expected action. The agent learns from its experience and develops a strategy that maximizes its profits. This is my capstone project for the Machine Learning Engineer Nanodegree, from Udacity. You can check my report here and the notebook with the tests of the codes used in this project here. The TEX file was produced with help of Overleaf.

Install

This project requires Python 2.7 and the following Python libraries installed:

Run

In a terminal or command window, navigate to the top-level project directory QLearning_Trading/ (that contains this README) and run one of the following commands:

python qtrader/agent.py <OPTION>
python -m qtrader.agent <OPTION>

Where OPTION could be train_learner, test_learner, test_random, optimize_k or optimize_gamma. The simulation will generate log files to be analyzed later on. Be aware that any of those commands take several minutes to finish.

Reference

T.M. Mitchell. Machine Learning. McGraw-Hill International Editions, 1997. link
M. Mohri, A. Rostamizadeh, A. Talwalkar. Foundations of Machine Learning. 2012. link
N.T. Chan, C.R. Shelton. An Electronic Market-Maker. 2001 link
N.T. Chan. Artificial Markets and Intelligent Agents. 2001 link
R. Cont, k. Arseniy, and S. Sasha. The price impact of order book events. Journal of financial econometrics, 2014 link
Du, Xin, Jinjian Zhai, and Koupin Lv. Algorithm Trading using Q-Learning and Recurrent Reinforcement Learning. link

License

The contents of this repository are covered under the Apache 2.0 License.

Learning to trade under the reinforcement learning framework

Related tags

Overview

Trading Using Q-Learning

Install

Run

Reference

License

Owner

Uirá Caiado

SiamMOT is a region-based Siamese Multi-Object Tracking network that detects and associates object instances simultaneously.

Best Practices on Recommendation Systems

DP-CL(Continual Learning with Differential Privacy)

Custom Implementation of Non-Deep Networks

Loopy belief propagation for factor graphs on discrete variables, in JAX!

PyTorch implementation for SDEdit: Image Synthesis and Editing with Stochastic Differential Equations

PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.

docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Server files for UltimateLabeling

Guided Internet-delivered Cognitive Behavioral Therapy Adherence Forecasting

A general framework for inferring CNNs efficiently. Reduce the inference latency of MobileNet-V3 by 1.3x on an iPhone XS Max without sacrificing accuracy.

Multispectral Object Detection with Yolov5

Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network"

The official implementation of A Unified Game-Theoretic Interpretation of Adversarial Robustness.

TuckER: Tensor Factorization for Knowledge Graph Completion

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning

BirdCLEF 2021 - Birdcall Identification 4th place solution

Make a Turtlebot3 follow a figure 8 trajectory and create a robot arm and make it follow a trajectory