Loading…
A Multifaceted Approach to Stock Market Trading Using Reinforcement Learning
In the recent past, algorithmic stock market trading for financial markets has undergone significant growth and played a major role in investment decisions. Several methods have been proposed with the objective of designing optimum trading strategies to maximize profitability, economic utility, and...
Saved in:
Published in: | IEEE access 2024, Vol.12, p.90041-90060 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | In the recent past, algorithmic stock market trading for financial markets has undergone significant growth and played a major role in investment decisions. Several methods have been proposed with the objective of designing optimum trading strategies to maximize profitability, economic utility, and risk-adjusted returns. Although traditional methods including mean reversion, momentum, and trend following approaches show good results, but have poor generalization and often perform well in specific time frames. Presently, Reinforcement Learning (RL) approaches are more adaptable and continually perceive the environment by making optimum trading decisions. However, it is still difficult to develop a lucrative trading approach in a complicated and dynamic stock market. The primary challenges in RL methods are effective state representation to reflect current market situations and a suitable trading reward to encourage agents to make more informed decisions. To address such challenges, this research presented a multifaceted strategy for multi-stock market trading using RL that incorporates enhanced state representation based on daily historical data, technical indicators, and fundamental indicators from balance sheets, income statements, and cash flow statements. To inform the agent about the impact of decisions taken on a day-to-day basis by considering risk, a novel reward function named PSR is also proposed. The proposed RL agent is trained in a multi-stock environment in which investors have multiple shares and trading signals are needed with the quantity of shares by using Advantage Actor-Critic (A2C), and Deep Deterministic Policy Gradient (DDPG) algorithms. Furthermore, the proposed multifaceted strategy is validated on 30 Dow Jones stocks and the proposed model outperforms the benchmark Dow Jones Industrial Average index during backtesting. |
---|---|
ISSN: | 2169-3536 2169-3536 |
DOI: | 10.1109/ACCESS.2024.3418510 |