Research on Intelligent Trader Model Based on Deep Reinforcement Learning

doi:10.3778/j.issn.1002-8331.1908-0254

Abstract

Abstract:

The stock market has the characteristics of rapid change, many interference factors, and insufficient period data. Stock trading is a game process under incomplete information, and the single-objective supervised learning model is difficult to deal with such serialization decision problems. Reinforcement learning is one of the effective ways to solve this kind of problems. This paper proposes the Intelligent Stock Trader and Gym（ISTG） model based on deep reinforcement learning, which integrates historical data, technical indicators, macroeconomic indicators and other data types. Judging criteria and excellent control strategies, processing long-period data, implementing a replay model that can incrementally expand different types of data, automatically calculating return labels, training intelligent traders, and proposing a method of directly calculating the single-step deterministic action values using market data. Using a stock market of more than 1400 stocks with more than 10 years of data in China, ISTG’s overall revenue has reached 13%, which is better than overall −7% of the buy-and-hold strategy.

Key words: deep reinforcement learning, Deep Reinforcement Learning with Double Q-Learning（DDQN）, one-step deterministic action value, quantization strategy

摘要：

股票市场具有变化快、干扰因素多、周期数据不足等特点，股票交易是一种不完全信息下的博弈过程，单目标的监督学习模型很难处理这类序列化决策问题。强化学习是解决该类问题的有效途径之一。提出了基于深度强化学习的智能股市操盘手模型ISTG（Intelligent Stock Trader and Gym），融合历史行情数据、技术指标、宏观经济指标等多数据类型，分析评判标准和优秀控制策略，加工长周期数据，实现可增量扩展不同类型数据的复盘模型，自动计算回报标签，训练智能操盘手，并提出直接利用行情数据计算单步确定性动作值的方法。采用中国股市1400多支的有10年以上数据的股票进行多种对比实验，ISTG的总体收益达到13%，优于买入持有总体−7%的表现。

关键词: 深度强化学习, 双价值网络的深度强化学习（DDQN）, 单步确定性动作值, 量化策略

HAN Daoqi, ZHANG Junyao, ZHOU Yuhang, LIU Qing. Research on Intelligent Trader Model Based on Deep Reinforcement Learning[J]. Computer Engineering and Applications, 2020, 56(21): 145-153.

韩道岐，张钧垚，周玉航，刘青. 基于深度强化学习的股市操盘手模型研究[J]. 计算机工程与应用, 2020, 56(21): 145-153.

[1]	MA Zhihao, ZHU Xiangbin. Research on Quasi-hyperbolic Momentum Gradient for Adversarial Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(24): 90-99.
[2]	LI Baoshuai, YE Chunming. Job Shop Scheduling Problem Based on Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(23): 248-254.
[3]	CHENG Yi, HAO Mimi. Path Planning for Indoor Mobile Robot with Improved Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(21): 256-262.
[4]	KUANG Liqun, LI Siyuan, FENG Li, HAN Xie, XU Qingyu. Application of Deep Reinforcement Learning Algorithm on Intelligent Military Decision System [J]. Computer Engineering and Applications, 2021, 57(20): 271-278.
[5]	KONG Songtao, LIU Chichi, SHI Yong, XIE Yi, WANG Kun. Review of Application Prospect of Deep Reinforcement Learning in Intelligent Manufacturing [J]. Computer Engineering and Applications, 2021, 57(2): 49-59.
[6]	ZHANG Rongxia, WU Changxu, SUN Tongchao, ZHAO Zengshun. Progress on Deep Reinforcement Learning in Path Planning [J]. Computer Engineering and Applications, 2021, 57(19): 44-56.
[7]	YANG Xueyu, CHEN Jianping, FU Qiming, LU You, WU Hongjie. Deep Deterministic Policy Gradient Algorithm Based on Stochastic Variance Reduction Method [J]. Computer Engineering and Applications, 2021, 57(19): 104-111.
[8]	SONG Haonan, ZHAO Gang, WANG Xingfen. Knowledge Reasoning Method Combining Knowledge Representation with Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(19): 189-197.
[9]	YANG Tong, QIN Jin. Adaptive ε-greedy Strategy Based on Average Episodic Cumulative Reward [J]. Computer Engineering and Applications, 2021, 57(11): 148-155.
[10]	SUN Yu, CAO Lei, CHEN Xiliang, XU Zhixiong, LAI Jun. Overview of Multi-Agent Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2020, 56(5): 13-24.
[11]	LI Yue, SHAO Zhenzhou, ZHAO Zhendong, SHI Zhiping, GUAN Yong. Design of Reward Function in Deep Reinforcement Learning for Trajectory Planning [J]. Computer Engineering and Applications, 2020, 56(2): 226-232.
[12]	LAI Jun, RAO Rui. Application of Deep Reinforcement Learning in Indoor UAV Target Search [J]. Computer Engineering and Applications, 2020, 56(17): 156-160.
[13]	HUANG Dongjin, JIANG Chenfeng, HAN Kaili. 3D Path Planning Algorithm Based on Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2020, 56(15): 30-36.
[14]	XU Zhixiong, CAO Lei, ZHANG Yongliang, CHEN Xiliang, LI Chenxi. Research on Deep Reinforcement Learning Algorithm Based on Dynamic Fusion Target [J]. Computer Engineering and Applications, 2019, 55(7): 157-161.
[15]	ZHANG Bin1, HE Ming1，2, CHEN Xiliang1, WU Chunxiao1, LIU Bin1, ZHOU Bo1. Self-Driving Via Improved DDPG Algorithm [J]. Computer Engineering and Applications, 2019, 55(10): 264-270.

Research on Intelligent Trader Model Based on Deep Reinforcement Learning

基于深度强化学习的股市操盘手模型研究

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics