Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2004.11B.2.207

Optimization of Stock Trading System based on Multi-Agent Q-Learning Framework  

Kim, Yu-Seop (한림대학교 정보통신공학부)
Lee, Jae-Won (성신여자대학교 컴퓨터정보공학부)
Lee, Jong-Woo (㈜아이닉스소프트)
Abstract
This paper presents a reinforcement learning framework for stock trading systems. Trading system parameters are optimized by Q-learning algorithm and neural networks are adopted for value approximation. In this framework, cooperative multiple agents are used to efficiently integrate global trend prediction and local trading strategy for obtaining better trading performance. Agents Communicate With Others Sharing training episodes and learned policies, while keeping the overall scheme of conventional Q-learning. Experimental results on KOSPI 200 show that a trading system based on the proposed framework outperforms the market average and makes appreciable profits. Furthermore, in view of risk management, the system is superior to a system trained by supervised learning.
Keywords
Q-learning; Stock Trading; Multi Agent; Buy Signal; Buy Order; Sell Signal; Sell Order;
Citations & Related Records
연도 인용수 순위
  • Reference
1 S. M. Kendall and K. Ord, 'Time Series,' Oxford, New York, 1997
2 R. Neuneier, 'Enhancing Q-Learning for Optimal Asset allocation,' Advanced in Neural Information Processing System, 10, MIT Press, Cambridge, pp.936-942, 1998
3 J. Lee, 'Stock Price Prediction using Reinforcement Learning,' Proc. of the 6th IEEE International Symposium on Industrial Electronics, 2001   DOI
4 R. S. Sutton and A. G. Barto, 'Reinforcement Learning : An Introduction,' MIT Press, Cambridge, 1998
5 M. Jakkola, M. Jordan and S. Signh, 'On the Convergence of Stochastic Iterative Dynamic Programming Algorithms,' Neural Computation, 6(6), pp.1185-2201, 1994   DOI   ScienceOn
6 J. Moody, Y. Wu, Y. Liao and M. Saffell, 'Performance Functions and Reinforcement Learning for Trading Systems and Portfolios,' Journal of Forecasting, 17(5-6), pp.441-470, 1998   DOI   ScienceOn
7 J. Moody and M. Saffell, 'Learning to Trade via Direct Reinforcement,' IEEE Transactions on Neural Networks, 12(4), pp.875-889, 2001   DOI   ScienceOn
8 G. Xiu, C. Laiwan, 'Algorithm for Trading and Portfolio Management Using Q-learning and Sharpe Ratio Maximization,' Proc. of ICONIP 2000, Korea, pp.832-837, 2000
9 R. Neuneier and O. Mihatsch, 'Risk Sensitive Reinforcement Learning,' Advances in Neural Information Processing Systems, 11, MIT Press, Cambridge, pp.1031-1037, 1999
10 L. C. Baird, 'Residual Algorithms : Reinforcement Learning with Function Approximation,' Proc. of Twelfth International Conference on Machine Learning, Morgan Kaufmann, San Francisco, pp.30-37, 1995