[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.3745/KIPSTB.2004.11B.2.207

Optimization of Stock Trading System based on Multi-Agent Q-Learning Framework

Kim, Yu-Seop (한림대학교 정보통신공학부)
Lee, Jae-Won (성신여자대학교 컴퓨터정보공학부)
Lee, Jong-Woo (㈜아이닉스소프트)

Publication Information

The KIPS Transactions:PartB / v.11B, no.2, 2004 , pp. 207-212 More about this Journal

Abstract

This paper presents a reinforcement learning framework for stock trading systems. Trading system parameters are optimized by Q-learning algorithm and neural networks are adopted for value approximation. In this framework, cooperative multiple agents are used to efficiently integrate global trend prediction and local trading strategy for obtaining better trading performance. Agents Communicate With Others Sharing training episodes and learned policies, while keeping the overall scheme of conventional Q-learning. Experimental results on KOSPI 200 show that a trading system based on the proposed framework outperforms the market average and makes appreciable profits. Furthermore, in view of risk management, the system is superior to a system trained by supervised learning.

Keywords

Q-learning; Stock Trading; Multi Agent; Buy Signal; Buy Order; Sell Signal; Sell Order;

Citations & Related Records

Reference

1	S. M. Kendall and K. Ord, 'Time Series,' Oxford, New York, 1997
2	R. Neuneier, 'Enhancing Q-Learning for Optimal Asset allocation,' Advanced in Neural Information Processing System, 10, MIT Press, Cambridge, pp.936-942, 1998
3	J. Lee, 'Stock Price Prediction using Reinforcement Learning,' Proc. of the 6th IEEE International Symposium on Industrial Electronics, 2001 DOI
4	R. S. Sutton and A. G. Barto, 'Reinforcement Learning : An Introduction,' MIT Press, Cambridge, 1998
5	M. Jakkola, M. Jordan and S. Signh, 'On the Convergence of Stochastic Iterative Dynamic Programming Algorithms,' Neural Computation, 6(6), pp.1185-2201, 1994 DOI ScienceOn
6	J. Moody, Y. Wu, Y. Liao and M. Saffell, 'Performance Functions and Reinforcement Learning for Trading Systems and Portfolios,' Journal of Forecasting, 17(5-6), pp.441-470, 1998 DOI ScienceOn
7	J. Moody and M. Saffell, 'Learning to Trade via Direct Reinforcement,' IEEE Transactions on Neural Networks, 12(4), pp.875-889, 2001 DOI ScienceOn
8	G. Xiu, C. Laiwan, 'Algorithm for Trading and Portfolio Management Using Q-learning and Sharpe Ratio Maximization,' Proc. of ICONIP 2000, Korea, pp.832-837, 2000
9	R. Neuneier and O. Mihatsch, 'Risk Sensitive Reinforcement Learning,' Advances in Neural Information Processing Systems, 11, MIT Press, Cambridge, pp.1031-1037, 1999
10	L. C. Baird, 'Residual Algorithms : Reinforcement Learning with Function Approximation,' Proc. of Twelfth International Conference on Machine Learning, Morgan Kaufmann, San Francisco, pp.30-37, 1995

KSCI

Optimization of Stock Trading System based on Multi-Agent Q-Learning Framework 다중 에이전트 Q-학습 구조에 기반한 주식 매매 시스템의 최적화

Optimization of Stock Trading System based on Multi-Agent Q-Learning Framework