Browse > Article

A Dynamic Asset Allocation Method based on Reinforcement learning Exploiting Local Traders  

O Jangmin (서울대학교 컴퓨터공학부)
Lee Jongwoo (숙명여자대학교 멀티미디어학과)
Zhang Byoung-Tak (서울대학교 컴퓨터공학부)
Abstract
Given the local traders with pattern-based multi-predictors of stock prices, we study a method of dynamic asset allocation to maximize the trading performance. To optimize the proportion of asset allocated to each recommendation of the predictors, we design an asset allocation strategy called meta policy in the reinforcement teaming framework. We utilize both the information of each predictor's recommendations and the ratio of the stock fund over the total asset to efficiently describe the state space. The experimental results on Korean stock market show that the trading system with the proposed meta policy outperforms other systems with fixed asset allocation methods. This means that reinforcement learning can bring synergy effects to the decision making problem through exploiting supervised-learned predictors.
Keywords
stock trading; asset allocation; reinforcement learning;
Citations & Related Records
연도 인용수 순위
  • Reference
1 B. G. Malkiel, A Random Walk Down Wall Street, Norton, New York, 1996
2 M. A. H. Dempster, T. W. Payne, Y. Romahi, and G. W. P. Thompson, 'Computational Learning Techniques for Intraday FX Trading Using Popular Technical Indicators.' IEEE Transactions on Neural Networks, 12(4), pp. 744-754, 2001   DOI   ScienceOn
3 A. Fan and M. Palaniswami, 'Stock Selection Using Support Vector Machines,' In Proceedings of International Joint Conference on Neural Networks, pp. 1793-1798, 2001   DOI
4 S. M. Kendall and K. Ord, Time Series, Oxford, New York, 1997
5 E. F. Fama, 'Multiperiod Consumption Investment Decisions.' American Economic Review, 60, pp, 163-174, 1970
6 E. F. Fama and K. R. French, 'Dividend Yields and Expected Stock Returns,' Journal of Financial Economics, 22, pp. 3-26, 1988   DOI   ScienceOn
7 H. Li, C. H. Dagli and D. Enke, A Comparison Study of Reinforcement Schemes on a Series-based Stock Price Forecasting Task, IEEE transactions on Neural Networks, Submitted, 2005
8 R. S. Sutton and A. G. Barto, Reinforcement Learning : An Introduction. MIT Press, Cambridge, 1998
9 K. Hornik, M. Stinchcombe and H. White, 'Multilayer feedforward networks are universal approximators,' Neural Networks, vol. 2, pp. 359-366, 1989   DOI   ScienceOn
10 J. W. Lee and J. O, 'A Multi-agent Q-Iearning Framework for Optimizing Stock Trading Systems,' Proceedings of International Conference on Database and Expert Systems Applications, pp. 153-162, 2002
11 J. Moody and M. Saffell, 'Learning to Trade via Direct Reinforcement,' IEEE Transactions on Neural Networks, 12(4), pp. 875-889, 2001   DOI   ScienceOn
12 E. W. Saad, D. V. Prokhorov, D. C. Wunsch II, 'Comparative Study of Stock Trend Prediction Using Time Delay, Recurrent and Probabilistic Neural Networks,' IEEE Transactions on Neural Networks, 9(6), pp. 1456-1470, 1998   DOI   ScienceOn
13 R. Neuneier, 'Risk Sensitive Reinforcement Learning,' Advances in Neural Information Processing Systems, pp, 1031-1037, MIT Press, Cambridge, 1999
14 J. O. J. W. Lee, and B.-T. Zhang, 'Stock Trading System Using Reinforcement Learning with Cooperative Agents,' In Proceedings of International Conference on Machine Learning, pp. 451-458, Morgan Kaufmann, 2002
15 S. D. Kim, J. W. Lee, J. Lee, and J.-S. Chae, 'A Two-Phase Stock Trading System Using Distributional Differences,' Proceedings of International Conference on Database and Expert Systems Applications, pp. 143-152, 2002