[KSCI] Korea Science Citation Index Service

A Dynamic Asset Allocation Method based on Reinforcement learning Exploiting Local Traders

O Jangmin (서울대학교 컴퓨터공학부)
Lee Jongwoo (숙명여자대학교 멀티미디어학과)
Zhang Byoung-Tak (서울대학교 컴퓨터공학부)

Publication Information

Journal of KIISE:Software and Applications / v.32, no.8, 2005 , pp. 693-703 More about this Journal

Abstract

Given the local traders with pattern-based multi-predictors of stock prices, we study a method of dynamic asset allocation to maximize the trading performance. To optimize the proportion of asset allocated to each recommendation of the predictors, we design an asset allocation strategy called meta policy in the reinforcement teaming framework. We utilize both the information of each predictor's recommendations and the ratio of the stock fund over the total asset to efficiently describe the state space. The experimental results on Korean stock market show that the trading system with the proposed meta policy outperforms other systems with fixed asset allocation methods. This means that reinforcement learning can bring synergy effects to the decision making problem through exploiting supervised-learned predictors.

Keywords

stock trading; asset allocation; reinforcement learning;

Citations & Related Records

Reference

1	B. G. Malkiel, A Random Walk Down Wall Street, Norton, New York, 1996
2	M. A. H. Dempster, T. W. Payne, Y. Romahi, and G. W. P. Thompson, 'Computational Learning Techniques for Intraday FX Trading Using Popular Technical Indicators.' IEEE Transactions on Neural Networks, 12(4), pp. 744-754, 2001 DOI ScienceOn
3	A. Fan and M. Palaniswami, 'Stock Selection Using Support Vector Machines,' In Proceedings of International Joint Conference on Neural Networks, pp. 1793-1798, 2001 DOI
4	S. M. Kendall and K. Ord, Time Series, Oxford, New York, 1997
5	E. F. Fama, 'Multiperiod Consumption Investment Decisions.' American Economic Review, 60, pp, 163-174, 1970
6	E. F. Fama and K. R. French, 'Dividend Yields and Expected Stock Returns,' Journal of Financial Economics, 22, pp. 3-26, 1988 DOI ScienceOn
7	H. Li, C. H. Dagli and D. Enke, A Comparison Study of Reinforcement Schemes on a Series-based Stock Price Forecasting Task, IEEE transactions on Neural Networks, Submitted, 2005
8	R. S. Sutton and A. G. Barto, Reinforcement Learning : An Introduction. MIT Press, Cambridge, 1998
9	K. Hornik, M. Stinchcombe and H. White, 'Multilayer feedforward networks are universal approximators,' Neural Networks, vol. 2, pp. 359-366, 1989 DOI ScienceOn
10	J. W. Lee and J. O, 'A Multi-agent Q-Iearning Framework for Optimizing Stock Trading Systems,' Proceedings of International Conference on Database and Expert Systems Applications, pp. 153-162, 2002
11	J. Moody and M. Saffell, 'Learning to Trade via Direct Reinforcement,' IEEE Transactions on Neural Networks, 12(4), pp. 875-889, 2001 DOI ScienceOn
12	E. W. Saad, D. V. Prokhorov, D. C. Wunsch II, 'Comparative Study of Stock Trend Prediction Using Time Delay, Recurrent and Probabilistic Neural Networks,' IEEE Transactions on Neural Networks, 9(6), pp. 1456-1470, 1998 DOI ScienceOn
13	R. Neuneier, 'Risk Sensitive Reinforcement Learning,' Advances in Neural Information Processing Systems, pp, 1031-1037, MIT Press, Cambridge, 1999
14	J. O. J. W. Lee, and B.-T. Zhang, 'Stock Trading System Using Reinforcement Learning with Cooperative Agents,' In Proceedings of International Conference on Machine Learning, pp. 451-458, Morgan Kaufmann, 2002
15	S. D. Kim, J. W. Lee, J. Lee, and J.-S. Chae, 'A Two-Phase Stock Trading System Using Distributional Differences,' Proceedings of International Conference on Database and Expert Systems Applications, pp. 143-152, 2002

KSCI

A Dynamic Asset Allocation Method based on Reinforcement learning Exploiting Local Traders 지역 투자 정책을 이용한 강화학습 기반 동적 자산 할당 기법

A Dynamic Asset Allocation Method based on Reinforcement learning Exploiting Local Traders