Browse > Article

Rule Discovery and Matching for Forecasting Stock Prices  

Ha, You-Min (연세대학교 컴퓨터과학과)
Kim, Sang-Wook (한양대학교 정보통신학부)
Won, Jung-Im (한양대학교 정보통신학부)
Park, Sang-Hyun (연세대학교 컴퓨터과학과)
Yoon, Jee-Hee (한림대학교 정보통신학부)
Abstract
This paper addresses an approach that recommends investment types for stock investors by discovering useful rules from past changing patterns of stock prices in databases. First, we define a new rule model for recommending stock investment types. For a frequent pattern of stock prices, if its subsequent stock prices are matched to a condition of an investor, the model recommends a corresponding investment type for this stock. The frequent pattern is regarded as a rule head, and the subsequent part a rule body. We observed that the conditions on rule bodies are quite different depending on dispositions of investors while rule heads are independent of characteristics of investors in most cases. With this observation, we propose a new method that discovers and stores only the rule heads rather than the whole rules in a rule discovery process. This allows investors to define various conditions on rule bodies flexibly, and also improves the performance of a rule discovery process by reducing the number of rules. For efficient discovery and matching of rules, we propose methods for discovering frequent patterns, constructing a frequent pattern base, and indexing them. We also suggest a method that finds the rules matched to a query issued by an investor from a frequent pattern base, and a method that recommends an investment type using the rules. Finally, we verify the superiority of our approach via various experiments using real-life stock data.
Keywords
stock databases; rule discovery; rule matching;
Citations & Related Records
연도 인용수 순위
  • Reference
1 J. Han, M. Kamber, 'Data Mining: Concepts and Techniques,' Morgan Kaufmann, 2001
2 R. Agrawal, R. Srikant, 'Fast Algorithms for mining Association Rules,' In Proc. Int'l. Conf. on VLDB, pp. 487-499, 1994
3 R. Agrawal, R. Srikant, 'Mining Sequential Patterns,' In Proc. Int'l. Conf. on Data Engineering, pp. 3-14, 1995
4 Koscom Data Mall, http://datamall.koscom.co.kr, 2005
5 G. Das, K.-I. Lin, H. Mannila, Gopal Renganathan, and Padhraic Smyth, 'Rule Discovery from Time Series,' In Proc. Int'l. Conf. on Knowledge Discovery and Datamining, pp. 16-22, 1998
6 S. Park and W. W. Chu, 'Discovering and Matching Elastic Rules From Sequence Databases', in Fundamenta Informaticae, Vol, 47, No. 1-2, pp. 75-90, Aug-Sept, 2001
7 S. Guha, R. Rastogi, and K. Shim, 'CURE: An Efficient Clustering Algorithm for Large Databases,' Information Systems Vol. 26, No. 1, pp. 35-58, 2001   DOI   ScienceOn
8 W. W. Chu and K. Chiang, 'Abstraction of High Level Concepts from Numerical Values in Databases,' In Proc. AAAI Workshop on Knowledge Discovery in Databases, pp. 133-144, 1994
9 B. Wah and M. Qian, 'Constrained Formulations and Algorithms for Stock-Price Predictions Using Recurrent FIR Neural Networks,' AAAI/IAAI pp. 211-216, 2002
10 E. Saad, D. Prokhorov, and D. Wunsch II, 'Comparative Study of Stock Trend Prediction Using Time Delay, Recurrent and Probabilistic Neural Networks,' IEEE Trans. on Neural Networks, pp. 1456-1470, 1998   DOI   ScienceOn
11 C. Faloutsos, M. Ranganathan, and Y. Manolopoulos, 'Fast Subsequence Matching in Time-series Databases,' In Proc. Int'l. Conf. on Management of Data, ACM SIGMOD, pp. 419-429, May 1994   DOI
12 T. Anderson, 'The Statistical Analysis of Time Series,' Wiley, 1971
13 H. White, 'Economic Prediction Using Neural Networks: The Case of IBM Daily Stock Returns,' In Proc. IEEE Int'l. Conf. on Neural Networks, pp. II451-II458, 1988   DOI
14 S. H. Park et al., 'Efficient Searches for Similar Subsequences of Difference Lengths in Sequence Databases,' In Proc. Int'l. Conf. on Data Engineering, IEEE ICDE, pp. 23-32, 2000   DOI
15 P. Bloomfield, 'Fourier Analysis of Time Series,' Wiley, 2000
16 R. Agrawal et al., 'Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases,' In Proc. Int'l. Conf. on VLDB, pp. 490-501, Sept. 1995
17 R. Agrawal, C. Faloutsos, and A. Swami, 'Efficient Similarity Search in Sequence Databases,' In Proc. Int'l. Conf. on Foundations of Data Organization and Algorithms, FODO, pp. 69-84, Oct. 1993
18 W. K. Loh, S. W. Kim, and K. Y. Whang, 'A Subsequence Matching Algorithm that Supports Normalization Transform in Time-Series Databases,' Data Mining and Knowledge Discovery Journal, Vol. 9, No. 1, pp. 5-28, Jul. 2004   DOI
19 S. W. Kim, S. H. Park, and W. W. Chu, 'An Index-Based Approach for Similarity Search Supporting Time Warping in Large Sequence Databases,' In Proc. Int'l. Conf. on Data Engineering, IEEE ICDE, pp. 607-614, 2001   DOI