DOI QR코드

DOI QR Code

Effectiveness Evaluations of Subsequence Matching Methods Using KOSPI Data

한국 주식 데이터를 이용한 서브시퀀스 매칭 방법의 효과성 평가

  • 유승근 (숭실대학교 대학원 컴퓨터학과) ;
  • 이상호 (숭실대학교 컴퓨터학부)
  • Published : 2005.06.01

Abstract

Previous researches on subsequence matching have been focused on how to make indexes in order to speed up the matching time, and do not take into account the effectiveness issues of subsequence matching methods. This paper considers the effectiveness of subsequence matching methods and proposes two metrics for effectiveness evaluations of subsequence matching algorithms. We have applied the proposed metrics to Korean stock data and five known matching algorithms. The analysis on the empirical data shows that two methods (i.e., the method supporting normalization, and the method supporting scaling and shifting) outperform the others in terms of the effectiveness of subsequence matching.

기존의 서브시퀀스 매칭 방법은 검색을 효율적으로 수행하기 위한 인덱스 구성 방법에 대하여 연구하였으며, 서브시퀀스 매칭 방법의 효과성 평가를 고려하지 않았다. 본 논문은 서브시퀀스 매칭 방법의 효과성에 대하여 고려하였으며, 서브시퀀스 매칭 방법의 효과성을 평가 할 수 있는 2가지 척도를 제안한다. 한국 주식 데이터와 5가지 서브시퀀스 매칭 방법에 대하여 제안된 효과성 측정 방안을 적용하였으며, 그 결과를 분석하였다. 실험 결과, 정규화를 지원하는 서브시퀀스 매칭 방법과 스케일링과 쉬프팅 변환을 지원하는 서브시퀀스 매칭 방법이 상대적으로 효과적인 서브시퀀스를 검색하였다.

Keywords

References

  1. R. Agrawal, C. Faloutsos and A. Swami, 'Efficient Similarity Search in Sequence Databases,' In Proceedings of the International Conference on Foundations of Data Organization and Algorithms, pp.69-84, 1993
  2. D. J. Bernidt and J. Clifford, 'Finding Patterns in Time Series : A Dynamic Programming Approach,' Advances in Knowledge Discovery and Data mining, AAA/MIT Press, pp.229-248, 1996
  3. K. K. W. Chu and M. H. Wong, 'Fast Time-Series Searching with Scaling and Shifting,' In Proceeding of the International Symposium on Principles of Databases Systems, pp.237-248, 1999 https://doi.org/10.1145/303976.304000
  4. C. Faloutsos, M. Ranganathan and Y. Manolopoulos, 'Fast Subsequence Matching in Time-Series Databases,' In Proceedings of the International Conference on Management of Data, pp.419-429, 1994 https://doi.org/10.1145/191843.191925
  5. D. Q. Goldin and P. C. Kanellakis, 'On Similarity Queries for Time-Series Data: Constraint Specification and Implementation,' In Proceedings of the International Conference on Principles of Data Mining and Knowledge Discovery, pp. 88-100, 1997
  6. E. Keogh, J. Lin and W. Tuppel, 'Clustering of Time Series Subsequence is Meaningless : Implication for Previous and Future Research,' In Proceedings of the third IEEE International Conference on Data Mining, pp.115-125, 2003
  7. R. R. Korfhage, 'Information Storage and Retrieval,' Wiley Press, 1997
  8. W. K. Loh, S. W. Kim and K. Y. Whang, 'Index Interpolation : An Approach for Subsequence Matching Supporting Normalization Transform in Time-Series Databases,' In Proceedings of the International Conference on Information and Knowledge Management, pp.480-487, 2000
  9. Y. S. Moon, K. Y. Whang and W. K. Loh, 'Duality-Based Subsequence Matching in Time-Series Databases,' In Proceedings of the International Conference on Data Engineering, pp.263-272, 2001
  10. S. H. Park, W. W. Chu, J. H. Yoon and C. Hsu, 'Efficient Searches for Similar Subsequence of Different Lengths in Sequence Databases,' In Proceedings of the International Conference on Data Engineering, pp.23-32, 2000
  11. D. Rafiei and A. Mendelzon, 'Similarity-Based Queries for Time Series Data,' In Proceedings of the International Conference on Management of Data, pp.13-24, 1997 https://doi.org/10.1145/253260.253264
  12. 김상욱, 박상현, '시퀀스 데이터베이스에서 타임 워핑을 지원하는 효과적인 유사 검색 기법', 정보과학회 논문지, 제28권 제4호, pp.643-654, 2001
  13. 노웅기, 김상욱, 황규영, '시계열 데이터베이스에서 인덱스 보간법을 기반으로 정규화변환을 지원하는 서브시퀀스 매칭 알고리즘', 정보과학회 논문지, 제28권 제2호, pp.217-232, 2001