Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2003.10D.6.907

A Data Generator for Database Benchmarks and its Performance Evaluation  

Ok, Eun-Taek (숭실대학교 대학원 컴퓨터학과)
Jeong, Hoe-Jin (숭실대학교 대학원 컴퓨터학과)
Lee, Sang-Ho (숭실대학교 컴퓨터학부)
Abstract
Database benchmarks require efficient of large-scale data. This presents the system architecture, control flows, and characteristics of the data generator we have developed. The data generator features generation of large-scale data, column-by-column data generation, a number of data distributions and verification, and real data generation. An extensive conparison with other data generators in terms of function is also presented. Finally, empirical performance experiments between RAID systems and non-RAID one have been conducted to alleviate I/O bottleneck. The test results can serve as guidelines to help confifure system architecture.
Keywords
Data Generator; Database Benchmarks; Performance Evaluation; RAID;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 D. DeWitt, 'The Wisconsin Benchmark: Past, Present, and Future,' The Benchmark Handbook, 2nd Ed., J. Gray Ed., Morgan Kaufmann, pp.269-316, 1993
2 P. O'Neil, 'The Set Query Benchmark,' The Benchmark Handbook, 2nd Ed., J. Gray Ed., Morgan Kaufmann, pp. 359- 396, 1993
3 C. Turbyfill, C. Orji, and D. Bitton, '$AS^{3}AP$ : An ANSI SQL Standard Scaleable and Portable Benchmark for Relational Database Systems,' The Benchmark Handbook, 2nd Ed., J. Gray Ed., Morgan Kaufmann, pp.317-358, 1993
4 강근석, 김성철, 김지현, 이윤오, 이정진, 이창수, '디스켓이 들어 있는 PC 통계학', 자유 아카데미, 1993
5 H.J. Jeong and S. H. Lee, 'An Integrated Benchmark Suite for Database Systems,' Proceedings of the IASTED International Conference on Information Systems and Databases, pp.74-79, 2002
6 D. Knuth, 'The Art of Computer Programming,' 2nd Ed., Addison Wesley, 1981
7 Datatect, Banner Software Inc, http://www.datatect.com/
8 DataFactory, Quest Software Inc, http://www.quest.com/datafactory/
9 TurboData, Canam Software Inc, http://www.turbodata.ca/
10 DatGen, http://www.datasetgeneratorcom/
11 M. Y. Kim, 'Synchronized Disk Interleaving,' IEEE Transactions on Computers, Vol.3, No.11, pp.978-988, 1986
12 G. Weikum and P. Zabback, 'Tuning of Striping Units in Disk-Array-Based File Systems,' Proceedings of the 2nd International Workshop on Research Issues on Data Engineering : Transaction and Query Processing, pp.80-87, 1992
13 D. A. Patterson, G. Gibson and R. H. Katz, 'A case for redundant arrays of inexpensive disks (RAID),' Proceedings of the 1988 ACM SIGMOD International Conference on Management of Data, pp.l09-116, 1988   DOI
14 P. Chen and D. Patterson, 'Maximizing Performance in a Striped Disk Array,' Proceedings of the 1990 ACM SIGARCH International Conference on Computer Architecture, pp.322-331, 1990   DOI
15 P. M. Chen and E. K. Lee, 'Striping in a RAID Level 5 Disk Array,' Proceedings of the 1995 ACM SIGMETRICS Joint International Conference on Measurement and Modeling of Computer Systems, pp.l36-145, 1995   DOI
16 전상훈, 안병철, '실시간 멀티미디어 데이터를 위한 RAID 구조의 실측 분석'정보처리학회논문지, 제9권 제2호, pp.191-199, 2002   과학기술학회마을
17 TPC Home Page, http://www.tpc.org
18 J. Gray, P. Sundaresan, S. Englert, K. Baclawski and P. Weinberger, 'Quickly Generating Billion-Record Synthetic Databases,' Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data, pp.233-242, 1994   DOI