Browse > Article
http://dx.doi.org/10.9708/jksci.2011.16.8.029

Performance Comparison of Column-Oriented and Row-Oriented Database Systems for Star Schema Join Processing  

Oh, Byung-Jung (Dept. of Computer Engineering, Hong-Ik University)
Ahn, Soo-Min (Dept. of Computer Engineering, Hong-Ik University)
Kim, Kyung-Chang (Dept. of Computer Engineering, Hong-Ik University)
Abstract
Unlike in traditional row-oriented database systems, a column-oriented database system stores data in column-oriented and not row-oriented order. Recently, research results revealed the effectiveness of column-oriented databases for applications such as data warehouse and decision support systems that access large volumes of data in a read only manner. In this paper, we investigate the join strategies for column-oriented databases and prove the effectiveness of column-oriented databases in data warehouse systems. For unbiased comparison, the two database systems are analyzed using the star schema benchmark and the performance analysis of a star schema join query is carried out. We experimented with well-known join algorithms and considered early materialization and late materialization join strategies for column-oriented databases. The performance results confirm that star schema join queries perform better in terms of disk I/O cost in column-oriented databases than in row-oriented databases. In addition, the late materialization strategy showed more performance gain than the early materialization strategy in column-oriented databases.
Keywords
column-oriented database system; row-oriented database system; data warehouse; early materialization; late materialization; star schema benchmark; star schema join;
Citations & Related Records
연도 인용수 순위
  • Reference
1 M. Stonebraker, et. al. "C-Store: AColumn-Oriented DBMS", Proc. of VLDB, 553-564, 2005.
2 Daniel J. Abadi, Daniel S. Myers, David J. DeWitt, Samuel R. Madden. "Materialization Strategies in a Column-Oriented DBMS". IEEE. 2007.
3 Daniel J. Abadi, Peter A. Boncz, and Stavros Hariz opoulos. "Column-oriented Database Systems". VLDB 2009 Tutorial.
4 P. A. Boncz. "Monet: A Next-Generation DBMS Kernel For Query-Intensive Applications". Ph.d. thesis, Universiteit van Amsterdam, May 2002.
5 Patrick E. O'Neil, Elizabeth J. O'Neil, and Xuedong Chen. "The Star Schema Benchmark (SSB)". Revision 3, June 5, 2009.
6 Raghu Ramakrishnan, Johannes Gehrke, "Database Managem ent Systems 2nd Edition", McGrawHill, pp. 333-348, 2000.
7 Abraham Silberschatz, Henry F. Korth, S. Sud arshan, "Da tabase System Concepts Fifth Edition", McGrawHill, pp. 481-502, 2006.
8 Alan Halverson, Jennifer L. Beckmann, Jeffrey F. Naughton, David J. DeWittP, "A Comparison of C-Store and Row-Store in a Common Framework", Proc. of the 32nd VLDB Conference, 2006.
9 Daniel J. Abadi. "Query Execution in Colum n-Orie nted Database Systems", Massachusetts Institute of Technology, pp. 85-93, 2008
10 P. Boncz, M. Zukowski, and N. Nes. "Mone tDB/X100: Hyper-pipelining query execution", Proc. of Intl' Conf. on Innovative Data System Research (CIDR), 2005.
11 P.A. Boncz and M.L. Kersten. "MIL primitives for querying a fragmented world", VLDB Journal, 8(2): 101-119, 1999.   DOI   ScienceOn