Browse > Article
http://dx.doi.org/10.5392/JKCA.2021.21.09.198

Design and Implementation of a Benchmarking System Based on ArangoDB  

Choi, Do-Jin (충북대학교 정보통신공학과)
Baek, Yeon-Hee (충북대학교 빅데이터학과)
Lee, So-Min (충북대학교 정보통신공학과)
Kim, Yun-A (충북대학교 빅데이터학과)
Kim, Nam-Young (충북대학교 빅데이터학과)
Choi, Jae-Young (충북대학교 정보통신공학과)
Lee, Hyeon-Byeong (충북대학교 정보통신공학과)
Lim, Jong-Tae (충북대학교 정보통신공학과)
Bok, Kyoung-Soo (원광대학교 SW융합학과)
Song, Seok-Il (한국교통대학교 컴퓨터공학과)
Yoo, Jae-Soo (충북대학교 정보통신공학과)
Publication Information
Abstract
ArangoDB is a NoSQL database system that has been popularly utilized in many applications for storing large amounts of data. In order to apply a new NoSQL database system such as ArangoDB, to real work environments we need a benchmarking system that can evaluate its performance. In this paper, we design and implement a ArangoDB based benchmarking system that measures a kernel level performance well as an application level performance. We partially modify YCSB to measure the performance of a NoSQL database system in the cluster environment. We also define three real-world workload types by analyzing the existing materials. We prove the feasibility of the proposed system through the benchmarking of three workload types. We derive available workloads in ArangoDB and show that performance at the kernel layer as well as the application layer can be visualized through benchmarking of three workload types. It is expected that applicability and risk reviews will be possible through benchmarking of this system in environments that need to transfer data from the existing database engine to ArangoDB.
Keywords
ArangoDB; Database; Benchmarking; Workload; NoSQL;
Citations & Related Records
연도 인용수 순위
  • Reference
1 G. Kang, D. Kong, L. Wang, and J. Zhan, "OStoreBench: Benchmarking Distributed Object Storage Systems Using Real-World Application Scenarios," In Benchmarking, Measuring, and Optimizing: Third BenchCouncil International Symposium Bench 2020, pp.90-105, 2020.
2 L. Sfaxi, and M. M. B. Aissa, "Babel: A Generic Benchmarking Platform for Big Data Architectures," Big Data Research, Vol.24, 100186, 2021.   DOI
3 D. Fernandes and J. Bernardino, "Graph Databases Comparison: AllegroGraph, ArangoDB, InfiniteGraph, Neo4J, and OrientDB," In DATA, pp.373-380, 2018.
4 L. Meiling, "Benchmarking Multi-model Databases with ArangoDB and OrientDB," 2017.
5 S. Chen, A. Ailamaki, M. Athanassoulis, P. B> Gibbons, R. Johnson, I. Pandis, and R. Stoica, "TPC-E vs. TPC-C: Characterizing the New TPC-E Benchmark via an I/O Comparison Study," ACM Sigmod Record, Vol.39, No.3, pp.5-10, 2011.   DOI
6 B. F. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears, "Benchmarking Cloud Serving Systems with YCSB," Proc. the 1st ACM symposium on Cloud computing, pp.143-154, 2010.
7 O. Erling, A. Averbuch, J. Larriba-Pey, H. Chafi, A. Gubichev, A. Prat, and P. Boncz, "The LDBC Social Network Benchmark: Interactive Workload," Proc. the 2015 ACM SIGMOD International Conference on Management of Data, pp.619-630, 2015.
8 https://github.com/iovisor/bcc, 2021.05.01.
9 https://medium.com/netflix-techblog/revisiting-1-million-writes-per-second-c191a84864cc, 2021.05.01.
10 https://www.arangodb.com/, 2021.05.01.
11 C. F. ANDOR and B. PARV, "NoSQL Database Performance Benchmarking-A Case Study," Studia Informatica, Vol.63, No.1, pp.80-93, 2017.
12 T. G. Armstrong, V. Ponnekanti, D. Borthakur, and M. Callaghan, "Linkbench: A Database Benchmark based on The Facebook Social Graph," Proc. the 2013 ACM SIGMOD International Conference on Management of Data, pp.1185-1196, 2013.
13 S. Ray, B. Simion, and A. D. Brown, "Jackpine: A Benchmark to Evaluate Spatial Database Performance," In 2011 IEEE 27th International Conference on Data Engineering, pp.1139-1150, 2011.
14 T. Harter, D. Borthakur, S. Dong, A. S. Aiyer, L. Tang, A. C. Arpaci-Dusseau, and R. H. Arpaci-Dusseau, "Analysis of HDFS under HBase: A Facebook Messages Case Study," Proc. USENIX conference on File and Storage Technologies, pp.199-212, 2014.
15 S. Kashyap, S. Zamwar, T. Bhavsar, and S. Singh, "Benchmarking and Analysis of Nosql Technologiesm," Int J Emerg Technol Adv Eng, Vol.3, No.9, pp.422-426, 2013.
16 S. Friedrich, W. Wingerath, F. Gessert, N. Ritter, E. Pldereder, L. Grunske, and D. Ull, "NoSQL OLTP Benchmarking: A Survey," In GI-Jahrestagung, pp.693-704, 2014.
17 Y. Chen, X. Qin, H. Bian, J. Chen, Z. Dong, X. Du, Y. Gao, D. Liu, J. Lu, and H. Zhang, "A Study of SQL-on-Hadoop Systems," Proc. Workshops on Big Data Benchmarks, Performance Optimization, and Emerging, pp.154-166, 2014.
18 R. Gunawan, A. Rahmatulloh, and I. Darmawan, "Performance Evaluation of Query Response Time in The Document Stored NoSQL Database," In 2019 16th International Conference on Quality in Research (QIR): International Symposium on Electrical and Computer Engineering, pp.1-6, 2019.
19 S. Patil, M. Polte, K. Ren, W. Tantisiriroj, L. Xiao, J. Lopez, and B. Rinaldi, "YCSB++ Benchmarking and Performance Debugging Advanced Features in Scalable Table Stores," Proc. the 2nd ACM Symposium on Cloud Computing, pp.1-14, 2011.
20 A. D. Brunelle, and Alan D, "Block IO Layer Tracing: blktrace," HP, Gelato-Cupertino, 2006.