Browse > Article
http://dx.doi.org/10.9708/jksci.2014.19.7.113

BLOCS: Block Correlation Aware Sequential Pattern Mining based Caching Algorithm for Hybrid Storages  

Lee, Seongjin (Department of Electronics and Computer Engineering, Hanyang University)
Won, Youjip (Department of Computer and Software, Hanyang University)
Abstract
In this paper, we propose BLOCS algorithm to find sequence of data that should be saved in cache device of hybrid storage system which uses SSD as a cache device. BLOCS algorithm which uses a sequence pattern mining scheme, creates a set of frequently requested sectors with respect to requested order of sectors. To compare the performance of the proposed scheme, we introduce Distance (DIST) based scheme, Request Frequency (FREQ) based scheme, and Frequency times Size (F-S) based scheme. We measure the hit ratio and I/O latency of different caching schemes using hybrid storage caching simulator. We acquired booting workload along with ten scenarios of launching applications and use the workloads as input to the cache simulator. After experiment with booting workload, we find that BLOCS scheme gives hit ratio of 61% which is about 15% higher than the least performing DIST scheme.
Keywords
Sequence pattern Mining; Caching Algorithm; Hybrid Storage; Cache Simulator;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Solid-State Drives Will Complement, Not Replace, Hard-Disk Drives in Data Centers. https://www.gartner.com/doc/2427717/solidstate-drives-complement-replace-harddisk
2 Jae-Duk Lee, Sung-Hoi Hur, and J.-D. Choi, "Effects of floating-gate interference on NAND flash memory cell operation," Electron Device Letters, IEEE, vol. 23, pp. 264-266, 2002.   DOI   ScienceOn
3 T. Kgil, D. Roberts, and T. Mudge, "Improving NAND Flash Based Disk Caches," in Computer Architecture, 2008. ISCA '08. 35th International Symposium on, pp. 327-338. Beijing, China, June, 2008
4 J.-W. Hsieh, T.-W. Kuo, P.-L. Wu, and Y.-C. Huang, "Energy-efficient and performance enhanced disks using flash-memory cache, " presented at the Proceedings of the 2007 international symposium on Low power electronics and design, Portland, OR, USA, Aug. 2007.
5 T. Bisson and S. A. Brandt, "Reducing Hybrid Disk Write Latency with Flash-Backed I/O Requests," in Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, 2007. MASCOTS '07. 15th International Symposium on, pp. 402-409. Istanbul, Turkey, Oct. 2007
6 Y. Joo, Y. Cho, K. Lee, and N. Chang, "Improving application launch times with hybrid disks," presented at the Proceedings of the 7th IEEE/ACM international conference on Hardware/software codesign and system synthesis, pp. 373-382, Grenoble, France, Oct. 2009.
7 Y. Joo, J. Ryu, S. Park, and K. G. Shin, "FAST: quick application launch on solid-state drives," presented at the Proceedings of the 9th USENIX conference on File and storage technologies, pp. 19-19, San Jose, California, USA. 2011.
8 R. Koller, L. Marmol, R. Rangaswami, S. Sundararaman, N. Talagala, and M. Zhao, "Write policies for host-side flash caches, " presented at the Proceedings of the 11th USENIX conference on File and Storage Technologies, pp. 45-58, San Jose, California, USA. 2013.
9 D. A. Holland, E. L. Angelino, G. Wald, and M. I. Seltzer, "Flash caching on the storage client, " in USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference, pp. 127-138, San Jose, California, USA. 2013.
10 M. Zaki, "SPADE: An Efficient Algorithm for Mining Frequent Sequences," Machine Learning, vol. 42, issue 1-2, pp. 31-60, Jan, 2001.   DOI
11 J. Ayres, J. Flannick, J. Gehrke, and T. Yiu, "Sequential PAttern mining using a bitmap representation," presented at the Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, Edmonton, pp. 429-435, Alberta, Canada, 2002.
12 Z. Li, Z. Chen, S. M. Srinivasan, and Y. Zhou, "C-Miner: mining block correlations in storage systems," presented at the Proceedings of the 3rd USENIX conference on File and storage technologies, pp. 13-13, San Francisco, CA, Mar. 2004.
13 DiskMon. http://technet.microsoft.com/en-us/sysin ternals/bb896646.aspx
14 NTFSInfo. http://technet.microsoft.com/en-us/sys internals/bb897424
15 California Software Labs, "I/O file system filter driver for Windows NT," CSWL INC Technical Report, Pleasanton, California, 2002.
16 Samsung, "1g x 8 bit / 2g x 8 bit NAND flash memory (k9lag08u1a)," 2007.
17 Seagate. Desktop HDD ST4000DM000 specification. http://www.seagate.com/internal-hard-drives/ desktop-hard-drives/desktop-hdd
18 Samsung. 512GB 2.5-inch SSD 840 pro series. http://www.samsung.com/us/computer/memory-storage/MZ-7PD512BW-specs
19 Samsung, "1gx 8bit/2gx 8bit/4gx 8bit NAND flash memory (K9K8G08U0A)." 2006.
20 X. Yan, J. Han, and R. Afshar, "CloSpan: Mining Closed Sequential Patterns in Large Datasets," in In proceedings of the third SIAM International conference on data mining, pp. 166-177, San Francisco, CA, May. 2003.
21 High-Capacity SSDs Finally Match the per-GB Prices of Smaller SSDs. http://dealnews.com/features/High-Capacity-SSDs-Finally-Match-the-per-GB-Prices-of-Smaller-SSDs/622014.html
22 Samsung. What is V-NAND and how is it different to existing technology? http://www.samsung.com/global/business/semiconductor/html/product/flash-solution/vnand/overview.html