Browse > Article
http://dx.doi.org/10.5370/JEET.2011.6.4.573

High-Performance Korean Morphological Analyzer Using the MapReduce Framework on the GPU  

Cho, Shi-Won (Division of Electronics and Electrical Engineering, Dongguk University)
Lee, Dong-Wook (Division of Electronics and Electrical Engineering, Dongguk University)
Publication Information
Journal of Electrical Engineering and Technology / v.6, no.4, 2011 , pp. 573-579 More about this Journal
Abstract
To meet the scalability and performance requirements of data analyses, which often involve voluminous data, efficient parallel or concurrent algorithms and frameworks are essential. We present a high-performance Korean morphological analyzer which employs the MapReduce framework on the graphics processing unit (GPU). MapReduce is a programming framework introduced by Google to aid the development of web search applications on a large number of central processing units (CPUs). GPUs are designed as a special-purpose co-processor. Their programming interfaces are typically formulated for graphics applications. Compared to CPUs, GPUs have greater computation power and memory bandwidth; however, GPUs are more difficult to program because of the design of their architectures. The performance of the Korean morphological analyzer using the MapReduce framework on the GPU is evaluated in comparison with the CPU-based model. The proposed Korean Morphological analyzer shows promising scalable performance on distributed computing with the GPU.
Keywords
GPGPU; MapReduce; Korean morphological analyzer; Natural language processing; Distributed processing;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
Times Cited By Web Of Science : 0
Times Cited By SCOPUS : 0
연도 인용수 순위
1 Seung-Shick Kang and Yung Taek Kim, Syllablebased Model for the Korean Morphology. The 15th International Conference on Computational Linguistics, pp. 211-226, 1995
2 Seung Hyun Yang and Young-Sum Kim, A High Speed Korean Morphological Analysis Method based on Pre-Analyzed Partial Words, Journal of the Korean Information Science Society: Software and Application, Vol. 27, No. 3., pp. 290-301, 2000   과학기술학회마을
3 Kwangseob Shim and Jaehyung Yang, MACH : A Supersonic Korean Morphological Analyzer, Proceedings of the 19th International Conference on Computational Linguistics (COLING-2002), pp. 939-945, 2002
4 OpenMPI, http://www.open-mpi.org/
5 MPICH2, http://www.mcs.anl.gov/research/projects/mpich2/
6 Parallel Virtual File System (PVFS), http://pvfs.org/
7 nVIDIA CUDA, http://developer.nvidia.com/object/cuda.html
8 MapReduce: Simplified Data Processing on Large Clusters, http://labs.google.com/papers/mapreduce.html
9 GPGPU.org, http://gpgpu.org/
10 OpenCL, http://www.khronos.org/opencl/
11 CUDA, Wikipedia, http://en.wikipedia.org/wiki/CUDA