Browse > Article
http://dx.doi.org/10.17661/jkiiect.2018.11.4.319

Design and Implementation of Conversion System Between ISO/IEC 10646 and Multi-Byte Code Set  

Kim, Chul (Department of Computer Science, Yongin University)
Publication Information
The Journal of Korea Institute of Information, Electronics, and Communication Technology / v.11, no.4, 2018 , pp. 319-324 More about this Journal
Abstract
In this paper, we designed and implemented a code conversion method between ISO/IEC 10646 and the multi-byte code set. The Universal Multiple-Octet Coded Character Set(UCS) provides codes for more than 65,000 characters, huge increase over ASCII's code capacity of 128 characters. It is applicable to the representation, transmission, interchange, processing, storage, input and presentation of the written form of the language throughout the world. Therefore, it is so important to guide on code conversion methods to their customers during customer systems are migrated to the environment which the UCS code system is used and/or the current code systems, i.e., ASCII PC code and EBCDIC host code, are used with the UCS together. Code conversion utility including the mapping table between the UCS and IBM new host code is shown for the purpose of the explanation of code conversion algorithm and its implementation in the system. The programs are successfully executed in the real system environments and so can be delivered to the customer during its migration stage from the UCS to the current IBM code system and vice versa.
Keywords
Basic Multilingual Plane; Double Byte Character Set; ISO/IEC 10646; UCS; Universal Multiple-Octet Coded Character Set; UNICODE;
Citations & Related Records
연도 인용수 순위
  • Reference
1 ISO, ISO/IEC 10646, Information technology - Universal Coded Character Set(UCS) - part 1 : Architecture and Basic Multilingual Plane, 2017.
2 The Unicode Consortium, The Unicode Standard, Version 11.0, 2018.
3 IBM, National Language Support Reference Manual Vol. 2, 1992.
4 IBM Korea, IBM Code User Manual, 1992.
5 ISO, ISO 2022, Information processing - ISO 7-bit and 8-bit coded character set - Code extension techniques, 1986.
6 ISO, ISO/IEC 6429, Information technology-Control functions for coded character sets, 1992.
7 Korean Standards Association, Universal Coded Character Set : KS C 5700, 1995.