• Title/Summary/Keyword: EUC-KR

Search Result 8, Processing Time 0.026 seconds

String Matching Algorithm on Multi-byte Character Set Texts (다중바이트 문자집합 텍스트에서의 문자열 검색 알고리즘)

  • Kim, Eun-Sang;Kim, Jin-Wook;Park, Kun-Soo
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.10
    • /
    • pp.1015-1019
    • /
    • 2010
  • An extensive research on exact string matching has been done, but there have been few researches on the matching in multi-byte character set texts such as EUC~KR. This paper shows that false matches may occur in multi-byte character set texts such as EUC-KR when using KMP algorithm, and presents a refined KMP algorithm without false matches applying a character-based prefix function. And also, Experimental results show that our algorithm is faster than string matching algorithms of widely used editors, Vim and Emacs, and the existing automata-based algorithm.

Algorithm of XML and SOAP Messages Canonicalization (XML 및 SOAP 메시지 정규화 알고리즘)

  • Jeong, Hoe-Gyeong
    • The Journal of Engineering Research
    • /
    • v.6 no.1
    • /
    • pp.125-137
    • /
    • 2004
  • This paper implemented system that run Canonical XML and SOAP messages algorithm. So, can change to more elaborate regular document. Thus, interoperable with other application that takes W3C recommendation. Also, use in several system that physical identify is required when it exchange XML and SOAP messages for web service interoperability. Moreover, Adding the transformation ability between universal encoding scheme and EUC-KR that is internal encoding scheme should be Canonical XML and SOAP messages algorithm that is suited to internal circumstances, and this should be a foundation technique of international interoperability confirmedness.

  • PDF

Development of EUC-KR based Locale and Application Program Supporting North Korean Collating Sequence (북한 한글 순서를 지원하는 EUC-KR 기반의 로캘과 응용 프로그램 개발)

  • Jung Il-dong;Lee Jung-hwa;Kim Yong-ho;Kim Kyongsok
    • The KIPS Transactions:PartB
    • /
    • v.11B no.7 s.96
    • /
    • pp.875-884
    • /
    • 2004
  • UCS (=ISO/IEC 10646, =Unicode) will be used widely as globalization. If UCS is used for official purpose in Koreas, UCS solves a Problem in different hangeul code between South and North Korea. But, UCS is not a solution for problems in unequal order with the same character. IS0/1EC 146sl : 2000 (International String Ordering), which is a international standard for string ordering, defines a framework sorting all char-acter strings consisting multi-national scripts. Because the Common Template Table in ISO/IEC 14651 defines orders of characters, we can change orders of characters without changes of characters sequences in programs. Therefore, we can solve a ordering problem without unifying order of hangeul in South and North Korea. Functions related ISO/IEC 14651 are contained by system librarys in unix-based operating system such as Linux, Solaris and FreeBSD. We implement EUC-KR-based North Korean locale, which includes North Korean hangeul order, in Linux in order to use North Korean locale in South Korea. And we develop a program ordering strings with South and North Korean hangout order.

A Representation of Korean Human Names and Their Written in Chinese Characters Pronunciation Processing Using UTF-8 (UTF-8을 이용한 인명한자의 표현과 한글 독음 처리)

  • 조영철;유정원;변정용
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2004.05a
    • /
    • pp.575-578
    • /
    • 2004
  • 인물 정보 검색 시스템은 사용자에게 종합적이고 정확한 정보와 편리한 사용자 인터페이스를 제공해야 한다. 하지만 현재 웹 상에서 이 시스템의 방대한 화장 한자 자료를 표현하는데 EUC-KR은 많은 어려움을 가지고 있다. 그리고 어려운 인명용 한자와 전문 한자 용어로 인해 일반인들의 사용이 어려웠다. 이를 해결하기 위해 본 논문에서는 확장된 한자를 표기하기 위하여 UTF-8 인코딩 방식을 사용한다. 그리고 사용자가 알기 원하는 한자의 한글 독음 변환 처리를 통해 데이터베이스의 효율성과 사용자가 쓰기 편한 인터페이스를 제공한다.

  • PDF

Design and Implement of Canonical XML Algorithm for Digital Signature System (전자서명 시스템을 위한 XML 정규화 알고리즘 설계 및 구현)

  • 유윤식;이강찬;전종홍;이원석;정회경
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2003.10a
    • /
    • pp.504-507
    • /
    • 2003
  • These days, XML is accepted and used to e-commerce market broadly. But by reason of XML document has autonomy of expression that can exist same form logically but several other forms physically, several problems ran happen in application that judge effectiveness as physical form such as XML digital signature. Therefore, it is recommending to propose and use Canonical XML algorithm to change identical XML document physically equally logically in W3C to solve this problems. We implemented system that run Canonical XML algorithm that suggested in W3C that can change to more elaborate regular document. Thus, interpretable with other application that takes W3C recommendation. Also, as well as use in digital signature system for web service is useful, use in several system that physical identify is required when it exchanges XML document for web service interoperability are considered to be valuable. Moreover, Adding the transformation ability between universal encoding scheme and EUC-KR that is internal encoding scheme should be Canonical XML Algorithm that is suited to internal circumstances, and this should be a foundation technique of international interoperability confirmedness.

  • PDF

An Implementation of the Canonical XML Algorithm (XML 정규화 알고리즘 구현)

  • 박기식;조인준;정회경
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.7 no.8
    • /
    • pp.1698-1707
    • /
    • 2003
  • These days, XML is accepted and used to e commerce market broadly. But by reason of XML document can exist same form logically but several other forms physically, several problems can happen in application that judge effectiveness as physical form such as XML digital signature. Therefore, it is recommending to propose and use canonical XML algorithm to change identical XML document physically equally logically in W3C to solve this problems. We implemented system that nm Canonical XML algorithm that suggested in W3C that can change to mon elaborate regular document. Thus, interoperable with other application that takes W3C recommendation Also, as well as use in digital signature system for web service is useful, use in several system that physical identify is required when it exchanges na document for web service interoperability are considered to be valuable. Moreover, Adding the transformation ability between universal encoding scheme and EUC­KR that is internal encoding scheme should be Canonical XML Algorithm that is suited to internal circumstances, and this should be a foundation technique of international interoperability confirmedness.

A Study on the Improvement of the Code System in Public Information Systems (공공정보시스템 부호체계 개선방안 연구)

  • Kim, Ji-Yong;Lee, Song-Hee;Choi, Jin-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2010.11a
    • /
    • pp.303-306
    • /
    • 2010
  • 공공정보시스템에서 외래어 표기법에 어긋나는 귀화자 성명이나 브랜드명(법인명)을 사용할 경우에, 비표준 확장한글을 인식하지 못하여 성명이나 주소를 포함하는 글자가 "?"로 표시되는 깨짐현상이 발생하여 공공서비스 이용에 많은 불편함을 초래하고 있다. 그 원인으로는 첫째로, 외래어 표기법을 준수하지 않았다는 것, 두 번째로는 기존에 구축된 공공정보시스템 대부분이 'EUC-KR' 인코딩 방식을 사용하고 있으므로 비표준 확장한글을 표현하지 못하고 있기 때문이다. 이에 본 논문에서는 기존의 운영환경을 그대로 유지하면서, 비표준 확장한글을 지원할 수 있는 시스템 운영방안을 제안하였다. 연구결과는 실제 공공정보시스템 운영시에 적용할 수 있으며, 사용자에게 보다 나은 서비스를 제공할 수 있다.