Search | Korea Science

Streamlined GoogLeNet Algorithm Based on CNN for Korean Character Recognition (한글 인식을 위한 CNN 기반의 간소화된 GoogLeNet 알고리즘 연구)

Kim, Yeon-gyu;Cha, Eui-young
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.20 no.9
- /
- pp.1657-1665
- /
- 2016
Various fields are being researched through Deep Learning using CNN(Convolutional Neural Network) and these researches show excellent performance in the image recognition. In this paper, we provide streamlined GoogLeNet of CNN architecture that is capable of learning a large-scale Korean character database. The experimental data used in this paper is PHD08 that is the large-scale of Korean character database. PHD08 has 2,187 samples for each character and there are 2,350 Korean characters that make total 5,139,450 sample data. As a training result, streamlined GoogLeNet showed over 99% of test accuracy at PHD08. Also, we made additional Korean character data that have fonts that are not in the PHD08 in order to ensure objectivity and we compared the performance of classification between streamlined GoogLeNet and other OCR programs. While other OCR programs showed a classification success rate of 66.95% to 83.16%, streamlined GoogLeNet showed 89.14% of the classification success rate that is higher than other OCR program's rate.
https://doi.org/10.6109/jkiice.2016.20.9.1657 인용 PDF KSCI

An Implementation of Hangul Handwriting Correction Application Based on Deep Learning (딥러닝에 의한 한글 필기체 교정 어플 구현)

Jae-Hyeong Lee;Min-Young Cho;Jin-soo Kim
- Journal of Korea Society of Industrial Information Systems
- /
- v.29 no.3
- /
- pp.13-22
- /
- 2024
Currently, with the proliferation of digital devices, the significance of handwritten texts in daily lives is gradually diminishing. As the use of keyboards and touch screens increase, a decline in Korean handwriting quality is being observed across a broad spectrum of Korean documents, from young students to adults. However, Korean handwriting still remains necessary for many documentations, as it retains individual unique features while ensuring readability. To this end, this paper aims to implement an application designed to improve and correct the quality of handwritten Korean script The implemented application utilizes the CRAFT (Character-Region Awareness For Text Detection) model for handwriting area detection and employs the VGG-Feature-Extraction as a deep learning model for learning features of the handwritten script. Simultaneously, the application presents the user's handwritten Korean script's reliability on a syllable-by-syllable basis as a recognition rate and also suggests the most similar fonts among candidate fonts. Furthermore, through various experiments, it can be confirmed that the proposed application provides an excellent recognition rate comparable to conventional commercial character recognition OCR systems.
https://doi.org/10.9723/jksiis.2024.29.3.013 인용 PDF

The design and implementation of automatic translation system for hangul's romanization ( A study on mechanic conversion using transcription ) (한글 로마자 자동 표기 시스템 설계 및 구현 ( 전사법(轉寫法) 기계적 변환에 관한 연구 ))

Kim, Hong-Sop;Pak, Jong-Sop;I, Hyon-Kol
- Annual Conference on Human and Language Technology
- /
- 1993.10a
- /
- pp.437-447
- /
- 1993
국제 협약에 따라 한글에 관한 정보는 로마자로 표기해야 한다. 우리나라는 1959년 2월 로마자 표기법'을 제정, 수차례 개정을 통해 1983년 6월 문교부(Ministry Of Education)안을 발표했으나, 표지판, 역명, 교과서, 공공문서들에서 활용되었지만, 영자신문, 외국 학술지등은 M-R( Mccune-Reishauer ) 표기법을, 인명, 신문 및 방송매체 등은 혼합표기방식을 사용함으로써 인(人). 지명(地名), ID, 대표어등 정보 검색시 혼란을 야기시켰고, 개정안에 따를 표기 정정으로 수백억원에 가까운 예산을 낭비하였으며, 최근 ISO(International Standard Organization)에서는 남북한 단일화 및 기계적 변환을 요구하고 있으나, 반달표 표기곤란, 편리성 결여, 북한의 주장등의 사유로 제정등을 거론하고 있는 실정에 있다. 잘 쓰지 않는 ASCII 코드 중에서 반달점 폰트를 제작하고 단어, 문장, 문서를 STRING으로 받아 알고리즘화된 음운법칙을 적용하여 소리글자로 변환하고 MOE테이블에서 대응글자를 참조하여 기계적 변환이 가능하도록 하였으며, 세련된 디자인, 풀다운 팝업방식을 채택, 2HD 1장으로 국내최초로 개발하였다.
PDF

A study of improve vectorising technique on the internet (인터넷에서의 개선된 벡터라이징 기법에 관한 연구)

김용호;이윤배
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.6 no.2
- /
- pp.271-281
- /
- 2002
Currently, most web designers guarante high quality using bitmap graphics as fixed font size, but that has defects about file size and flexibility. Especially, to provide high quality of banner and advertise characters, after you should use a bitmap edit program, and then we should follow the method we add that program to HTML documents as bitmap data. In this study, as I show a couple of new tags in front of HTML documents, I show methods which can be presented diverse effects. When text information are stored, because we print out a screen with simple control points and outside information, it can be possible for us to express the same quality of Hangul characters like printed documents in a web browser. Regardless of the second class of platform, we can make it possible the character expression with exact character expressions and diverse effects.
PDF KSCI

Character Segmentation on Printed Korean Document Images Using a Simplification of Projection Profiles (투영 프로파일의 간략화 방법을 이용한 인쇄체 한글 문서 영상에서의 문자 분할)

Park Sang-Cheol;Kim Soo-Hyung
- The KIPS Transactions:PartB
- /
- v.13B no.2 s.105
- /
- pp.89-96
- /
- 2006
In this paper, we propose two approaches for the character segmentation on Korean document images. One is an improved version of a projection profile-based algorithm. It involves estimating the number of characters, obtaining the split points and then searching for each character's boundary, and selecting the best segmentation result. The other is developed for low quality document images where adjacent characters are connected. In this case, parts of the projection profile are cut to resolve the connection between the characters. This is called ${\alpha}$-cut. Afterwards, the revised former segmentation procedure is conducted. The two approaches have been tested with 43,572 low-quality Korean word images punted in various font styles. The segmentation accuracies of the former and the latter are 91.81% and 99.57%, respectively. This result shows that the proposed algorithm using a ${\alpha}$-cut is effective for low-quality Korean document images.
https://doi.org/10.3745/KIPSTB.2006.13B.2.089 인용 PDF KSCI

Studies about Changes in Modern Korean Font and Effect of Digital Font to Caligraphy Design Thinking of Korea (근현대 한글 컴퓨터서체의 변화와 디지털폰트가 Caligraphy에 준 영향에 관한 연구)

Lee, Sung-Soo;Choi, Byoung-Mook
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.8 no.6
- /
- pp.159-163
- /
- 2008
Recently, typography in Korea has been varied. Among them, after 2005 a new trend caligraphy has appeared and in reality people discussed that this has been occurred as a coincidence. However, as a view of one who is interested in Hangeul typography, born of Hangeul caligraphy is not a coincidence. It is because computerizing of Hangeul has been later than U.S. or Japan. Korea was colonized by Japan and experienced the Korean War and economical independence got late and cultural independence got late as well. Our language, Hangeul was independenced in the beginning of 1990's after computer was introduce and original letter of Hangeul was scanned and basic standard was built. From the end of 80's there were many efforts for independence of Hangeul but there were problems on hardware side than software side. In the beginning of 90's basic fonts such as Myungjo or Gothic of Choi Jung-ho's fonts were set for computer hardware, but later 90's can be called as a time for distributing designed fonts for that new fonts that were planned and applying new ideas on fonts were done. In 2000 the two major font company in Korea Sandoll Communication and Yoon Design Institute made a fonts such as When branding had to be done by designer, typo must be written so adjusting size and moving baseline made old style and caligraphy fonts to be born. These cycling process has been natural motive for nowadays' caligraphy and these two major companies' role has made caligraphy to be popular.
PDF

CKFont2: An Improved Few-Shot Hangul Font Generation Model Based on Hangul Composability (CKFont2: 한글 구성요소를 이용한 개선된 퓨샷 한글 폰트 생성 모델)

Jangkyoung, Park;Ammar, Ul Hassan;Jaeyoung, Choi
- KIPS Transactions on Software and Data Engineering
- /
- v.11 no.12
- /
- pp.499-508
- /
- 2022
A lot of research has been carried out on the Hangeul generation model using deep learning, and recently, research is being carried out how to minimize the number of characters input to generate one set of Hangul (Few-Shot Learning). In this paper, we propose a CKFont2 model using only 14 letters by analyzing and improving the CKFont (hereafter CKFont1) model using 28 letters. The CKFont2 model improves the performance of the CKFont1 model as a model that generates all Hangul using only 14 characters including 24 components (14 consonants and 10 vowels), where the CKFont1 model generates all Hangul by extracting 51 Hangul components from 28 characters. It uses the minimum number of characters for currently known models. From the basic consonants/vowels of Hangul, 27 components such as 5 double consonants, 11/11 compound consonants/vowels respectively are learned by deep learning and generated, and the generated 27 components are combined with 24 basic consonants/vowels. All Hangul characters are automatically generated from the combined 51 components. The superiority of the performance was verified by comparative analysis with results of the zi2zi, CKFont1, and MX-Font model. It is an efficient and effective model that has a simple structure and saves time and resources, and can be extended to Chinese, Thai, and Japanese.
https://doi.org/10.3745/KTSDE.2022.11.12.499 인용 PDF KSCI

A study on compression and decompression of hanguel and chinese character bit map font (한글 한자 비트 맵 폰트의 압축과 복원에 관한연구)

조경윤
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.33B no.4
- /
- pp.63-71
- /
- 1996
In this paper, a variable length block code for real time compression and decompression of hanguel and chinese character bit map font is proposed. The proposed code shows a good compression ratio in complete form of hangeul myoungjo and godik style and chinese batang and doddum style bit map font. Besides, a compression and decompression ASIC is designed and simulated on CAD. The 0.8 micron CMOS sea of gate is used to implement the ASIC in amount of 5,200 gates, and it runs at simple hardware and compress and decompress at 33M bit/sec at maximum, which is ideal for real time applications.
PDF

Detailed Recognition of Similar Characters Based on Optimum Linear Transform (최적선형변환에 의한 유사문자의 상세분류인식)

김형원;김성원;양윤모
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.04b
- /
- pp.493-495
- /
- 2001
본 논문에서는 문자 인식에서 두 단계의 식별과정을 통하여 인식률을 향상시키는 방법에 대하여 연구하였다. 한글 문자인식에서의 어려움은 인식대상 클래스가 많고 유사문자가 많은 반면, 여러 폰트의 글자를 하나의 글자를 하나의 클래스로 할 경우에는 그 문자의 분산이 더욱 커지게 되는 점이다. 따라서 본 연구에서는 문자의 분포를 고려하여 거리를 계산하는 Bayes에 의한 식별 함수를 1단계 인식과정에서 사용하여 1위 후보문자를 인식하였다. 2단계에서는 미리 준비된 1위 후부문자의 유사문자세트의 최적선형변환 공간에서 상세분류를 행하였다. 결과적으로 1단계의 Bayes거리반에 의한 인식률(91.1%)보다도, 또한 처음부터 모든 클래스에 대하여 최적선형변환에 의한 인식률(87.9%)보다 좋은 결과(92.9%)를 얻게되었다. 이로서 1단계의 대규모 문자세트에 대한 대분류에서는 문자의 분포를 고려하는 Bayes에 의한 인식이 유효하고, 2단계의 최적선형변환에 의한 인식은 소수의 유사문자들에 대한 변별력을 높이는데 유효함을 입증하였다.
PDF

Hangul Font Outline Vector Modification Algorithm According to Weather Information (날씨에 따른 한글 폰트 윤곽선 벡터 변형 알고리즘)

Park, Dong-Yeon;Jo, Se-Ran;Kim, Nam-Hee;Lim, Soon-Bum
- Journal of Korea Multimedia Society
- /
- v.25 no.9
- /
- pp.1328-1337
- /
- 2022
Recently, research on various font designs has been actively conducted to deliver effective emotional information in a digital environment. In this study, we propose a Hangul font outline vector modification algorithm that effectively conveys sensitivity according to weather information and can be transformed immediately. The algorithm performs a series of transformations: sets outlines according to design pattern templates, calculates the glyph's position to reflect physical rules, splits outline segments into smaller sizes and deforms the outlines. Through this, we could create several vector font designs such as humidity, cloud, wind, and snow. The usability evaluation was close to good, so it can be used in diverse ways if we improve readability and effective design expression.
https://doi.org/10.9717/kmms.2022.25.9.1328 인용 PDF KSCI HTML

Search Result 67, Processing Time 0.018 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)