• Title/Summary/Keyword: character Frequency

Search Result 351, Processing Time 0.033 seconds

Analysis of Korean Language to Optimize the Hangul Character Coding for Information Processing and Communication (한글의 정보처리 및 통신용 부호 최적화를 위한 한국어 분석)

  • Hong, Wan-Pyo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.10 no.3
    • /
    • pp.375-380
    • /
    • 2015
  • This paper is studied the Korean language to optimize the Hangul character coding for information processing in information terminal device and transmission in network. The paper analyzed Hangul character in Korean language and use frequency of each character. The paper also compared the analysis result to Hangul characters which are coded in standard in Korean character and Unicode. This study referred "Modern Korean Use Frequency Rate Survey Result" issued by The National Institute of the Korean Language. There are total 58,437 Korean words in the report. As a result of this paper, the Korean word 58,437ea are consisted of Hangul character total 1,540ea. The highest use frequency character is "다" and its use frequency to total use frequency rate is 15%. The lowest use character is "휫"and its use frequency to total use frequency rate is 0.00003%. The number of analyzed Hangul character 1,540 is less 7.2 times and 1.5 times than Korean and Unicode standard respectively.

On the Filtering of Hangul character Element with the Spatial Positioning Modulation (공간 위치 변조에 의한 한글자소의 필터링)

  • 강대수;진용옥
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.17 no.9
    • /
    • pp.1029-1039
    • /
    • 1992
  • This paper presents the filtering method which is processed on the frequency domain among Hangul character recognition methods. It is processed the Hangul character parrern with spatial positioning modulation and mapped the Hangul character element which have spatial position variant feature onto frequency domain, at this time, normalized spatial position and so normalized the character size in frequency domain. And it is grouped the Hangul character element according to the generating position and set the standard pattern, and used each standard character element pattern with character element filter and filtering the character pattern of Hangul character, it is derived the normalized cross correlation function and the coherence function led to the filtering results, and calculated classification threshold.

  • PDF

Character based Hangeul search using Location-specific Character Frequency

  • Lee, Jung-Hwa;Lee, Jong-Min;Kim, Seong-Woo
    • Journal of information and communication convergence engineering
    • /
    • v.7 no.3
    • /
    • pp.345-350
    • /
    • 2009
  • Hangul search functionality, including dictionary search is used in many Hangeul applications. Existing research of hangeul search method is the study of using hangeul syllable as a basic unit. However when you consider the characteristics of Hangul, the research of using hangeul character as a basic unit is needed. In this paper we propose the character based hangeul search method using the location-specific frequency information and verify the effectiveness of the proposed method through the experiments.

Underwater Acoustic Characteristics and Application to Seabed Survey (해저탐사에 적용되는 음파특성)

  • Kim Seong-Ryul;Lee Yong-Kuk;Jung Baek-Hun
    • 한국석유지질학회:학술대회논문집
    • /
    • 2005.09a
    • /
    • pp.9-16
    • /
    • 2005
  • The electromagnetic (light) waves are limited to penetrate the media, ie, water and sea-bottom layers, due to high energy attenuation, but acoustic (sound) waves play as the good messenger to gather the underwater target information. Therefore the acoustic methods are applied to almost of ocean equipments and technology in terms of in-water and sub-bottom surveys, Generally the sound character is controlled by its frequency. In case that the sound source is low frequency, the penetration is high and the resolution is low. On the other hand, its character is reversed at the high frequency. The common character at the both of light and sound is the energy damping according to the travel distance increase.

  • PDF

Character Extraction Using Wavelet Transform and Fuzzy Clustering (웨이브렛 변환과 퍼지 군집화를 활용한 문자추출)

  • Hwang, Jung-Won;Hwang, Jae-Ho
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.44 no.4 s.316
    • /
    • pp.93-100
    • /
    • 2007
  • In this paper, a novel approach based on wavelet transform is proposed to process the scraped character which is represented on digital image. The basis idea is that the scraped character is described by its textured neighborhood, and it is decomposed into multiresolution features at different levels with its background region. The image is first decomposed into sub bands by applying Daubechies wavelets. Character features are extracted from the low frequency sub-bands by partition, FCM clustering and area-based region process. High frequency ones are activated by applying local energy density over a moving mask. Features are synthesized in order to reconstruct the original image state through inverse wavelet transform Background region is eliminated and character is extracted. The experimental results demonstrate the effectiveness of the proposed method.

Acoustic Characteristics Applied to Seafloor and Sub-bottom Survey (해저지형${\cdot}$지층탐사에 적용되는 음파특성)

  • Kim, Seong-Ryul;Lee, Yong-Kuk;Jeong, Baek-Hun
    • 한국지구과학회:학술대회논문집
    • /
    • 2005.09a
    • /
    • pp.42-50
    • /
    • 2005
  • The electromagnetic (light) waves are limited to penetrate the media, ie, water and sea-bottom layers, due to high energy attenuation, but acoustic (sound) waves play as the good messenger to gather the underwater target information. Therefore the acoustic methods are applied to almost of ocean equipments and technology in terms of in-water and sub-bottom surveys. Generally the sound character is controlled by its frequency. In case that the sound source is low frequency, the penetration is high and the resolution is low. On the other hand, its character is reversed at the high frequency. The common character at the both of light and sound is the energy damping according to the travel distance increase.

  • PDF

Thinning algorithm of hand-printed korean character using wavelet transform (웨이브렛 변환을 이용한 필기체 한글 문자의 세선화 알고리즘)

  • 길문호;유기형;박정호;최재호;곽훈성
    • Proceedings of the IEEK Conference
    • /
    • 1998.06a
    • /
    • pp.745-748
    • /
    • 1998
  • Recently, image and voice processing part is using wavelet transform. We propose thining algorithm using wavelet tranform. Wavelet transform consists of low frequency and high frequency in the spatial and frequency domain. After the wavelet decomposition, more than 90 percents of energy are contained in lowest frequency band. Therefor, for images with large difference of gray value between foreground and background like character images, we can more accurately in the lowest frequency band. Lowest frequency band has wavelet transform significant coefficient(WTS) that is required for the thinning algorithm we proposed Paper [3][5][7][8] can not separate consonants and vowels of korean characters. Becuase korean characters have structural feature. This paper can separate consonants and vowels. Simulation executed low frequency image and data compression can reduce 1/4$^{n}$ with level n. we can redcue time complexity 3/8.

  • PDF

Consideration of CJK Joint Hanja Unicode when is used in AMI/HDB-3 Line Coding (AMI/HDB-3 회선부호화와 한·중·일 한자 유니코드 체계 고찰)

  • Tai, Dong-Zhen;Hong, Wan Pyo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.8 no.7
    • /
    • pp.1011-1015
    • /
    • 2013
  • This paper analyses the violation rate of CJK joint Chines character Unicode to the source code rule. In the paper, Chinese character 150ea in Chinese Unicode which have relatively a higher frequency in use of a character was chosen to study. The frequency rate in use of the 150ea characters is about 50% of the total frequency rate of the Chinese characters. The study was applied the AMI/HDB-3 line coding/scrambling and HDLC protocol, According to the analyses, the number of violated characters were 77ea of 150 ea, frequency rate in use 29%. Therefore, when the violated 77ea characters are replaced to the matched character codes to the source coding rule, the processing rate of the line coder can be improved about 37%.

An On-line Speech and Character Combined Recognition System for Multimodal Interfaces (멀티모달 인터페이스를 위한 음성 및 문자 공용 인식시스템의 구현)

  • 석수영;김민정;김광수;정호열;정현열
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.2
    • /
    • pp.216-223
    • /
    • 2003
  • In this paper, we present SCCRS(Speech and Character Combined Recognition System) for speaker /writer independent. on-line multimodal interfaces. In general, it has been known that the CHMM(Continuous Hidden Markov Mode] ) is very useful method for speech recognition and on-line character recognition, respectively. In the proposed method, the same CHMM is applied to both speech and character recognition, so as to construct a combined system. For such a purpose, 115 CHMM having 3 states and 9 transitions are constructed using MLE(Maximum Likelihood Estimation) algorithm. Different features are extracted for speech and character recognition: MFCC(Mel Frequency Cepstrum Coefficient) Is used for speech in the preprocessing, while position parameter is utilized for cursive character At recognition step, the proposed SCCRS employs OPDP (One Pass Dynamic Programming), so as to be a practical combined recognition system. Experimental results show that the recognition rates for voice phoneme, voice word, cursive character grapheme, and cursive character word are 51.65%, 88.6%, 85.3%, and 85.6%, respectively, when not using any language models. It demonstrates the efficiency of the proposed system.

  • PDF

Analysis of Young Adult Information Concentrating on the Significance of Application of Cartoon Characters on Garments (캐릭터를 의류상품에 응용하기 위한 신세대 소비자 정보 분석)

  • 김칠순;조예진
    • Journal of the Korean Society of Costume
    • /
    • v.51 no.4
    • /
    • pp.31-42
    • /
    • 2001
  • The purpose of this study was to develop a large representative data base for character goods marketing strategy. The study was to determine character name awareness in relation to segmented distribution regions and such demographic variables as sex and age. The author also analyzed preferred design. A total of 360 questionnaires were distributed and 359 reliable ones were used for statistical analysis. A SAS statistical package including frequency tables and Chi square test and factor analysis and Kendall′s relation analyses was used. The results are as follow : character name awareness involves "character name decognition" based on asking subjects to identify character names from 50 given names. "Tele-tubbies" was found to be a dominant commercial character name as a result of the recognition test, and "Sailer-moon" was found to be a dominant animation character goods as a result of the recognition test. Character recognition was significantly different in the segmented distribution legions, three age groups and different sex groups. People considered design first in purchasing any character goods, and they considered color second in purchasing them. The most favored part of garment far character to be stitch to was the center front in the T-shirt/dress shirt, the center back in the Jumper/jacket, back pocket in the trousers/skirt. The results of a Chi-square test showed that preferred part in the trousers/skirt of character was related with age variables.

  • PDF