• Title/Summary/Keyword: numerals

Search Result 96, Processing Time 0.036 seconds

Feature Extraction and Statistical Pattern Recognition for Image Data using Wavelet Decomposition

  • Kim, Min-Soo;Baek, Jang-Sun
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.3
    • /
    • pp.831-842
    • /
    • 1999
  • We propose a wavelet decomposition feature extraction method for the hand-written character recognition. Comparing the recognition rates of which methods with original image features and with selected features by the wavelet decomposition we study the characteristics of the proposed method. LDA(Linear Discriminant Analysis) QDA(Quadratic Discriminant Analysis) RDA(Regularized Discriminant Analysis) and NN(Neural network) are used for the calculation of recognition rates. 6000 hand-written numerals from CENPARMI at Concordia University are used for the experiment. We found that the set of significantly selected wavelet decomposed features generates higher recognition rate than the original image features.

  • PDF

Arabic-Numerals to Korean Transliteration Disambiguation using BERT (BERT를 이용한 숫자-한국어 음역 모호성 해소)

  • Park, Jeong Yeon;Yuk, Dae Bum;Lee, Jae Sung
    • Annual Conference on Human and Language Technology
    • /
    • 2020.10a
    • /
    • pp.42-44
    • /
    • 2020
  • TTS(Text-to-Speech) 시스템을 위해서는 한글 이외의 문자열을 한글로 변환해줄 필요가 있다. 이러한 문자열에는 숫자, 특수문자 등의 문자열이 포함되어 있다. 특히 숫자의 경우, 숫자가 사용되는 문맥에 따라 그 발음방법이 달라지는 문제점이 있다. 본 논문에서는 기존의 규칙기반과 한정된 문맥 정보만을 활용할 수 있는 방법이 아닌, 딥러닝을 이용한 방법으로 문맥에 따라 발음방법이 달라지는 숫자 음역의 모호성을 해소하는 방법을 소개한다.

  • PDF

The Efficient Feature Extraction of Handwritten Numerals in GLVQ Clustering Network (GLVQ클러스터링을 위한 필기체 숫자의 효율적인 특징 추출 방법)

  • Jeon, Jong-Won;Min, Jun-Yeong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.2 no.6
    • /
    • pp.995-1001
    • /
    • 1995
  • The structure of a typical pattern recognition consists a pre-processing, a feature extraction(algorithm) and classification or recognition. In classification, when widely varying patterns exist in same category, we need the clustering which organize the similar patterns. Clustering algorithm is two approaches. Firs, statistical approaches which are k-means, ISODATA algorithm. Second, neural network approach which is T. Kohonen's LVQ(Learning Vector Quantization). Nikhil R. Palet al proposed the GLVQ(Generalized LVQ, 1993). This paper suggest the efficient feature extraction methods of handwritten numerals in GLVQ clustering network. We use the handwritten numeral data from 21's authors(ie, 200 patterns) and compare the proportion of misclassified patterns for each feature extraction methods. As results, when we use the projection combination method, the classification ratio is 98.5%.

  • PDF

A pilot implementation of Korean in Database Semantics: focusing on numeral-classifier construction (데이터베이스 의미론을 이용한 한국어 구현 시론: 수사-분류사 구조를 중심으로)

  • Choe, Jae-Woong
    • Korean Journal of Cognitive Science
    • /
    • v.18 no.4
    • /
    • pp.457-483
    • /
    • 2007
  • Database Semantics (DBS) attempts to provide a comprehensive and integrated approach to human communication which seeks theory-implementation transparency. Two key components of DBS are Word bank as a data structure and left-Associative Grammar (LAG) as an algorithm. This study aims to provide a pilot implementation of Korean in DBS. First, it is shown how the three separate modules of grammar in DBS, namely, Hear, Think, and Speak, combine to form an integrated system that simulates a cognitive agent by making use of a simple Korean sentence as an example. Second, we provide a detailed analysis of the structure in Korean that is a characteristic of Korean involving numerals, classifiers, and nouns, thereby illustrating how DBS can be applied to Korean. We also discuss an issue raised in the literature concerning a problem that arises when we try to apply the LAG algorithm to the analysis of head-final language like Korean, and then discuss some possible solution to the problem.

  • PDF

Recognition of Handwritten Numerals using Hybrid Features And Combined Classifier (복합 특징과 결합 인식기에 의한 필기체 숫자인식)

  • 박중조;송영기;김경민
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.5 no.1
    • /
    • pp.14-22
    • /
    • 2001
  • Off-line handwritten numeral recognition is a very difficult task and hard to achieve high recognition results using a single feature and a single classifier, since handwritten numerals contain many pattern variations which mostly depend upon individual writing styles. In this paper, we propose handwritten numeral recognition system using hybrid features and combined classifier. To improve recognition rate, we select mutually helpful features -directional features, crossing point feature and mesh features- and make throe new hybrid feature sets by using these features. These hybrid feature sets hold the local and global characteristics of input numeral images. And we implement combined classifier by combining three neural network classifiers to achieve high recognition rate, where fuzzy integral is used for multiple network fusion. In order to verify the performance of the proposed recognition system, experiments with the unconstrained handwritten numeral database of Concordia University, Canada were performed. As a result, our method has produced 97.85% of the recognition rate.

  • PDF

Design of a Fuzzy Classifier by Repetitive Analyses of Multifeatures (다중 특징의 반복적 분석에 의한 퍼지 분류기의 설계)

  • 신대정;나승유
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.6 no.3
    • /
    • pp.14-24
    • /
    • 1996
  • A fuzzy classifier which needs various analyses of features using genetic algorithms is proposed. The fuzzy classifier has a simple structure, which contains a classification part based on fuzzy logic theory and a rule generation ation padptu sing genetic algorithms. The rule generation part determines optimal fuzzy membership functions and inclusior~ or exclusion of each feature in fuzzy classification rules. We analyzed recognition rate of a specific object, then added finer features repetitively, if necessary, to the object which has large misclassification rate. And we introduce repetitive analyses method for the minimum size of string and population, and for the improvement of recognition rates. This classifier is applied to three examples of the classification of iris data, the discrimination of thyroid gland cancer cells and the recognition of confusing handwritten and printed numerals. In the recognition of confusing handwritten and printed numerals, each sample numeral is classified into one of the groups which are divided according to the sample structure. The fuzzy classifier proposed in this paper has recognition rates of 98. 67% for iris data, 98.25% for thyroid gland cancer cells and 96.3% for confusing handwritten and printed numeral!;.

  • PDF

FUSION BASED RECOGNITION METHOD FOR HANDWRITTEN NUMERALS ON BANK SHEETS (은행 수납장표 자동인식을 위한 융합기반 필기 숫자 인식방법)

  • 전효세;소영성
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1999.10b
    • /
    • pp.449-451
    • /
    • 1999
  • 지금까지 많은 필기 숫자 인식 방법들이 제안되었지만 고도의 신뢰도가 요구되는 은행 수납 장표상의 숫자 인식에 적합한 방법은 아직 발표된 것이 미미한 실정이다. 본 연구에서는 세 개의 분류기의 결과를 융합하여 100%에 가까운 신뢰도를 낼 수 있는 필기숫자 인식 시스템을 제안하였다. Karhunen-Loeve Transform(KLT)를 통하여 특징을 추출하였으며 오류 역전파 신경망(BP), LVQ를 적용한 SOFM(SOFM-LVQ)과 Weignted Several Nearest Neighbor(WSNN)을 분류기로 사용하였다. 융합을 위해서는 다수결(Majority Voting)이 아닌 만장일치제(Unanimous Voting)을 적용하여 신뢰도를 높혔다. ETL-6 DB를 사용하여 실험하였으며 실험 결과 99.95%의 높은 신뢰도를 기록하였다.

  • PDF

Combining Different Distance Measurements Methods with Dempster-Shafer-Theory for Recognition of Urdu Character Script

  • Khan, Yunus;Nagar, Chetan;Kaushal, Devendra S.
    • International Journal of Ocean System Engineering
    • /
    • v.2 no.1
    • /
    • pp.16-23
    • /
    • 2012
  • In this paper we discussed a new methodology for Urdu Character Recognition system using Dempster-Shafer theory which can powerfully estimate the similarity ratings between a recognized character and sampling characters in the character database. Recognition of character is done by five probability calculation methods such as (similarity, hamming, linear correlation, cross-correlation, nearest neighbor) with Dempster-Shafer theory of belief functions. The main objective of this paper is to Recognition of Urdu letters and numerals through five similarity and dissimilarity algorithms to find the similarity between the given image and the standard template in the character recognition system. In this paper we develop a method to combine the results of the different distance measurement methods using the Dempster-Shafer theory. This idea enables us to obtain a single precision result. It was observed that the combination of these results ultimately enhanced the success rate.

A Study on Performance Evaluation of Clustering Algorithms using Neural and Statistical Method (신경망 및 통계적 방법에 의한 클러스터링 성능평가)

  • 윤석환;민준영;신용백
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.19 no.37
    • /
    • pp.41-51
    • /
    • 1996
  • This paper evaluates the clustering performance of a neural network and a statistical method. Algorithms which are used in this paper are the GLVQ(Generalized Learning vector Quantization) for a neural method and the k-means algorithm fer a statistical clustering method. For comparison of two methods, we calculate the Rand's c statistics. As a result, the mean of c value obtained with the GLVQ is higher than that obtained with the k-means algorithm, while standard deviation of c value is lower. Experimental data sets were the Fisher's IRIS data and patterns extracted from handwritten numerals.

  • PDF

Recognition of Handwritten Numerals using Eigenvectors (고유벡터를 이용한 필기체 숫자인식)

  • 박중조;김경민;송명현
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.6 no.6
    • /
    • pp.986-991
    • /
    • 2002
  • This paper presents off-line handwritten numeral recognition method by using Eigen-Vectors. In this method, numeral features are extracted statistically by using Eigen-Vectors through KL transform and input numeral is recognized in the feature space by the nearest-neighbor classifier. In our feature extraction method, basis vectors which express best the property of each numeral type within the extensive database of sample numeral images are calculated, and the numeral features are obtained by using this basis vectors. Through the experiments with the unconstrained handwritten numeral database of Concordia University, we have achieved a recognition rate of 96.2%.