• Title/Summary/Keyword: recognition distance

Search Result 1,007, Processing Time 0.026 seconds

Recognition Performance Improvement of Unsupervised Limabeam Algorithm using Post Filtering Technique

  • Nguyen, Dinh Cuong;Choi, Suk-Nam;Chung, Hyun-Yeol
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.8 no.4
    • /
    • pp.185-194
    • /
    • 2013
  • Abstract- In distant-talking environments, speech recognition performance degrades significantly due to noise and reverberation. Recent work of Michael L. Selzer shows that in microphone array speech recognition, the word error rate can be significantly reduced by adapting the beamformer weights to generate a sequence of features which maximizes the likelihood of the correct hypothesis. In this approach, called Likelihood Maximizing Beamforming algorithm (Limabeam), one of the method to implement this Limabeam is an UnSupervised Limabeam(USL) that can improve recognition performance in any situation of environment. From our investigation for this USL, we could see that because the performance of optimization depends strongly on the transcription output of the first recognition step, the output become unstable and this may lead lower performance. In order to improve recognition performance of USL, some post-filter techniques can be employed to obtain more correct transcription output of the first step. In this work, as a post-filtering technique for first recognition step of USL, we propose to add a Wiener-Filter combined with Feature Weighted Malahanobis Distance to improve recognition performance. We also suggest an alternative way to implement Limabeam algorithm for Hidden Markov Network (HM-Net) speech recognizer for efficient implementation. Speech recognition experiments performed in real distant-talking environment confirm the efficacy of Limabeam algorithm in HM-Net speech recognition system and also confirm the improved performance by the proposed method.

A Study on Processing of Speech Recognition Korean Words (한글 단어의 음성 인식 처리에 관한 연구)

  • Nam, Kihun
    • The Journal of the Convergence on Culture Technology
    • /
    • v.5 no.4
    • /
    • pp.407-412
    • /
    • 2019
  • In this paper, we propose a technique for processing of speech recognition in korean words. Speech recognition is a technology that converts acoustic signals from sensors such as microphones into words or sentences. Most foreign languages have less difficulty in speech recognition. On the other hand, korean consists of vowels and bottom consonants, so it is inappropriate to use the letters obtained from the voice synthesis system. That improving the conventional structure speech recognition can the correct words recognition. In order to solve this problem, a new algorithm was added to the existing speech recognition structure to increase the speech recognition rate. Perform the preprocessing process of the word and then token the results. After combining the result processed in the Levenshtein distance algorithm and the hashing algorithm, the normalized words is output through the consonant comparison algorithm. The final result word is compared with the standardized table and output if it exists, registered in the table dose not exists. The experimental environment was developed by using a smartphone application. The proposed structure shows that the recognition rate is improved by 2% in standard language and 7% in dialect.

Distance Measurement Using the Kinect Sensor with Neuro-image Processing

  • Sharma, Kajal
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.4 no.6
    • /
    • pp.379-383
    • /
    • 2015
  • This paper presents an approach to detect object distance with the use of the recently developed low-cost Kinect sensor. The technique is based on Kinect color depth-image processing and can be used to design various computer-vision applications, such as object recognition, video surveillance, and autonomous path finding. The proposed technique uses keypoint feature detection in the Kinect depth image and advantages of depth pixels to directly obtain the feature distance in the depth images. This highly reduces the computational overhead and obtains the pixel distance in the Kinect captured images.

Surface Curvature Based 3D Pace Image Recognition Using Depth Weighted Hausdorff Distance (표면 곡률을 이용하여 깊이 가중치 Hausdorff 거리를 적용한 3차원 얼굴 영상 인식)

  • Lee Yeung hak;Shim Jae chang
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.1
    • /
    • pp.34-45
    • /
    • 2005
  • In this paper, a novel implementation of a person verification system based on depth-weighted Hausdorff distance (DWHD) using the surface curvature of the face is proposed. The definition of Hausdorff distance is a measure of the correspondence of two point sets. The approach works by finding the nose tip that has a protrusion shape on the face. In feature recognition of 3D face image, one has to take into consideration the orientated frontal posture to normalize after extracting face area from original image. The binary images are extracted by using the threshold values for the curvature value of surface for the person which has differential depth and surface characteristic information. The proposed DWHD measure for comparing two pixel sets were used, because it is simple and robust. In the experimental results, the minimum curvature which has low pixel distribution achieves recognition rate of 98% among the proposed methods.

  • PDF

The Recognition of Grapheme 'ㅁ', 'ㅇ' Using Neighbor Angle Histogram and Modified Hausdorff Distance (이웃 각도 히스토그램 및 변형된 하우스도르프 거리를 이용한 'ㅁ', 'ㅇ' 자소 인식)

  • Chang Won-Du;Kim Ha-Young;Cha Eui-Young;Kim Do-Hyeon
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.2
    • /
    • pp.181-191
    • /
    • 2005
  • The classification error of 'ㅁ', 'ㅇ' is one of the main causes of incorrect recognition in Korean characters, but there haven't been enough researches to solve this problem. In this paper, a new feature extraction method from Korean grapheme is proposed to recognize 'ㅁ', 'ㅇ'effectively. First, we defined an optimal neighbor-distance selection measure using modified Hausdorff distance, which we determined the optimal neighbor-distance by. And we extracted neighbor-angle feature which was used as the effective feature to classify the two graphemes 'ㅁ', 'ㅇ'. Experimental results show that the proposed feature extraction method worked efficiently with the small number of features and could recognize the untrained patterns better than the conventional methods. It proves that the proposed method has a generality and stability for pattern recognition.

  • PDF

Image Recognition by Using Hybrid Coefficient Measure of Correlation and Distance (상관계수과 거리계수의 조합형 척도를 이용한 영상인식)

  • Hong, Seong-Jun;Cho, Yong-Hyun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.20 no.3
    • /
    • pp.343-347
    • /
    • 2010
  • This paper presents an efficient image recognition method using the hybrid coefficient measure of correlation and distance. The correlation coefficient is applied to measure the statistical similarity by using Pearson coefficient, and distance coefficient is also applied to measure the spacial similarity by using city-block. The total similarity among images is calculated by extending the similarity between the feature vectors, then the feature vectors can be extracted by PCA and ICA, respectively. The proposed method has been applied to the problem for recognizing the 960(30 persons * 4 expressions * 2 lights * 4 poses) facial images of 40*50 pixels. The experimental results show that the proposed method of ICA has a superior recognition performances than the method using PCA, and is affected less by the environmental influences so as lighting.

Development of Augmented Reality Based Electronic Circuit Education System (증강현실 기반 전자회로 교육 시스템 개발)

  • Oh, DoBong;Shim, SeungHwan;Choi, HanGo
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.9 no.12
    • /
    • pp.333-338
    • /
    • 2020
  • This paper proposes an augmented reality-based electronic circuit education system as a way for electronic circuit education, which is the basis of ICT convergence technology field. It consists of a hardware module that can identify the actual circuit and a mobile educational content that can check the current flow, input, output, and measured value by applying augmented reality technology. An experiment was conducted on image recognition, which is the main performance, for the purpose of stable operation of the system, and as the experimental method the recognition rate was measured by changing the distance between the hardware module and the mobile device to a certain interval. As a result of the experiment, the recognition rate was 100 percent at a distance of 25[Cm] or higher, and it was confirmed that the recognition rate decreased by 12% at a distance below 25[Cm], which can be said to be the effect of an error that results in image loss taken due to close distance. In the future, we plan to apply the education system presented in this paper to classes, which increases the efficiency of classes and improve students' interest and understanding of the subject.

A New 3D Active Camera System for Robust Face Recognition by Correcting Pose Variation

  • Kim, Young-Ouk;Jang, Sung-Ho;Park, Chang-Woo;Sung, Ha-Gyeong;Kwon, Oh-Yun;Paik, Joon-Ki
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.1485-1490
    • /
    • 2004
  • Recently, we have remarkable developments in intelligent robot systems. The remarkable features of intelligent robot are that it can track user, does face recognition and vital for many surveillance based systems. Advantage of face recognition when compared with other biometrics recognition is that coerciveness and contact that usually exist when we acquire characteristics do not exist in face recognition. However, the accuracy of face recognition is lower than other biometric recognition due to decrease in dimension from of image acquisition step and various changes associated with face pose and background. Factors that deteriorate performance of face recognition are many such as distance from camera to face, lighting change, pose change, and change of facial expression. In this paper, we implement a new 3D active camera system to prevent various pose variation that influence face recognition performance and propose face recognition algorithm for intelligent surveillance system and mobile robot system.

  • PDF

Wine Label Character Recognition in Mobile Phone Images using a Lexicon-Driven Post-Processing (사전기반 후처리를 이용한 모바일 폰 영상에서 와인 라벨 문자 인식)

  • Lim, Jun-Sik;Kim, Soo-Hyung;Lee, Chil-Woo;Lee, Guee-Sang;Yang, Hyung-Jung;Lee, Myung-Eun
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.5
    • /
    • pp.546-550
    • /
    • 2010
  • In this paper, we propose a method for the postprocessing of cursive script recognition in Wine Label Images. The proposed method mainly consists of three steps: combination matrix generation, character combination filtering, string matching. Firstly, the combination matrix generation step detects all possible combinations from a recognition result for each of the pieces. Secondly, the unnecessary information in the combination matrix is removed by comparing with bigram of word in the lexicon. Finally, string matching step decides the identity of result as a best matched word in the lexicon based on the levenshtein distance. An experimental result shows that the recognition accuracy is 85.8%.

An Object Recognition Method Based on Depth Information for an Indoor Mobile Robot (실내 이동로봇을 위한 거리 정보 기반 물체 인식 방법)

  • Park, Jungkil;Park, Jaebyung
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.21 no.10
    • /
    • pp.958-964
    • /
    • 2015
  • In this paper, an object recognition method based on the depth information from the RGB-D camera, Xtion, is proposed for an indoor mobile robot. First, the RANdom SAmple Consensus (RANSAC) algorithm is applied to the point cloud obtained from the RGB-D camera to detect and remove the floor points. Next, the removed point cloud is classified by the k-means clustering method as each object's point cloud, and the normal vector of each point is obtained by using the k-d tree search. The obtained normal vectors are classified by the trained multi-layer perceptron as 18 classes and used as features for object recognition. To distinguish an object from another object, the similarity between them is measured by using Levenshtein distance. To verify the effectiveness and feasibility of the proposed object recognition method, the experiments are carried out with several similar boxes.