• 제목/요약/키워드: Voice pattern recognition

검색결과 50건 처리시간 0.021초

자동차 ECU제어를 위한 음성인식 패턴매칭레벨에 관한 연구 (A Study on Voice Recognition Pattern matching level for Vehicle ECU control)

  • 안종영;김영섭;김수훈;허강인
    • 한국인터넷방송통신학회논문지
    • /
    • 제10권1호
    • /
    • pp.75-80
    • /
    • 2010
  • 자동차 환경에서의 음성인식은 잡음처리가 매우 중요한 요소이다. 하드웨어 및 소프트웨어로 적인 접근방법으로 많은 연구가 되어 지고 있다. 하드웨어적인 방법으로는 Low-pass filter를 기본으로한 잡음처리 필터가 많이 연구되어 가시적인 성과를 보이고 있고, 소프트웨어적으로는 Noise canceler, 신경망 등 패턴인식 알고리듬의 연구가 이루어지고 있다. 본 논문에서는 시계열 패턴인식에 적용 가능한 알고리듬인 DTW(Dynamic Time Warping)를 자동차 잡음환경에 적용하여 그 음성인식을 위한 파라미터 패턴에 대한 매칭 레벨을 분류하여 잡음환경 적합한 패턴 매칭 레벨을 분석 하였다.

인간-로봇 상호협력작업을 위한 모바일로봇의 지능제어에 관한 연구 (A Study on Intelligent Control of Mobile Robot for Human-Robot Cooperative Operation in Manufacturing Process)

  • 김두범;배호영;김상현;임오득;백영태;한성현
    • 한국산업융합학회 논문집
    • /
    • 제22권2호
    • /
    • pp.137-146
    • /
    • 2019
  • This study proposed a new technique to control of mobile robot based on voice command for (Human-Robot Cooperative operation in manufacturing precess). High performance voice recognition and control system was designed In this paper for smart factory. robust voice recognition is essential for a robot to communicate with people. One of the main problems with voice recognition robots is that robots inevitably effects real environment including with noises. The noise is captured with strong power by the microphones, because the noise sources are closed to the microphones. The signal-to-noise ratio of input voice becomes quite low. However, it is possible to estimate the noise by using information on the robot's own motions and postures, because a type of motion/gesture produces almost the same pattern of noise every time it is performed. In this paper, we describe an robust voice recognition system which can robustly recognize voice by adults and students in noisy environments. It is illustrated by experiments the voice recognition performance of mobile robot placed in a real noisy environment.

신경망을 이용한 단어에서 모음추출에 관한 연구 (A study on the vowel extraction from the word using the neural network)

  • 이택준;김윤중
    • 한국산업정보학회:학술대회논문집
    • /
    • 한국산업정보학회 2003년도 추계공동학술대회
    • /
    • pp.721-727
    • /
    • 2003
  • This study designed and implemented a system to extract of vowel from a word. The system is comprised of a voice feature extraction module and a neutral network module. The voice feature extraction module use a LPC(Linear Prediction Coefficient) model to extract a voice feature from a word. The neutral network module is comprised of a learning module and voice recognition module. The learning module sets up a learning pattern and builds up a neutral network to learn. Using the information of a learned neutral network, a voice recognition module extracts a vowel from a word. A neutral network was made to learn selected vowels(a, eo, o, e, i) to test the performance of a implemented vowel extraction recognition machine. Through this experiment, could confirm that speech recognition module extract of vowel from 4 words.

  • PDF

히어 캠 임베디드 플랫폼 설계 (HearCAM Embedded Platform Design)

  • 홍선학;조경순
    • 디지털산업정보학회논문지
    • /
    • 제10권4호
    • /
    • pp.79-87
    • /
    • 2014
  • In this paper, we implemented the HearCAM platform with Raspberry PI B+ model which is an open source platform. Raspberry PI B+ model consists of dual step-down (buck) power supply with polarity protection circuit and hot-swap protection, Broadcom SoC BCM2835 running at 700MHz, 512MB RAM solered on top of the Broadcom chip, and PI camera serial connector. In this paper, we used the Google speech recognition engine for recognizing the voice characteristics, and implemented the pattern matching with OpenCV software, and extended the functionality of speech ability with SVOX TTS(Text-to-speech) as the matching result talking to the microphone of users. And therefore we implemented the functions of the HearCAM for identifying the voice and pattern characteristics of target image scanning with PI camera with gathering the temperature sensor data under IoT environment. we implemented the speech recognition, pattern matching, and temperature sensor data logging with Wi-Fi wireless communication. And then we directly designed and made the shape of HearCAM with 3D printing technology.

다기능 전동휠체어의 음성인식 모듈에 관한 연구 (Voice Recognition Module for Multi-functional Electric Wheelchair)

  • 류홍석;김정훈;강성인;강재명;이상배
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 하계종합학술대회 논문집(3)
    • /
    • pp.83-86
    • /
    • 2002
  • This paper intends to provide convenience to the disabled, losing the use of their limbs, through voice recognition technology. The voice recognition part of this system recognizes voice by DTW (Dynamic Time Warping) Which is most Widely used in Speaker dependent system. Specially, S/N rate was improved through Wiener filter in the pre-treatment phase while considering real environmental conditions; the result values of 12th order feature pattern per frame are extracted by DTW algorithm using LPC and Cepsturm in feature extraction process. Furthermore, miniaturization is pursued using TMS320C32, 71's the floating-point DSP, for the hardware part. Currently, 90% of hardware porting has been completed, but we can confirm that the recognition rate was 96% as a result of performing the DTW algorithm in PC.

  • PDF

A Study of the Pattern Kernels for a Lip Print Recognition

  • Paik, Kyoung-Seok;Chung, Chin-Hyun
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1998년도 제13차 학술회의논문집
    • /
    • pp.64-69
    • /
    • 1998
  • This paper presents a lip print recognition by the pattern kernels for a personal identification. A lip print recognition is developed less than the other physical attributes of a fingerprint, a voice pattern, a retinal blood/vessel pattern, or a facial recognition. A new method is proposed to recognize a lip print bi the pattern kernels. The pattern kernels are a function consisted of some local lip print pattern masks. This function converts the information on a lip print into the digital data. The recognition in the multi-resolution system is more reliable than recognition in the single-resolution system. The results show that the proposed algorithm by the multi-resolution architecture can be efficiently realized.

  • PDF

Pattern kernels에 의한 Lip Print인식 연구 (A Study of a Lip Print Recognition by the Pattern Kernels)

  • 백경석;정진현
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1998년도 하계학술대회 논문집 G
    • /
    • pp.2249-2251
    • /
    • 1998
  • This paper presents a lip print recognition by the pattern kernels for a personal identification. A lip print recognition is developed less than the other physical attribute that is a fingerprint, a voice pattern, a retinal blood-vessel pattern, or a facial recognition. A new method by the pattern kernels is pro for a lip print recognition. The pattern kerne function consisted of some local lip print p masks. This function identifies the lip print known person or an unknown person. The results show that the proposed algorithm the pattern kernels can the efficiently realized.

  • PDF

소아애성에 영향을 주는 환경에 대한 연구 (Environments of Hoarseness in Children)

  • 안철민;박상준;이건영
    • 대한후두음성언어의학회지
    • /
    • 제8권2호
    • /
    • pp.173-177
    • /
    • 1997
  • The speech movements are acquired activity, not determined by instincts or by biologic inheritance either. The child listens to the sound from the surrounding persons, observes the speech movement of the people and tried to imitate them. Then the child acquires their specific phonation pattern. We guessed that the parents influences to the child are very important in the developing of the speech movements. Because the parents are first contact person to the baby. The recognition of parents about the voice changes in the child will be important too. And social environments such as kindergarden, school, friends contact with, can influence to the voice of the child. We investigated the state of the voice, parents influence and social environmental factor. In the bases of this study, we knew that the parents recognition about the voice changes of child, faulty vocal habits of child, social environmental factors influenced to the voice of child. And we thought we have to do our best for the early detection of voice changes and proper treatment.

  • PDF

VHDL을 이용한 구순문 인식 시스템의 구현 연구 (An Implementation of Lip Print Recognition system using VHDL)

  • 최우진;정진현
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1999년도 하계학술대회 논문집 G
    • /
    • pp.2935-2937
    • /
    • 1999
  • The human has recognizable part of body such as a fingerprint, a crimson, a blood vessel. This part has been investigated constantly, its confidence for personal recognition is high. In spite of specialized part of human body, a lip print recognition is developed less than the other physical attribute that is a fingerprint. a voice pattern, a retinal blood-vessel pattern, or a facial recognition. This paper is to implement hardware for lip print recognition system using VHDL.

  • PDF

음성패턴인식 인터랙티브 콘텐츠 개발 (Interactive content development of voice pattern recognition)

  • 나종원
    • 한국항행학회논문지
    • /
    • 제16권5호
    • /
    • pp.864-870
    • /
    • 2012
  • 언어 학습 콘텐츠에서 공통적으로 가질 수 있는 문제점들을 분석하고 문제점에 대하여 음성 패턴인식기술을 적용하여 기존의 문제점을 해결하였다. 언어 학습 콘텐츠의 첫 번째 문제점은 온라인 학습 자세이다. 수업 진행은 되었지만 다른 웹 페이지를 열어 게임을 하는 등 학생들의 집중력은 떨어졌다. 두 번 째 문제점은 Speaking 학습 과정을 만들었지만 실제로 따라 읽는지 판단할 수가 없었다. 세 번 째 문제점은 학습 관리 시스템에 의한 기계적 진행이 아니라 선생님들의 평가에 의해 잘하는 학생들과 못하는 학생간의 학습 진행에 차이를 둘 필요가 생겼다. 마지막으로 가장 큰 문제는 기존에 만들어 놓은 콘텐츠들은 그대로 유지되면서 위의 문제들을 해결할 수 있어야 했다. 이러한 배경 하에 음성 패턴인식기술은 말하기 학습 전용 학습 프로그램으로 학습 진행을 위한 음성인식은 물론 학습 자체를 위한 음성인식 기능들을 모두 가지고 있으며 인식 절차에 사용된 학습자의 발화 데이터를 원하는 형태의 오디오 파일로 변경하여 서버의 특정 위치로 전송하거나 SQL서버에 등록할 수도 있으며, 또한 컴포넌트이기 때문에 그 어떠한 시스템이나 프로그램이라도 모두 적용 가능하고 이미 만들어진 콘텐츠 전체를 손상시키지 않고 쉽게 삽입하여 새로운 기능들을 사용할 수 있었다. 본 논문으로 교육 방식을 보다 인터렉티브하게 바꾸어 적극적인 수업참여가 되도록 기여하였다.