통합 검색 | Korea Science

Voice Command-based Prediction and Follow of Human Path of Mobile Robots in AI Space

Tae-Seok Jin
- 한국산업융합학회 논문집
- /
- 제26권2_1호
- /
- pp.225-230
- /
- 2023
This research addresses sound command based human tracking problems for autonomous cleaning mobile robot in a networked AI space. To solve the problem, the difference among the traveling times of the sound command to each of three microphones has been used to calculate the distance and orientation of the sound from the cleaning mobile robot, which carries the microphone array. The cross-correlation between two signals has been applied for detecting the time difference between two signals, which provides reliable and precise value of the time difference compared to the conventional methods. To generate the tracking direction to the sound command, fuzzy rules are applied and the results are used to control the cleaning mobile robot in a real-time. Finally the experiment results show that the proposed algorithm works well, even though the mobile robot knows little about the environment.
https://doi.org/10.21289/KSIC.2023.26.2.225 인용 PDF HTML

HRTF를 이용한 2채널 스테레오 음원을 수정 및 편집 할 수 있는 입체음향 저작도구 개발 (Development of Stereo Sound Authoring Tool to Modify and Edit 2Channel Stereo Sound Source Using HRTF)

김영식;김용일;배명수;전수민;이대호
- 한국정보처리학회:학술대회논문집
- /
- 한국정보처리학회 2017년도 추계학술발표대회
- /
- pp.909-912
- /
- 2017
컴퓨터를 이용한 가상훈련체계를 구현하는데 있어, 청각적인 요소는 시각적인 요소 다음으로 인간의 중요한 인지 능력을 담당한다. 특히 청각 능력의 향상은 훈련 시의 성과와 밀접한 관련을 가지고 있으며, 훈련 효과 향상에 기여하는 바가 높다. 본 논문에서는 이와 같은 가상훈련체계를 구축하는데 있어 반드시 필요한 음향시스템을 기존에 단순한 재생이 아닌 사용자 혹은 개발자가 필요로 하는 음원을 직접 저작할 수 있는가에 초점을 두었으며, 머리전달함수(HRTF: Head Related Transfer Function)를 이용한 음원을 수정 및 편집하고 사용할 수 있는 시험체계를 개발하였다. 체계 성능 평가를 위하여 기능 및 청감 테스트를 실시하였다.
https://doi.org/10.3745/PKIPS.y2017m11a.909 인용 PDF

Wavelet 분석을 통한 시뮬레이션 음향 효과 개선에 관한 연구 (The Study of Sound Effect Improved Simulation though Wavelet analysis and Fourier transform)

김영식;김용일;배명수
- 한국정보처리학회:학술대회논문집
- /
- 한국정보처리학회 2017년도 춘계학술발표대회
- /
- pp.960-962
- /
- 2017
본 논문에서는 군사 훈련 및 교육에 사용될 수 있는 시뮬레이션에 사용될 수 있는 음원 파일들을 주파수 별로 분리하고 각 대역 별로 필터링해 사용하는 방법을 제안한다. 주파수 별 분리를 위해 Wavelet 분석을 통하여 주파수를 단계별로 분리하고 변환하여 각 분리된 주파수별 잡음제거를 한다. 이와 같은 작업이 이루어 질 수 있는 관련 저작도구를 구현 한다.
https://doi.org/10.3745/PKIPS.y2017m04a.960 인용 PDF

컴퓨터 게임에서 HMM 기반의 명령어 신호 처리 시간 단축을 위한 방법 (A HMM-based Method of Reducing the Time for Processing Sound Commands in Computer Games)

박도생;김상철
- 한국게임학회 논문지
- /
- 제16권2호
- /
- pp.119-128
- /
- 2016
컴퓨터 게임에서 대부분의 사용자 인터페이스 방법은 키보드, 마우스, 터치스크린이다. 사운드 형태 명령어의 전체 처리 시간은 크게 명령어 입력 시간과 인식 시간으로 구성된다. 본 논문은 명령어 신호 전체를 입력받지 않고 일부 앞부분 신호만을 받음으로써, 입력 시간을 줄여 전체 처리 시간을 단축하는 방법을 제안한다. 우리의 방법에서는 HMM(Hidden Markov Process)를 이용해 명령어 신호를 인식하는데, 전체 신호 및 부분 신호들에 대해 별도의 HMM을 구성한다. 플랫홈 게임의 대표 명령어들을 음성과 손바닥 소리로 표현해, 본 논문의 방법을 실험했다. 실험 결과, 인식률의 큰 저하 없이 명령어 처리 시간을 줄임을 알 수 있었다. 본 연구는 게임의 사용자 인터페이스 방법을 다양화하는데 기여할 것이다.
https://doi.org/10.7583/JKGS.2016.16.2.119 인용 PDF KSCI

통신환경에서 음성인식 인터페이스 (Speech Recognition Interface in the Communication Environment)

한태근;김종근;이동욱
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 2001년도 하계학술대회 논문집 D
- /
- pp.2610-2612
- /
- 2001
This study examines the recognition of the user's sound command based on speech recognition and natural language processing, and develops the natural language interface agent which can analyze the recognized command. The natural language interface agent consists of speech recognizer and semantic interpreter. Speech recognizer understands speech command and transforms the command into character strings. Semantic interpreter analyzes the character strings and creates the commands and questions to be transferred into the application program. We also consider the problems, related to the speech recognizer and the semantic interpreter, such as the ambiguity of natural language and the ambiguity and the errors from speech recognizer. This kind of natural language interface agent can be applied to the telephony environment involving all kind of communication media such as telephone, fax, e-mail, and so on.
PDF

Sound Improvement of Violin Playing Robot Applying Auditory Feedback

Jo, Wonse;Yura, Jargalbaatar;Kim, Donghan
- Journal of Electrical Engineering and Technology
- /
- 제12권6호
- /
- pp.2378-2387
- /
- 2017
Violinists learn to make better sounds by hearing and evaluating their own playing though numerous practice. This study proposes a new method of auditory feedback, which mimics this violinists' step and verifies its efficiency using experiments. Making the desired sound quality of a violin is difficult without auditory feedback even though an expert violinist plays. An algorithm for controlling a robot arm of violin playing robot is determined based on correlations with bowing speed, bowing force, and sound point that determine the sound quality of a violin. The bowing speed is estimated by the control command of the robot arm, where the bowing force and the sound point are recognized by using a two-axis load cell and a photo interrupter, respectively. To improve the sound quality of a violin playing robot, the sounds information is obtained by auditory feedback system applied Short Time Fourier Transform (STFT) to the sounds from a violin. This study suggests Gaussian-Harmonic-Quality (GHQ) uses sounds' clarity, accuracy, and harmonic structure in order to decide sound quality, objectively. Through the experiments, the auditory feedback system improved the performance quality by the robot accordingly, changing the bowing speed, bowing force, and sound point and determining the quality of robot sounds by GHQ sound quality evaluation system.
https://doi.org/10.5370/JEET.2017.12.6.2378 인용 PDF KSCI

여러 대의 스마트폰을 이용한 입체 음향 시스템 구현 (Implementation of Stereophonic Sound System Using Multiple Smartphones)

김기준;명창호;박호종
- 방송공학회논문지
- /
- 제19권6호
- /
- pp.810-818
- /
- 2014
본 논문에서는 여러 대의 스마트폰을 이용하여 입체 음향을 재현하는 시스템을 제안한다. 기존의 스마트폰을 이용한 음향 시스템은 여러 기기에서 동일 신호를 재생하는 방식이기 때문에 입체감을 제공하기 어렵다. 이를 해결하기 위하여 본 논문에서는 다른 위치에 있는 기기 별로 서로 다른 신호를 재생하고 진폭 패닝 기법을 사용하여 임의의 위치에 가상 음원을 생성하는 시스템을 제안한다. 제안한 방법을 사용하면 기존 방법보다 우수한 입체감을 제공할 수 있으며, 사용자의 설정에 따라 자유롭게 입체 음향 효과를 조정할 수 있다. 상용 스마트폰을 이용하여 제안하는 시스템을 구현하였으며, 구현한 시스템이 원하는 입체 음향 효과를 제공하는 것을 확인하였다.
https://doi.org/10.5909/JBE.2014.19.6.810 인용 PDF KSCI KPUBS HTML

A Study on Stable Motion Control of Humanoid Robot with 24 Joints Based on Voice Command

Lee, Woo-Song;Kim, Min-Seong;Bae, Ho-Young;Jung, Yang-Keun;Jung, Young-Hwa;Shin, Gi-Soo;Park, In-Man;Han, Sung-Hyun
- 한국산업융합학회 논문집
- /
- 제21권1호
- /
- pp.17-27
- /
- 2018
We propose a new approach to control a biped robot motion based on iterative learning of voice command for the implementation of smart factory. The real-time processing of speech signal is very important for high-speed and precise automatic voice recognition technology. Recently, voice recognition is being used for intelligent robot control, artificial life, wireless communication and IoT application. In order to extract valuable information from the speech signal, make decisions on the process, and obtain results, the data needs to be manipulated and analyzed. Basic method used for extracting the features of the voice signal is to find the Mel frequency cepstral coefficients. Mel-frequency cepstral coefficients are the coefficients that collectively represent the short-term power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency. The reliability of voice command to control of the biped robot's motion is illustrated by computer simulation and experiment for biped walking robot with 24 joint.
https://doi.org/10.21289/KSIC.2018.21.1.017 인용 PDF KSCI

산업용 로보트의 동작제어 명령어의 인식에 관한 연구 (A study on the voice command recognition at the motion control in the industrial robot)

이순요;권규식;김홍태
- 대한인간공학회지
- /
- 제10권1호
- /
- pp.3-10
- /
- 1991
The teach pendant and keyboard have been used as an input device of control command in human-robot sustem. But, many problems occur in case that the usef is a novice. So, speech recognition system is required to communicate between a human and the robot. In this study, Korean voice commands, eitht robot commands, and ten digits based on the broad phonetic analysis are described. Applying broad phonetic analysis, phonemes of voice commands are divided into phoneme groups, such as plosive, fricative, affricative, nasal, and glide sound, having similar features. And then, the feature parameters and their ranges to detect phoneme groups are found by minimax method. Classification rules are consisted of combination of the feature parameters, such as zero corssing rate(ZCR), log engery(LE), up and down(UD), formant frequency, and their ranges. Voice commands were recognized by the classification rules. The recognition rate was over 90 percent in this experiment. Also, this experiment showed that the recognition rate about digits was better than that about robot commands.
PDF

자율형 이동로봇을 위한 전방위 화자 추종 시스템 (Speaker Tracking System for Autonomous Mobile Robot)

이창훈;김용호
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 2002년도 합동 추계학술대회 논문집 정보 및 제어부문
- /
- pp.142-145
- /
- 2002
This paper describes a omni-directionally speaker tracking system for mobile robot interface in real environment. Its purpose is to detect a robust 360-degree sound source and to recognize voice command at a long distance(60-300cm). We consider spatial features, the relation of position and interaural time differences, and realize speaker tracking system using fuzzy inference process based on inference rules generated by its spatial features.
PDF

검색결과 25건 처리시간 0.041초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)