통합 검색 | Korea Science

A.I.에이전트와의 보이스 인터랙션 : 국내외 IT회사 사례연구 (Voice Interactions with A. I. Agent : Analysis of Domestic and Overseas IT Companies)

이서영
- 한국엔터테인먼트산업학회논문지
- /
- 제15권4호
- /
- pp.15-29
- /
- 2021
인공지능 에이전트는 4차 산업혁명의 핵심 기술이고, 현재 많은 기업들이 AI 음성 인식 비서를 탑재 출시함으로써 산업 내 치열한 경쟁을 벌이고 있다. 애플, 마이크로소프트, 구글, 아마존, 삼성 등 고객 충성도를 확보하고 있으며 자사 하드웨어 제품을 내놓고 있는 기업의 경우, AI 비서 서비스를 자사 제품에 적용함으로써 고객 충성도를 높이고, 시장 점유율 역시 극대화뿐 아니라 향후 음성 인터페이스 플랫폼 시장 장악력을 확대하고 있다. 본 연구는 인공지능분야의 해외 및 국내 주요 기업들의 현황을 분석하고 보이스 UI 개발과 혁신 수용 관점에서 사용자 만족을 위한 기술 발전 방향에 초점을 맞추어 미래 전략 방향을 제언했다. B2B 기술적인 측면에서는 음성 인식률을 높이고 하드웨어향상, 자연언어 처리기술 및 빅데이터 및 인공지능 접목한 혁신 기술의 데이터가 쌓인 클라우드 컴퓨팅 활용뿐 아니라 및 Open A.I.언어 인공지능인 GPT-3의 활용 및 사용성, 유용성, 감성 측면에서 사용자 만족을 높일 필요가 있다. 본 연구는 산업계와 학계에 실무적, 이론적 함의를 준다.
https://doi.org/10.21184/jkeia.2021.6.15.4.15 인용

Real-Time Implementation of Acoustic Echo Canceller Using TMS320C6711 DSK

Heo, Won-Chul;Bae, Keun-Sung
- 음성과학
- /
- 제15권1호
- /
- pp.75-83
- /
- 2008
The interior of an automobile is a very noisy environment with both stationary cruising noise and the reverberated music or speech coming out from the audio system. For robust speech recognition in a car environment, it is necessary to extract a driver's voice command well by removing those background noises. Since we can handle the music and speech signals from an audio system in a car, the reverberated music and speech sounds can be removed using an acoustic echo canceller. In this paper, we implement an acoustic echo canceller with robust double-talk detection algorithm using TMS-320C6711 DSK. First we developed the echo canceller on the PC for verifying the performance of echo cancellation, then implemented it on the TMS320C6711 DSK. For processing of one speech sample with 8kHz sampling rate and 256 filter taps of the echo canceller, the implemented system used only 0.035ms and achieved the ERLE of 20.73dB.
PDF

Autonomous Aero-Robot and Disaster Response

Inoue, Koichi;Nakanishi, Hiroaki
- 한국산업안전학회:학술대회논문집
- /
- 한국안전학회 2003년도 추계 학술논문발표회 논문집
- /
- pp.3-16
- /
- 2003
After a not-widely-known fact is revealed that Japan is a leading country in production and use of industrial unmanned helicopters, a kind of UAV. The voice command system and the autonomous flight control system with a variety of control algorithms including neural network, robust and adaptive control that have been developed in collaboration between Kyoto University and Yamaha Motor Co., and funded by the Ministry of Education and Science of Japan are described in some detail. Both already-proven and promising future applications of the autonomous unmanned helicopters are given.
PDF

PDA 상에서 음성명령어를 구현하기 위한 음성인식기의 설계 (The design of Speech Recognizer to Implement the Voice Command on the PDA)

곽상훈;김철;최승호
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 2001년도 추계학술발표대회 논문집 제20권 2호
- /
- pp.37-40
- /
- 2001
본 논문에서는 PDA상에서 음성으로 명령어를 제어하기 위해 Window CE 3.0 환경에서 음성인식기를 설계하였다. 전처리과정에서 26차 특징파라미터를 추출하고, HTK를 통해 학습하였다. 트라이폰 기반의 가변어휘 음성인식기를 설계하였으며, PDA의 응용프로그램은 Embedded Visual C++언어를 사용하여 22개의 음성명령어를 제어하도록 하였다. 그 결과 PDA상에서 $92\%의 인식률이 나타났으며 이것은 음성인식이 모바일 환경에서도 접근이 가능함을 알 수 있었다.
PDF

Spatio-Temporal Pattern Recognition Neural Network를 이용한 전동 휠체어의 음성 제어에 관한 연구 (A Study on the Voice-Controlled Wheelchair using Spatio-Temporal Pattern Recognition Neural Network)

백승우;김승범;권장우;이응혁;홍승홍
- 대한의용생체공학회:학술대회논문집
- /
- 대한의용생체공학회 1993년도 춘계학술대회
- /
- pp.90-93
- /
- 1993
In this study, Korean speech was recognized by using spatio-temporal recognition neural network. The subjects of speech are numeric speech from zero to nine and basic command which might be used for motorized wheelchair developed it own Lab. Rabiner and Sambur's method of speech detection was used in determining end-point of speech, speech parameter was extracted by using LPC 16 order. The recognition rate was over 90%.
PDF

DTW방식을 이용한 음성 명령에 의한 커서 조작 (Cursor Moving by Voice Command using DTW method)

추명경;손영선
- 한국지능시스템학회논문지
- /
- 제11권1호
- /
- pp.82-87
- /
- 2001
본 논문에서는 마우스 대신에 음성으로 명령을 입력하여 퍼지 추론을 통해 위도우 화면상의 커서를 이동시키는 인터페이스를 구현하였다. 입력된 음성이 대체로 짧은 언어이기에 이를 인식하기 위하여 고립단어 인식에 강한 DTW방식을 사용하였다. DTW방식의 단점중인 하나가 음성길이가 비슷한 명령을 입력하였을 때 표준패턴 중 오차 값이 가장 작은 패턴으로 인식하는 것이다. 예를 들면 \"아주 많이 이동해\"하는 음성이 입력되었을 때 비슷한 음성길이를 가진 \"아주 많이 오른쪽\"으로 인식하는 경우가 있다. 이런 오류를 해결하고자 각 패턴의 DTW오차 거리 값과 표준 패턴의 음성길이를 기준으로 임계값을 퍼지 추론하여 명령으로서의 수락 여부를 결정하였다. 판단이 애매한 부분은 사용자에게 질의를 하여 응답에 따라 수락 여부를 결정하였다.
PDF

구글 홈을 활용한 응용프로그램 제어 시스템의 설계 (Design of Application Control System Using Google Home)

김동현;김휘민
- 한국컴퓨터정보학회:학술대회논문집
- /
- 한국컴퓨터정보학회 2019년도 제60차 하계학술대회논문집 27권2호
- /
- pp.135-136
- /
- 2019
일반적으로 컴퓨터에서 문서 작업을 하기 위해서는 사용자는 컴퓨터 화면을 볼 수 있는 시각과 키보드와 마우스를 조작하기 위하여 손을 사용해야 한다. 시각과 손이 불편한 대부분 장애우는 컴퓨터를 조작하기 어렵다. 장애우들을 보조해주는 정보통신 보조기기의 가격은 비싸며 기기 보급을 지원해주는 사업이 있지만, 사업에 선정되기 어렵다는 문제가 있다. 이 논문에서는 구글 홈을 이용하여 텍스트, 워드, 엑셀, 한글 등 다양한 응용프로그램을 음성을 이용하여 제어하기 위한 시스템을 제안한다. 제안한 시스템은 구글 어시스턴트가 다이어로그플로우로 설계한 인텐트를 웹 훅을 이용해 서버에서 컴퓨터로 접근하여 응용프로그램을 제어한다.
PDF

에지 컴퓨팅 기반 음성 명령 스마트홈 제어 시스템 구축 (Edge Computing-Based Voice Command Smart Home Control System)

김소철;윤서정;고현규
- 한국정보처리학회:학술대회논문집
- /
- 한국정보처리학회 2022년도 추계학술발표대회
- /
- pp.764-766
- /
- 2022
본 시스템은 스마트폰에서 사용자의 음성을 이용해 집 안이나 밖에서 IoT 단말을 효율적으로 제어할 수 있는 시스템으로, 인식된 음성에 맞춰 가전제품 기동, 조명 조절 등 IoT 단말을 컨트롤한다. 사용자의 음성은 Json 형태의 명령으로 변환되어 에지 컴퓨팅 기술을 통해 저사양 단말이 고사양 단말의 유휴자원을 활용하며 명령에 따른 IoT 단말 컨트롤이 진행된다. 이러한 아키텍처는 IoT 단말 데이터를 외부에 노출하지 않고 컴퓨팅 자원을 효율적으로 운용할 수 있는 시스템을 제공한다.
https://doi.org/10.3745/PKIPS.y2022m11a.764 인용 PDF

A Real-Time Embedded Speech Recognition System

Nam, Sang-Yep;Lee, Chun-Woo;Lee, Sang-Won;Park, In-Jung
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2002년도 ITC-CSCC -1
- /
- pp.690-693
- /
- 2002
According to the growth of communication biz, embedded market rapidly developing in domestic and overseas. Embedded system can be used in various way such as wire and wireless communication equipment or information products. There are lots of developing performance applying speech recognition to embedded system, for instance, PDA, PCS, CDMA-2000 or IMT-2000. This study implement minimum memory of speech recognition engine and DB for apply real time embedded system. The implement measure of speech recognition equipment to fit on embedded system is like following. At first, DC element is removed from Input voice and then a compensation of high frequency was achieved by pre-emphasis with coefficients value, 0.97 and constitute division data as same size as 256 sample by lapped shift method. Through by Levinson - Durbin Algorithm, these data can get linear predictive coefficient and again, using Cepstrum - Transformer attain feature vectors. During HMM training, We used Baum-Welch reestimation Algorithm for each words training and can get the recognition result from executed likelihood method on each words. The used speech data is using 40 speech command data and 10 digits extracted form each 15 of male and female speaker spoken menu control command of Embedded system. Since, in many times, ARM CPU is adopted in embedded system, it's peformed porting the speech recognition engine on ARM core evaluation board. And do the recognition test with select set 1 and set 3 parameter that has good recognition rate on commander and no digit after the several tests using by 5 proposal recognition parameter sets. The recognition engine of recognition rate shows 95%, speech commander recognizer shows 96% and digits recognizer shows 94%.
PDF

전장가시화를 위한 한국형 지상전술데이터링크 구축 연구 (Study on Korean Variable Message Format Construction for Battlefield Visualization)

김승춘;이형근
- 전기전자학회논문지
- /
- 제15권1호
- /
- pp.104-112
- /
- 2011
지상군은 감시정찰, 지휘통제 및 정밀타격과 관련된 정보를 교환하기 위한 수단으로 음성위주로 사용하고 있다. 하지만 지상군 작전에 참가하는 전력들간에 전장 가시화를 위해서는 자동화된 상황인식과 지휘통제를 제공할 수 있는 지상전술데이터링크가 필요하다. 이런 필요한 핵심기술 확보를 위해 응용연구를 통하여 메지지 표준과 메시지처리기가 완성되었다. 또한 각 무기체계의 장착을 위한 지상용 데이터링크가 시험개발이 진행 중이다. 본 논문에서는 합동작전, 지상작전 및 연합작전을 수행으로 통합 전장관리체계의 연동성 확보가 가능하여 근실시간으로 상황인식과 타격체계가 자동화된 한국형 지상전술데이터링크의 구축 연구를 제시한다. 메시지 처리기의 M&S 실험결과, 단일 소대망의 노드수와 메시지 길이 및 메시지 발생주기에 따라 지연 시간이 변화하는 것을 확인할 수 있었다. 따라서 각 상황에 네트워크 프로토콜을 변경하여 성능을 최적화 할 수 있음을 확인하였다.
https://doi.org/10.7471/ikeee.2011.15.1.104 인용 PDF KSCI

검색결과 96건 처리시간 0.026초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)