Search | Korea Science

Implementation of Paper Keyboard Piano with a Kinect (키넥트를 이용한 종이건반 피아노 구현 연구)

Lee, Jung-Chul;Kim, Min-Seong
- Journal of the Korea Society of Computer and Information
- /
- v.17 no.12
- /
- pp.219-228
- /
- 2012
In this paper, we propose a paper keyboard piano implementation using the finger movement detection with the 3D image data from a kinect. Keyboard pattern and keyboard depth information are extracted from the color image and depth image to detect the touch event on the paper keyboard and to identify the touched key. Hand region detection error is unavoidable when using the simple comparison method between input depth image and background depth image, and this error is critical in key touch detection. Skin color is used to minimize the error. And finger tips are detected using contour detection with area limit and convex hull. Finally decision of key touch is carried out with the keyboard pattern information at the finger tip position. The experimental results showed that the proposed method can detect key touch with high accuracy. Paper keyboard piano can be utilized for the easy and convenient interface for the beginner to learn playing piano with the PC-based learning software.
https://doi.org/10.9708/jksci/2012.17.12.219 인용 PDF KSCI

Face Detection Algorithm and Hardware Implementation for Auto Focusing Using Face Features in Skin Regions (AF를 위한 피부색 영역의 얼굴 특징을 이용한 Face Detection 알고리즘 및 하드웨어 구현)

Jeong, Hyo-Won;Kwak, Boo-Dong;Ha, Joo-Young;Han, Hag-Yong;Kang, Bong-Soon
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.13 no.12
- /
- pp.2547-2554
- /
- 2009
In this paper, we proposed a face detection algorithm and a hardware implementation method for ROI(Region Of Interest) of AF(Auto Focusing). We used face features in skin regions of YCbCr color space for face detection. The face features are the number of skin pixels in face regions, edge pixels in eye regions, and shadow pixels in lip regions. The each feature was statistically selected by 2,000 sample pictures of face. The proposed algorithm detects two faces that are closer center of the image for considering the effectiveness of hardware resource. The detected faces are displayed by rectangle for ROI of AF, and the rectangles are represented by positions in the image about starting point and ending point of the rectangles. The proposed face detection method was verified by using FPGA boards and mobile phone camera sensor.
https://doi.org/10.6109/JKIICE.2009.13.12.2547 인용 PDF KSCI

Improving the Performance of a Speech Recognition System in a Vehicle by Distinguishing Male/Female Voice (성별 구별방법에 의한 자동차 내 음성 인식 성능 향상)

Yang, Jin-Woo;Kim, Sun-Hyeop
- Journal of KIISE:Software and Applications
- /
- v.27 no.12
- /
- pp.1174-1182
- /
- 2000
본 논문은 주행중인 자동차 환경에서 운전자의 안전성 및 편의성의 동시 확보를 위하여, 보조적인 스위치 조작 없이 상시 음성의 입, 출력이 가능한 시스템을 제안하였다. 이대 잡음에 강인한 threshold 값을 구하기 위하여, 1.5초마다 기준 에너지와 영 교차율을 변경하였으며 대역 통과 여과기를 이용하여 1차, 2차로 나누어 실시간 상태에서 자동으로, 정확하게 끝점 검출을 처리하였다. 또한 남성, 여성을 피치검출로 구분하여 모델을 선택하게 하였고, 주행중인 자동차 속도에 따라 가장 적합한 모델을 사용하기 위하여 Idle-40km, 40-80km, 80-100km로 구분하여 남성, 여성 모델을 각각 구분하여 인식할 수 있게 하였다. 그리고, 음성의 특징 벡터와 인식 알고리즘은 PLP 13차와 OSDP(one-Stage Dynamic Programming)을 사용하였다. 본 실험은 서울시내 도로 및 내부 순환도로에서 각각 속도별로 구분하여 화자독립 인식 실험을 한 결과 40-80km 상태에서 남자는 96.8%, 여자는 95.1%, 80-100km 상태에서는 남자 91.6%, 여자는 90.6%의 인식결과를 얻을 수 있었고, 화자종속 인식실험 결과 40-80km 상태에서 남자는 98%, 여자는 96%, 80-100km 상태에서는 남자는 96%, 여자는 94%의 높은 인식률을 얻었으므로, system의 유효성을 입증하였다.
PDF

Boundary Extraction of Moving Objects using Moving Edge and Heuristic Search (이동에지와 휴리스틱 탐색을 이용한 움직이는 물체의 경계추출)

김종대;김성대;김재균
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.14 no.3
- /
- pp.249-262
- /
- 1989
We present a method of boundary extraction of moving objects. We propose four methods for detecting moving edge pixels which can be located on the boundaries of moving objects. We select the best one after we test the above four methods to real image sequences. The portion of the boundaries of moving objects which is marked as moving edge pixels is searched along the moving edge pixels with simple heuristics. And the end points of the resultant line segments are utilized as the start points of the secon stage heuristic search. This second stage search is performed for the boundaries of moving objects which is not marked as moving edge pixels due to various reasons. We test our algorithm for two real sequences and we find that this simple algorithm has good performance.
PDF

Tracking of eyes based on the iterated spatial moment using weighted gray level (명암 가중치를 이용한 반복 수렴 공간 모멘트기반 눈동자의 시선 추적)

Choi, Woo-Sung;Lee, Kyu-Won
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.14 no.5
- /
- pp.1240-1250
- /
- 2010
In this paper, an eye tracking method is presented by using on iterated spatial moment adapting weighted gray level that can accurately detect and track user's eyes under the complicated background. The region of face is detected by using Haar-like feature before extracting region of eyes to minimize an region of interest from the input picture of CCD camera. And the region of eyes is detected by using eigeneye based on the eigenface of Principal component analysis. Also, feature points of eyes are detected from darkest part in the region of eyes. The tracking of eyes is achieved correctly by using iterated spatial moment adapting weighted gray level.
https://doi.org/10.6109/jkiice.2010.14.5.1240 인용 PDF KSCI

Development of a Stock Information Retrieval System using Speech Recognition (음성 인식을 이용한 증권 정보 검색 시스템의 개발)

Park, Sung-Joon;Koo, Myoung-Wan;Jhon, Chu-Shik
- Journal of KIISE:Computing Practices and Letters
- /
- v.6 no.4
- /
- pp.403-410
- /
- 2000
In this paper, the development of a stock information retrieval system using speech recognition and its features are described. The system is based on DHMM (discrete hidden Markov model) and PLUs (phonelike units) are used as the basic unit for recognition. End-point detection and echo cancellation are included to facilitate speech input. Continuous speech recognizer is implemented to allow multi-word speech. Data collected over several months are analyzed.
PDF

The Environmental Control System using Speech Recognition (음성인식을 이용한 생활환경 제어장치)

정혁준;임재용;이행세;오문식
- Proceedings of the IEEK Conference
- /
- 2000.09a
- /
- pp.141-144
- /
- 2000
일반인들은 음성인식을 이용한 생활보조기구들의 필요성이 적지만 장애인이나 노인들은 가족이나 주변인의 도움을 받지 않고서는 가전제품의 작동이나 전화통화 등과 같은 일을 스스로 하기에는 쉽지 않다. 이러한 사람들에게 각 가정에 널리 보급되어 있는 PC를 이용하여서 타인의 도움을 받지 않고서도 간편하게 사용할 수 있게 음성을 이용한 생활보조기구들 제어에 응용하였다본 음성인식기는 음성의 끝점 검출, 음성의 특징계수 추출, 백터 양자화 학습 및 인식, HMM학습 그리고 HMM인식으로 나누어져 있다. 그리고 그 인식 결과에 따라 생활보조기구등을 제어하였다. 이러한 음성인식기를 만드는 것은 노인이나 장애인들에게 자신이 혼자할수 없는 생활의 편리함을가져다 주기 위함이고 일반정상인에게도 많은 편리함을 가져다 주기 위함이다. 그러나 언어 학습과정에서 노인이나 환자는 학습에 어려움이 있어 적은 학습으로도 인식되어야하는 과제가 남아있다.
PDF

Spectral Pattern Based Robust Speech Endpoint Detection in Noisy Environments (스펙트럼 패턴 기반의 잡음 환경에 강인한 음성의 끝점 검출 기법)

Park, Jin-Soo;Lee, Yoon-Jae;Lee, In-Ho;Ko, Han-Seok
- Phonetics and Speech Sciences
- /
- v.1 no.4
- /
- pp.111-117
- /
- 2009
In this paper, a new speech endpoint detector in noisy environment is proposed. According to the previous research, the energy feature in the speech region is easily distinguished from that in the speech absent region. In conventional method, the endpoint can be found by applying the edge detection filter that finds the abrupt changing point in feature domain. However, since the frame energy feature is unstable in noisy environment, the accurate edge detection is not possible. Therefore, in this paper, the novel feature extraction method based on spectrum envelop pattern is proposed. Then, the edge detection filter is applied to the proposed feature for detection of the endpoint. The experiments are performed in the car noise environment and a substantial improvement was obtained over the conventional method.
PDF

An Implementation of Multimedia Game using Speech Recognition for Windows (Windows환경에서 음성인식을 이용한 멀티미디어 게임의 구현)

윤재선
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06e
- /
- pp.335-338
- /
- 1998
본 논문에서는 음성인식 알고리즘인 HMM을 사용하여 Windows 환경에서 온라인으로 사용할 수 있는 음성인식 게임“Voice Illust Magic”개발에 관하여 소개한다. 사용자와 컴퓨터가 상호작용(Interaction)할 수 있는 매체를 마우스와 키보드뿐만 아니라 게임에 필요한 명령어를 음성인식으로 실행함으로써 정보전달이 매우 효과적으로 이루어져 사용자가 접근하기 쉽고 편리하게 되었으며 의사전달 효율을 높일 수 있었다. 음성인식 과정을 온라인으로 마이크를 통해 들어온 음성을 자동으로 끝점을 검출한 후, Mel-Cepstrum을 추출하여 Word 단위의 reference HMM과 비교하여 최적의 model이 선택되면, 윈도우즈에게 메시지를 보내어 마우스나 키보드가 동작하는 것과 마찬가지로 실행되도록 하였다. 또한, 입력 음성을 모든 reference pattern과 비교하는 것이 아니라 그 상황에 적용될 수 있는 표준 패턴을 한정함으로써 탐색시간을 줄일 수 있었으며 높은 인식률을 나타내었다.
PDF

Text-dependent Speaker Verification System in SVAPI 1.0 Environment (SVAPI 1.0 환경에서의 어구 종속 화자 확인 시스템)

김유진
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.08a
- /
- pp.401-405
- /
- 1998
SVAPI 1.0 환경에서의 어구 종속 화자 확인 시스템에 대해 기술한다. 구현된 시스템은 궁극적으로 공중 전화망 응용이 가능한 실용 시스템을 목표로 개발되었으며 이를 위해 SVAPI 위원회에 의해 제안된 SVAPI 1.0을 개발 환경으로 사용하였다. SVAPI는 객체 지향 구조, 클라이언트-서버 및 telephony 환경의 지원등이 특징이며 어플리케이션과 엔진을 독립적으로 개발할 수 있는 이점을 제공한다. 구현된 데모 시스템은 펜티엄 프로세서와 Windows95/NT 4.0 운영체제 그리고 Win16/Win32 API를 통해 제어 가능하며 음성 입력이 가능한 디바이스를 장착한 IBM 호환 PC이다. 화자의 성문 등록은 화자가 동일한 어구를 3회 발성하여 이뤄지며 등록과 확인의 응답속도는 모두 1초 이내이다. 소프트웨어의 구성은 크게 어플리케이션과 어구 종속 화자 확인 엔진으로 구분할 수 있으며 엔진은 끝점 검출 알고리즘, 음성 특징 추출 알고리즘 그리고 연속 HMM 기반의 화자 성문 모델 등록 및 유사도 계산 등을 포함한 확인 알고리즘으로 구성되어 있다. 화자의 성문은이름과 같은 약 3음절 이상의 단어로 등록되고 테스트되었다. 엔진의 객관적인 평가를 위해 전화선을 통해 남자 6명, 여자 3명의 화자로부터 자신의 이름을 각각 40회 발성하여 구축된 음성 데이터 베이스를 사용하였으며 실험 결과 남자는 2.85%, 여자는 2.44%의 EER을 각각 얻었다.
PDF

Search Result 74, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)