• Title/Summary/Keyword: Sound recognition

Search Result 311, Processing Time 0.028 seconds

On the Classification of Voice Sound and the Recognition of Vowels for Korean Continuous Speech (한국어 연속음인식에 관한 연구(유성음 분류 및 단모음 인식 ))

  • 하판봉;이철희;방승찬;안수길
    • The Journal of the Acoustical Society of Korea
    • /
    • v.5 no.3
    • /
    • pp.28-35
    • /
    • 1986
  • 우리나라 음성의 유성음을 모음, 비음 및 유성화 자음으로 분류하는 알고리즘을 기술하였다. 먼 저 기존의 PITCH 검출 알고리즘에 의하여 음성을 유성음과 무성음으로 나눈 뒤, 단지 정규화된 1차 상 관계수, 영교차율, LOG 에너지 및 LPG 에너지의 골짜기 검출만을 이용하여, 유성음은 모음, 비음 및 유 성화자음으로 분류하고 무성음은 실제의 무성음과 묵음으로 분류하였다. 그리고 이렇게 분류된 모음에 대하여 단모음 인식을 행하였다. 단지 한 FRAME으로 모음을 대표하였기 때문에 메모리 크기와 인식 시간을 줄였다. 여기서 UP & DOWN 및 수정된 영교차율을 새로이 정의하여 적용한 결과 만족한 결과 를 얻을 수 있었다. LPC 매개변수 및 전력 스펙트럼도 단모음 인식의 FEATURE로 사용하였다. 그리고 각 FEATURE 의 성능을 비교하였다. 이들 FEATURE을 잘 조합하여 2단계 인식을 행한 결과 92%의 높은 인식율을 얻을 수 있었다.

  • PDF

Development of Piano Playing Robot (피아노 연주 로봇의 개발)

  • Park, Kwang-Hyun;Jung, Seong-Hoon;Pelczar, Christopher;Hoang, Thai V.;Bien, Zeung-Nam
    • Proceedings of the KIEE Conference
    • /
    • 2007.04a
    • /
    • pp.334-336
    • /
    • 2007
  • This paper presents a beat gesture recognition method to synchronize the tempo of a robot playing a piano with the desired tempo of the user. To detect an unstructured beat gesture expressed by any part of a body, we apply an optical flow method, and obtain the trajectories of the center of gravity and normalized central moments of moving objects in images. The period of a beat gesture is estimated from the results of the fast Fourier transform. In addition, we also apply a motion control method by which robotic fingers are trained to follow a set of trajectories, Since the ability to track the trajectories influences the sound a piano generates, we adopt an iterative learning control method to reduce the tracking error.

  • PDF

Learning Framework based on Public Open Data for Workplace Etiquette Education (직장예절교육용 공공개방데이터를 활용한 학습 프레임워크)

  • Kim, Yuri
    • Knowledge Management Research
    • /
    • v.19 no.1
    • /
    • pp.133-146
    • /
    • 2018
  • This study develops an Education framework for users who need public open data for workplace etiquette education in a timely manner by mobile application. It facilitates utilizing efficiently Workplace etiquette contents that scattered in various platforms such as blogs, Youtube and web-sites run by private education agencies. Furthermore, it makes Public open data for workplace etiquette through gathering 'metadata', which is a comprehensive source of workplace etiquette. Accordingly, framework changes recognition about necessity of workplace etiquette education positively and suggests method that can promote effective workplace etiquette education. If the system in the study can provide public open data of workplace etiquette education, many young job applicants and workers will have a proper perception on it and sound workplace etiquette culture will be settled in the companies. Public data has been rising as a vital national strategic asset these days. Hopefully the public data will pave a way to discover the blue ocean in the market and open up a new type of businesses.

A Study on the Importance of Uninsured (Indirect) Cost Item of Workplace Accidents

  • Jung, Cecil;Baek, Jong-Bae
    • Korean Chemical Engineering Research
    • /
    • v.55 no.4
    • /
    • pp.497-502
    • /
    • 2017
  • Estimation of accident cost is a sound and great safety indicator on determining accurate occupational safety and health prevention. Just like in Korea, Heinrich ratio analysis of (1:4) between direct and indirect costs has been become widely used in safety management because of its simplicity. In this study four major categories of uninsured (indirect) cost items and 18 sub-categories of uninsured (indirect) cost items were identified. To determine and validate the importance and necessity of the results of a literature review an expert or professional surveyed had been analyses using the SPSS 18.0, where in the participants whose expertize is in the field of compensation and safety. Based on the results of survey all participants all uninsured (indirect) cost items classified was important and necessary when accidents occurred. Despite recognition of expert on the classification of uninsured (indirect) cost items, it is quite difficult to make generalization for all kind of costs in occupational accident case due to different nature of business for each industry.

A Study on the Human Auditory Scaling (인간의 청각 척도에 관한 고찰)

  • Yang, Byung-Gon
    • Speech Sciences
    • /
    • v.2
    • /
    • pp.125-134
    • /
    • 1997
  • Human beings can perceive various aspects of sound including loudness, pitch, length, and timber. Recently many studies were conducted to clarify complex auditory scales of the human ear. This study critically reviews some of these scales (decibel, sone, phon for loudness perception; mel and bark for pitch) and proposes to apply the scales to normalize acoustic correlates of human speech. One of the most important aspects of human auditory perception is the nonlinearity which should be incorporated into the linear speech analysis and synthesis system. Further studies using more sophisticated equipment are desirable to refine these scales, through the analysis of human auditory perception of complex tones or speech. This will lead scientists to develop better speech recognition and synthesis devices.

  • PDF

A Study of Comparison Between Green Building Certification Criteria and Ecological Area Rate System in Apartment Housing (공동주택을 중심으로 친환경 건축물 인증제도와 생태면적율 제도에 대한 비교연구)

  • Kim, Chul;Lim, Tae-Sub;Kim, Byung-Seon
    • Proceedings of the SAREK Conference
    • /
    • 2008.06a
    • /
    • pp.1291-1296
    • /
    • 2008
  • Recently, ecological area rate system become effective due to enlargement of recognition and high intellectual standard and for ecological circulation in urban areas. Ecological area rate system is to control environmental quality of life what has grown worse in urban districts and corresponds to purposes of green building certification criteria for environmentally sound and sustainable development. Therefore, purposes of this study are to present suggestions through research of theory and comparison between ecological area rate system and green building certification criteria.

  • PDF

Wearable system for sound visualization and disaster alarm for the Hearing-Impaired (청각장애인을 위한 사운드-시각화 및 재난 경보 웨어러블 시스템)

  • Lee, Se-Hoon;Kong, Jin-yong;Yeom, Dae-hoon;Kang, Eun-ho;Baek, Yong-Tae
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2017.07a
    • /
    • pp.257-258
    • /
    • 2017
  • 본 논문에서는 청각 장애인들은 시각에 의존하지 않고는 소리를 인지할 수 없다는 문제를 해결하기 위해 사운드를 시각화하는 웨어러블 시스템을 구현하였다. 시스템의 음성 인식 센서가 음성을 인식해 웨어러블 디스플레이에 전송된 메시지를 확인하고, 기상 재난 메시지를 웨어러블에서 실시간으로 확인하여 안전사고를 예방할 수 있게 하여 청각장애인의 어려움을 해결하였다.

  • PDF

Deep Learning based Music Classification System (딥러닝 기반의 음원검색 및 분류 시스템)

  • Lee, Sei-Hoon;Jeong, Ui-Jung
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2018.07a
    • /
    • pp.119-120
    • /
    • 2018
  • 본 논문에서는 음악을 듣고 어떤 음악인지 인식하고 판별하는 음원분류 시스템과 해당 기술 구현을 딥러닝을 통해 적용하도록 제안하였다. 제안한 시스템은 인공심층신경망을 통해 음원파일을 여러 음원 특징 추출 모델에 따라 검출된 특징들을 학습하여 해당 음원의 고유한 보컬이나 반주의 특색 등을 찾아내어 이를 인식할 수 있도록 구현하였다. 이를 통해, 기존의 Fingerprint 방식의 데이터베이스 검색 시스템과는 다른 접근방식으로 보다 사람이 음악을 기억하는 방법에 가깝도록 구현하여 능동성과 유연성을 개선하고 다양한 응용분야로 활용할 수 있는 시스템을 제안하였다.

  • PDF

Design of direction control system for camera, Using sound source recognition and delay time. (음원인식 및 지연시간을 이용한 카메라의 방향제어 시스템 설계)

  • Lee, Hui-Tae;Kim, Young-Sub
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2017.11a
    • /
    • pp.1076-1078
    • /
    • 2017
  • 본 연구는 이상음원(비명, 유리창 파손음, 경적소리 등) 발생 시, 2개의 마이크로폰에 입력되는 사운드에 대하여 음원 방향추적 장치와 연결된 카메라에 음원의 방향 정보를 전송함으로써, 카메라의 View Point를 음원 발생방향으로 이동시켜 사고현장을 더욱 신속하게 대처할 수 있는 시스템에 대한 연구이다. 일반적인 음성을 이용한 감시카메라는 단순히 소리 발생 여부만 감지하지만, 본 시스템은 이상음원 발생 지점으로 카메라의 방향 제어를 가능하게 한다. 이상음원의 검출은 기존에 수집한 DB를 기반으로 비교, 분석 과정을 통하여 이상음원을 분류한다. 음원 발생 방향은 음원 발생 시, 마이크로폰에 도달하는 음원의 시간차에 따른 음파의 위상차를 계산하여 음원 발생 방향을 판단하게 된다.

Dysarthric speaker identification with different degrees of dysarthria severity using deep belief networks

  • Farhadipour, Aref;Veisi, Hadi;Asgari, Mohammad;Keyvanrad, Mohammad Ali
    • ETRI Journal
    • /
    • v.40 no.5
    • /
    • pp.643-652
    • /
    • 2018
  • Dysarthria is a degenerative disorder of the central nervous system that affects the control of articulation and pitch; therefore, it affects the uniqueness of sound produced by the speaker. Hence, dysarthric speaker recognition is a challenging task. In this paper, a feature-extraction method based on deep belief networks is presented for the task of identifying a speaker suffering from dysarthria. The effectiveness of the proposed method is demonstrated and compared with well-known Mel-frequency cepstral coefficient features. For classification purposes, the use of a multi-layer perceptron neural network is proposed with two structures. Our evaluations using the universal access speech database produced promising results and outperformed other baseline methods. In addition, speaker identification under both text-dependent and text-independent conditions are explored. The highest accuracy achieved using the proposed system is 97.3%.