• Title/Summary/Keyword: 청취 모델

Search Result 51, Processing Time 0.023 seconds

Objectively Quantified Consonance of Complex Sounds (객관적으로 정량화된 복합 신호음의 조화도)

  • Chon, Sang-Bae;Choi, In-Yong;Lee, Min-Gu;Sung, Koeng-Mo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.7
    • /
    • pp.323-327
    • /
    • 2007
  • In this paper, objectively quantified consonance of complex sound is proposed as a new psychoacoustical parameter. Proposing algorithm quantifies consonance of complex sound after applying psycho acoustical models which are parts of human perception such as masking effect, equal loudness contour, and critical band. To verify proposing algorithm, experiments with 10 car horn signals which have different complex sound were performed. The experiments show cross correlation of 0.95 between objectively quantified consonance by proposing algorithm and subjectively assessed consonance by listening tests. Considering the fact that there are few psychoacoustical parameter except Zwicker parameter, proposing algorithm will help to quantify psychoacoustical effect of complex sounds objectively.

A Korean Multi-speaker Text-to-Speech System Using d-vector (d-vector를 이용한 한국어 다화자 TTS 시스템)

  • Kim, Kwang Hyeon;Kwon, Chul Hong
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.3
    • /
    • pp.469-475
    • /
    • 2022
  • To train the model of the deep learning-based single-speaker TTS system, a speech DB of tens of hours and a lot of training time are required. This is an inefficient method in terms of time and cost to train multi-speaker or personalized TTS models. The voice cloning method uses a speaker encoder model to make the TTS model of a new speaker. Through the trained speaker encoder model, a speaker embedding vector representing the timbre of the new speaker is created from the small speech data of the new speaker that is not used for training. In this paper, we propose a multi-speaker TTS system to which voice cloning is applied. The proposed TTS system consists of a speaker encoder, synthesizer and vocoder. The speaker encoder applies the d-vector technique used in the speaker recognition field. The timbre of the new speaker is expressed by adding the d-vector derived from the trained speaker encoder as an input to the synthesizer. It can be seen that the performance of the proposed TTS system is excellent from the experimental results derived by the MOS and timbre similarity listening tests.

A Development of Telephone for the Hearing Impaired to Improve Listening Ability of Telephone Speech (난청인의 통화 청취도 향상을 위한 전화기 개발)

  • 이상민;송철규;이영묵;김원기
    • Journal of Biomedical Engineering Research
    • /
    • v.18 no.4
    • /
    • pp.457-466
    • /
    • 1997
  • We developed a new hearing aid telephone which helps the hearing impaired person to improve the listening ability of telephone speech. Recently, the hearing impaired person and the elderly who has hearing loss have been continuously increased and their desire for participating society as a producer has been increased also. So they strong1y want the hearing aid devices which make compensation fortheir handicap. The hearing aid telephone is one of the basic aid devices that helps the hearing impaired to communicate well with other poeple and to acquire easily useful information through the phone. We analyze the hearing ability of the hearing impaired, design the new model of the hearing aid telephone and test the telephone in three fields-electrical, word perception, user test. Our new tolephone has lour band pass filter channels and the center frequencies of these filters are 500, 1000, 2000, 3000Hz which are considered psychoacoustic factors and telephone line characteristics. The hearing impaired can adjust the total gain characteristics of receiving sound to his hearing ability by setting four volumes in the telelphone. This procedure is called fitting which is a very important factor for the hearing impaired to take meaning of speech. The total gain of this telephone is over 20dB from 250Hz to 3200Hz range. From the results of the tests we certify that our new model is better for the hearing impaired to understand the meaning or telephone speech than the old general models. The next step of developing the hearing aid telephone is to study about compressing sidetone and noise, dividing frequency bands, selecting hearing aid pattern and compensating psychoacoustic loudness. we expect that the advanced hearing aid telephone can be developed by the research about speech perception characteristics of the hearing impaired in engineering and clinical side.

  • PDF

A Clinical Study on Binaural Hearing Aid (양이 보청효과에 관한 연구)

  • 김기령;김영명;심윤주
    • Proceedings of the KOR-BRONCHOESO Conference
    • /
    • 1978.06a
    • /
    • pp.9.2-9
    • /
    • 1978
  • Monaural and binaural hearing aid performance under quiet and noisy conditions were compared in regard to (1) the degree of hearing impairment, (2) the symmetry of pure tone audiogram, (3) the automatic gain control of the hearing aid. (4) hearing impairement with recruitment and, word discrimination ability. Performance using binaural hearing aids was consistently superior to that using monaural hearing aids. The results were as follows. 1. Speech detection thresholds were enhanced by a mean of 4.25dB when tested with danavox 747 PP stereo type hearing aid and by a mean of 4.12 dB when tested hearing aids connected seperately to the right and left ears. 2. Binaurally tested speech reception thresholds were superior to monaurally tested thresholds by a mean of 3.56dB when tested in quiet and by a mean of 5.56dB when tested in noise. 3. Binaurally tested word discrimination scores were also superior by a mean of 17.09% in quiet and by a mean 19.63% in noise. 4. Both SRT and word discrimination scores were performed best by subjects with moderately-severe impairement. The performance by one mildly impaired subject was the poorest of all performances. The levels of performance order were; moderately-severe loss, severe loss. moderate loss and mild loss. 5. The data obtained using AGC aids when compaired with that of linear amplification show that when AGC aids were worn in both ears. the results were very poor but when one AGC aid was worn in one ear and linear amplification in the other. the results were good. 6. The advantages of binaural hearing aids were obvious even in cases 1) with great diferences in hearing thresholds between right and left ears, 2) when the subject was unable to discriminate words without vision and. 3) when the subject had extreme recruitme t phenomenon.

  • PDF

A Basic study on Development of VTS support system by Risk of Collision Model (충돌위험도 모델을 이용한 관제 지원 시스템 개발에 관한 기초 연구)

  • Park, Sangwon;Park, Youngsoo
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2015.07a
    • /
    • pp.18-20
    • /
    • 2015
  • In ports of Korea, the marine traffic flow is congested due to a large number of vessels coming in and going out. In order to improve the safety and efficiency of these vessels, South Korea is operating with a Vessel Traffic System, which is monitoring its waters 24-7. However despite these efforts of the VTS (Vessel Traffic System) officers, marine accidents are occurring in their assigned districts and it is made a danger situation every 20minute. On this paper, we listened to Busan VHF channel for 3days and applied to collision risk model. With collision risk model, We deducted a moment which advise or recommend to vessel. We suggested a collision risk model as VTSO support system.

  • PDF

A study on broadcasting service model of medium wave digital radio (중파 디지털라디오 방송서비스 모델에 관한 연구)

  • Han, Hak-Soo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.12 no.4
    • /
    • pp.149-158
    • /
    • 2007
  • A In the rapidly changing media environment, radio needs to be digitized in order to provide regular compatible service. Especially, most medium high frequency Medias have program contents thar are similar to the standard FM bandwidth, so digitizing radio will bring less effect than it is expected. However, the characteristics of radio medias are mobility, individuality, site-to-site, wide service coverage, immediate delivery and publicity, so it is necessary to study the future of radio media to provide continuous services, moreover, the main discussion regarding balanced development between medias will be the trend of the digitization of radio. Also it is easy and will cost less to change compared to other medias. AM was the first media to broadcast in Korea, and its network is spread all over Korea, also the receivers are the most widely distributed which means the signal reaches everywhere in Korea. In this study, the proper service model for AM digital radio is provided in this environment in which all Medias are rapidly digitizing due to the advancement of digital technology. The results of experimental are based on library study and KBS data of sound wave.

  • PDF

A Basic Study on Development of VTS Control Guideline by Collision Risk model based on Ship's Operator's Consciousness (선박운항자 의식 기반 충돌 위험도 모델을 이용한 관제 가이드 라인 개발에 관한 기초 연구)

  • Park, Sang-Won;Park, Young-Soo
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2015.10a
    • /
    • pp.20-23
    • /
    • 2015
  • In ports of Korea, the marine traffic flow is congested due to a large number of vessels coming in and going out. In order to improve the safety and efficiency of these vessels, South Korea is operating with a Vessel Traffic System, which is monitoring its waters 24-7. However despite these efforts of the VTS (Vessel Traffic System) officers, marine accidents are occurring in their assigned districts. VTS Officers are controlling subjectively based on their experience due to no VTS guideline. On this paper, we listened to Busan VHF channel for 3days and applied to collision risk model. With collision risk model, We deducted a moment which advise or recommend to vessel in encounter situation, VTSO's career, day&night.. We suggested a collision risk value as guide line of VTSO's control time.

  • PDF

Development of Text-to-Speech System for PC (PC용 Text-to-Speech 시스템 개발)

  • Choi Muyeol;Hwang Cholgyu;Kim Soontae;Kim Junggon;Yi Sopae;Jang Seokbok;Pyo Kyungnan;Ahn Hyesun;Kim Hyung Soon
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.41-44
    • /
    • 1999
  • 본 논문에서는 PC 응용을 위한 고음질의 한국어 text-to-speech(TTS) 합성 시스템을 개발하였다. 개발된 시스템의 합성방식으로는 음의 고저 조절, 인접음 사이의 연결 처리 및 음색제어 등에서 기존의 PSOLA 방식에 비해 장점을 가지는 정현파 모델 기반의 방식을 채택하였고, 자연스러운 운율 모델링을 위하여 통계적 기법중의 하나인 Classification and regression tree(CART) 방법을 사용하였다. 또한 음소 경계의 불연속성 문제를 줄이기 위한 합성단위로 초성-중성 및 종성 단위를 사용하였고, 다양한 음색표현이 가능하도록 음색제어 기능을 갖추었다. 그리고, 표준 Speech Application Program Interface(SAPI)를 준용한 TTS engine 형태로 구현함으로써 PC 상에서의 응용 프로그램 개발 편의성을 높였다. 합성음의 청취평가 결과 음질의 우수성 및 음색제어 기능의 유효성을 확인할 수 있었다.

  • PDF

Voice/Tone Warning System Design for Military Aircraft (군용 항공기를 위한 음성/톤 경고 시스템 설계)

  • Na, Hana;Kim, Do Gyun
    • Journal of Platform Technology
    • /
    • v.9 no.3
    • /
    • pp.24-35
    • /
    • 2021
  • High-speed military aircraft shall be able to identify and resolve enemy threats or internal component defects with survival equipment and warning systems to minimize casualties. Warning system is divided into visual method with symbolic display and auditory method with communication equipment, which is superior in that they it has a short response time and does not cause pilot confusion by listening to simple messages. Thus, this paper suggested and evaluated effective design methods of voice/tone warning systems for military aircraft based on a life cycle perspective. Since military aircraft is safety-sensitive, priorities and three properties(Inhibitible, Interruptible, and Deactivatable) were applied to each warning to reflect criticality and urgency. As a result, we confirmed that it took 40ms to play the voice warnings, satisfying all requirements through V model-based development and testing, and improving product reliability.

Predicting Dangerous Traffic Intervals between Ships in Vessel Traffic Service Areas Using a Poisson Distribution (푸아송 분포를 이용한 해상교통관제 구역 내 선박 상호간 교통위험 상황의 발생 간격 분석에 관한 연구)

  • Park, Sang-Won;Park, Young-Soo
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.22 no.5
    • /
    • pp.402-409
    • /
    • 2016
  • Vessel traffic servies (VTS) control movements in ports and coastal areas 24 hours a day using VHF. Thus, we were able to check ship movements and the patterns followed by VTS officers in VTS areas using VHF communication analysis. This study is intended to identify control intervals for dangerous situations and provide VTS officers with basic data and guidelines to prevent these occurrences in advance. We listened to Busan port's VHF communication for seven days and obtained risk values using the Park model with reference to controlled ships. The probability of a dangerous situation arising under a controller's watch per unit of time was confirmed to follow a Poisson distribution. As a result, for each 3.50 hours that VTS directly controls an area, (and in daytime for each 2.85 hours) a ship communicates in a VTS area every 3.84 hours, and some of there communications exceed certain risk values in VTS areas.