• Title/Summary/Keyword: Listener

Search Result 194, Processing Time 0.025 seconds

Indoor Location System based on TDOA between RF and Ultrasonic Signal (RF와 초음파 사이의 TDOA에 기반한 실내 측위시스템)

  • Seo, Young-Dong;Song, Moon-Kyou
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.34 no.6C
    • /
    • pp.611-618
    • /
    • 2009
  • Recently, an indoor location-aware technology has been focused on as a key technology for context awareness in ubiquitous computing environments. The conventional Cricket system was designed with a non-centralized architecture, which has advantages in terms of user privacy, deployment, scalability, decentralized administration, network heterogeneity, and low cost. In this paper, an indoor location system based on TDOA between RF and ultrasound signals is designed, which improves the Cricket system. A 2.4GHz frequency is employed for transmitting RF messages, which is in an ISM band. The beaconing frequency is doubled to enhance the channel utilization rate. The ultrasonic pulse duration is optimized to increase the coverage of ultrasonic signals. The function of calculating location coordinates is embedded in a listener. The location-update rate and location accuracy are also improved.

Perceptual, Acoustical, and Physiological Tools in Ataxic Dysarthria Management: A Case Report (운동실조형 마비성구음장애에 적용되는 지각적, 음향학적, 생리학적 도구에 관하여 - 환자사례를 중심으로 -)

  • Kim Hyang Hui
    • Proceedings of the KSPS conference
    • /
    • 1996.02a
    • /
    • pp.9-22
    • /
    • 1996
  • Among the various dysarthric subtypes, diagnosis of ataxic dysarthria is rendered when the speech characteristics include imprecise and irregular articulatory breakdowns, marked degree of speech rate impairment, overall monopitch and monoloudness, and respiratory-articulatory incoordination. Traditionally, speech pathologists have relied only upon their ‘ears’ to describe and evaluate the dysarthric speech. A statement of percentage of correct words identified by a listener do not provide so much more than an index of severity. Within the same perceptual dimension, a carefully constructed speech intelligibility test can specify patterns of errors. The patterns can contain a diagnostic value as well as Provide strategies for remediation. The phonetically transcribed texts on single words and a standard passage, 'kail' produced by an ataxic dysarthria are presented in this report, with an emphasis of the articulatory error analysis. Furthermore,, acoustic tools [e.g., spectrography to measure formant transitions, segment durations, consonant spectra, etc.] are utilized to serve as basic measures that objectively document patients' speech intelligibility, Finally, the treatment methods [e.g., spectrography as a visual feedback, gestural reorganization using pacing method, DAF (Delayed Auditory Feedback)] to modify the dysarthric behaviors are presented.

  • PDF

Perceptual Characteristics of Korean Consonants Distorted by the Frequency Band Limitation (주파수 대역 제한에 의한 한국어 자음의 지각 특성 분석)

  • Kim, YeonWhoa;Choi, DaeLim;Lee, Sook-Hyang;Lee, YongJu
    • Phonetics and Speech Sciences
    • /
    • v.6 no.1
    • /
    • pp.95-101
    • /
    • 2014
  • This paper investigated the effects of frequency band limitation on perceptual characteristics of Korean consonants. Monosyllabic speech (144 syllables of CV type, 56 syllables of VC type, 8 syllables of V type) produced by two announcers were low- and high-pass filtered with cutoff frequencies ranging from 300 to 5000 Hz. Six listeners with normal hearing performed perception test by types of filter and cutoff frequencies. We reported phoneme recognition rates and types of perception error of band-limited Korean consonants to examine how frequency distortion in the process of speech transmission affect listener's perception. The results showed that recognition rates varied with the following factors: position in a syllable, manner of articulation, place of articulation, and phonation types. Consonants in the final position were stronger to the frequency band limitation than those in the initial position. Fricatives and Affricates are stronger than stops. Fortis consonants were less stronger than their lenis or aspirated counterparts. Types of perception error also varied depending on such factors as consonant's place of articulation: In case of bilabial stops, they were perceived as alveolar stops with while in cases of alveolar and velar stops, there were changes in phonation types without any change in the place of articulation.

Improvement of front/back Sound Localization Characteristics using Psychoacoustics of Head Related Transfer Function (머리전달함수의 심리음향적 특성을 이용한 전/후 음상정위 특성 개선)

  • Koo, Kyo-Sik;Cha, Hyung-Tai
    • Journal of Broadcast Engineering
    • /
    • v.11 no.4 s.33
    • /
    • pp.448-457
    • /
    • 2006
  • HRTF DB, including the information of the sounds which is arrived to our ears, is generally used to make a 3D sound. But it can decline some three-dimensional effects by the confusion between front and back directions due to the non-individual HRTF depending on each listener. In this paper, we propose a new method to use psychoacoustic theory that reduces the confusion of sound image localization. And we make use of an excitation energy by the sense of hearing. This method is brought HRTF spectrum characteristics into relief to draw out the energy ratio about the bark band. Informal listening tests show that the proposed method improves the front-back sound localization characteristics much better than the conventional methods.

A Spatial Audio System Using Multiple Microphones on a Rigid Sphere

  • Lee, Tae-Jin;Jang, Dae-Young;Kang, Kyeong-Ok;Kim, Jin-Woong;Jeong, Dae-Gwon;Hamada, Hareo
    • ETRI Journal
    • /
    • v.27 no.2
    • /
    • pp.153-165
    • /
    • 2005
  • The main purpose of a spatial audio system is to give a listener the same impression as if he/she were present in a recorded environment. A dummy head microphone is generally used for such purposes. Because of its human-like shape, we can obtain good spatial sound images. However, its shape is a restriction on its public use and it is difficult to convert a 2-channel recording into multi-channel signals for an efficient rendering over a multi-speaker arrangement. In order to solve the problems mentioned above, a spatial audio system is proposed that uses multiple microphones on a rigid sphere. The system has five microphones placed on special points of the rigid sphere, and it generates audio signals for headphone, stereo, stereo dipole, 4-channel, and 5-channel reproduction environments. Subjective localization experiments show that front/back confusion, which is a common limitation of spatial audio systems using the dummy head microphone, can be reduced dramatically in 4-channel and 5-channel reproduction environments and can be reduced slightly in a headphone reproduction.

  • PDF

Perceptual Vowel Space and Mental Representation of Korean Monophthongs (한국어 단모음의 지각적 모음공간과 심적 표상)

  • Choi, Yang-Gyu
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.287-301
    • /
    • 2003
  • The purpose of this study was to examine whether the same vowel sounds are perceived differently by the two local dialect speakers, Seoul dialect speakers (SDS) and Kyungnam dialect speakers (KDS), whose vowel systems differ each other. In the first experiment SDS and KDS heard vowels synthesized in vowel space with F1 by F2 and categorized them into one of 10 Korean monophthongs. The results showed that SDS and KDS perceived the synthesized vowels differently. For example, /$\varepsilon$ versus /e/ contrast, ${\o}$/, and /y/ are differentiated by SDS, whereas they are perceptually confused by KDS. We also observed that /i/ could not be perceived unless the vowel synthesis included F3 or higher formants. In the second experiment SDS and KDS performed the similarity rating task of 10 synthesized Korean monophthongs. Two-dimensional MDS solution based on the similarity rating scores was obtained for each dialect group. The first dimension can be named 'vowel advancement' and the second 'vowel height'. The comparison of the two MDS solutions showed that the overall psychological distances among the vowels are shorter in KDS than SDS and that especially the distance between /$\Lambda$/ and /i/ is shorter in KDS than SDS. The result suggested that perception or mental representation of vowels depends on the vowel system of the listener's dialect or language. Further research problems were discussed in the final section.

  • PDF

Indoor Passive Location Tracking and Activity Monitoring using WSN for Ubiquitous Healthcare

  • Singh, Vinay Kumar;Lee, Seung-Chul;Lim, Hyo-Taek;Myllyla, Risto;Chung, Wan-Young
    • Journal of information and communication convergence engineering
    • /
    • v.5 no.4
    • /
    • pp.382-388
    • /
    • 2007
  • Indoor location system using wireless sensor network technology was applied for the status evaluation and activity monitoring of elderly person or chronic invalid at home. Location awareness application is transparent to the daily activities, while providing the embedded computing infrastructure with an awareness of what is happening in this space. To locate an object, the active ceiling-mounted reference beacons were placed throughout the building. Reference beacons periodically publish location information on RF and ultrasonic signals to allow application running on mobile or static nodes to study and determine their physical location. Once object-carried passive listener receives the information, it subsequently determines it's location from reference beacons. By using only the sensor nodes without any external network infrastructure the cost of the system was reduced while the accuracy in our experiments. was fairly good and fine grained between 7 and 15 cm for location awareness in indoor environments. Passive architecture used here provides the security of the user privacy while at the server the privacy was secured by providing the authentication using Geopriv approach. This information from sensor nodes is further forwarded to base station where further computation is performed to determine the current position of object and several applications are enabled for context awareness.

Design of the Specific IP Access Deny for the Database (데이터베이스에서 지정된 IP 주소 접근 금지를 위한 기능 설계)

  • Jang, Seung-Ju;Kim, Sung-Jin
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.39C no.8
    • /
    • pp.716-721
    • /
    • 2014
  • This paper proposes how to prevent of vulnerability from IP address attack of Oracle DB by C program environments. An attacker may try to login DB by connectng remote IP address. Recently an attacker use foreign IP address and try to connect to DB using known DB account. Therefore, DB data is frequently disclosed. I propose a new idea that develops specific IP address blocking module in C program in the Oracle DB. By this module, we can use the Oracle DB safely.

Speech Quality Measure in a Mobile Communication System Using PLP Cepstral Distance with CMS (심리 음향 켑스트럼 평균 차감법을 이용한 이동 전화망에서의 음질 평가)

  • Yun, J.J.;Park, S.W.;Park, Y.C.;Youn, D.H.;Cha, I.H.
    • Speech Sciences
    • /
    • v.6
    • /
    • pp.163-179
    • /
    • 1999
  • For the set up, management and repair of a mobile communication system, continuous estimation of speech quality is required. Speech quality measurement can be conducted by listener's judgement in a subjective test such as MOS (Mean Opinion Score) test. However, this method is laborious, expensive and time-consuming, it is advisable to predict subjective speech quality via objective measures. This paper presents a robust objective speech quality measure, PLP-CMS (Perceptual Linear Predictive-Cepstral Mean Subtraction), which can predict subjective speech quality in mobile communication systems. PLP-CMS has a high correlation with subjective quality owing to PLP (Perceptual Linear Predictive) analysis and shows a robust performance not being influenced by PSTN (Public Switched Telephone Network) channel effects due to CMS (Cepstral Mean Subtraction). To prove the performance of our proposed algorithm, we carried out subjective and objective quality estimation on speech samples which are variously distorted in a real mobile communication system. As a result, we demonstrated that PLP-CMS has a higher correlation with subjective quality than PSQM (Perceptual Speech Quality Measure) and PLP-CD (Perceptual Linear Predictive-Cepstral Distance).

  • PDF

A Range Dependent Structural HRTF Model for 3-D Sound Generation in Virtual Environments (가상현실 환경에서의 3차원 사운드 생성을 위한 거리 변화에 따른 구조적 머리전달함수 모델)

  • Lee, Young-Han;Kim, Hong-Kook
    • MALSORI
    • /
    • no.59
    • /
    • pp.89-99
    • /
    • 2006
  • This paper proposes a new structural head-related transfer function(HRTF) model to produce sounds in a virtual environment. The proposed HRTF model generates 3-D sounds by using a head model, a pinna model and the proposed distance model for azimuth, elevation, and distance that are three aspects for 3-D sounds, respectively. In particular, the proposed distance model consists of level normalization block distal region model, and proximal region model. To evaluate the performance of the proposed model, we setup an experimental procedure that each listener identifies a distance of 3-D sound sources that are generated by the proposed method with a predefined distance. It is shown from the tests that the proposed model provides an average distance error of $0.13{\sim}0.31$ meter when the sound source is generated as if it is 0.5 meter $\sim$ 2 meters apart from the listeners. This result is comparable to the average distance error of the human listening for the actual sound source.

  • PDF