• Title/Summary/Keyword: Listener

Search Result 194, Processing Time 0.022 seconds

The Continuous Speech Recognition with Prosodic Phrase Unit (운율구 단위의 연속음 인식)

  • 강지영;엄기완;김진영;최승호
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.8
    • /
    • pp.9-16
    • /
    • 1999
  • Generally, a speaker structures utterances very clearly by grouping words into phrases. This facilitates the listener's recovery of the meaning of the utterance and the speaker's intention. To this purpose, a speaker uses, among other things, prosodic information such as intonation pause, duration, intensity, etc. The research described here is concerned with the relationship between the strength of prosodic boundaries in spoken utterances as perceived by untrained listeners(Perceptual boundary strength, PBS)-In this paper, the preceptual boundary strength is used as the same meaning of the prosodic boundary strength-and prosodic information. We made a rule determinating the prosodic boundaries and verified the usefulness of the prosodic phrase as a recognition unit. Experiments results showed that the performance of speech recognition(SR) is improved in aspect of recognition rate and time compared with that using sentences as recognition unit. In the future we will suggest the methods that estimate more appropriate boundaries and study more various methods of prosody assisted SR.

  • PDF

Acoustic Variation Conditioned by Prosody in English Motherese

  • Choi, Han-Sook
    • Phonetics and Speech Sciences
    • /
    • v.2 no.1
    • /
    • pp.41-50
    • /
    • 2010
  • The current study exploresacoustic variation induced by prosodic contexts in different speech styles,with a focus on motherese or child-directed speech (CDS). The patterns of variation in the acoustic expression of voicing contrast in English stops, and the role of prosodic factors in governing such variation are investigated in CDS. Prosody-induced acoustic strengthening reported from adult-directed speech (ADS)is examined in the speech data directed to infants at the one-word stage. The target consonants are collected from Utterance-initial and -medial positions, with or without focal accent. Overall, CDS shows that the prosodic prominence of constituents under focal accent conditions variesin the acoustic correlates of the stop laryngeal contrasts. The initial position is not found with enhanced acoustic values in the current study, which is similar to the finding from ADS (Choi, 2006 Cole et al, 2007). Individualized statistical results, however, indicate that the effect of accent on acoustic measures is not very robust, compared to the effect of accent in ADS. Enhanced distinctiveness under focal accent is observed from the limited subjects' acoustic measures in CDS. The results indicate dissimilar strategies to mark prosodic structures in different speech styles as well as the consistent prosodic effect across speech styles. The stylistic variation is discussed in relation to the listener under linguistic development in CDS.

  • PDF

The effects of Sijo, Korean short lyric song on calm impatience is on YouTube (https://youtu.be/__Ua6p9S0o8) sung by Wol-ha Kim

  • Ko, Kyung Ja
    • CELLMED
    • /
    • v.7 no.3
    • /
    • pp.11.1-11.3
    • /
    • 2017
  • The aim of this article is to argue that a valuable tool to calm impatience is to listen to Sijo. The author seeks to argue that the best way to calm impatience is to listen to Sijo. Sijo refers to a slow and mellow music in the family of Han Ak (Korean music, 韓樂). The term slow is a revered keyword in our culture. "Slow" is a blank word in Han Ak (Korean music, 韓樂). The soul of Wol-ha Kim's Sijo is a beauty of space and easiness. Therefore, her voice will help relax the muscles of the listener and calm the soul. It is akin to the struggles of modern people competing for something excessively but ending up with nothing. We often find that gentle jogging is better than sprinting. Slow music is thus good for one's health. For example, we know that our skin can become beautiful and that real beauty can be obtained only when the body and mind are at rest and in comfort. Physical appearance depends on a healthy mind and body. The author believes that Sijo for music therapy is good for our mental health. If its effectiveness is confirmed after performing tests on animals and humans in an experimental study, we can use this type of music to treat patients with psychological illnesses.

Service Platform of Grid Systems for Ubiquitous Multimedia Applications (유비쿼터스 멀티미디어 응용을 위한 그리드 시스템의 서비스 플랫폼)

  • Park Eun-jeong;Shin Heon-shik
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.1B
    • /
    • pp.9-18
    • /
    • 2006
  • Advances in wireless network are enabling the development of ubiquitous multimedia services. These multimedia services need efficient platforms to comply with the requirements of mobile computing. We introduce an adaptive service platform based on mobile agent and grid systems while specifying the challenges of ubiquitous multimedia services and focusing on frequent disconnections and scarce resources. We applied our platform to framework RtoA (Ready-to-Attend) which supports mobile users to access compute-intensive multimedia service, specifically, mobile education and video conferencing. RtoA includes hand-off, speaker and listener service which enable people to attend a conference or a class with satisfying quality of multimedia service. ns-2 based simulation verifies that our scheme is an efficient way to reduce energy consumption of mobile devices and to improve the response time of mobile applications.

A wireless sensor network approach to enable location awareness in ubiquitous healthcare applications

  • Singh, Vinay Kumar;Lim, Hyo-Taek;Chung, Wan-Young
    • Journal of Sensor Science and Technology
    • /
    • v.16 no.4
    • /
    • pp.277-285
    • /
    • 2007
  • In this paper, we outline the research issues that we are pursuing towards building of location aware environments for mainly ubiquitous healthcare applications. Such location aware application can provide what is happening in this space. To locate an object, such as patient or elderly person, the active ceiling-mounted reference beacons were placed throughout the building. Reference beacons periodically publish location information on RF and ultrasonic signals to allow application running on mobile or static nodes to study and determine their physical location. Once object-carried passive listener receives the information, it subsequently determines it's location from reference beacons. The cost of the system was reduced while the accuracy in our experiments was fairly good and fine grained between 7 and 12 cm for location awareness in indoor environments by using only the sensor nodes and wireless sensor network technology. Passive architecture used here provides the security of the user privacy while at the server the privacy was secured by providing the authentication using Geopriv approach. This information from sensor nodes is further forwarded to base station where further computation is performed to determine the current position of object.

The effect of head movement on HRTF in 3D sound system: Sensitivity analysis on Sphere HRTF (머리움직임이 입체음향 시스템의 머리전달함수에 미치는 영향: 구 머리전달함수의 민감도해석)

  • 김선민;박영진
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2002.05a
    • /
    • pp.353-358
    • /
    • 2002
  • Human's vision is mostly confined to the area in the front and we, humans heavily depend on the sense of hearing to gather information in areas out of our sight. Thus, the virtual reality system consisting of the 3D sound effect gives the user a much better sense of reality than the system without the sound effect. Virtual 3D sound technology has mainly been researched with binaural system. The conventional binaural sound systems reproduce the desired sound at two arbitrary points using two channels in 3-D space. Head movement of listener might be change the nominal acoustic transfer function and deteriorate the performance of 3D sound system based on loudspeakers that needs a crosstalk canceller. In this paper, low kinds of sensitivity functions of sphere HRTF are derived to investigate the effect of head movement on HRTF in 3D sound system. Changes of HRTF caused by rotational and translational motion of head are obtained as we calculate the derivatives of HRTF with respect to angle and distance.

  • PDF

A Case Study on Pedagogical Tasks in Mathematics Curriculum Integrating Dynamic Manipulation Environments and the Role of a Teacher (동적조작 환경이 융합된 수학교과과정에서의 교수-학습 과제 사례 분석과 교사의 역할)

  • Hong, Seong-Kowan
    • School Mathematics
    • /
    • v.11 no.2
    • /
    • pp.281-299
    • /
    • 2009
  • In this paper, we show how dynamic manipulation environments can be integrated in the mathematics curriculum by presenting some pedagogical tasks manufactured by dynamic manipulation. These examples are composed to produce meaningful definitions through inductive experiments, to strengthen the thinking ability on continuity through the visualization, to make mathematics through investigation and finding, and to strengthen the ability of posing and generalizing problems. Through these examples students can observe the process of how mathematics is being invented, and they can experience how to solve mathematical problems using physical experiments in dynamic manipulation environments. When integration of dynamic manipulation into the teaching and learning of mathematics is applied, some difficulties can come out. To resolve such difficulties, a teacher must play the role of a co-worker of students in addition to the role of a scaffolder, coach, or close listener.

  • PDF

Effects of base token for stimuli manipulation on the perception of Korean stops among native and non-native listeners

  • Oh, Eunjin
    • Phonetics and Speech Sciences
    • /
    • v.12 no.1
    • /
    • pp.43-50
    • /
    • 2020
  • This study investigated whether listeners' perceptual patterns varied according to base token selected for stimuli manipulation. Voice onset time (VOT) and fundamental frequency (F0) values were orthogonally manipulated, each in seven steps, using naturally produced words that contained a lenis (/kan/) and an aspirated (/khan/) stop in Seoul Korean. Both native and non-native groups showed significantly higher numbers of aspirated responses for the stimuli constructed with /khan/, evidencing the use of minor cues left in the stimuli after manipulation. For the native group the use of the VOT and F0 cues in the stop categorization did not differ depending on whether the base token included the lenis or aspirated stop, indicating that the results of previous studies remain tenable that investigated the relative importance of the acoustic cues in the native listener perception of the Korean stop contrasts by using one base token for manipulating perceptual stimuli. For the non-native group, the use patterns of the F0 cue differed as a function of base token selected. Some findings indicated that listeners used alternative cues to identify the stop contrast when major cues sound ambiguous. The use of the manipulated VOT and F0 cues by the non-native group was not native-like, suggesting that non-native listeners may have perceived the minor cues as stable in the context of the manipulated cue combinations.

An Efficient Crosstalk Cancellation Algorithm Using Pole-zero Dewarping (Pole-zero Dewarping을 이용한 효율적인 Crosstalk 제거 알고리듬)

  • Lee Junho;Park Young-cheol;Youn Dae-hee;Jeong Jae-woong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.3
    • /
    • pp.133-140
    • /
    • 2005
  • Crosstalk canceller in stereo channel audio reproduction system has the purpose to deliver desired signals exactly at the listener's ear. Generally. it has a Poor performance in low frequency bands. Frequency-warped Otters are used to provide improved performance in crosstalk canceller for these problems. However. such filters are more complex to implement than conventional filters. This paper presents an efficient method for low-order IIR approximation of frequency warped crosstalk cancellation filters using Pole-zero dewarping. The method preserves the advantages of frequency warping, but has a computational complexity that is similar to the conventional method. This Paper also presents a series of experiments that validate the method of crosstalk canceller.

Environment Adaptive Sound Localization for Multi-Channel Surround Sound System

  • Lee, Yoon Bae;Mariappan, Vinayagam;Cho, Juphil;Lee, Seon Hee
    • International journal of advanced smart convergence
    • /
    • v.5 no.4
    • /
    • pp.21-25
    • /
    • 2016
  • Recent development in multi-channel surround is emerging in various formats to provide better stereoscopic and sound effects to consumers in recent broadcasting. The ability sound localize the sound sources in space is most considerable design factor on multi-channel surround system for human earing perception model. However, this paper propose the change of the sound localization according to the spacing of the speakers, which is not covered in the existing research focus on sound system design. Presently the sound system uses the position and number of the speakers to localize the sound. In the multi-channel surround environment, the proposed design uses the sound localization is caused by the directional characteristics of the speaker, the distance between the speakers and the distance between the listener and the speaker according to the directivity is required. The proposed design is simulated using virtual measurement with MATLAB simulation environment and performances are measured.