• Title/Summary/Keyword: cue

Search Result 549, Processing Time 0.025 seconds

Correlation of Acoustic Cues in Stop Productions of Korean and English Adults and Children

  • Kong, Eun-Jong;Weismer, Gary
    • Phonetics and Speech Sciences
    • /
    • v.2 no.4
    • /
    • pp.29-37
    • /
    • 2010
  • Previous studies have investigated a between-category relationship of multiple acoustic cues for a laryngeal contrast by examining the distributions of VOT, f0 and H1-H2. The current study examined within-category correlations between cues comprising stops by Korean- and English-speaking adults and children to understand how children master the internal structure of stop phonation types in two languages. Word-initial stops were collected from about 70 children and 15 adults speaking English and Korean, and were analyzed in terms of VOT, f0 and H1-H2 to compute correlation coefficients. Findings in adults' productions included a gender-differentiated cue-correlation pattern associated with H1-H2 in Korean tense stops and a trading relationship between f0 and VOT in Korean lax and aspirated stops and English voiced and voiceless stops. Children did not necessarily have adult-like cue-correlation patterns even in early-acquired categories, suggesting that the mastery of intra-category structure of phonation type might occur later than inter-category structure.

  • PDF

Images Automatic Annotation: Multi-cues Integration (영상의 자동 주석: 멀티 큐 통합)

  • Shin, Seong-Yoon;Ahn, Eun-Mi;Rhee, Yang-Won
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2010.05a
    • /
    • pp.589-590
    • /
    • 2010
  • All these images consist a considerable database. What's more, the semantic meanings of images are well presented by the surrounding text and links. But only a small minority of these images have precise assigned keyphrases, and manually assigning keyphrases to existing images is very laborious. Therefore it is highly desirable to automate the keyphrases extraction process. In this paper, we first introduce WWW image annotation methods, based on low level features, page tags, overall word frequency and local word frequency. Then we put forward our method of multi-cues integration image annotation. Also, show multi-cue image annotation method is more superior than other method through an experiment.

  • PDF

A Study On the Disguised Voice - From a prosodic point of view - (위장발화에 대한 연구 - 운율적 특성을 중심으로 -)

  • Cho Minha;Nho Seogeun;Song Minkyu;Shin Jiyoung;Kang Sunmee
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.191-195
    • /
    • 2003
  • The aim of this paper is to analyze the phonetic features for disguised voice. In this paper we examined the features such as phonation types, pitch range, speech rate, intonation type and boundary tones etc. So the result of the analysis is as follows. : $\circled1$ Phonation types are very important manner of disguised voice for male subjects. $\circled2$ Pitch range and average of pitch value is very important cue for speaker verification. $\circled3$ pitch contour, speech rate and boundary tones can be a secondary cue for speaker verification.

  • PDF

The Effect of Focus Representation and Intonational Manipulation in Phoneme Detecting (초점 실현과 운율 조작에 대한 음소지각)

  • Kim, Hee-Seung;Shin, Ji-Young;Kim, Kee-Ho
    • MALSORI
    • /
    • no.60
    • /
    • pp.97-108
    • /
    • 2006
  • The purpose of this study is to observe how Korean listeners detect a target phoneme with 'Focus' represented by prosodic prominence and question-induced semantic emphasis, and with intonational manipulation. According to the automated phoneme detection task using E-Prime, the Korean listeners detected phoneme targets more rapidly when the target-bearing words were in prominence position and in question-induced position. However, the presence of question-induced semantic emphasis reduced the prominence effect, so two effects interacted: when question-induced emphasis were primarily given as a cue, prominence which was given as secondary cue affected less to fine the new information. Besides, the intonation with manipulation was responded to faster than without manipulation.

  • PDF

Depth Perception using A Parallel-Axis Stereoscopic Camera Rig

  • Ramesh, Rohit;Shin, Heung-Sub;Jeong, Shin-Il;Chung, Wan-Young
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2010.10a
    • /
    • pp.147-148
    • /
    • 2010
  • Recently, advancement in the visual technology has lead to the further development of the three dimensional (3D) imaging systems. The visual perception to view a pair of images simultaneously, is a crucial factor to build a stereoscopic 3D image. In this paper, we present the depth cues between the intensities of the two images when viewing with both eyes. Due to this stereoscopic effect, objects at different distances from the eyes differ in their horizontal positions, giving the depth cue of horizontal disparity. By simple image processing technique, we also present the binocular disparity map between the two images. A median filter has been used to filter out all the noises occurring in the disparity map image.

  • PDF

Multiple Cues Based Particle Filter for Robust Tracking (다중 특징 기반 입자필터를 이용한 강건한 영상객체 추적)

  • Hossain, Kabir;Lee, Chi-Woo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2012.11a
    • /
    • pp.552-555
    • /
    • 2012
  • The main goal of this paper is to develop a robust visual tracking algorithm with particle filtering. Visual Tracking with particle filter technique is not easy task due to cluttered environment, illumination changes. To deal with these problems, we develop an efficient observation model for target tracking with particle filter. We develop a robust phase correlation combined with motion information based observation model for particle filter framework. Phase correlation provides straight-forward estimation of rigid translational motion between two images, which is based on the well-known Fourier shift property. Phase correlation has the advantage that it is not affected by any intensity or contrast differences between two images. On the other hand, motion cue is also very well known technique and widely used due to its simplicity. Therefore, we apply the phase correlation integrated with motion information in particle filter framework for robust tracking. In experimental results, we show that tracking with multiple cues based model provides more reliable performance than single cue.

Perceptual weighting on English lexical stress by Korean learners of English

  • Goun Lee
    • Phonetics and Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.19-24
    • /
    • 2022
  • This study examined which acoustic cue(s) that Korean learners of English give weight to in perceiving English lexical stress. We manipulated segmental and suprasegmental cues in 5 steps in the first and second syllables of an English stress minimal pair "object". A total of 27 subjects (14 native speakers of English and 13 Korean L2 learners) participated in the English stress judgment task. The results revealed that native Korean listeners used the F0 and intensity cues in identifying English stress and weighted vowel quality most strongly, as native English listeners did. These results indicate that Korean learners' experience with these cues in L1 prosody can help them attend to these cues in their L2 perception. However, L2 learners' perceptual attention is not entirely predicted by their linguistic experience with specific acoustic cues in their native language.

Attentional modulation on multiple acoustic cues in phonological processing of L2 sounds

  • Hyunjung Lee;Eun Jong Kong
    • Phonetics and Speech Sciences
    • /
    • v.15 no.4
    • /
    • pp.11-16
    • /
    • 2023
  • The present study examines how a cognitive attention affects Korean learners of English (L2) in perceiving the English stop voicing distinction (/d/-/t/). This study tested the effect of attentional distractor on primary and non-primary acoustic cues, focusing on the role of Voice Onset Time (VOT) and fundamental frequency (F0). Using the dual-task paradigm, 28 Korean adult learners of English participated in the stop identification task carried with (distractor) and without (no-distractor) arithmetic calculation. Results showed that when distracted, Korean learners' sensitivity to VOT decreased as priorly reported with native English speakers. Furthermore, as F0 is a primary cue for a L1 Korean stop laryngeal contrast, its role in L2 English voicing distinction was also affected by a distractor, without compensating for the reduced VOT sensitivity. These findings suggest that flexible use of multiple cues in L1 is not necessarily beneficial for L2 phonological processing when coping with a adverse listening condition.

2D/3D conversion method using depth map based on haze and relative height cue (실안개와 상대적 높이 단서 기반의 깊이 지도를 이용한 2D/3D 변환 기법)

  • Han, Sung-Ho;Kim, Yo-Sup;Lee, Jong-Yong;Lee, Sang-Hun
    • Journal of Digital Convergence
    • /
    • v.10 no.9
    • /
    • pp.351-356
    • /
    • 2012
  • This paper presents the 2D/3D conversion technique using depth map which is generated based on the haze and relative height cue. In cases that only the conventional haze information is used, errors in image without haze could be generated. To reduce this kind of errors, a new approach is proposed combining the haze information with depth map which is constructed based on the relative height cue. Also the gray scale image from Mean Shift Segmentation is combined with depth map of haze information to sharpen the object's contour lines, upgrading the quality of 3D image. Left and right view images are generated by DIBR(Depth Image Based Rendering) using input image and final depth map. The left and right images are used to generate red-cyan 3D image and the result is verified by measuring PSNR between the depth maps.