• 제목/요약/키워드: 말소리

검색결과 1,337건 처리시간 0.021초

황순원 「소나기」의 문화융합 콘텐츠 사례를 통해 본 인문학적 상상력 (Humanistic Imagination through the Case of Cultural Convergence Contents of Hwang Soon-won 「Sonagi」)

  • 이내관
    • 한국융합학회논문지
    • /
    • 제9권10호
    • /
    • pp.199-208
    • /
    • 2018
  • 본 논문은 1953년 5월 "신문학"지에 발표된 황순원의 "소나기"가 텔레비전 단막극, 애니메이션, 영화, 뮤지컬, CF, 문학콘서트 등과 같은 다양한 문화융합 콘텐츠로 재생산 되면서 원작과 달라진 부분을 인문학적 상상력 측면에서 고찰한 것이다. TV문학관 <소나기>에서는 원작에 존재하지 않은 인물인 소녀의 '엄마'와 소년의 동생인 '석이'라는 인물을 새롭게 창조하여 스토리를 보다 치밀하게 전개하였고, 애니메이션에서는 주인공의 대화가 말소리로 제시되어 작품의 주제를 효과적으로 드러내었다는 점이 특징적이다. 영화에서 여주인공은 "소나기"의 결말 부분이 마음에 들지 않는다면서 "죽거든 저를 업어준 사내애를 산 채로 같이 묻어 달라"는 독특한 상상력으로 소설을 차용한다. 어린이의 손목시계 겸 휴대폰 기능이 있는 키즈워치 CF에서도 "소나기"가 변용되었다. 또한 뮤지컬 <소나기>에서는 약 2톤 정도의 물을 사용하여 관객들에게 보다 생생함과 현장감을 주었다. 이처럼 소설을 원작으로 한 다양한 문화융합 콘텐츠에서는 원작의 내용이 매체의 특성에 따라 변용되면서 연출가의 독특한 상상력의 세계가 관객과 시청자에게 전달되었다.

<한국어 립씽크를 위한 3D 디자인 시스템 연구> ()

  • 신동선;정진오
    • 한국HCI학회:학술대회논문집
    • /
    • 한국HCI학회 2006년도 학술대회 2부
    • /
    • pp.362-369
    • /
    • 2006
  • 3 차원 그래픽스에 적용하는 한국어 립씽크 합성 체계를 연구하여, 말소리에 대응하는 자연스러운 립씽크를 자동적으로 생성하도록 하는 디자인 시스템을 연구 개발하였다. 페이셜애니메이션은 크게 나누어 감정 표현, 즉 표정의 애니메이션과 대화 시 입술 모양의 변화를 중심으로 하는 대화 애니메이션 부분으로 구분할 수 있다. 표정 애니메이션의 경우 약간의 문화적 차이를 제외한다면 거의 세계 공통의 보편적인 요소들로 이루어지는 반면 대화 애니메이션의 경우는 언어에 따른 차이를 고려해야 한다. 이와 같은 문제로 인해 영어권 및 일본어 권에서 제안되는 음성에 따른 립싱크 합성방법을 한국어에 그대로 적용하면 청각 정보와 시각 정보의 부조화로 인해 지각의 왜곡을 일으킬 수 있다. 본 연구에서는 이와 같은 문제점을 해결하기 위해 표기된 텍스트를 한국어 발음열로 변환, HMM 알고리듬을 이용한 입력 음성의 시분할, 한국어 음소에 따른 얼굴특징점의 3 차원 움직임을 정의하는 과정을 거쳐 텍스트와 음성를 통해 3 차원 대화 애니메이션을 생성하는 한국어 립싱크합성 시스템을 개발 실제 캐릭터 디자인과정에 적용하도록 하였다. 또한 본 연구는 즉시 적용이 가능한 3 차원 캐릭터 애니메이션뿐만 아니라 아바타를 활용한 동적 인터페이스의 요소기술로서 사용될 수 있는 선행연구이기도 하다. 즉 3 차원 그래픽스 기술을 활용하는 영상디자인 분야와 HCI 에 적용할 수 있는 양면적 특성을 지니고 있다. 휴먼 커뮤니케이션은 언어적 대화 커뮤니케이션과 시각적 표정 커뮤니케이션으로 이루어진다. 즉 페이셜애니메이션의 적용은 보다 인간적인 휴먼 커뮤니케이션의 양상을 지니고 있다. 결국 인간적인 상호작용성이 강조되고, 보다 편한 인간적 대화 방식의 휴먼 인터페이스로 그 미래적 양상이 변화할 것으로 예측되는 아바타를 활용한 인터페이스 디자인과 가상현실 분야에 보다 폭넓게 활용될 수 있다.

  • PDF

음성인식 기반 응급상황관제 (Emergency dispatching based on automatic speech recognition)

  • 이규환;정지오;신대진;정민화;강경희;장윤희;장경호
    • 말소리와 음성과학
    • /
    • 제8권2호
    • /
    • pp.31-39
    • /
    • 2016
  • In emergency dispatching at 119 Command & Dispatch Center, some inconsistencies between the 'standard emergency aid system' and 'dispatch protocol,' which are both mandatory to follow, cause inefficiency in the dispatcher's performance. If an emergency dispatch system uses automatic speech recognition (ASR) to process the dispatcher's protocol speech during the case registration, it instantly extracts and provides the required information specified in the 'standard emergency aid system,' making the rescue command more efficient. For this purpose, we have developed a Korean large vocabulary continuous speech recognition system for 400,000 words to be used for the emergency dispatch system. The 400,000 words include vocabulary from news, SNS, blogs and emergency rescue domains. Acoustic model is constructed by using 1,300 hours of telephone call (8 kHz) speech, whereas language model is constructed by using 13 GB text corpus. From the transcribed corpus of 6,600 real telephone calls, call logs with emergency rescue command class and identified major symptom are extracted in connection with the rescue activity log and National Emergency Department Information System (NEDIS). ASR is applied to emergency dispatcher's repetition utterances about the patient information. Based on the Levenshtein distance between the ASR result and the template information, the emergency patient information is extracted. Experimental results show that 9.15% Word Error Rate of the speech recognition performance and 95.8% of emergency response detection performance are obtained for the emergency dispatch system.

An acoustic and perceptual investigation of the vowel length contrast in Korean

  • Lee, Goun;Shin, Dong-Jin
    • 말소리와 음성과학
    • /
    • 제8권1호
    • /
    • pp.37-44
    • /
    • 2016
  • The goal of the current study is to investigate how the sound change is reflected in production or in perception, and what the effect of lexical frequency is on the loss of sound contrasts. Specifically, the current study examined whether the vowel length contrasts are retained in Korean speakers' productions, and whether Korean listeners can distinguish vowel length minimal pairs in their perception. Two production experiments and two perception experiments investigated this. For production tests, twelve Korean native speakers in their 20s and 40s completed a read-aloud task as well as a map-task. The results showed that, regardless of their age group, all Korean speakers produced vowel length contrasts with a small but significant differences in the read-aloud test. Interestingly, the difference between long and short vowels has disappeared in the map task, indicating that the speech mode affects producing vowel length contrasts. For perception tests, thirty-three Korean listeners completed a discrimination and a forced-choice identification test. The results showed that Korean listeners still have a perceptual sensitivity to distinguish lexical meaning of the vowel length minimal pair. We also found that the identification accuracy was affected by the word frequency, showing a higher identification accuracy in high- and mid- frequency words than low frequency words. Taken together, the current study demonstrated that the speech mode (read-aloud vs. spontaneous) affects the production of the sound undergoing a language change; and word frequency affects the sound change in speech perception.

The Interlanguage Speech Intelligibility Benefit (ISIB) of English Prosody: The Case of Focal Prominence for Korean Learners of English and Natives

  • Lee, Joo-Kyeong;Han, Jeong-Im;Choi, Tae-Hwan;Lim, Injae
    • 말소리와 음성과학
    • /
    • 제4권4호
    • /
    • pp.53-68
    • /
    • 2012
  • This study investigated the speech intelligibility of Korean-accented and native English focus speech for Korean and native English listeners. Three different types of focus in English, broad, narrow and contrastive, were naturally induced in semantically optimal dialogues. Seven high and seven low proficiency Korean speakers and seven native speakers participated in recording the stimuli with another native speaker. Fifteen listeners from each of Korean high & low proficiency and native groups judged audio signals of focus sentences. Results showed that Korean listeners were more accurate at identifying the focal prominence for Korean speakers' narrow focus speech than that of native speakers, and this suggests that the interlanguage speech intelligibility benefit-talker (ISIB-T) held true for narrow focus regardless of Korean speakers' and listeners' proficiency. However, Korean listeners did not outperform native listeners for Korean speakers' production of narrow focus, which did not support for the ISIB-listener (L). Broad and contrastive focus speech did not provide evidence for either the ISIB-T or ISIB-L. These findings are explained by the interlanguage shared by Korean speakers and listeners where they have established more L1-like common phonetic features and phonological representations. Once semantically and syntactically interpreted in a higher level processing in Korean narrow focus speech, the narrow focus was phonetically realized in a more intelligible way to Korean listeners due to the interlanguage. This may elicit ISIB. However, Korean speakers did not appear to make complete semantic/syntactic access to either broad or contrastive focus, which might lead to detrimental effects on lower level phonetic outputs in top-down processing. This is, therefore, attributed to the fact that Korean listeners did not take advantage over native listeners for Korean talkers and vice versa.

The Patterns of Vowel Insertion in Korean Speakers' Production of English C+/l/ and C+/r/ Clusters

  • Kang, Seo-Yoon
    • 말소리와 음성과학
    • /
    • 제4권4호
    • /
    • pp.3-17
    • /
    • 2012
  • This study examines Korean speakers' production of English consonant clusters, focusing on vowel insertion. An acoustic analysis along with a statistical test was carried out to see what factors are involved in this production. The following factors were considered in the present study: phonetic properties, L1 transfer, and cluster types. Specifically, liquid types were considered to see if they cause any difference depending on C+/l/ or C+/r/ clusters in the onset in terms of vowel insertion patterns. That is, it was examined which Korean speakers produce better, C+/l/ or C+/r/ clusters. Interestingly, the result of the present experiment shows that the correct answer percent was higher in the C+/r/ onset clusters than C+/l/ onset clusters unlike Eckman's (1977) Marked Differential Hypothesis. In other words, the occurrence of the vowel insertion in C+/l/ clusters is higher than C+/r/ onset clusters. This may be attributed to L1 transfer. Furthermore, in the present study, three patterns of vowel insertion in the C+/l/ clusters were identified by implementing an acoustic analysis based on vowel duration and formant: a) vowel insertion with gemination, b) phonological epenthesis, and c) phonetic intrusion. However, phonetic intrusion mainly occurred in the C+/r/ clusters. Data were collected from 54 Korean speakers to see what factors are involved in vowel insertion patterns in the production of English consonant clusters. This study provides evidence for L1 transfer, the duration effect of /l/ in a different context, and three kinds of vowel insertion patterns in conjunction with gestural coordination by age groups.

성대마비와 성대구증의 강도 변화에 따른 최대발성지속시간 비교 (Comparison of Maximum Phonation Time Associated with the Changes in Vocal Intensity in Patients with Unilateral Vocal Fold Palsy and Sulcus Vocalis)

  • 최세진;최홍식;김재옥;최예린
    • 말소리와 음성과학
    • /
    • 제4권1호
    • /
    • pp.125-131
    • /
    • 2012
  • The patients with incomplete glottic closure have an important feature decreasing the maximum phonation time (MPT) because airflow rate or air leakage is greater than people without voice disorders. Also they can appear a problem in the intensity regulation. This study analyzed MPT difference based on the comfortable intensity and louder intensity and the correlation between MPT and respiration volume of unilateral vocal fold palsy (UVFP) and sulcus vocalis (SV) group. The twenty with UVFP, the 21 with SV, the 21 normal subjects measured MPT in /a/ vowel prolongation task with comfortable intensity and louder intensity and compared analysis by measuring FVC, $FEV_1$, $FEV_1/FVC$ to analyze the correlation between MPT and respiration volume. First, a comparison of MPT according to the intensity between groups is that MPT of the normal group was statistically significant long compared to the patient group in comfortable intensity, but MPT between groups was not statistically significant difference in the louder intensity. Second, an analysis of the correlation between MPT and respiration volume is that this was statistically significant correlation between MPT in comfortable intensity and MPT in louder intensity. But this did not show statistically significant correlation between intensity and respiration volume. This study can be supported the preceding study results deduced that shorting MPT of the patient group compared to the normal group was originated in the problem of laryngeal valving mechanism at the level of vocal folds rather than a problem of respiratory function. Also at the phonation by varying the intensity, the result can deduce that in the case of patient group, the length of MPT had been improved by increasing the glottal closure ratio in the louder intensity. These results can support the theoretical basis that should be applied to the clinicians by varying the intensity at the voice evaluation and voice therapy for the patients with the glottis incompetence.

성별에 따른 주관적 음성문제 인지와 관련 위험 요인 (Gender Differences in Risk Factors of Self-reported Voice Problems)

  • 변해원;황영진
    • 말소리와 음성과학
    • /
    • 제4권1호
    • /
    • pp.99-108
    • /
    • 2012
  • Recent research has identified that self-reported voice problems are a risk indicator for voice disorders. However, previous studies concerning the general population did not take into account the influence of gender on self-reported voice problems. The purpose of the present cross-sectional study was to determine the gender differences in risk factors of self-reported voice problems in the Korean adult population using national survey data. This study utilized data from the Korea National Health and Nutritional Examination Survey 2008. Subjects inclued 3,622 people (1,508 male and 2,114 female) aged 19 years and older living in the community. Data were analyzed using t-test, one-way ANOVA, and multiple logistic regression. The prevalence of self-reported voice problems was 5.9% in males, and 8.1% in females Females had higher incidents of self-reported voice problems than males. Adjusting for covariates, in males, age (OR=2.47, 95% CI: 1.07-5.70), pain and discomfort during the last two weeks (OR=3.64, 95% CI: 2.20-6.01) were independently associated with self-reported voice problems (p<0.05). In women, age (OR=1.96, 95% CI: 1.18-3.26), education (OR=2.09, 95% CI: 1.06-4.12), smoking (OR=2.70, 95% CI: 1.48-4.93), thyroid disorders (OR=2.58, 95% CI: 1.47-4.53), pain and discomfort during the last two weeks (OR=1.75, 95% CI: 1.21-2.54) were independently associated with self-reported voice problem (p<0.05). Self-reported voice problems related risk factors differed according to gender. These findings suggest that there needs to be different program strategies that reflect gender differences in self-reported voice problems.

An Experimental Study of Comfortable Pitch and Loudness with Target Matching: Effects on Electroglottographic and Acoustic Measures

  • 최성희
    • 말소리와 음성과학
    • /
    • 제4권4호
    • /
    • pp.139-146
    • /
    • 2012
  • This study was designed to examine comfort levels of pitch and loudness with target matching and their effects on electroglottographic (EGG) and acoustic measures. Twelve speakers, six males and six females, were instructed to produce /a/ sustained vowel for three seconds at a comfortable pitch and loudness level without any instruction and with a target matching procedure of either a certain f0 or SPL separately with visual and auditory feedback. The range of pitch for females and males were presented by progressing up and down randomly at intervals of 5Hz from 150 Hz to 310 Hz (total 33 frequency targets) and from 85 Hz to 190 Hz (total 22 frequency targets), respectively. The loudness levels were 65, 75, 85, 95 dB (total of four intensity targets) for both males and females. Subjective estimations of comfortable levels were obtained using a 10-point equal-appearing interval rating scale following each phonation. The results showed that males and females demonstrated similar trends in loudness levels with greatest comfort at 75 dB, whereas pitch comfort ratings showed a greater variability with females having a wider range with target matching. In the comfort levels of individuals, most male and female speakers rated higher comfort at soft, rather than loud phonations. On the other hand, most male speakers perceived highest comfort levels below the comfort pitch levels they phonated under natural conditions. Higher frequency ranges, however, were perceived to be more comfortable than those of natural condition in most female speakers, although the comfortable pitch levels in spontaneous phonations were within the comfort level ranges determined by targeted phonations. When comparing acoustic (%jitter, %shimmer, SNR) and EGG measures (CQ%) between spontaneous comfortable phonations and targeted phonations produced by the same subject at similar f0 and intensity, no significant differences were observed (p>0.05). Thus, target matching procedures may be considered a compatible and alternative method to reduce the variability of comfortable pitch and loudness levels by eliciting consistent comfortable phonations.

음성진전 유무에 따른 내전형 연축성 발성장애의 보툴리눔 독소-A 주입 후 음성 특성 변화 양상 (The Aspect of Voice Characteristics Change after Botulinum Toxin-A Injection in Patients with Adductor Spasmodic Dysphonia according to Vocal Tremor)

  • 고혜주;최홍식;임성은;최예린
    • 말소리와 음성과학
    • /
    • 제4권4호
    • /
    • pp.95-107
    • /
    • 2012
  • As BTX-A, which has been known to be the most effective treatment for ADSD, is not effective in treating vocal tremors, voice assessment must be employed to perform differential diagnosis of SD and vocal tremor in an accurate fashion. In this study, the characteristics of vocal changes after botulinum toxin injection were compared by analyzing the voice characteristics resulting from the presence of vocal tremors using objective analysis devices, with the aim of helping to provide prognoses and to determine remedial effects in clinical cases comprising patients with adductor spasmodic dysphonia accompanied by voice tremors. Respiratory function tests, aerodynamic analysis, electroglottography (EGG), acoustic analysis, auditory perception tests, and K-VHI had been conducted at intervals of four, eight, and twelve weeks before and after injection, targeting a group of 17 ADSD female patients (a ADSD group of four with vocal tremor and a ADSD group of 13 without voice tremor). For average FVC and FEV1, the T group showed statistically significant low averages compared with the NT group, whereas the T group showed statistically significant high average ATRI compared with the NT group. In addition, the T group showed a statistically significant Fatr, lower than that of the NT group. For the ADSD group of patients with voice tremor, their vocal tremor remained unchanged despite noticeable decrease in wringing voices. In other words, as the vocal tremor and wringing voices are two distinctive features, there is a need for the two features to be targeted separately for differential diagnosis.