• Title/Summary/Keyword: 음역

Search Result 67, Processing Time 0.028 seconds

A Retrieval System Using the Automatic Transition of the English-Adopted Words into Transliterations (영어외래어의 음역어 자동변환을 이용한 검색 시스템)

  • Lee, Mi-Ran;Kim, Yang-Taek;Jeun, Hong-Tee;Youn, Sung-Dae
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.04b
    • /
    • pp.1073-1076
    • /
    • 2002
  • 정보 검색시 질의어가 외래어일 경우에 검색의 재현율은 급격하게 감소된다. 이는 외래어에서 나오는 음역어의 불일치와 영어외래어, 한글음역어는 같은 색인으로 처리가 되지 않기 때문이다. 따라서 본 논문에서는 영어외래어를 한글음역어로 자동 변환시키고, 자동 변환시에는 영어음소에 해당하는 발음값을 한글음소로 모두 변환시킨 다음 조합하였다. 조합된 음역어들은 다시 동치부류 DB에 저장되어, 질의어 검색시 검색어가 동치부류 색인어로 확장되어 검색된다. 제안한 검색시스템의 성능을 평가하기 위해서 재현율을 측정하였다.

  • PDF

Affective responses to singing voice in different vocal registers and modes (보컬 음역대와 음악 조성에 따른 감상자의 정서반응)

  • Wu, Yingyi;Hyun-Ju Chong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.1
    • /
    • pp.75-82
    • /
    • 2023
  • The purpose of this study was to investigate listener's affective responses to different vocal registers and modes in terms of valence (i.e., negative to positive affect) and arousal (i.e., low to high energy level). The data were collected from four different conditions (i.e., higher and lower registers paired with major and minor modes). A total of 188 female college students participated in the survey online and rated their perceived valence and arousal levels on a visual analogue scale after listening to each excerpt. The two-way analysis of variance (ANOVA) was administered for data analysis. The results revealed that there were significant differences in the affective responses to the two vocal registers, showing that the arousal was more affected by the register than the valence. Secondly, mode had statistically significant impact on both valence and arousal while weighing more on valence. Further, there was significant interaction effect of vocal register and mode on valence, but not on arousal. Results also displayed that listeners had the most negative valence when listening to the excerpt of minor mode in higher register, while having the lowest arousal when listening to the excerpt of minor mode in lower register. These findings imply that it is important to consider the vocal range as well as the musical mode when selecting music for appreciation.

Frequency Range Enhancement for Faster Convergence of Neural Music Source Separation Systems (신경망 기반 음원 분리 시스템의 학습 속도 향상을 위한 음역대 강조 기법)

  • Kim, Min-Seok;Choi, Woo-Sung;Jung, Soon-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2020.05a
    • /
    • pp.567-569
    • /
    • 2020
  • 여러 악기가 섞여 있는 음원으로부터 원하는 악기 소리를 추출하는 음원 분리 기법 중 최근 신경망 기반 시스템이 활발히 연구되고 있다. 악기마다 고유의 음역대를 가진다는 사실에 감안하여, 연구진은 기존 음원 분리 신경망에 적은 수의 학습 파라미터를 추가하여 학습 속도를 대폭 향상시킬 수 있는 음역대 강조 기법을 제안한다.

Arabic-Numerals to Korean Transliteration Disambiguation using BERT (BERT를 이용한 숫자-한국어 음역 모호성 해소)

  • Park, Jeong Yeon;Yuk, Dae Bum;Lee, Jae Sung
    • Annual Conference on Human and Language Technology
    • /
    • 2020.10a
    • /
    • pp.42-44
    • /
    • 2020
  • TTS(Text-to-Speech) 시스템을 위해서는 한글 이외의 문자열을 한글로 변환해줄 필요가 있다. 이러한 문자열에는 숫자, 특수문자 등의 문자열이 포함되어 있다. 특히 숫자의 경우, 숫자가 사용되는 문맥에 따라 그 발음방법이 달라지는 문제점이 있다. 본 논문에서는 기존의 규칙기반과 한정된 문맥 정보만을 활용할 수 있는 방법이 아닌, 딥러닝을 이용한 방법으로 문맥에 따라 발음방법이 달라지는 숫자 음역의 모호성을 해소하는 방법을 소개한다.

  • PDF

Text Transliteration System and Number Transliteration Disambiguation for TTS (음성합성을 위한 텍스트 음역 시스템과 숫자 음역 모호성 처리)

  • Park, Jeong Yeon;Shin, Hyeong Jin;Yuk, Dae Bum;Lee, Jae Sung
    • Annual Conference on Human and Language Technology
    • /
    • 2018.10a
    • /
    • pp.449-452
    • /
    • 2018
  • TTS(Text-to-Speech)는 문자열을 입력받아 그 문자열을 음성으로 변환하는 음성합성 기술이다. 그러나 실제 입력되는 문장에는 한글뿐만 아니라 영단어 및 숫자 등이 혼합되어 있다. 영단어는 대소문자에 따라 다르게 읽을 수 있으며, 단위로 사용될 때는 약어로 사용되는 것이므로, 알파벳 단위로 읽어서는 안 된다. 숫자 또한 함께 사용되는 단어에 따라 읽는 방식이 달라진다. 본 논문에서는 한글과 숫자 및 단위, 영단어가 혼합된 문장을 분류하고 이를 음역하는 시스템을 구성하며 word vector를 이용한 숫자 및 단위의 모호성 해소방법을 소개한다.

  • PDF

A Study on User Satisfaction with CJK Romanization in the OCLC WorldCat System (도서관 서지정보의 한중일 로마자표기법에 대한 이용자 만족도 연구)

  • Ha, Yoo-Jin
    • Journal of the Korean Society for information Management
    • /
    • v.27 no.2
    • /
    • pp.95-115
    • /
    • 2010
  • The purpose of this study is to investigate how individuals assess Chinese, Japanese, and Korean (CJK) transliterated bibliographic information on current library catalogs. Two separate studies, a survey and an experiment, were conducted using the WorldCat system. Users noted that Romanization has many issues which can inhibit user‘s ability to understand the transliterated bibliographic information even when it is in the person’s own native language and even when the individual had extensive experience with transliteration systems. The experimental results also supported these findings: participants had better results and satisfaction when looking for information written in English than when searching for transliterated information written in their native language. Implications for future research suggests a need to investigate user preferences for translation vs. transliteration of bibliographic information. This study proposes consideration of using English translation as a parallel link with CJK Romanization for bibliographic information.

Interaction of native language interference and universal language interference on L2 intonation acquisition: Focusing on the pitch range variation (L2 억양에서 나타나는 모국어 간섭과 언어 보편적 간섭현상의 상호작용: 피치대역을 중심으로)

  • Yune, Youngsook
    • Phonetics and Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.35-46
    • /
    • 2021
  • In this study, we examined the interactive aspects between pitch reduction phenomena considered a universal language phenomenon and native language interference in the production of L2 intonation performed by Chinese learners of Korean. To investigate their interaction, we conducted an acoustic analysis using acoustic measures such as pitch span, pitch level, pitch dynamic quotient, skewness, and kurtosis. In addition, the correlation between text comprehension and pitch was examined. The analyzed material consisted of four Korean discourses containing five and seven sentences of varying difficulty. Seven Korean native speakers and thirty Chinese learners who differed in their Korean proficiency participated in the production test. The results, for differences by language, showed that Chinese had a more expanded pitch span, and a higher pitch level than Korean. The analysis between groups showed that at the beginner and intermediate levels, pitch reduction was prominent, i.e., their Korean was characterized by a compressed pitch span, low pitch level, and less sentence internal pitch variation. Contrariwise, the pitch use of advanced speakers was most similar to Korean native speakers. There was no significant correlation between text difficulty and pitch use. Through this study, we observed that pitch reduction was more pronounced than native language interference in the phonetic layer.

Vocal Range of Older Adults in Comparison with Young Adults Depending on the Presence of Pitch Cues (음정제시 유무에 따른 노인과 성인의 음역대 비교)

  • Hong, Hyejin;Kim, Soo Ji
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.4
    • /
    • pp.377-386
    • /
    • 2016
  • The purpose of this study was to compare the vocal range between the older and young adults depending on the absence and presence of pitch cues. Participants were 44 older adults aged from 60 to 85 years and 59 college students aged from 19 to 25 years. Each participant was instructed to vocalize for examining the possible highest pitch to the lowest pitch in two conditions; with and without pitch cues. Without pitch cues, the maximum and minimum pitch of male participants was higher in the older adults than in the young adults group, while the minimum pitch of females was lower in the older adults group than in the young adults group. When presented with pitch cues, young adults could expand their vocal range, but the older adults showed no significant changes in produced vocal ranges. The results indicate that the range of voice in older adults may be affected by aging and these results can be beneficial when selecting the appropriate range of song for singing activities with older adults.

Method for 3D Visualization of Sound Data (사운드 데이터의 3D 시각화 방법)

  • Ko, Jae-Hyuk
    • Journal of Digital Convergence
    • /
    • v.14 no.7
    • /
    • pp.331-337
    • /
    • 2016
  • The purpose of this study is to provide a method to visualize the sound data to the three-dimensional image. The visualization of the sound data is performed according to the algorithm set after production of the text-based script that form the channel range of the sound data. The algorithm consists of a total of five levels, including setting sound channel range, setting picture frame for sound visualization, setting 3D image unit's property, extracting channel range of sound data and sound visualization, 3D visualization is performed with at least an operation signal input by the input device such as a mouse. With the sound files with the amount an animator can not finish in the normal way, 3D visualization method proposed in this study was highlighted that the low-cost, highly efficient way to produce creative artistic image by comparing the working time the animator with a study presented method and time for work. Future research will be the real-time visualization method of the sound data in a way that is going through a rendering process in the game engine.