• Title/Summary/Keyword: Arabic speech

Search Result 14, Processing Time 0.019 seconds

Arabic-Numerals to Korean Transliteration Disambiguation using BERT (BERT를 이용한 숫자-한국어 음역 모호성 해소)

  • Park, Jeong Yeon;Yuk, Dae Bum;Lee, Jae Sung
    • Annual Conference on Human and Language Technology
    • /
    • 2020.10a
    • /
    • pp.42-44
    • /
    • 2020
  • TTS(Text-to-Speech) 시스템을 위해서는 한글 이외의 문자열을 한글로 변환해줄 필요가 있다. 이러한 문자열에는 숫자, 특수문자 등의 문자열이 포함되어 있다. 특히 숫자의 경우, 숫자가 사용되는 문맥에 따라 그 발음방법이 달라지는 문제점이 있다. 본 논문에서는 기존의 규칙기반과 한정된 문맥 정보만을 활용할 수 있는 방법이 아닌, 딥러닝을 이용한 방법으로 문맥에 따라 발음방법이 달라지는 숫자 음역의 모호성을 해소하는 방법을 소개한다.

  • PDF

Compensation in VC and Word

  • Yun, Il-Sung
    • Phonetics and Speech Sciences
    • /
    • v.2 no.3
    • /
    • pp.81-89
    • /
    • 2010
  • Korean and three other languages (English, Arabic, and Japanese) were compared with regard to the compensatory movements in a VC (Vowel and Consonant) sequence and word. For this, Korean data were collected from an experiment and the other languages' data from literature. All the test words of the languages had the same syllabic contexture, i.e., /CVCV(r)/, where C was an oral stop and intervocalic consonants were either bilabial or alveolar stops. The present study found that (1) Korean is most striking in the durational variations of segments (vowel and the following hetero-syllabic consonant); (2) unlike the three languages that show a constant sum of VC, Korean yields a three-way distinction in the length of VC according the type (lax unaspirated vs. tense unaspirated vs. tense aspirated) of the following stop consonant; (3) a durational constancy is maintained up to the word level in the three languages, but Korean word duration varies as a function of the feature tenseness of the intervocalic consonants; (4) consonant duration is proven to differentiate Korean the most from the other languages. It is suggested that the durational difference between a lax consonant and its tense cognate(s) and the degree of compensation between V and C are determined by the phonology in each language.

  • PDF

The perception and production of Korean vowels by Egyptian learners (이집트인 학습자의 한국어 모음 지각과 산출)

  • Benjamin, Sarah;Lee, Ho-Young
    • Phonetics and Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.23-34
    • /
    • 2021
  • This study aims to discuss how Egyptian learners of Korean perceive and categorize Korean vowels, how Koreans perceive Korean vowels they pronounce, and how Egyptian learners' Korean vowel categorization affects their perception and production of Korean vowels. In Experiment 1, 53 Egyptian learners were asked to listen to Korean test words pronounced by Koreans and choose the words they had listened to among 4 confusable words. In Experiment 2, 117 sound files (13 test words×9 Egyptian learners) recorded by Egyptian learners were given to Koreans and asked to select the words they had heard among 4 confusable words. The results of the experiments show that "new" Korean vowels that do not have categorizable ones in Egyptian Arabic easily formed new categories and were therefore well identified in perception and relatively well pronounced, but some of them were poorly produced. However, Egyptian learners poorly distinguished "similar" Korean vowels in perception, but their pronunciation was relatively well identified by native Koreans. Based on the results of this study, we argued that the Speech Learning Model (SLM) and Perceptual Assimilation Model (PAM) explain the L2 speech perception well, but they are insufficient to explain L2 speech production and therefore need to be revised and extended to L2 speech production.

Words for Numbers and Transcoding Processes Reflected by ERPs during Mental Arithmetic (수 연산과정에서 ERP로 확인된 숫자어휘와 부호변환 과정)

  • Kim, Choong-Myung;Kim, Dong-Hwee
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.11 no.2
    • /
    • pp.689-695
    • /
    • 2010
  • The effect of the code conversion process of Korean script (Hangul), also known as words for numbers, was investigated using event-related potentials (ERPs) during mental arithmetic operations. Study subjects were asked to determine whether the arithmetic results of a given target stimuli were correctly matched. Visual inspection and statistics of mean ERPs showed stimulus type-dependent processing rather than task-dependent processing. Results of addition and multiplication tasks revealed that the overall temporal profiles of the Arabic numerals were similar to the Hangul words for numbers. The only exception to this observation was a delayed positive-slope peak occurring around 300 ms, which was likely related to the encoding process of Hangul words for numbers to Arabic-digits, defined as a 'transcoding-related potential.' Source analysis confirmed that the topography of different waveforms for the two conditions was attributed to a single dipole located in the left temporo-parietal area; this area is known to be involved in Hangul words for number processing. These results suggest that the initial processing for encoding words for numbers was followed by arithmetic operations without direct access of internal number representation. Korea Academia-Industrial cooperation Society. The Korea Academia-Industrial cooperation Society. The Korea Academia-Industrial cooperation Society. The Korea Academia-Industrial cooperation Society.