• 제목/요약/키워드: phonetic data

검색결과 200건 처리시간 0.018초

Secure Blocking + Secure Matching = Secure Record Linkage

  • Karakasidis, Alexandros;Verykios, Vassilios S.
    • Journal of Computing Science and Engineering
    • /
    • 제5권3호
    • /
    • pp.223-235
    • /
    • 2011
  • Performing approximate data matching has always been an intriguing problem for both industry and academia. This task becomes even more challenging when the requirement of data privacy rises. In this paper, we propose a novel technique to address the problem of efficient privacy-preserving approximate record linkage. The secure framework we propose consists of two basic components. First, we utilize a secure blocking component based on phonetic algorithms statistically enhanced to improve security. Second, we use a secure matching component where actual approximate matching is performed using a novel private approach of the Levenshtein Distance algorithm. Our goal is to combine the speed of private blocking with the increased accuracy of approximate secure matching.

Government and Derivation in Korean Phonology

  • Park, Hee-Heon;David Michaels
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 1996년도 10월 학술대회지
    • /
    • pp.117-122
    • /
    • 1996
  • This paper proposes a derivational account of tensing and neutralization of obstruents in Korean within the theory of Government Phonology (GP) (Kaye, Lowenstamm and Vergnaud 1990, henceforth KLV; Park 1996). We begin by outling the relevant tensing and neutralization data in Korean. We point out several problems that need to be addressed in any account of these data. We then set out the central notions of GP, pointing out how adherence to the requirement that government relations remain constant throughout a derivation under the Projection Principle prevents a GP account of tensing and neutralization in Korean, which requires government relations to switch between lexical and phonetic representations. To address this problem, we propose abandoning the Projection Principle, extending lexical representations in GP along the lines of the Markedness Theory approach (Michaels 1989), and adopting the economy principles for derivation of the Minimalist approach (Chomsky 1993; Chomsky & Lasnik 1991). finally, we summarize the analysis of obstruent phenomena in Korean within GP extended in these ways.

  • PDF

한국어의 리듬에 관한 실험음성학적 연구 (An Experimental Phonetic Study of Rhythm in Standard Korean)

  • 이현복
    • 대한음성학회지:말소리
    • /
    • 제25_26호
    • /
    • pp.52-64
    • /
    • 1993
  • This paper aims to explore the rhythmic phenomena of standard Korean by an experimental phonetic method. A total of 16 informants taking part in this experiment were divided into four groups : old males(OM) and old females(OF) in their fifties and young males(YM) and young females(YF) in their twenties. The informants were asked to read speech data consisting of two rhythmic units, each of which began with a stressed syllable with a long wowel. Starting with the frame / 'ma:1 'ma:nta /, the first rhythmic unit was expanded up to five syllables in all while keeping the second rhythmic unit constant with a view to investigate the pattern of increase in the interstress time interval. The results of this study are as follows: 1. There is a considerable difference between yen and old generations with respect to the duration of interstress interval . The young generation tends to speak faster than the old generation. This observation is supported by difference in the interstress intervals as exhibited by OM(389.66), OF(473), YM(275.55), YF(285.83) in the test frame '말 많다' ['ma:1 'ma:nta]. 2. Young and old generations showed a different tendency in the increase rate of duration between mono-syllables and polysyllables. In other words, the rhythm of young generation shows the tendency of syllable-timed language whereas that of old generation clearly leans towards the stressed-timed language.

  • PDF

ToBI and beyond: Phonetic intonation of Seoul Korean ani in Korean Intonation Corpus (KICo)

  • Ji-eun Kim
    • 말소리와 음성과학
    • /
    • 제16권1호
    • /
    • pp.1-9
    • /
    • 2024
  • This study investigated the variation in the intonation of Seoul Korean interjection ani across different meanings ("no" and "really?") and speech levels (Intimate and Polite) using data from Korean Intonation Corpus (KICo). The investigation was conducted in two stages. First, IP-final tones in the dataset were categorized according to the K-ToBI convention (Jun, 2000). While significant relationships were observed between the meaning of ani and its IP-final tones, substantial overlap between groups was notable. Second, the F0 characteristics of the final syllable of ani were analyzed to elucidate the apparent many-to-many relationships between intonation and meaning/speech level. Results indicated that these seemingly overlapping relationships could be significantly distinguished. Overall, this study advocates for a deeper analysis of phonetic intonation beyond ToBI-based categorical labels. By examining the F0 characteristics of the IP-final syllable, previously unclear connections between meaning/speech level and intonation become more comprehensible. Although ToBI remains a valuable tool and framework for studying intonation, it is imperative to explore beyond these categories to grasp the "distinctiveness" of intonation, thereby enriching our understanding of prosody.

The Patterns of Vowel Insertion in Korean Speakers' Production of English C+/l/ and C+/r/ Clusters

  • Kang, Seo-Yoon
    • 말소리와 음성과학
    • /
    • 제4권4호
    • /
    • pp.3-17
    • /
    • 2012
  • This study examines Korean speakers' production of English consonant clusters, focusing on vowel insertion. An acoustic analysis along with a statistical test was carried out to see what factors are involved in this production. The following factors were considered in the present study: phonetic properties, L1 transfer, and cluster types. Specifically, liquid types were considered to see if they cause any difference depending on C+/l/ or C+/r/ clusters in the onset in terms of vowel insertion patterns. That is, it was examined which Korean speakers produce better, C+/l/ or C+/r/ clusters. Interestingly, the result of the present experiment shows that the correct answer percent was higher in the C+/r/ onset clusters than C+/l/ onset clusters unlike Eckman's (1977) Marked Differential Hypothesis. In other words, the occurrence of the vowel insertion in C+/l/ clusters is higher than C+/r/ onset clusters. This may be attributed to L1 transfer. Furthermore, in the present study, three patterns of vowel insertion in the C+/l/ clusters were identified by implementing an acoustic analysis based on vowel duration and formant: a) vowel insertion with gemination, b) phonological epenthesis, and c) phonetic intrusion. However, phonetic intrusion mainly occurred in the C+/r/ clusters. Data were collected from 54 Korean speakers to see what factors are involved in vowel insertion patterns in the production of English consonant clusters. This study provides evidence for L1 transfer, the duration effect of /l/ in a different context, and three kinds of vowel insertion patterns in conjunction with gestural coordination by age groups.

유/무성/묵음 정보를 이용한 TTS용 자동음소분할기 성능향상 (Improvement of an Automatic Segmentation for TTS Using Voiced/Unvoiced/Silence Information)

  • 김민제;이정철;김종진
    • 대한음성학회지:말소리
    • /
    • 제58호
    • /
    • pp.67-81
    • /
    • 2006
  • For a large corpus of time-aligned data, HMM based approaches are most widely used for automatic segmentation, providing a consistent and accurate phone labeling scheme. There are two methods for training in HMM. Flat starting method has a property that human interference is minimized but it has low accuracy. Bootstrap method has a high accuracy, but it has a defect that manual segmentation is required In this paper, a new algorithm is proposed to minimize manual work and to improve the performance of automatic segmentation. At first phase, voiced, unvoiced and silence classification is performed for each speech data frame. At second phase, the phoneme sequence is aligned dynamically to the voiced/unvoiced/silence sequence according to the acoustic phonetic rules. Finally, using these segmented speech data as a bootstrap, phoneme model parameters based on HMM are trained. For the performance test, hand labeled ETRI speech DB was used. The experiment results showed that our algorithm achieved 10% improvement of segmentation accuracy within 20 ms tolerable error range. Especially for the unvoiced consonants, it showed 30% improvement.

  • PDF

Implementation of HMM-Based Speech Recognizer Using TMS320C6711 DSP

  • Bae Hyojoon;Jung Sungyun;Bae Keunsung
    • 대한음성학회지:말소리
    • /
    • 제52호
    • /
    • pp.111-120
    • /
    • 2004
  • This paper focuses on the DSP implementation of an HMM-based speech recognizer that can handle several hundred words of vocabulary size as well as speaker independency. First, we develop an HMM-based speech recognition system on the PC that operates on the frame basis with parallel processing of feature extraction and Viterbi decoding to make the processing delay as small as possible. Many techniques such as linear discriminant analysis, state-based Gaussian selection, and phonetic tied mixture model are employed for reduction of computational burden and memory size. The system is then properly optimized and compiled on the TMS320C6711 DSP for real-time operation. The implemented system uses 486kbytes of memory for data and acoustic models, and 24.5 kbytes for program code. Maximum required time of 29.2 ms for processing a frame of 32 ms of speech validates real-time operation of the implemented system.

  • PDF

한국어 단모음의 음성학적 기반연구 (A Fundamental Phonetic Investigation of Korean Monophthongs)

  • 문승재
    • 대한음성학회지:말소리
    • /
    • 제62호
    • /
    • pp.1-17
    • /
    • 2007
  • The purpose of this study was to investigate and quantitatively describe the acoustic characteristics of current Korean monophthongs. Recordings were made of 33 men and 27 women producing the vowels /i, e, ${\epsilon}$, a, ${\partial}$, o, u, i/ in a carrier phrase "This character is ___." A listening test was conducted in which 19 participants judged each vowel. F1, F2, and F3 were measured from the vowels judged as intended vowels by more than 17 people from the listening test. Analysis of formant data shows some interesting results including the undeniable confirmation of the 7-vowel system in modern Korean. It turns out that quite different sounding Korean vowels and English vowels happen to have very similar formant measurements. Also the difference between "citation-form reading" vs. "natural utterance reading" is discussed.

  • PDF

실시간 음성타자 시스템 구현 (Development of Realtime Phonetic Typewriter)

  • 조우연;최두일
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1999년도 추계학술대회 논문집 학회본부 B
    • /
    • pp.727-729
    • /
    • 1999
  • We have developed a realtime phonetic typewriter implemented on IBM PC with sound card based on Windows 95. In this system, analyzing of speech signal, learning of neural network, labeling of output neurons and visualizing of recognition results are performed on realtime. The developing environment for speech processing is established by adding various functions, such as editing, saving, loading of speech data and 3-D or gray level displaying of spectrogram. Recognition experimental using Korean phone had a 71.42% for 13 basic consonant and 90.01% for 7 basic vowel accuracy.

  • PDF

한국어와 영어 음절의 지속시간에 대한 비교연구 -음절체와 각운을 중심으로- (Duration of bodies and rhymes in Korean and English syllables)

  • 백은아;노동우;정옥란;강수균
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 10월 학술대회지
    • /
    • pp.169-172
    • /
    • 2003
  • The purpose of this study was to provide preliminary data on the acoustical differences of one syllable words spoken by speakers with different language backgrounds. 20 native speakers of Korean and English were asked to read 7 one-syllable words written in their native language. The phonetic and phonemic characteristics of 7 words were similar between two languages. The ratio of duration of the body (onset+nucleus) and the rhyme(nucleus+coda) relative to the duration of each syllable were calculated using CSL (Computerized Speech Laboratory). The results corresponds to the body-coda structure of the Korean syllable which is supported by the recent experimental psychological studies. More acoustic studies on the Korean syllable structure are required to establish clinical foundation for the phonological awareness and the reading intervention programs.

  • PDF