• Title/Summary/Keyword: speech technology

Search Result 1,900, Processing Time 0.026 seconds

A new feature specification for vowel height (모음 높이의 새로운 표기법에 대하여)

  • Park Cheon-Bae
    • MALSORI
    • /
    • no.27_28
    • /
    • pp.27-56
    • /
    • 1994
  • Processes involving the change of vowel height are natural enough to be found in many languages. It is essential to have a better feature specification for vowel height to grasp these processes properly, Standard Phonology adopts the binary feature system, and vowel height is represented by the two features, i.e., [\pm high] and [\pm low]. This has its own merits. But it is defective because it is misleading when we count the number of features used in a rule to compare the naturalness of rules. This feature system also cannot represent more than three degrees of height, We wi31 discard the binary features for vowel height. We consider to adopt the multivalued feature [n high] for the property of height. However, this feature cannot avoid the arbitrariness resulting from the number values denoting vowel height. It is not easy to expect whether the number in question is the largest or not It also is impossible to decide whether a larger number denotes a higher vowel or a lower vowel. Furthermore this feature specification requires an ad hoc condition such as n > 3 or n \geq 2, whenever we want to refer to a natural class including more than one degree of height The altelnative might be Particle Phonology, or Dependency Phonology. These might be apt for multivalued vowel height systems, as their supporters argue. However, the feature specification of Particle Phonology will be discarded because it does not observe strictly the assumption that the number of the particle a is decisive in representing the height. One a in a representation can denote variant degrees of height such as [e], [I], [a], [a ] and [e ]. This also means that we cannot represent natural classes in terms of the number of the particle a, Dependency Phonology also has problems in specifying a degree of vowel height by the dependency relations between the elements. There is no unique element to represent vowel height since every property has to be defined in terms of the dependency relations between two or more elements, As a result it is difficult to formulate a rule for vowel height change, especially when the phenomenon involves a chain of vowel shifts. Therefore, we suggest a new feature specification for vowel height (see Chapter 3). This specification resorts to a single feature H and a few >'s which refer exclusively to the degree of the tongue height when a vowel is pronounced. It can cope with more than three degrees of height because it is fundamentally a multivalued scalar feature. This feature also obviates the ad hoc condition for a natural class while the [n high] type of multivalued feature suffers from it. Also this feature specification conforms to our expection that the notation should become simpler as the generality of the class increases, in that the fewer angled brackets are used, the more vowels are included, Incidentally, it has also to be noted that, by adopting a single feature for vowel height, it is possible to formulate a simpler version of rules involving the changes of vowel height especially when they involve vowel shifts found in many languages.

  • PDF

Place Assimilation in OT

  • Lee, Sechang
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.109-116
    • /
    • 1996
  • In this paper, I would like to explore the possibility that the nature of place assimilation can be captured in terms of the OCP within the Optimality Theory (Mccarthy & Prince 1999. 1995; Prince & Smolensky 1993). In derivational models, each assimilatory process would be expressed through a different autosegmental rule. However, what any such model misses is a clear generalization that all of those processes have the effect of avoiding a configuration in which two consonantal place nodes are adjacent across a syllable boundary, as illustrated in (1):(equation omitted) In a derivational model, it is a coincidence that across languages there are changes that have the result of modifying a structure of the form (1a) into the other structure that does not have adjacent consonantal place nodes (1b). OT allows us to express this effect through a constraint given in (2) that forbids adjacent place nodes: (2) OCP(PL): Adjacent place nodes are prohibited. At this point, then, a question arises as to how consonantal and vocalic place nodes are formally distinguished in the output for the purpose of applying the OCP(PL). Besides, the OCP(PL) would affect equally complex onsets and codas as well as coda-onset clusters in languages that have them such as English. To remedy this problem, following Mccarthy (1994), I assume that the canonical markedness constraint is a prohibition defined over no more than two segments, $\alpha$ and $\beta$: that is, $^{*}\{{\alpha, {\;}{\beta{\}$ with appropriate conditions imposed on $\alpha$ and $\beta$. I propose the OCP(PL) again in the following format (3) OCP(PL) (table omitted) $\alpha$ and $\beta$ are the target and the trigger of place assimilation, respectively. The '*' is a reminder that, in this format, constraints specify negative targets or prohibited configurations. Any structure matching the specifications is in violation of this constraint. Now, in correspondence terms, the meaning of the OCP(PL) is this: the constraint is violated if a consonantal place $\alpha$ is immediately followed by a consonantal place $\bebt$ in surface. One advantage of this format is that the OCP(PL) would also be invoked in dealing with place assimilation within complex coda (e.g., sink [si(equation omitted)k]): we can make the constraint scan the consonantal clusters only, excluding any intervening vowels. Finally, the onset clusters typically do not undergo place assimilation. I propose that the onsets be protected by certain constraint which ensures that the coda, not the onset loses the place feature.

  • PDF

INTONATION OF TAIWANESE: A COMPARATIVE OF THE INTONATION PATTERNS IN LI, IL, AND L2

  • Chin Chin Tseng
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.574-575
    • /
    • 1996
  • The theme of the current study is to study intonation of Taiwanese(Tw.) by comparing the intonation patterns in native language (Ll), target language (L2), and interlanguage (IL). Studies on interlanguage have dealt primarily with segments. Though there were studies which addressed to the issues of interlanguage intonation, more often than not, they didn't offer evidence for the statement, and the hypotheses were mainly based on impression. Therefore, a formal description of interlanguage intonation is necessary for further development in this field. The basic assumption of this study is that native speakers of one language perceive and produce a second language in ways closely related to the patterns of their first language. Several studies on interlanguage prosody have suggested that prosodic structure and rules are more subject to transfer than certain other phonological phenomena, given their abstract structural nature and generality(Vogel 1991). Broselow(1988) also shows that interlanguage may provide evidence for particular analyses of the native language grammar, which may not be available from the study of the native language alone. Several research questions will be addressed in the current study: A. How does duration vary among native and nominative utterances\ulcorner The results shows that there is a significant difference in duration between the beginning English learners, and the native speakers of American English for all the eleven English sentences. The mean duration shows that the beginning English learners take almost twice as much time (1.70sec.), as Americans (O.97sec.) to produce English sentences. The results also show that American speakers take significant longer time to speak all ten Taiwanese utterances. The mean duration shows that Americans take almost twice as much time (2.24sec.) as adult Taiwanese (1.14sec.) to produce Taiwanese sentences. B. Does proficiency level influence the performance of interlanguage intonation\ulcorner Can native intonation patterns be achieved by a non-native speaker\ulcorner Wenk(1986) considers proficiency level might be a variable which related to the extent of Ll influence. His study showed that beginners do transfer rhythmic features of the Ll and advanced learners can and do succeed in overcoming mother-tongue influence. The current study shows that proficiency level does play a role in the acquisition of English intonation by Taiwanese speakers. The duration and pitch range of the advanced learners are much closer to those of the native American English speakers than the beginners, but even advanced learners still cannot achieve native-like intonation patterns. C. Do Taiwanese have a narrower pitch range in comparison with American English speakers\ulcorner Ross et. al.(1986) suggests that the presence of tone in a language significantly inhibits the unrestricted manipulation of three acoustical measures of prosody which are involved in producing local pitch changes in the fundamental frequency contour during affective signaling. Will the presence of tone in a language inhibit the ability of speakers to modulate intonation\ulcorner The results do show that Taiwanese have a narrower pitch range in comparison with American English speakers. Both advanced (84Hz) and beginning learners (58Hz) of English show a significant narrower FO range than that of Americans' (112Hz), and the difference is greater between the beginning learners' group and native American English speakers.

  • PDF

The Effects of Physical Function Level and Intensity of Treatment for Rehabilitation on Improvement of Physical Function in Children with Cerebral Palsy: Follow-up Study for 6 Months (뇌성마비 아동의 신체 기능수준과 재활 목적 치료 강도가 신체 기능향상에 미치는 영향: 6개월간 추적연구)

  • Kim, Bu-Young;Yun, Young-Ju;Shin, Yong-Beom;Kim, Soo-Yeon;Oh, Tae-Young
    • Journal of the Korean Society of Physical Medicine
    • /
    • v.13 no.1
    • /
    • pp.27-38
    • /
    • 2018
  • PURPOSE: The purpose of this study was to find out the treatment patterns of Children with cerebral palsy, and to analyze the effect of physical function level and treatment intensity on improvement of physical function in children with cerebral palsy for six months. METHODS: Participants were 126 children (boys 83, girls 43) diagnosed cerebral palsy that the mean age was at 33months, ranged from 8 months to 77 months. We collected data related on demography and disable characteristic, treatment pattern using by questionnaire constructed ourselves for six months on caregivers. The treatment pattern includes, type, frequency, and institute of treatment. We performed the evaluation of Gross Motor Function Measurement (GMFM) and Pediatric Evaluation of Disability Inventory (PEDI) between pre and post for six months in order to find out improvement of physical function. We analyzed the effect of physical functional level measured by Gross Motor Functional Classification system, age, treatment intensity on physical function using by repeated measures ANOVA for SPSS PC ver. 22.0. RESULTS: The average of treatment frequency for physical therapy was 5.74 times per week, occupational therapy was 3.96 times, speech therapy was 2.96 times, treatment for accompanying disability was 3.12 times. Physical function level and age was significantly factors affecting improvement of physical function, there was no significant difference according to treatment intensity. CONCLUSION: We suggest that physical function and age might be important factors on improvement of physical function and professional rehabilitation team must consider the appropriate treatment type customized to each children.

Perception of native Korean Speakers on English and German

  • Kang, Hyun-Sook;Koo, So-Ryeong;Lee, Sook-hyang
    • Proceedings of the KSPS conference
    • /
    • 2000.07a
    • /
    • pp.86-87
    • /
    • 2000
  • In this paper, we discuss why two different surface forms appear in loanwords for English and German /${\int}$/ In Korean, a vowel is inserted into loanwords if a consonant cannot be properly syllabified. Therefore, /${\int}$/ in some positions of loanwords trigger vowel insertion. Interestingly, /${\int}$/s in the onset cluster of English and German words were borrowed in Korean as Iful with the inserted vowel [u] whereas If Is in the coda position of English and German words were borrowed as Ifil with the inserted vowel [i]. For example, 'shrimp' is adopted as [${\int}urimphi$] whereas 'rush' is adopted as [$ra{\int}i$]. In this paper, we attempt to find out the phonetic reason for the distribution of the surface forms of /${\int}$/. We assume that since the formant frequency of [i] is higher than that of [u], the peak frequency of /${\int}$/ with the surface form of [${\int}$i] in loanwords may be higher than that of /${\int}$/ with the surface form of [${\int}u$]. We also assume that duration may be another factor for the distribution of [${\int}i$] and [${\int}u$]. Since /${\int}$/ and /u/ use lip rounding whereas /i/ doesn't, the duration for [${\int}i$] might be longer than that of [${\int}u$]. German supports our assumption. /${\int}$/ in the onset cluster is longer than /${\int}$/ in the coda position. It also has higher peak frequency than that of /${\int}$/ in the coda position. In loanwords, ${\int}$ in the onset cluster is borrowed as [${\int}u$] as in Spiegel whereas /${\int}$/ in the coda position is borrowed as [${\int}i$] as in Bosch. English, however, does not support our assumption. Peak frequency of [${\int}$] depends on the preceding vowel, not on its position in the syllable structure. If the preceding vowel is front, then the peak freuency of the following of the following /${\int}$/ is high but if the preceding vowel is back, than the peak frequency of the following /${\int}$/ is low. The peak frequency of /${\int}$/ in the onset cluster seems to be in between. As we assumed, however, the duration of /${\int}$/ in the coda position is longer than of /${\int}$/ in the onset cluster. With the mixed results, we question whether Koreans really hear two different xounds for /${\int}$/ in English words. For the future experiment, we would like to perform the perception tet for /${\int}$/ in English words.

  • PDF

Some notes on the French "e muet" (불어의 "묵음 e (e muet)"에 관한 연구)

  • Lee Jeong-Won
    • MALSORI
    • /
    • no.31_32
    • /
    • pp.173-193
    • /
    • 1996
  • 불어의 "묵음 e(e muet)"에 대한 정의를 내리기는 매우 까다롭다. 불어에서 "e"가 "묵음 e(e muet)"로 불리우는 이유는 "e"가 흔히 탈락되기 때문이다. 현재 "e muet"는 다음 발화체에서 볼 수 있듯이 열린음절에서만 나타난다. "Je/le/re/de/man/de/ce/re/por/ta/ge/." [omitted](나는 그 리포트를 다시 요구한다. : 이 경우 실제 발화시 schwa 삭제 규칙이 적용된다.) 둘째, 접두사에 나타나는 "e muet"는 s의 중자음 앞에서 s가 유성음, [z]로 발음되는 것을 막기 위해 쓰인다. "ressembler[omitted](닮다); ressentir[omitted](느끼다)" 같은 경우, 셋째, 몇몇 낱말의 경우 고어의 철자가 약화되어 "e muet"로 발음이 되고 있다. "monsieur[$m{\partial}sj{\emptyset}$](미스터), faisan[$f{\partial}z{\tilde{a}}$](꿩), faisait[$f{\partial}z{\varepsilon}$]("하다"동사의 3인칭 단수 반과거형)"등. 또 과거 문법학자들은 이를 "여성형의 E"로 불렀는데, 이는 형태론적으로 낱말의 여성형을 남성형과 구분짓기 위해 사용되고 있기 때문이기도 하다. 예를 들어, "$aim{\acute{e}}-aim{\acute{e}}e$"(발음은 둘 다 [${\varepsilon}me$]로 동일하다 : 사랑받다)의 경우. 현대불어의 구어체어서 "e muet"는 어말자음을 발음하기 위해 쓰이고 있다. 예를 들어, "pote[pot](단짝)-pot[po](항아리)". 이러한 "e muet"는 발음상으로 지역적, 개인적 및 문맥적 상황에 따라 그 음색 자체가 매우 불안정하며 여러 가지 음가(열린 ${\ae}$ 또는 닫힌 ${\O}$)로 나타난다. 예를 들어 "seul[$s{\ae}l$](홀로), ceux[$s{\O}$](이것들)"에서와 같이 발음되며, 또한 원칙적으로 schwa, [${\partial}$]로 발음이 되는 "Je[$\Im\partial$]"와 "le[$l{\partial}$]"의 경우, Paris 지역에서는 "Je sais[${\Im}{\ae}{\;}s{\Im}$](나는 안다); Prends-le[$pr{\tilde{a}}{\;}l{\ae}$](그것을 집어라)"로 발음을 하는 한편, 프랑스 북부 지방세서는 동일한 발화체를 [${\ae}$]대신에 [${\o}$]로 발음한다. 실제로 언어학적 측면에서 고려되는 "e muet"는 schwa로 나타나는 "Je[$\Im\partial$]"와 "le[$l{\partial}$]"의 경우인데, 불어 음운론에서는 schwa에 의해 대립되는 낱말짝이 없기 때문에 schwa를 음소로 인정할 것인가에 대해 논란이 있다. 그러나 불어에서 schwark 음운론적 역할을 한다는 사실은 다음과 같은 예에서 찾아 볼 수 있다. 첫째, 발음상으로 동사의 변화형에서 "porte[$p{\jmath}rte$](들다: 현재형), porte[$p{\jmath}rte$](과거분사형), porta[$p{\jmath}rte$](단순과거형)"등이 대립되며, 이휘 "Porto[$p{\jmath}rte$](포르토)"와도 대립된다. 둘째, 어휘적 대립 "le haut[$l{\partial}o$](위)/l'eau[lo](물)"와 형태론적 대립 "le[$l{\partial}$](정관사, 남성단수)/les[le](정관사, 복수)"등에서 "묵음 e"는 분명히 음운론적 역할을 하고 있다. 본 논문에서는 이와 같이 음색이 복잡하게 나타나는 "e muet"의 문제를 리듬단위, 문맥적 분포 및 음절모형 측면, 즉 음성학 및 음운론적 측면에서 다양하게 분석하여 그 본질을 규명해 보고 "e muet"탈락현상을 TCG(Theorie de Charme et de Gouvernement) 측면에서 새롭게 해석해 보았다.

  • PDF

The Experimental Phonetic Study of Word Accent in Standard Korean (표준한국어 악센트의 실험음성학적 연구 -청취 테스트 및 음향분석-)

  • Seong Cheol-jae
    • MALSORI
    • /
    • no.21_24
    • /
    • pp.43-89
    • /
    • 1992
  • In this thesis, the prominent aspect of word accent in standard Korean is studied by auditory test and acoustic analysis experiment. The definition of 'accent' is, following Hoyoung Lee's discussion(1990), to be described as 'the means whereby a focused part of an utterance is made to stand out in order to concentrate the hearer's attention on it.' That is to say, the ten of 'accent' may be described in terms of phonological phenomenon and the accented syllable can be phonetically prominent as the result of those phonological process. Prosodic features may have different characteristics in different languages whether they contain linguistically important functions or not. Thus the characteristics of word accent in standard Korean will be determined as the content and trait of prosodic features. Following this viewpoint, present study looked over prosodic features which may effect the characteristics of word accent in standard Korean, through systematic experimental procedure. And the result of this experiment has been verified by statistical method, the T-test, for the purpose of identifying the relatedness among prosodic features(parameters). This thesis, therefore, aimed to investigate the intrinsic acoustic and physical qualities of the word accent in standard Korean. Nonsense words composed by 'mal' and 'ma' which can be divided into 'heavy syllable' and 'light syllable' quoted from Hyman(1975) have been classified into 28 types with respect to syllable numbers(2 syl., 3 sy1., 4 syl.) and these words have become the target of auditory test and acoustic experiment. As the result of those experimental Procedures, the word accent in standard Korean may be said that it has a tendency of fixing first two syllables regardless of syllable numbers. The syllable types of HH, HL, LL in the first two syllables may be prominent at first syllable and the type of H may be at second syllable. Various prosodic features(parameters) including duration, intensity, and Fo(purely phonetic terms) were also strengthened in those positions. The result of this experiment can be cleared up like these : 1. The most important feature is proved as 'duration', the feature of intensity resulted in more subsidiary one than the feature of duration. 2. Fo( fundamental frequency) could be observed as having some coherent contour through almost all syllable types(99 %), that is, in 2 syllable types, it had rising contour, in 2 syllable types, rising-falling contour, and in 4 syllable types, it contained rising-falling-rising contour. The result of auditory test was different with those contour forms of all Fo surveyed. With respect to these results, the discuss for Fo is determined' to be excluded comparing other features. 3. Finally, this thesis resulted in a decision that the word accent in standard Korean may has fixed(somewhat weaker) accent, especially fixed at first two syllables in almost all words. 4. Various kinds of syllable types related with 2,3,4 syllables, therefore, can be reclassified into 4 types of HH, HL, LH, LL following the concept of accent fixing placement(i.e. first two syllables). In these 4 types, the types of HH, HL, LL were prominent at the position of the first syllable , and the type of LH was prominent at the second syllable otherwise.

  • PDF

Design of a Low Power Digital Filter Using Variable Canonic Signed Digit Coefficients (가변 CSD 계수를 이용한 저전력 디지털 필터의 설계)

  • Kim, Yeong-U;Yu, Jae-Taek;Kim, Su-Won
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.38 no.7
    • /
    • pp.455-463
    • /
    • 2001
  • In this Paper, an approximate processing method is proposed and tested. The proposed method uses variable CSD (VCSD) coefficients which approximate filter stopband attenuation by controlling the precision of the CSD coefficient sets. A decimation filter for Audio Codec '97 specifications has been designed having processor architecture that consists of program/data memory, arithmetic unit, energy/level decision, and sinc filter blocks, and fabricated with 0.6${\mu}{\textrm}{m}$ CMOS sea-of-gate technology. For the combined two halfband FIR filters in decimation filter, the number of addition operations were reduced to 63.5%, 35.7%, and 13.9%, compared to worst-case which is not an adaptive one. Experimental results show that the total power reduction rate of the filter is varying from 3.8 % to 9.0 % with respect to worst-case. The proposed approximate processing method using variable CSD coefficients is readily applicable to various kinds of filters and suitable, especially, for the speech and audio applications, like oversampling ADCs and DACs, filter banks, voice/audio codecs, etc.

  • PDF

A Study on the Meanings and Roles of Oral History from a Perspective of Archival Science (기록학적 관점에서의 구술의 의미와 역할에 관한 연구)

  • Kim, Myoung-Hun
    • The Korean Journal of Archival Studies
    • /
    • no.24
    • /
    • pp.73-112
    • /
    • 2010
  • With progress of the sound and moving picture recording technology, sound and moving picture have been a tool for evidence and memory on human activities. Accordingly, in archival science the importance of oral history as a record is disseminating and the production of oral record is carried out actively. But for producing oral record in archival institutions, the identity of oral record need to be established more firmly. Archival science is the task which delivers the current appearance of life to future through records. Therefore producing oral record in archival science must have unique characters. And archival science is the task which is building current memory. Therefore the identity of oral more firmly. This article intends to explore the meaning and role of oral record from a perspective of archival science. All these days, the theories and methodologies had been developed focusing on written records mainly in the deep-rooted influence of positivism. But as it is enabled the creation and preservation of records through 'speech', it need to be noted that oral record is the very core of tool for delivering the current society shape and collective memory. Therefore this article will intend to explore the meaning and role of oral record as a part of effort to establish the identity of oral record.

A Study on the Classification of Unstructured Data through Morpheme Analysis

  • Kim, SungJin;Choi, NakJin;Lee, JunDong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.4
    • /
    • pp.105-112
    • /
    • 2021
  • In the era of big data, interest in data is exploding. In particular, the development of the Internet and social media has led to the creation of new data, enabling the realization of the era of big data and artificial intelligence and opening a new chapter in convergence technology. Also, in the past, there are many demands for analysis of data that could not be handled by programs. In this paper, an analysis model was designed and verified for classification of unstructured data, which is often required in the era of big data. Data crawled DBPia's thesis summary, main words, and sub-keyword, and created a database using KoNLP's data dictionary, and tokenized words through morpheme analysis. In addition, nouns were extracted using KAIST's 9 part-of-speech classification system, TF-IDF values were generated, and an analysis dataset was created by combining training data and Y values. Finally, The adequacy of classification was measured by applying three analysis algorithms(random forest, SVM, decision tree) to the generated analysis dataset. The classification model technique proposed in this paper can be usefully used in various fields such as civil complaint classification analysis and text-related analysis in addition to thesis classification.