• Title/Summary/Keyword: English Word

검색결과 576건 처리시간 0.023초

남북한 고등학교 영어교과서 4-gram 연어 비교 분석 (Comparative Analysis of 4-gram Word Clusters in South vs. North Korean High School English Textbooks)

  • 김정렬
    • 한국콘텐츠학회논문지
    • /
    • 제20권7호
    • /
    • pp.274-281
    • /
    • 2020
  • 본 연구는 4-gram 연어분석으로 남북한 고등학교 영어교과서를 비교분석하고자 하는 것이 목적이다. N-gram 분석은 그동안 우리가 알고 있는 관습적인 관용어와는 달리 코퍼스를 구성하여 기계적인 방법으로 물리적으로 함께 공기하는 빈도가 높은 낱말군을 객관적인 방법으로 추출하여 분석하는 것이다. 본 연구의 목적은 AntConc의 N-gram 분석 도구로 4-gram 연어를 남북한 영어교과서 코퍼스에서 찾아서 비교 분석해 보는 것이다. 분석의 대상은 북한의 2013 교육개혁에 따른 북한 고등중학교 영어교과서와 남한의 2015교육과정에 따른 고등학교 영어교과서로 구성된 코퍼스에서 구어와 문어의 token과 type을 구분하여 분석 비교한다. 이를 분석대상으로 하여 코퍼스의 4-gram 연어를 문법범주와 기능범주로 나눈 준거를 통해서 분석하였다. 문법범주는 크게 명사구, 동사구, 전치사구, 부분절 그리고 기타로 나누어 범주화하고 기능범주는 지칭, 텍스트의 조직, 입장과 기타로 나누었다. 분석한 결과 4-gram 연어에 나타난 구어와 문어 모두 남한의 영어교과서가 북한의 영어교과서 보다 token과 type의 수가 상대적으로 많았다. 그리고 문법범주에는 남북한 모두 영어교과서에 동사구와 부분절 형태의 4-gram 연어가 가장 많았으며 기능범주에는 남북한 모두 영어교과서에 입장 기능과 관련된 4-gram 연어가 가장 많았다.

The Role of Post-lexical Intonational Patterns in Korean Word Segmentation

  • Kim, Sa-Hyang
    • 음성과학
    • /
    • 제14권1호
    • /
    • pp.37-62
    • /
    • 2007
  • The current study examines the role of post-lexical tonal patterns of a prosodic phrase in word segmentation. In a word spotting experiment, native Korean listeners were asked to spot a disyllabic or trisyllabic word from twelve syllable speech stream that was composed of three Accentual Phrases (AP). Words occurred with various post-lexical intonation patterns. The results showed that listeners spotted more words in phrase-initial than in phrase-medial position, suggesting that the AP-final H tone from the preceding AP helped listeners to segment the phrase-initial word in the target AP. Results also showed that listeners' error rates were significantly lower when words occurred with initial rising tonal pattern, which is the most frequent intonational pattern imposed upon multisyllabic words in Korean, than with non-rising patterns. This result was observed both in AP-initial and in AP-medial positions, regardless of the frequency and legality of overall AP tonal patterns. Tonal cues other than initial rising tone did not positively influence the error rate. These results not only indicate that rising tone in AP-initial and AP_final position is a reliable cue for word boundary detection for Korean listeners, but further suggest that phrasal intonation contours serve as a possible word boundary cue in languages without lexical prominence.

  • PDF

Latent Semantic Analysis Approach for Document Summarization Based on Word Embeddings

  • Al-Sabahi, Kamal;Zuping, Zhang;Kang, Yang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권1호
    • /
    • pp.254-276
    • /
    • 2019
  • Since the amount of information on the internet is growing rapidly, it is not easy for a user to find relevant information for his/her query. To tackle this issue, the researchers are paying much attention to Document Summarization. The key point in any successful document summarizer is a good document representation. The traditional approaches based on word overlapping mostly fail to produce that kind of representation. Word embedding has shown good performance allowing words to match on a semantic level. Naively concatenating word embeddings makes common words dominant which in turn diminish the representation quality. In this paper, we employ word embeddings to improve the weighting schemes for calculating the Latent Semantic Analysis input matrix. Two embedding-based weighting schemes are proposed and then combined to calculate the values of this matrix. They are modified versions of the augment weight and the entropy frequency that combine the strength of traditional weighting schemes and word embedding. The proposed approach is evaluated on three English datasets, DUC 2002, DUC 2004 and Multilingual 2015 Single-document Summarization. Experimental results on the three datasets show that the proposed model achieved competitive performance compared to the state-of-the-art leading to a conclusion that it provides a better document representation and a better document summary as a result.

$\cdot$ 영 동시조음 데이터베이스의 구축 (Speech Coarticulation Database of Korean and English)

  • 김종미
    • 한국음향학회지
    • /
    • 제18권3호
    • /
    • pp.17-26
    • /
    • 1999
  • We present the first speech coarticulation database of Korean, English and Konglish/sup 3)/ named "SORIDA"/sup 4)/, which is designed to cover the maximum number of representations of coarticulation in these languages [1]. SORIDA features a compact database which is designed to contain a maximum number of triphones in a minimum number of prompts. SORIDA contains all consonantal triphones and vowel allophones in 682 Korean prompts of word length and in 717 English prompt words, spoken five times by speakers of balanced genders, dialects and ages. Korean prompts are synthesized lexicons which maximize their coarticulation variation disregarding any stress phenomena, while English prompts are natural words that fully reflect their stress effects with respect to the coarticulation variation. The prompts are designed differently because English phonology has stress while Korean does not. An intermediate language, Konglish has also been modeled by two Korean speakers reading 717 English prompt words. Recording was done in a controlled laboratory environment with an AKG Model C-100 microphone and a Fostex D-5 digital-audio-tape (DAT) recorder. The total recording time lasted four hours. SORIDA CD-ROM is available in one disk of 22.05 kHz sampling rate with a 16 bit sample size. SORIDA digital audio-tapes are available in four 124-minute-tapes of 48 kHz sampling rate. SORIDA′s list of phonetically-rich-words is also available in English and Korean.

  • PDF

An Acoustic Study of English Sentence Stress and Rhythm Produced by Korean Speakers

  • Kim, Ok-Young
    • 음성과학
    • /
    • 제14권1호
    • /
    • pp.121-135
    • /
    • 2007
  • The purpose of this paper is to examine how Korean speakers realize English stress and rhythm at the sentence level, and investigate what different acoustic characteristics of English sentence stress and rhythm Korean speakers have, compared with those of American English speakers. Stressed words in the sentence were analyzed in terms of duration, fundamental frequency, and intensity of the stressed vowel in the word with neutral stress and with emphatic stress, respectively. According to the results, when the words had emphatic stress, both Koreans' and Americans' F0 and intensity of the stressed vowel were higher than those with neutral stress. Korean speakers of English realized the sentence stress with shorter vowel duration and higher F0 than American English speakers when the words had emphatic stress. The analysis of the timing of the sentence with increased unstressed syllables showed that both Americans and Koreans produced the sentence with longer duration as the number of unstressed syllables increased. However, the duration of unstressed syllables between stressed syllables by Koreans was longer than that by Americans. Americans seemed to produce unstressed syllables between stressed syllables faster than Koreans for regular intervals of stressed syllables. This analysis implies that if there are more unstressed syllables between stressed syllables, Koreans might produce unstressed syllables and the whole sentence with longer duration.

  • PDF

Maritime English vs Maritime English Communication

  • 최승희
    • 한국항해항만학회:학술대회논문집
    • /
    • 한국항해항만학회 2015년도 춘계학술대회
    • /
    • pp.272-274
    • /
    • 2015
  • Success of communication at sea is directly linked with clear and complete delivery and receipt of the target message between interlocutors. It can be said that speakers' effective delivery of their intended message and listeners' precise decoding and accurate understanding are the keys to successful maritime communication. From this perspective, the scope of maritime English education and training needs to be reconceptualized and expanded into the area of communication itself, beyond the simple acquisition of, and familiarization with, IMO Standard Maritime Communication Phrases (SMCP). Therefore, in order to make learners' acquisition of marine communication knowledge more feasible, and the knowledge learned more practically applicable, training on effective and clear oral delivery should be also considered within the frame of maritime English education. Thus, critical training elements to realize this goal need to be suggested as guidelines. In this presentation, the theoretical background on this will be introduced in terms of English as a Lingua Franca (ELF) and Lingua Franca Core (LFC), which are the current mainstream forms of English communication in the international business context. Based on this, six key training elements will be discussed; that is, speech rate, word groups, pauses, nuclear stresses, consonants (including consonant clusters), and vowels (specifically long and short vowels). Finally, the practical pedagogical methods of each element, and its actual application into a real ESP classroom, will be suggested.

  • PDF

Phonetic investigation of epenthetic vowels produced by Korean learners of English

  • Shin, Dong-Jin;Iverson, Paul
    • 말소리와 음성과학
    • /
    • 제6권4호
    • /
    • pp.17-26
    • /
    • 2014
  • The present study examined epenthetic vowels produced by Korean learners of English in read sentences, in terms of acoustic measures and extra-phonological factors. The results demonstrated three main findings. First, epenthetic vowels had relatively high F1 values and a wide range of F2 values. Most of the epenthetic vowels were inserted near Korean high central vowels, but some vowels were inserted near front vowels due to co-articulation with surrounding vowels. Second, vowel epenthesis was affected by the context. The results showed that the epenthesis was frequently seen with word junctions between obstruents (e.g., stops-fricatives). Third, Korean learners were not affected by English background and were very weakly affected by orthography. English experience, which is one of the extra-phonological factors, was not related to epenthesis production. However, orthography, the other extra-phonological factor, very weakly affected the amount of epenthesis production. Nine percent of all epenthesis production was affected by the English past-tense suffix '-ed'; approximately 70% of the participants were affected by this suffix. The findings of the present study contributed to understanding vowel epenthesis. First, the study revealed that the epenthetic vowels produced by Korean learners of English were close to the high central vowel, supporting previous studies that the epenthetic vowel is quite close to the shortest vowel. Second, the study examined the various phonetic environments of epenthetic vowels, revealing that vowel epenthesis occurred more frequently in a certain phonetic circumstance.

미국인 남성이 발음한 영어 모음의 포먼트 궤적 (Formant Trajectories of English Vowels Produced by American Males)

  • 양병곤
    • 말소리와 음성과학
    • /
    • 제1권3호
    • /
    • pp.65-72
    • /
    • 2009
  • Formant values are the most important acoustic correlates of English vowels. Classical studies on English vowels reported the first three formant values measured at a single timepoint on a sustained vowel segment. However, many recent studies revealed that partial onset or offset segments with information of dynamic spectral changes may contribute to the exact identification of English vowels with an accuracy almost comparable to that by the whole vowel segment or word. The purpose of this study was to examine formant trajectories of nine English vowels collected by Hillenbrand et al.(1995). Acoustic analysis was systematically made by a Praat script at six equidistant timepoints over the vowel segment. Results showed that the first formant trajectories played an important role in distinguishing each vowel within the front- or back-vowel groups. The second formant trajectories of the back vowels varied more drastically than those of the front vowels. The third formant value was similar except the high vowel /i/. From the vowel space on F1 by F2 axes, the formant trajectories of each vowel clearly showed a transition toward the locus of the following consonant /d/. Other acoustic data revealed that there were some vowel inherent duration or pitch values. From this study we can conclude that the dynamic spectral changes are very important in specifying acoustic characteristics of the English vowels. Further studies on vowels and diphthongs in different contexts are desirable.

  • PDF

Korean-English bilingual children's production of stop contrasts

  • Oh, Eunhae
    • 말소리와 음성과학
    • /
    • 제11권3호
    • /
    • pp.1-7
    • /
    • 2019
  • Korean (L1)-English (L2) bilingual adults' and children's production of Korean and English stops was examined to determine the age effects and L2 experience on the development of L1 and L2 stop contrasts. Four groups of Seoul Korean speakers (experienced and inexperienced adult and child groups) and two groups of age-matched native English speakers participated. The overall results of voice onset time (VOT) and fundamental frequency (F0) of phrase-initial stops in Korean and word-intial stops in English showed a delay in the acquisition of L1 due to the dominant exposure to L2. Significantly longer VOT and lower F0 for aspirated stops as well as high temporal variability across repetitions of lenis stops were interpreted to indicate a strong effect of English on Korean stop contrasts for bilingual children. That is, the heavy use of VOT for Korean stop contrasts shows bilingual children's attention to the acoustic cue that are primarily employed in the dominant L2. Furthermore, inexperienced children, but not adults, were shown to create new L2 categories that are distinctive from the L1 within 6 months of L2 experience, suggesting greater independence between the two phonological systems. The implications of bilinguals' age at the time of testing to the degree and direction of L1-L2 interaction are further discussed.

음운적 양음절성의 허상 (Against Phonological Ambisyllabicity)

  • 김영석
    • 한국영어학회지:영어학
    • /
    • 제1권1호
    • /
    • pp.19-38
    • /
    • 2001
  • The question of how / ... VCV .../ sequences should be syllabified is a much discussed, yet unresolved, issue in English phonology. While most researchers recognize an over-all universal tendency towards open syllables, there seem to be at least two different views as regards the analysis of / ... VCV .../ when the second vowel is unstressed: ambisyllabicity (e.g., Kahn 1976) and resyllabification (e.g., Borowsky 1986). Basically, we adopt the latter view and will present further evidence in its favor. This does not exclude low-level “phonetic” ambisyllabification, however. Following Nespor and Vogel (1986), we also assume that the domain of syllabification or resyllabification is the phonological word. With the new conception of the syllable structure of English, we attempt a reanalysis of Aitkin's Law as well as fe-tensing in New York City and Philadelphia.

  • PDF