• 제목/요약/키워드: American English corpus

검색결과 23건 처리시간 0.018초

A Comparison of the Constructions Make / Take a Decision in Malaysian English with the Supervarieties

  • Christina Sook Beng Ong
    • 아시아태평양코퍼스연구
    • /
    • 제4권1호
    • /
    • pp.43-59
    • /
    • 2023
  • This study aims to compare the structures of light verb constructions (LVCs) taking decision as the deverbal noun in Malaysian English, British English and American English. A general corpus made up of Internet forum threads from Lowyat.Net, was created to represent Malaysian English while the British National Corpus (BNC) and Corpus of Contemporary American English (COCA) were used to represent the supervarieties. Light verbs make and take are found to be heading deverbal noun decision. Differences are observed in the use of articles. The frequency of Malaysian English LVCs without article is the highest while supervarieties LVCs prefer indefinite article. The high occurrences of LVCs without articles in Malaysian English can be attributed to the influence from Malaysian substrate languages. Findings also show that descriptive adjective is the most frequently used modifier in all three varieties of English. This suggests the standard LVC structure, comprising a light verb, the indefinite article, and a deverbal noun is no longer rigidly adhered to even among the native speakers of English.

Reduction and Frequency Analyses of Vowels and Consonants in the Buckeye Speech Corpus

  • Yang, Byung-Gon
    • 말소리와 음성과학
    • /
    • 제4권3호
    • /
    • pp.75-83
    • /
    • 2012
  • The aims of this study were three. First, to examine the degree of deviation from dictionary prescribed symbols and actual speech made by American English speakers. Second, to measure the frequency of vowel and consonant production of American English speakers. And third, to investigate gender differences in the segmental sounds in a speech corpus. The Buckeye Speech Corpus was recorded by forty American male and female subjects for one hour per subject. The vowels and consonants in both the phonemic and phonetic transcriptions were extracted from the original files of the corpus and their frequencies were obtained using codes of a free software R. Results were as follows: Firstly, the American English speakers produced a reduced number of vowels and consonants in daily conversation. The reduction rate from the dictionary transcriptions to the actual transcriptions was around 38.2%. Secondly, the American English speakers used more front high and back low vowels while three-fourths of the consonants accounted for stops, fricatives, and nasals. This indicates that the segmental inventory has nonlinear frequency distribution in the speech corpus. Thirdly, the two gender groups produced vowels and consonants similarly even though there were a few noticeable differences in their speech. From these results we propose that English teachers consider pronunciation education reflecting the actual speech sounds and that linguists find a way to establish unmarked segmentals from speech corpora.

The fundamental frequency (f0) distribution of American speakers in a spontaneous speech corpus

  • Byunggon Yang
    • 말소리와 음성과학
    • /
    • 제16권1호
    • /
    • pp.11-16
    • /
    • 2024
  • The fundamental frequency (f0), representing an acoustic measure of vocal fold vibration, serves as an indicator of the speaker's emotional state and language-specific pattern in daily conversations. This study aimed to examine the f0 distribution in an English corpus of spontaneous speech, establishing normative data for American speakers. The corpus involved 40 participants engaging in free discussions on daily activities and personal viewpoints. Using Praat, f0 values were collected filtering outliers after removing nonspeech sounds and interviewer voices. Statistical analyses were performed with R. Results indicated a median f0 value of 145 Hz for all the speakers. The f0 values for all speakers exhibited a right-skewed, pointy distribution within a frequency range of 216 Hz from 75 Hz to 339 Hz. The female f0 range was wider than that of males, with a median of 113 Hz for males and 181 Hz for females. This spontaneous speech corpus provides valuable insights for linguists into f0 variation among individuals or groups in a language. Further research is encouraged to develop analytical and statistical measures for establishing reliable f0 standards for the general population.

성대진동 및 성별이 미국영어 마찰음에 미치는 효과에 관한 코퍼스 기반 연구 (A corpus-based study on the effects of voicing and gender on American English Fricatives)

  • 윤태진
    • 말소리와 음성과학
    • /
    • 제10권2호
    • /
    • pp.7-14
    • /
    • 2018
  • The paper investigates the acoustic characteristics of English fricatives in the TIMIT corpus, with a special focus on the role of voicing in rendering fricatives in American English. The TIMIT database includes 630 talkers and 2,342 different sentences, and comprises more than five hours of speech. Acoustic analyses are conducted in the domain of spectral and temporal properties by treating gender, voicing, and place of articulation as independent factors. The results of the acoustic analyses revealed that acoustic signals interact in a complex way to signal the gender, place, and voicing of fricatives. Classification experiments using a multiclass support vector machine (SVM) revealed that 78.7% of fricatives are correctly classified. The majority of errors stem from the misclassification of /θ/ as [f] and /ʒ/ as [z]. The average accuracy of gender classification is 78.7%. Most errors result from the classification of female speakers as male speakers. The paper contributes to the understanding of the effects of voicing and gender on fricatives in a large-scale speech corpus.

Buckeye corpus에 나타난 탄설음화 현상 분석 (A study of flaps in American English based on the Buckeye Corpus)

  • 황병후;강석한
    • 말소리와 음성과학
    • /
    • 제10권3호
    • /
    • pp.9-18
    • /
    • 2018
  • This paper presents an acoustic and phonological study of the alveolar flaps in American English. Based on the Buckeye Corpus, the flapping tokens produced by twenty men are analyzed at both lexical and post-lexical levels. The data, analyzed with Pratt speech analysis, include duration, F2 and F3 in voicing during the flap, as well as duration, F1, F2, F3, and f0 in the adjacent vowels. The results provide evidence on two issues: (1) The different ways in which voiced and voiceless alveolar stops give rise to neutralized flapping stops by following lexical and post-lexical levels, (2) The extent to which the vowel features (height, frontness, and tenseness) affect flapping sounds. The results show that flaps are affected by pre-consonantal vowel features at the lexical as well as post-lexical levels. Unlike previous studies, this study uses the Praat method to distinguish flapped from unflapped tokens in the Buckeye Corpus and examines connections between the lexical and post-lexical levels.

A Corpus-based study on the Effects of Gender on Voiceless Fricatives in American English

  • Yoon, Tae-Jin
    • 말소리와 음성과학
    • /
    • 제7권1호
    • /
    • pp.117-124
    • /
    • 2015
  • This paper investigates the acoustic characteristics of English fricatives in the TIMIT corpus, with a special focus on the role of gender in rendering fricatives in American English. The TIMIT database includes 630 talkers and 2342 different sentences, comprising over five hours of speech. Acoustic analyses are conducted in the domain of spectral and temporal properties by treating gender as an independent factor. The results of acoustic analyses revealed that the most acoustic properties of voiceless sibilants turned out to be different between male and female speakers, but those of voiceless non-sibilants did not show differences. A classification experiment using linear discriminant analysis (LDA) revealed that 85.73% of voiceless fricatives are correctly classified. The sibilants are 88.61% correctly classified, whereas the non-sibilants are only 57.91% correctly classified. The majority of the errors are from the misclassification of /ɵ/ as [f]. The average accuracy of gender classification is 77.67%. Most of the inaccuracy results are from the classification of female speakers in non-sibilants. The results are accounted for by resorting to biological differences as well as macro-social factors. The paper contributes to the understanding of the role of gender in a large-scale speech corpus.

Is Compared to Different from Compared with? A Discussion of Prepositions that Are Particularly Difficult for EFL Learners

  • Lee, Seung-Ah
    • 영어영문학
    • /
    • 제55권6호
    • /
    • pp.1057-1085
    • /
    • 2009
  • This paper addresses the question of why prepositions are particularly difficult forEFL learners. The first reason for such difficulty lies in the distinction between seemingly equivalent prepositions such as to and with, as in compared to and compared with. Most monolingual learners' dictionaries regard these two phrases as virtually synonymous. Yet, the results of the corpus analysis conducted in this study indicate that there are differences between the two. A second reason why EFL learners have problems with prepositions is that there are often variations in the inputdata. For example, although from generally follows different, in American English different than is also used. On the other hand, in British English, different to is the second most commonly used construction. This type of regional variation, confirmed in the corpus findings of the present paper, causes confusion in students of English. A learner who is not accustomed to British English may be puzzled by the expression different to. Finally, L1 negative transfer is responsible for the incorrect use of expressions such as discuss about. An error of this sort is the result of interference from the learner's mother tongue. The English verb discuss is not subcategorized for a preposition, whereas the equivalent Korean verb, for example, requires a noun phrase combined with the postposition.

벅아이 코퍼스의 모음 길이 연구 (A Study on the Vowel Duration of the Buckeye Corpus)

  • 정혜정;윤규철
    • 말소리와 음성과학
    • /
    • 제7권4호
    • /
    • pp.103-110
    • /
    • 2015
  • The purpose of this study is to assess the vowel property by examining the vowel duration of the American English vowles found in the Buckeye corpus[6]. The vowel durations were analyzed in terms of various linguistic factors including the number of syllables of the word containing the vowel, the location of the vowel in a word, types of stress, function versus content word, the word frequency in the corpus and the speech rate calculated from the three consecutive words. The findings from this work agreed mostly with those from earlier studies, but with some exceptions. The relationship between the speech rate and the vowel duration proved non-linear.

An Analysis of Korean and American Presidential Addresses: Focusing on Punctuation and Transition

  • Jun, Ki-Suk;Jung, Kyu-Tae
    • 영어어문교육
    • /
    • 제17권2호
    • /
    • pp.1-18
    • /
    • 2011
  • The object of this study is to show some features of English, focused on such mechanics as punctuation and transition, in Korean presidential addresses transcribed in English which are different from those of the United States. Towards that end, the presidential addresses of the United States and Korea from January, 2010 to June, 2010 are collected, made into corpora, and analyzed. Through analyzing the corpora, this paper is to address the following research questions: (1) What features can be regarded as different in terms of punctuation and transition? (2) If there are any differences between the corpora, are they significant enough to pose any problems for Korean and American English users to communicate with each other? (3) If so, what can be done to solve the problems in regard to pedagogical implications? Overall, as for punctuation, both Presidents' addresses share a lot in common, even with some idiosyncratic variations though. However, there are some noticeable differences in transitional devices. It is not clear whether those should be taken as a sign of personal preference, though. Transitional markers are meant to be part of wording in writing. (196 words).

  • PDF

Effects of Corpus Use on Error Identification in L2 Writing

  • Yoshiho Satake
    • 아시아태평양코퍼스연구
    • /
    • 제4권1호
    • /
    • pp.61-71
    • /
    • 2023
  • This study examines the effects of data-driven learning (DDL)-an approach employing corpora for inductive language pattern learning-on error identification in second language (L2) writing. The data consists of error identification instances from fifty-five participants, compared across different reference materials: the Corpus of Contemporary American English (COCA), dictionaries, and no use of reference materials. There are three significant findings. First, the use of COCA effectively identified collocational and form-related errors due to inductive inference drawn from multiple example sentences. Secondly, dictionaries were beneficial for identifying lexical errors, where providing meaning information was helpful. Finally, the participants often employed a strategic approach, identifying many simple errors without reference materials. However, while maximizing error identification, this strategy also led to mislabeling correct expressions as errors. The author has concluded that the strategic selection of reference materials can significantly enhance the effectiveness of error identification in L2 writing. The use of a corpus offers advantages such as easy access to target phrases and frequency information-features especially useful given that most errors were collocational and form-related. The findings suggest that teachers should guide learners to effectively use appropriate reference materials to identify errors based on error types.