• 제목/요약/키워드: lexical approach

검색결과 75건 처리시간 0.019초

A Corpus-based Lexical Analysis of the Speech Texts: A Collocational Approach

  • Kim, Nahk-Bohk
    • 영어어문교육
    • /
    • 제15권3호
    • /
    • pp.151-170
    • /
    • 2009
  • Recently speech texts have been increasingly used for English education because of their various advantages as language teaching and learning materials. The purpose of this paper is to analyze speech texts in a corpus-based lexical approach, and suggest some productive methods which utilize English speaking or writing as the main resource for the course, along with introducing the actual classroom adaptations. First, this study shows that a speech corpus has some unique features such as different selections of pronouns, nouns, and lexical chunks in comparison to a general corpus. Next, from a collocational perspective, the study demonstrates that the speech corpus consists of a wide variety of collocations and lexical chunks which a number of linguists describe (Lewis, 1997; McCarthy, 1990; Willis, 1990). In other words, the speech corpus suggests that speech texts not only have considerable lexical potential that could be exploited to facilitate chunk-learning, but also that learners are not very likely to unlock this potential autonomously. Based on this result, teachers can develop a learners' corpus and use it by chunking the speech text. This new approach of adapting speech samples as important materials for college students' speaking or writing ability should be implemented as shown in samplers. Finally, to foster learner's productive skills more communicatively, a few practical suggestions are made such as chunking and windowing chunks of speech and presentation, and the pedagogical implications are discussed.

  • PDF

A Collocational Analysis of Korean High School English Textbooks and Suggestions for Collocation Instruction

  • Kim, Nahk-Bohk
    • 영어어문교육
    • /
    • 제10권3호
    • /
    • pp.41-66
    • /
    • 2004
  • Under the textbook-driven approach to English education in the Korean selling, the importance of the English textbook can not be overemphasized as the main source of learning materials. Recently, with the development of computer-based language corpora, the recognition of the importance of collocations and the availability of computerized databases of words have caused a resurgence and facilitation in the instruction of collocation. The primary purpose of the present study is to identify the characteristics of lexical collocation and the extent of its use in high school 10th-grade textbooks. From all the analyses, it is revealed that the language materials reflect various constructed collocation in the case of adjective+noun and noun+noun collocations in a natural context. However, verb+noun and adverb+verb collocations are not fully reflected. This is true for delexicalized verbs, and verb and adjective intensifiers. Also the language materials do not provide sufficient support for the lexical syllabus, even though all textbooks may be somewhat adequate in terms of vocabulary size. Finally, based on the analyses of the texts, the suggestions for English collocation instruction are made in the lexical approach.

  • PDF

한국어 어휘 중의성 해소에서 어휘 확률에 대한 효과적인 평가 방법 (An Effective Estimation method for Lexical Probabilities in Korean Lexical Disambiguation)

  • 이하규
    • 한국정보처리학회논문지
    • /
    • 제3권6호
    • /
    • pp.1588-1597
    • /
    • 1996
  • 본 논문은 한국어 어휘 중의성 해소(lexical disambiguation)에서 어휘 확률 (lexical probability) 평가방법에 대해 기술하고 있다. 통계적 접근 방법의 어휘 중 의성 해소에서는 일반적으로 말뭉치(corpus)로부터 추출된 통계 자료에 기초하여 어 휘 확률과 문맥 확률(contextual probability)을 평가한다. 한국어는 어절별로 띄어 쓰기가 이루어지므로 어절 단위로 어휘 확률을 적용하는 것이 바람직하다. 하지만 한 국어는 어절의 다양성이 심하기 때문에 상당히 큰 말뭉치를 사용하더라도 어절 단위 로는 어휘 확률을 직접 평가할 수 없는 경우가 다소 있다. 이러한 문제점을 극복하기 위해 본 연구에서는 어휘 분석 측면에서 어절의 유사성을 정의하고 이에 기반을 둔 한국어 어휘 확률 평가 방법을 제안한다. 이 방법에서는 어떤 어절에 대해 어휘 확률 을 직접 평가할 수 없는 경우 이와 어휘 분석이 유사한 어절들을 통해 간접적으로 평 가한다. 실험결과 제안된 접근방법이 한국어 어휘 중의성 해소에 효과적인 것으로 나 타나고 있다.

  • PDF

Novice Corpus Users' Gains and Views on Corpus-based Lexical Development: A Case Study of COVID-19-related Expressions

  • Chen, Mei-Hua
    • 아시아태평양코퍼스연구
    • /
    • 제2권1호
    • /
    • pp.1-11
    • /
    • 2021
  • Recently, corpus assisted vocabulary instruction has been attracting a lot of interest. Most studies have focused on understanding language learners' receptive vocabulary knowledge. Limited attention has been paid to learners' productive competence. To fill this gap, this study attended to learners' productive lexical development in terms of form, meaning and use respectively. This study introduced EFL learners to the corpus-based language pedagogy to learn COVID-19 theme-based vocabulary. To investigate the gains and views of 33 EFL first-year college students, a sentence completion task and a questionnaire were developed. Learners' productive performances in the three lexical knowledge aspects (i.e., form, meaning and use) were particularly targeted. The results revealed that the students achieved significant gains in all aspects regardless of their proficiency level. In particular, the less proficient students achieved greater knowledge retention compared with their highly proficient counterparts. Meanwhile, students showed positive attitudes towards the corpus-based approach to vocabulary learning.

A Hybrid Approach for the Morpho-Lexical Disambiguation of Arabic

  • Bousmaha, Kheira Zineb;Rahmouni, Mustapha Kamel;Kouninef, Belkacem;Hadrich, Lamia Belguith
    • Journal of Information Processing Systems
    • /
    • 제12권3호
    • /
    • pp.358-380
    • /
    • 2016
  • In order to considerably reduce the ambiguity rate, we propose in this article a disambiguation approach that is based on the selection of the right diacritics at different analysis levels. This hybrid approach combines a linguistic approach with a multi-criteria decision one and could be considered as an alternative choice to solve the morpho-lexical ambiguity problem regardless of the diacritics rate of the processed text. As to its evaluation, we tried the disambiguation on the online Alkhalil morphological analyzer (the proposed approach can be used on any morphological analyzer of the Arabic language) and obtained encouraging results with an F-measure of more than 80%.

음성 질의 처리를 위한 의미 기반 오류 수정 (Semantic-oriented Error Correction for Spoken Query Processing)

  • 정민우;김병창;이근배
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 10월 학술대회지
    • /
    • pp.153-156
    • /
    • 2003
  • Voice input is often required in many new application environments such as telephone-based information retrieval, car navigation systems, and user-friendly interfaces, but the low success rate of speech recognition makes it difficult to extend its application to new fields. Popular approaches to increase the accuracy of the recognition rate have been researched by post-processing of the recognition results, but previous approaches were mainly lexical-oriented ones in post error correction. We suggest a new semantic-oriented approach to correct both semantic level and lexical errors, which is also more accurate for especially domain-specific speech error correction. Through extensive experiments using a speech-driven in-vehicle telematics information application, we demonstrate the superior performance of our approach and some advantages over previous lexical-oriented approaches.

  • PDF

유로워드넷 방식에 기반한 한국어와 영어의 명사 상하위어 정렬 (Alignment of Hypernym-Hyponym Noun Pairs between Korean and English, Based on the EuroWordNet Approach)

  • 김동성
    • 한국언어정보학회지:언어와정보
    • /
    • 제12권1호
    • /
    • pp.27-65
    • /
    • 2008
  • This paper presents a set of methodologies for aligning hypernym-hyponym noun pairs between Korean and English, based on the EuroWordNet approach. Following the methods conducted in EuroWordNet, our approach makes extensive use of WordNet in four steps of the building process: 1) Monolingual dictionaries have been used to extract proper hypernym-hyponym noun pairs, 2) bilingual dictionary has converted the extracted pairs, 3) Word Net has been used as a backbone of alignment criteria, and 4) WordNet has been used to select the most similar pair among the candidates. The importance of this study lies not only on enriching semantic links between two languages, but also on integrating lexical resources based on a language specific and dependent structure. Our approaches are aimed at building an accurate and detailed lexical resource with proper measures rather than at fast development of generic one using NLP technique.

  • PDF

Interface between Morphology and Syntax: A Constraint-Based and Lexicalist Approach

  • Kim, Jong-Bok
    • 한국언어정보학회지:언어와정보
    • /
    • 제2권1호
    • /
    • pp.177-213
    • /
    • 1998
  • conflicting criterial used in identifying words have called the lexical integrity principle into question. That is, cases where the morphological word does not coincide with the syntactic word have notivated the syntactic view of word derivation, as pointed out by Bresnan and Mchombo(1995). Further, the implicit desire to make the clausal structure of Korean parallel to those posited for English(Chomsky 1991) and French(Pollock 1989) has also led most of the current literature on Korean morphology to claim that Korean verbal inflections head their own functional projections such as AgrP, TP, and MP im syntax. In this paper, I will first argue against such a syntactic view. After reviewing some basic properties of Korean verbal inflections, I will show that the evidence from mismatch phenomena supports the lexical integrity principle over the head-movement theories of word derivation. Then, I will propose a theory of lexical grammar which maintains the lexical integrity principle while retaining the effects of functional projections and syntactic movement.

  • PDF

Semantic-Oriented Error Correction for Voice-Activated Information Retrieval System

  • Yoon, Yong-Wook;Kim, Byeong-Chang;Lee, Gary-Geunbae
    • 대한음성학회지:말소리
    • /
    • 제44호
    • /
    • pp.115-130
    • /
    • 2002
  • Voice input is often required in many new application environments, but the low rate of speech recognition makes it difficult to extend its application. Previous approaches were to raise the accuracy of the recognition by post-processing of the recognition results, which were all lexical-oriented. We suggest a new semantic-oriented approach in speech recognition error correction. Through experiments using a speech-driven in-vehicle telematics information application, we show the excellent performance of our approach and some advantages it has as a semantic-oriented approach over a pure lexical-oriented approach.

  • PDF

Categorization and production in lexical pitch accent contrasts of North Kyungsang Korean

  • Kim, Jungsun
    • 말소리와 음성과학
    • /
    • 제10권1호
    • /
    • pp.1-7
    • /
    • 2018
  • Categorical production in language processing helps speakers to produce phonemic contrasts. This categorization and production is utilized for the production-based and imitation-based approach in the present study. Contrastive signals in speakers' speech reflect the shapes of boundaries with categorical characteristics. Signals that provide information about lexical pitch accent contrasts can introduce categorical distinctions for productive and cognitive selection. This experiment was conducted with nine North Kyungsang speakers for a production task and nine North Kyungsang speakers for an imitation task. The first finding of the present study is the rigidity of categorical production, which controls the boundaries of lexical pitch accent contrasts. The categorization of North Kyungsang speakers' production allows them to classify minimal pitch accent contrasts. The categorical production in imitation appeared in two clusters, representing two meaningful contrasts. The second finding of the present study is that there are individual differences in speakers' production and imitation responses. The distinctive performances of individual speakers showed a variety of curves. For the HL-LH patterns, the categorical production tended to be highly distinctive as compared to the other pitch accent patterns (HH-HL and HH-LH), showing that there are more continuous curves than categorical curves. Finally, the present study shows that, for North Kyungsang speakers, imitative production is the core type of categorical production for determining the existence of the lexical pitch accent system. However, several questions remain for defining that categorical production, which leads to ideas for future research.