• 제목/요약/키워드: Corpus-based Study

검색결과 205건 처리시간 0.025초

모음 상승 현상의 음성적 고찰: 어미 {-고}의 실현을 중심으로 (A Phonetic Study of Vowel Raising: A Closer Look at the Realization of the Suffix {-go})

  • 이향원;신지영
    • 한국어학
    • /
    • 제81권
    • /
    • pp.267-297
    • /
    • 2018
  • Vowel raising in Korean has been primarily treated as a phonological, categorical change. This study aims to show how the Korean connective suffix {-go} is realized in various environments, and propose a principle of vowel raising based on both acoustic and perceptual data. To that end, we used a corpus of spoken Korean to analyze the types of syntactic constructions, the realization of prosodic boundaries (IP and PP), and the types of boundary tone associated with {-go}. It was found that the vowel tends to be raised most frequently in utterance-final position, while in utterance-medial position the vowel was raised more when the syntactic and prosodic distance between {-go} and the following constituent was smaller. The results for boundary tone also showed a correlation between vowel raising and the discourse function of the boundary tone. In conclusion, we propose that vowel raising is not simply an optional phenomenon, but rather a type of phonetic reduction related to the comprehension of the following constituent.

'막'의 운율적 특성과 담화적 기능 (Prosodic features and discourse functions of discourse marker 'mak'('막'))

  • 송인성
    • 한국어학
    • /
    • 제65권
    • /
    • pp.211-236
    • /
    • 2014
  • The aim of this study is to investigate categorical characteristics of 'mak' and their discourse functions through analyzed the prosodic features of 'mak'. The previous studies of 'mak' focused on grammatical or semantic characteristics, but this study focuses on the prosodic features of 'mak' based on speech data. As a result, adverb 'mak' and discourse marker 'mak' are distinguished from prosodic boundary, duration, pause and sort of number tonal patterns. Functions of discourse marker 'mak' is as follows: Maintenance of utterance, Attention, Delay, Expression negative manner. These functions have salient prosodic features related to their functions. Consequently prosodic features are important to analyze categorical characteristics and to establish functions of 'mak'.

한국어 추측 표현의 화행 실현 양상과 교수학습 내용 연구 (A Study of the Realization of Speech Act and Teaching-learning Contents of Korean Speculative Expressions)

  • 정미진
    • 한국어학
    • /
    • 제76권
    • /
    • pp.187-211
    • /
    • 2017
  • The purpose of this study is to investigate the speech act realization of speculative expressions and to present their teaching-learning contents. It is hard for Korean learners to use speculative expressions appropriately because there are various similar expressions and their meaning is distinctive in detail. This study describes speech act realizations of '-는 것 같다, -을까, -나 보다, -을걸'. All these forms have the meaning of speculations, so they are mainly used to present uncertain information or thoughts of speaker. But they show distinctive aspects. '-는 것 같다' is mainly used to present contents contrary to their counterparts' opinions or irritating for their counterparts. It is used as polite forms because it conveys meanings of uncertainty. Especially in these contexts, it performs the refusal speech acts. '-을까' has the characteristic feature in the complex forms such as '뭐랄까', '뭐라고 할까' and it performs request speech acts more frequently than '-는 것 같다'. Also it is used to express the speakers' opinions contrary to their counterparts'. '-나 보다' expresses speaker's speculations based on hearer's conditions or his speech, so it is used to respond to hearer actively and express interests unlike other speculative expressions. '-을걸' isn't used to perform request, to express interests to hearer. However, it is mainly used when speaker has the contrary assumptions or expectations to hearer's. Based on the analyze, this study presents and grades teaching-learning contents of speculative expressions.

텍스트 마이닝 기법을 이용한 환경 분야의 ICT 활용 연구 동향 분석 (A Study on Environmental research Trends by Information and Communications Technologies using Text-mining Technology)

  • 박보영;오관영;이정호;윤정호;이승국;이명진
    • 대한원격탐사학회지
    • /
    • 제33권2호
    • /
    • pp.189-199
    • /
    • 2017
  • 본 연구는 텍스트 마이닝 기법을 활용하여 환경 분야에서 ICT의 활용 연구동향을 정량적으로 분석하였다. 이를 위해 환경 분야 키워드 38개, ICT 관련 키워드 16개를 바탕으로 국가과학기술정보센터(NDSL)에서 최근 20년(1996년-2015년)의 논문 359편을 수집하였다. 해당 논문을 대상으로 환경 분야 및 ICT 관련 자연어를 처리하여 말뭉치(Corpus)단위로 분류체계를 재구성하였다. 전술된 분류체계의 키워드를 바탕으로 텍스트 마이닝 분석 기법인 빈도 분석, 키워드 분석, 키워드 간 연관규칙을 확인하였다. 그 결과 '환경 일반' 및 '기후' 분야의 키워드 출현 빈도가 전체의 77 %, ICT는 '공공융합서비스' 및 '산업융합서비스'가 약 30 %의 비율을 차지하였다. 시계열 분석을 통해 환경 분야에서의 ICT 활용 연구는 최근 5년(2011년-2015년)사이에 급증하여 과거(1996년-2010년)과 비교하여 약 2배 이상 관련 연구가 증가된 것으로 나타났다. 키워드 간 연관 규칙을 생성하여 환경 분야를 기준으로 나타내었을 때, '환경 일반'은 16개, '기후'는 '14'개의 ICT 기반 기술을 주로 활용하고 있는 것으로 확인하였다.

CASI 초분광 영상을 이용한 RapidEye 위성영상의 대리복사보정 (Vicarious Radiometric Calibration of RapidEye Satellite Image Using CASI Hyperspectral Data)

  • 장안진;최재완;송아람;김예지;정진하
    • 대한공간정보학회지
    • /
    • 제23권3호
    • /
    • pp.3-10
    • /
    • 2015
  • 지상의 모든 물체는 고유의 분광 반사율을 갖고 있으며, 이러한 특성을 이용하여 지상 물체의 분류와 목표물 탐지 등이 가능하다. 정확한 분석을 위해서는 취득된 원격탐사 자료를 분광 반사율로 변환해야 한다. 이를 위한 절대복사보정 기법으로는 자료 제공 기관에서 명시한 변환 수식을 이용하는 방법, 지상에서 측정한 분광 반사율만으로 단순 경험적 회귀 분석을 이용하는 방법, ATCOR/FLAASH 같은 수학적 모델을 이용하는 방법 등이 있다. 본 연구에서는 CASI 초분광 영상의 분광 반사율 자료를 이용하여 RapidEye 위성영상의 대리복사보정을 수행하고, 그 결과를 다른 복사보정 기법 결과 및 지상 자료와 비교하였다. 실험 결과 제안 기법이 ATCOR 및 New Kurucz 2005 기법보다 높은 유사성을 보였으며, 일반적으로 활용되는 ELM 기법과 유사한 결과를 도출하였다.

한국어 음성합성기의 운율 예측을 위한 의사결정트리 모델에 관한 연구 (A Study of Decision Tree Modeling for Predicting the Prosody of Corpus-based Korean Text-To-Speech Synthesis)

  • 강선미;권오일
    • 음성과학
    • /
    • 제14권2호
    • /
    • pp.91-103
    • /
    • 2007
  • The purpose of this paper is to develop a model enabling to predict the prosody of Korean text-to-speech synthesis using the CART and SKES algorithms. CART prefers a prediction variable in many instances. Therefore, a partition method by F-Test was applied to CART which had reduced the number of instances by grouping phonemes. Furthermore, the quality of the text-to-speech synthesis was evaluated after applying the SKES algorithm to the same data size. For the evaluation, MOS tests were performed on 30 men and women in their twenties. Results showed that the synthesized speech was improved in a more clear and natural manner by applying the SKES algorithm.

  • PDF

First Report of Two Cephalobidae Species (Nematoda: Cephalobomorpha) in South Korea

  • Kim, Taeho;Kim, Jiyeon;Park, Joong-Ki
    • Animal Systematics, Evolution and Diversity
    • /
    • 제34권4호
    • /
    • pp.181-189
    • /
    • 2018
  • Cephalobus aff. quinilineatus (Shavrov, 1968) Anderson and Hooper, 1970 and Eucephalobus hooperi MarinariPalmisano, 1967 from the family Cephalobidae Filipjev, 1934 (Cephalobomorpha) are newly reported from South Korea. Cephalobus aff. quinilineatus is distinguished from other Cephalobus species by its high and rounded labial probolae and five lateral incisures, with three incisures extending to the tail terminus. Eucephalobus hooperi is distinguished from other Eucephalobus species by its three bifurcated labial probolae with pointed termini and by morphometric characters such as body and tail length and the corpus:isthmus ratio. In this study, the morphological characters and morphometrics of C. aff. quinilineatus and E. hooperi Korean population are described and illustrated based on optical and/or scanning electron microscopy.

First report and morphological description of two Acrobeloides species(Nematoda: Rhabditida: Cephalobidae) in South Korea

  • Kim, Taeho;Lee, Yucheol;Park, Joong-Ki
    • Journal of Species Research
    • /
    • 제10권4호
    • /
    • pp.405-411
    • /
    • 2021
  • The genus Acrobeloides(Cobb, 1924) Thorne, 1937 are bacterial feeders and are one of the most abundant and widely distributed nematode groups in various terrestrial environments. Based on morphological and morphometric analyses, we found two Acrobeloides species reported in Korea for the first time: A. bodenheimeri (Steiner, 1936) Thorne, 1937 and A. tricornis (Throne, 1925) Thorne, 1937. These species exhibit morphological characters concordant with typical features of the genus Acrobeloides, such as a fusiform pharyngeal corpus with swollen metacorpus and lateral incisures extending to the tail terminus. However, A. bodenheimeri is distinguished from other acrobeloids by having its low and rounded labial probolae, distinct post-uterine sac and five lateral incisures. Acrobeloides tricornis is distinguished from its congeners by the following characteristics: its high labial probolae with acuate termini, inconspicuous post-uterine sac and five lateral incisures. Morphological characters and their measurements, and illustrations of A. bodenheimeri and A. tricornis are described in this study.

A Study on the Performance Analysis of Entity Name Recognition Techniques Using Korean Patent Literature

  • Gim, Jangwon
    • 한국정보기술학회 영문논문지
    • /
    • 제10권2호
    • /
    • pp.139-151
    • /
    • 2020
  • Entity name recognition is a part of information extraction that extracts entity names from documents and classifies the types of extracted entity names. Entity name recognition technologies are widely used in natural language processing, such as information retrieval, machine translation, and query response systems. Various deep learning-based models exist to improve entity name recognition performance, but studies that compared and analyzed these models on Korean data are insufficient. In this paper, we compare and analyze the performance of CRF, LSTM-CRF, BiLSTM-CRF, and BERT, which are actively used to identify entity names using Korean data. Also, we compare and evaluate whether embedding models, which are variously used in recent natural language processing tasks, can affect the entity name recognition model's performance improvement. As a result of experiments on patent data and Korean corpus, it was confirmed that the BiLSTM-CRF using FastText method showed the highest performance.

텍스트 기반의 훈련 데이터 구축을 위한 자동 데이터 태깅 작업에 대한 연구 (A Study on Automatic Data Tagging for Text-based Training Data Construction)

  • 김나연;소혜령;박준호
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2020년도 추계학술발표대회
    • /
    • pp.1008-1009
    • /
    • 2020
  • 텍스트 기반의 훈련 데이터는 데이터를 수집한 이후에 각 문자별로 태깅 작업이 필요하다. 말뭉치(Corpus)는 언어학에서 주로 이루고 있는 텍스트 집합이다. 말뭉치는 각 단어의 품사 표기에 대한 정보가 태그 형태로 되어 있다. 본 연구에서는 한국어 기반의 태깅 작업을 연구했으며, 기본 한국어 말뭉치가 아닌 기업이나 연구 기관에서 데이터를 수집하여 말뭉치나 별도 학습 데이터를 구축하기 위한 자동 태깅 방법에 대해 알아본다.