• Title/Summary/Keyword: 단어산출

Search Result 84, Processing Time 0.019 seconds

A Study on Automatic Indexing of Korean Texts based on Statistical Criteria (통계적기법에 의한 한글자동색인의 연구)

  • Woo, Dong-Chin
    • Journal of the Korean Society for information Management
    • /
    • v.4 no.1
    • /
    • pp.47-86
    • /
    • 1987
  • The purpose of this study is to present an effective automatic indexing method of Korean texts based on statistical criteria. Titles and abstracts of the 299 documents randomly selected from ETRI's DOCUMENT data base are used as the experimental data in this study the experimental data is divided into 4 word groups and these 4 word groups are respectively analyzed and evaluated by applying 3 automatic indexing methods including Transition Phenomena of Word Occurrence, Inverse Document Frequency Weighting Technique, and Term Discrimination Weighting Technique.

  • PDF

Cerebral activation related with morphological priming effect in production of Korean Endings (한국어 어말어미 산출관련 대뇌 활성화)

  • Hwang, Yu-Mi;Shin, Jung-Moo;Lim, Soo-Mee;Ryu, Keun-Taek;Khang, Hyun-Soo;Yi, Kwang-Oh;Nam, Ki-Chun
    • Proceedings of the Korean Society for Cognitive Science Conference
    • /
    • 2005.05a
    • /
    • pp.273-277
    • /
    • 2005
  • 본 연구는 한국어 어말어미 산출시 나타나는 대뇌 활성화 영역을 살펴보기 위하여 실시되었다. 두 가지 실험이 실시되었다 실험 1은 어말어미의 기본형을 주고 이를 의문형, 명령형으로 산출하는 고립단어 실험을 실시하였다. 통제 조건으로 모음변환조건(C1)과 아라비아문자보기(C2)를 사용하였다. 실험 1의 결과 ‘어말어미-C1’ 조건에서 좌반구의 측두엽과 전두엽부분의 의 활성화 superior temporal gyrus와 inferior frontal gyrus의 활성화가 관찰되었다. ‘어말어미-C2’ 의 조건에서 우반구에서 후두엽의 활성화와 좌반구에서의 후두엽, 전두엽, lingual G, Cuneus, fusiform G, inferior occipital G에서의 활성화를 관찰할 수 있었다. 실험 2는 명령형과 의문형 어미의 형태점화효과와 관련된 대뇌 활성화 영역을 관찰하기 위하여 Er-fMRI 기법을 이용하여 실시되었다. 실험 조건은 어미동일조건, 어간반복조건, 무관련 조건으로 구성되었다. 피험자들은 점화자극이 제시된 후 신호가 제시되고 나오는 표적단어를 의문형 또는 명령으로 산출하도록 하는 과제를 실시하였다. 뇌 활성화 영역을 분석한 결과 의문형과 명령형을 산출할 때의 활성화 영역에서 $^{\ast}^{\ast}^{\ast}$를 볼 때의 영역을 빼기 (substraction)한 결과 공통적으로 좌반구 브로카 영역이 활성화되었고, 의문형과 명령형 안에서 어미동일조건에서 무관련 조건을 뺀 경우에는 좌반구의 superior temporal G 영역의 활성화가 관찰되었다. 이들 결과를 종합해 볼 때 어말어미 산출 그 자체와 직접 관련되는 영역으로는 좌반구의 측두엽과 전두엽 부분이 관찰되었다. 특히 한국어 어말어미 산출시 나타나는 형태점화 양상과 관련된 대뇌영역으로 발견된 브로카 영역에서의 활성화는 어미 변환과 관련된 영역이라기보다는 산출시 관련되는 articulation, motor coordinate관련 영역으로 추정되고, 측두엽의 활성화는 형태소, 의미 관련 지식의 data base로 추정된다. 또한 우반구 전두엽 부분에서 관찰된 활성화는 억제관련 영역으로 짐작된다.

  • PDF

Some Characteristics of Language Production Processes: The Effects of Knowledge Types, Text Types, and Production Modes (언어 산출 과정의 몇 가지 특성: - 지식 유형, 텍스트 유형, 산출양식이 언어 산출에 미치는 효과)

  • Rho, Young-Hee;Lee, Jung-Mo
    • Annual Conference on Human and Language Technology
    • /
    • 1993.10a
    • /
    • pp.241-247
    • /
    • 1993
  • 이 연구에서는 지식 유형, 텍스트 유형 및 언어 산출 양식이 언어 산출 과정에 미치는 효과를 알아보았다. 본 연구에서는 산출할 말글에 대한 1) 사전정보를 대형(거시적) 의미구조, 소형(미시적) 의미구조, 관련 단어들의 모음의 세 지식 유령에 의해 조작하고, 2) 산출할 말글 유형을 이야기 말글과 논술 말글의 두 유형으로 변화하고, 3) 언어 산출 양식을 말하기, 펜으로 쓰기, 컴퓨터로 쓰기의 세 양식으로 변화하였을 때에, 언어 산출 과정에 어떠한 처리 부담이 가하여지는가를 3개의 실험을 통하여 연구하였다.

  • PDF

Extracting Multi-type Elements Consisting of Multi-words from Sentences (문장으로부터 여러 단어로 구성된 여러 유형의 요소 추출)

  • Yang, Seon;Ko, Youngjoong
    • Annual Conference on Human and Language Technology
    • /
    • 2014.10a
    • /
    • pp.73-77
    • /
    • 2014
  • 문장을 대상으로 특정 응용 분야에 필요한 요소를 자동으로 추출하는 정보 추출(information extraction) 과제는 자연어 처리 및 텍스트 마이닝의 중요한 과제 중 하나이다. 특히 추출해야할 요소가 한 단어가 아닌 여러 단어로 구성된 경우 추출 과정에서 고려되어야할 부분이 크게 증가한다. 또한 추출 대상이 되는 요소의 유형 또한 여러 가지인데, 감정 분석 분야를 예로 들면 화자, 객체, 속성 등 여러 유형의 요소에 대한 분석이 필요하며, 비교 마이닝 분야를 예로 들면 비교 주체, 비교 상대, 비교 술어 등의 요소에 대한 분석이 필요하다. 본 논문에서는 각각 여러 단어로 구성될 수 있는 여러 유형의 요소를 동시에 추출하는 방법을 제안한다. 제안 방법은 구현이 매우 간단하다는 장점을 가지는데, 필요한 과정은 형태소 부착과 변환 기반 학습(transformation-based learning) 두 가지이며, 파싱 혹은 청킹 같은 별도의 전처리 과정도 거치지 않는다. 평가를 위해 제안 방법을 적용하여 비교 마이닝을 수행하였는데, 비교 문장으로부터 각자 여러 단어로 구성될 수 있는 세 가지 유형의 비교 요소를 자동 추출하였으며, 실험 결과 정확도 84.33%의 우수한 성능을 산출하였다.

  • PDF

Age-related Changes in Word Defining Abilities in Concrete and Abstract Nouns with Normal Elderly (노화에 따른 구체명사와 추상명사의 단어정의하기 능력 변화)

  • Kim, Soo Ryon;Kim, HyangHee
    • 재활복지
    • /
    • v.21 no.3
    • /
    • pp.187-207
    • /
    • 2017
  • The purpose of this study was to explore the characteristics of defining concrete and abstract nouns for the elderly. A total of 382 elderly participated in this study and they were classified into four age groups (i.e., over 55 to under 64, over 65 to under 74, over 75 to under 84, and over 85 year-old group). They performed the word definition task, composed of five concrete and five abstract nouns. The total scores and numbers and ratio of core/supplementary meanings were compared among four elderly groups. The frequency and ratio of error types were also examined. The results showed that all four groups had statistically significant differences in total scores, numbers and ratio of core and supplementary meaning of concrete noun definition task. In addition, abstract noun definition performances revealed group differences except the two groups (over 75 to under 84 and over 85-year-old group). The oldest group showed a sharp increase in error production. The highest ratio of error types were personal experience in over 55 to under 64-year-old group, and over 65 to under 74 year-old groups; and for the target word repetition in over 75 to under 84 year-old group; and no response in over 85 year-old group. In conclusion, both concrete and abstract word defining abilities had age-related deterioration. This decline results from impairment in spreading semantic knowledge within semantic network, which is vulnerable to aging. Characteristics of word definition for elderly can provide basic information to understand various neurolinguistic disorders associated with age.

Convergent Analysis on the Speech Sound of Typically Developing Children Aged 3 to 5 : Focused on Word Level and Connected Speech Level (3-5세 일반아동의 말소리에 대한 융합적 분석: 단어와 자발화를 중심으로)

  • Kim, Yun-Joo;Park, Hyun-Ju
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.6
    • /
    • pp.125-132
    • /
    • 2018
  • This study was to investigate the speech sound production characteristics and evaluation aspects of preschool children through word test and connected speech test. For this, the authors conducted Assessment of Phonology and Articulation for Children(APAC) to 72 normal children(24 three-, four-, and five-year-olds each) and analyzed difference in percent of correct consonant(PCC) and intelligibility according to age and sex, correlation between PCC and intelligibility, and speech sound error patterns. PCC and intelligibility increased with age but there was no difference according to sex. The correlation was statistically significant in 5-year-old group. Speech sound error patterns were different in the two tests. This study showed that children's speech sound production varied according to language unit. Therefore, both types of tests should be done to grasp their speech sound production ability properly. This suggests that current standard to identify language impairment only by PCC of word level requires review and further studies.

Automatic Keyword Extraction using Hierarchical Graph Model Based on Word Co-occurrences (단어 동시출현관계로 구축한 계층적 그래프 모델을 활용한 자동 키워드 추출 방법)

  • Song, KwangHo;Kim, Yoo-Sung
    • Journal of KIISE
    • /
    • v.44 no.5
    • /
    • pp.522-536
    • /
    • 2017
  • Keyword extraction can be utilized in text mining of massive documents for efficient extraction of subject or related words from the document. In this study, we proposed a hierarchical graph model based on the co-occurrence relationship, the intrinsic dependency relationship between words, and common sub-word in a single document. In addition, the enhanced TextRank algorithm that can reflect the influences of outgoing edges as well as those of incoming edges is proposed. Subsequently a novel keyword extraction scheme using the proposed hierarchical graph model and the enhanced TextRank algorithm is proposed to extract representative keywords from a single document. In the experiments, various evaluation methods were applied to the various subject documents in order to verify the accuracy and adaptability of the proposed scheme. As the results, the proposed scheme showed better performance than the previous schemes.

Phonological development of children aged 3 to 7 under the condition of sentence repetition (문장 따라말하기 과제에서 3~7세 아동의 말소리발달)

  • Kim, Soo-Jin;Park, Na rae;Chang, Moon Soo;Kim, Young Tae;Shin, Moonja;Ha, Ji-Wan
    • Phonetics and Speech Sciences
    • /
    • v.12 no.1
    • /
    • pp.85-95
    • /
    • 2020
  • Sentence repetition is a way of evaluating speech sound production to improve the limitation of word tests and spontaneous speech analysis. Speech sounds produced by children can be evaluated using several indicators. This study examined the progression of the percentage of correct consonants-revised (PCC-R) and phonological whole-word measure in different age and gender groups after setting consonants in various vowel contexts and implementing sentence repetition tasks that were designed to give all phonemes the chance to appear at least three times. For this study, 11 sentence repetition tasks were applied to 535 children aged 3 to 7 across the country, after which the resulting PCC-R and whole-word measure were analyzed. The study results showed that all the indicators improved in older age groups and there were significant differences depending on age, however, no significant differences dependent on gender were found. The sentence repetition conditions data used in this study were collected from across the country, and the age difference between each age group was six months. This study is noteworthy because it collected a sufficient amount of data from each group, highlighted the limitation of the word naming and the spontaneous speech analysis, and suggests new criteria of evaluation through the analysis of each whole-word measure in sentence repetition, which was not applied in previous studies.

A Usability Testing of the Word-Prediction Function of the AAC Keyboard for the People with Cerebral Palsy (보완대체의사소통(AAC) 글자판의 단어예측기능에 대한 뇌병변장애인 대상의 사용성 평가)

  • Lee, H.Y.;Hong, K-H.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.9 no.3
    • /
    • pp.209-214
    • /
    • 2015
  • The purpose of this study was to examine (1) the influence of the word-prediction function on the sentence generation speed and (2) the necessity, convenience, and satisfaction of the word-prediction function of the AAC keyboard. A total of 10 adults with cerebral palsy participated and the word-prediction function of the Korean high-tech AAC device called "MyTalkie Smart" keyboard was used for this study. Participants were required to generate sentence as voice outputs using a word-prediction function and letters direct-input function respectively, then they were required to evaluate the necessity, convenience, and satisfaction using a five-point Likert scale. Other user requirements were examined using a free feedback. The results of this study presented that the sentence generation speeds were faster when participants used a word-prediction function than using a letters direct-input function. However, there was no statistically significant difference between these two input methods, and it might be due to the lack of time to practice the new device. Participants showed positive responses for the necessity, convenience, and satisfaction of the word-prediction function.

  • PDF

Regional differences in Korean children's development of speech production (우리나라 아동의 지역별 말소리 발달 차이)

  • Shin, Moonja;Ha, Ji-Wan;Kim, Young Tae;Kim, Soo-Jin
    • Phonetics and Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.57-67
    • /
    • 2019
  • This study aimed to investigate regional differences in the development of speech production in Korean children. A total of 619 children aged 2 to 7 years from the Jeolla, Seoul/Gyeonggi, Chungcheong, and Gyeongsang areas were included in this study. The subjects were assessed with the UTAP2 word-level test. In PWC, PMLU, and PWP, the performance was significantly lower in Gyeongsang at 2 years 11 months and in Jeolla and Chungcheong at 3 years 5 months than in Seoul/Gyeonggi. The total PCC of Gyeongsang and Chungcheong and UTAP PCC of Chungcheong were significantly lower at 2 years 11 months compared with those of Seoul/Gyeonggi, while Jeolla and Chungcheong showed significantly lower total PCC and UTAP PCC than Seoul/Gyeonggi at 3 years 5 months. However, no regional difference was observed in any indicators after the age of 3 years 6 months. These results suggest that there are regional differences in the ability to produce speech sounds at a very young age, and that the differences can be explained by the differences between Seoul/Gyeonggi and the other provinces rather than by the individual characteristics of specific regions.