• 제목/요약/키워드: 하위범주화

검색결과 99건 처리시간 0.025초

Prediction of Prosodic Break Using Syntactic Relations and Prosodic Features (구문 관계와 운율 특성을 이용한 한국어 운율구 경계 예측)

  • Jung, Youngim;Cho, SunHo;Yoon, Aesun;Kwon, Hyuk-Chul
    • Annual Conference on Human and Language Technology
    • /
    • 한국정보과학회언어공학연구회 2007년도 제19회 한글 및 한국어 정보처리 학술대회
    • /
    • pp.7-14
    • /
    • 2007
  • 본 논문에서는 자연스러운 한국어 운율구 경계를 예측하기 위해 (1) 문장 성분을 하위범주화하고, (2) 세분화된 문장 성분 간 의존관계를 이용하여 통사구를 추출하며 (3) 추출한 통사구의 유형에 따른 운율구 경계 예측 규칙을 설정하였다. 또한, (4) 통사적 정보 외에도 통사구와 문장의 길이, 통사구의 문장 내 위치, 문맥의 의미 정보 등에 따라 가변적인 운율구 경계를 판단하여 보다 자연스러운 한국어 운율구 경계 예측 시스템을 개발하였다. 그 결과 통사구 경계와 상관 관계가 높은 강한 운율구 경계 예측과 운율구 내부 비경계 예측에 있어 90% 이상의 높은 재현율과 정확도를 보였으며, 전체 운율구 경계 예측에 있어서도 87% 이상의 성능을 보였다.

  • PDF

Implementation of Dependency Parser using Argument Information based on Korean WordNet (한국어 어휘의미망에 기반한 논항 정보를 이용한 의존문법 구문분석기의 구현)

  • Im, Gyeong-Eop;Jung, Youngim;Kwon, Hyuk-Chul
    • Annual Conference on Human and Language Technology
    • /
    • 한국정보과학회언어공학연구회 2007년도 제19회 한글 및 한국어 정보처리 학술대회
    • /
    • pp.158-164
    • /
    • 2007
  • 한국어는 한 어절이 한 개 이상의 형태소로 이루어졌으며, 이 때문에 지역 중의성이 발생한다. 대부분의 선행 연구에서는 이러한 지역 중의성을 배제하거나, 태거를 사용하여 지역 중의성을 제거해왔다. 본 연구에서는 문장의 모든 형태소 분석에 대해 구문분석을 시도하며, 중의성을 제거하고자 적용된 의존문법 규칙과 구 묶음, 부사 하위범주화, 논항 정보 사전 이용 등의 다양한 기법을 설명하고, 구문분석 성능을 실험으로 나타낸다. 특히, 말뭉치마다 논항 정보 사전을 따로 구축하는 번거로움을 피하고자 한국어 어휘의미망을 사용한다.

  • PDF

Automatic Text Categorization Using Passage-based Weight Function and Passage Type (문단 단위 가중치 함수와 문단 타입을 이용한 문서 범주화)

  • Joo, Won-Kyun;Kim, Jin-Suk;Choi, Ki-Seok
    • The KIPS Transactions:PartB
    • /
    • 제12B권6호
    • /
    • pp.703-714
    • /
    • 2005
  • Researches in text categorization have been confined to whole-document-level classification, probably due to lacks of full-text test collections. However, full-length documents availably today in large quantities pose renewed interests in text classification. A document is usually written in an organized structure to present its main topic(s). This structure can be expressed as a sequence of sub-topic text blocks, or passages. In order to reflect the sub-topic structure of a document, we propose a new passage-level or passage-based text categorization model, which segments a test document into several Passages, assigns categories to each passage, and merges passage categories to document categories. Compared with traditional document-level categorization, two additional steps, passage splitting and category merging, are required in this model. By using four subsets of Routers text categorization test collection and a full-text test collection of which documents are varying from tens of kilobytes to hundreds, we evaluated the proposed model, especially the effectiveness of various passage types and the importance of passage location in category merging. Our results show simple windows are best for all test collections tested in these experiments. We also found that passages have different degrees of contribution to main topic(s), depending on their location in the test document.

An Exploratory Study on Modalities and Harmful Effects of 'Chinmokjil(Socializing Behavior)' (온라인 커뮤니티에서의 '친목질'의 행태와 폐해에 대한 탐색적 연구)

  • Jung, Seung-Hwan;Kim, Hee-Eun;Kim, Shinwoo
    • Journal of Digital Contents Society
    • /
    • 제19권8호
    • /
    • pp.1471-1480
    • /
    • 2018
  • This study explored what the phenomenon of 'Chinmokjil(Socializing Behavior)', which online community is seriously wary of, implies and actually affects the online community. Interviews were conducted for 13 people who had experienced Chinmokjil in online communities, and the results were analyzed by qualitative analysis. First, Chinmokjil is conceptualized as 'privatization or privately organizing of online community' Second, the actual phenomenons of Chinmokjil are sub-categorized as 8 categories Third, the ultimate negative impacts of Chinmokjil are sub-categorized as 3 categories. Based on the results, it can be explained that the unique norms of communities in online are different from those in offline.

Development of the Encouraging Language Model for Elementary School Teachers (초등학교 교사를 위한 격려 언어 모형 개발)

  • Seon, Young-Woon;Oh, Ik-Soo
    • The Korean Journal of Elementary Counseling
    • /
    • 제10권1호
    • /
    • pp.39-56
    • /
    • 2011
  • The purpose of this study is to draw the elements of encouraging language from the literatures of encouragement and develop the encouraging language model for elementary school teachers. To achieve this, first of all, the literatures about the methods of encouragement were collected. And then the collected literatures were categorized according to the main concept which each literature contained. As a result, 5 categories and 17 subcategories were drawn. 5 categories were valuing a child as a human-being itself, trusting a child, thinking rationally about a child's mistakes, giving a feedback about a child's behaviors non-evaluatively, and reflecting a child's positive feeling. These 5 categories were established as the elements of encouraging language. The encouraging language model was developed on the bases of the 5 elements of encouraging language. The model was constructed of the examples of encouraging language in various classroom situations. The model contains various situations which elementary school teachers often confront in their classrooms. And the model shows the examples of encouraging language proper for each situation. Every example was constructed on the bases of the elements of encouraging language.

  • PDF

Improving a Korean Spell/Grammar Checker for the Web-Based Language Learning System (웹기반 언어 학습시스템을 위한 한국어 철자/문법 검사기의 성능 향상)

  • 남현숙;김광영;권혁철
    • Korean Journal of Cognitive Science
    • /
    • 제12권3호
    • /
    • pp.1-18
    • /
    • 2001
  • The goal of this paper is the pedagogical application of a Korean Spell/Grammar Checker to the web-based language learning system for Korean writing. To maximize the efficient instruction of our learning system \\`Urimal Baeumteo\\` we have to improve our Korean Spell/Grammar Checker. Today the NLP system\\`s performance defends on its semantic processing capability. In our Korean Spell/Grammar Checker. the tasks accomplished in the semantic level are: the detection and correction of misused derived and compound nouns in a Korean spell-checking device and the detection and correction of syntactic and semantic errors in a Korean grammars-checking device. We describe a common approach to the partial parsing using collocation rules based on the dependency grammar. To provide more detailed semantic rules. we classified nouns according to their concepts. and subcategorized verbs referring to their syntactic and semantic features. Improving a Korean Spell/Gl-Grammar Checker makes our learning system active and intelligent in a web-based environment. We acknowledge the flaws in our system: the classification of nouns based on their meanings and concepts is a time consuming task. the analytic unit of this study is principally limited to the phrases in a sentence therefore the accurate parsing of embedded sentences remains a difficult problem to solve. Concerning the web-based language learning system. it is critically important to consider its interface design and structure of its contents.

  • PDF

The Design.Marketing Strategies for Korean Traditional Sauces by emotion-oriented Categorization (감성지향적 범주화를 통한 장류제품의 디자인.마케팅 전략)

  • Lee, Yu-Ri;Yang, Jong-Youl;Park, Sang-June
    • Science of Emotion and Sensibility
    • /
    • 제10권3호
    • /
    • pp.491-502
    • /
    • 2007
  • Categorization is very important for product design. Consumer's emotion become different according to a type of categorization, so design concept and design elements must be combined differently with difference of the emotion. Specially, categorization process is necessary if nowadays product line is enlarged, and a product differentiation is not clear. That is, designers decide on correct categories and a design concept based on similarity of emotion and have to provide to consumer-oriented design. The purpose of this study is to provide a design direction for Korean traditional sauce products after extracting consumers' sensitivity from the whole image of Korean traditional sauce and each images of the sauces-korean hot pepper paste, soybean paste, fermented soybeans paste, SsamJang, and soy sauce- and deciding categories of the each sauces based on the extracted sensitivities' similarity. In the result of this study, we knew that Korean traditional sauces didn't differentiate from consumers' preference images. In our empirical research, the research - emotional image survey on sauces - have conclusion that emotional image of "well-being", "tasty" have positive influence, but emotional image of "messy and dirty", "smelly" have negative influence. Therefore, we suggest that positive emotional images like "tasty" should be emphasized, but negative emotional images like "messy" should be eliminated for design and marketing strategy of Korean traditional sauces. This research will suggest the guideline for product design with respect to academic aspects and working-level aspects.

  • PDF

Interpretation of Noun Sequence using Semantic Information Extracted from Machine Readable Dictionary and Corpus (기계가독형사전과 코퍼스에서 추출한 의미정보를 이용한 명사열의 의미해석)

  • 이경순;김도완;김길창;최기선
    • Korean Journal of Cognitive Science
    • /
    • 제12권1_2호
    • /
    • pp.11-24
    • /
    • 2001
  • The interpretation of noun sequence is to find semantic relation between the nouns in noun sequence. To interpret noun sequence, semantic knowledge about words and relation between words is required. In this thesis, we propose a method to interpret a semantic relation between nouns in noun sequence. We extract semantic information from an machine readable dictionary (MRD) and corpus using regular expressions. Based on the extracted information, semantic relation of noun sequence is interpreted. And. we use verb subcategorization information together with the semantic information from an MRD and corpus. Previous researches use semantic knowledge extracted only from an MRD but our method uses an MRD. corpus. and subcategorizaton information to interpret noun sequences. Experimental result shows that our method improves the accuracy rate by +40.30% and the coverage rate by + 12.73% better than previous researches.

  • PDF

Exploring the Image Types of Secondary School Students' Perception about the Talented Person in Convergence (중등학생들이 생각하는 융합인재에 대한 이미지 유형 탐색)

  • Lee, Jun-Ki;Lee, Tae-Kyong;Shin, Sein;Chung, Duk-Ho;Oh, Sang-Wook
    • Journal of The Korean Association For Science Education
    • /
    • 제33권7호
    • /
    • pp.1486-1509
    • /
    • 2013
  • This study aims to identify the image types of secondary school students' perception about the talented person in convergence and to find the differences in drawing images of the talented person in convergence among the students who have taken STEAM class and the ones who haven't. One hundred and eighty seven students in middle and high schools located in the southern part of South Korea participated in this study and they were asked to draw a picture of the talented person in convergence with a brief explanation. Based on students' pictures, researchers categorized their perception about convergence and talented person in convergence by using an inductive method. The result indicated that secondary school students' perceptions were categorized into convergence as individual cognitive processing and collective cognitive processing and convergence as outcomes. The image of the convergence in a talented person leaning toward individual cognitive processing was divided into the following seven types: idea banker type, various talented celebrity type, multi-tasking master type, multi-talented career type, active problem-solver type, creative developer type, and unrealistic ideal man type. Another image of collective cognitive processing was split into expert group type and interactive-mates group type. The other image was transformer type which is the subcategory of convergence as outcomes. From this study, it can be suggested that secondary school students express the various images of the talented person in convergence depending on experiencing STEAM or not.

Two-Level Clausal Segmentation Algorithm using Sense Information (의미 정보를 이용한 이단계 단문 분할 알고리즘)

  • Park, Hyun-Jae;Lee, Su-Seon;Woo, Yo-Seop
    • Annual Conference on Human and Language Technology
    • /
    • 한국정보과학회언어공학연구회 1999년도 제11회 한글 및 한국어 정보처리 학술대회
    • /
    • pp.237-241
    • /
    • 1999
  • 단문 분할은 한 문장에 중심어인 용언이 복수개인 경우 용언을 중심으로 문장을 나누는 방법이다. 기존의 방법은 정형화된 문장의 경우 효율적인 결과를 얻을 수 있으나 구문적으로 복잡한 문장인 경우는 한계를 보였다. 본 논문에서는 이러한 한계를 극복하기 위해 구문 정보의 단문 분할이 아닌 의미 정보를 활용하여 복잡한 문장을 효율적으로 단문으로 분할하는 방법을 제안한다. 정형화된 문장의 경우와 달리 일상적인 문장은 문장의 구조적 애매성이나 조사의 생략 등이 빈번하므로 의미 수준에서의 단문 분할이 필요하다. 본 논문에서는 의미 영역에서 단문 분할의 할 경우 기존의 방법들의 애매성을 해소할 수 있다는 점을 보인다. 이를 위해, 먼저 하위범주화 사전과 시소러스의 의미 정보를 이용하여 용언과 보어 성분간의 의존구조를 1차적으로 작성하고 이후 구문적인 정보와 기타 문법적인 지식을 사용하여 기타 성분을 의존구조에 점진적으로 포함시켜가는 이단계 단문 분할 알고리즘을 제안한다. 제안된 이단계 단문 분할 방법의 유용성을 보이기 위해 ETRI-KONAN의 말뭉치 중 20,000문장을 반 자동적인 방법으로 술어와 보어 성분간의 의존구조를 태깅한 후 본 논문에서 제안한 방법과 비교하는 실험을 수행한다.

  • PDF