• Title/Summary/Keyword: phrase identification

Search Result 18, Processing Time 0.029 seconds

Korean Noun Phrase Identification using Maximum Entropy Method (최대 엔트로피 모델을 이용한 한국어 명사구 추출)

  • Kang, In-Ho;Jeon, Su-Young;Kim, Gil-Chang
    • Annual Conference on Human and Language Technology
    • /
    • 2000.10d
    • /
    • pp.127-132
    • /
    • 2000
  • 본 논문에서는 격조사의 구문적인 특성을 이용하여, 수식어까지 포함한 명사구 추출 방법을 연구한다. 명사구 판정을 위해 연속적인 형태소열을 문맥정보로 사용하던 기존의 방법과 달리, 명사구의 처음과 끝 그리고 명사구 주변의 형태소를 이용하여 명사구의 수식 부분과 중심 명사를 문맥정보로 사용한다. 다양한 형태의 문맥정보들은 최대 엔트로피 원리(Maximum Entropy Principle)에 의해 하나의 확률 분포로 결합된다. 본 논문에서 제안하는 명사구 추출 방법은 먼저 구문 트리 태깅된 코퍼스에서 품사열로 표현되는 명사구 문법 규칙을 얻어낸다. 이렇게 얻어낸 명사구 규칙을 이용하여 격조사와 인접한 명사구 후보들을 추출한다. 추출된 각 명사구 후보는 학습 코퍼스에서 얻어낸 확률 분포에 기반하여 명사구로 해서될 확률값을 부여받는다. 이 중 제일 확률값이 높은 것을 선택하는 형태로 각 격조사와 관계있는 명사구를 추출한다. 본 연구에서 제시하는 모델로 실험을 한 결과 평균 4.5개의 구를 포함하는 명사구를 추출할 수 있었다.

  • PDF

A Language Model and Clue based Machine Learning Method for Discovering Technology Trends from Patent Text (특허 문서 텍스트로부터의 기술 트렌드 탐지를 위한 언어 모델 및 단서 기반 기계학습 방법)

  • Tian, Yingshi;Kim, Young-Ho;Jeong, Yoon-Jae;Ryu, Ji-Hee;Myaeng, Sung-Hyon
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.5
    • /
    • pp.420-429
    • /
    • 2009
  • Patent text is a rich source for discovering technological trends. In order to automate such a discovery process, we attempt to identify phrases corresponding to the problem and its solution method which together form a technology. Problem and solution phrases are identified by a SVM classifier using features based on a combination of a language modeling approach and linguistic clues. Based on the occurrence statistics of the phrases, we identify the time span of each problem and solution and finally generate a trend. Based on our experiment, we show that the proposed semantic phrase identification method is promising with its accuracy being 77% in R-precision. We also show that the unsupervised method for discovering technological trends is meaningful.

Automatic Construction of Foreign Word Transliteration Dictionary from English-Korean Parallel Corpus (영-한 병렬 코퍼스로부터 외래어 표기 사전의 자동 구축)

  • Lee, Jae Sung
    • The Journal of Korean Association of Computer Education
    • /
    • v.6 no.2
    • /
    • pp.9-21
    • /
    • 2003
  • This paper proposes an automatic construction system for transliteration dictionary from English-Korean parallel corpus. The system works in 3 steps: it extracts all nouns from Korean documents as the first step, filters transliterated foreign word nouns out of them with the language identification method as the second step, and extracts the corresponding English words by using a probabilistic alignment method as the final step. Specially, the fact that there is a corresponding English word in most cases, is utilized to extract the purely transliterated part from a Koreans word phrase, which is usually used in combined forms with Korean endings(Eomi) or particles(Josa). Moreover, the direct phonetic comparison is done to the words in two different alphabet systems without converting them to the same alphabet system. The experiment showed that the performance was influenced by the first and the second preprocessing steps; the most efficient model among manually preprocessed ones showed 85.4% recall, 91.0% precision and the most efficient model among fully automated ones got 68.3% recall, 89.2% precision.

  • PDF

Reconsideration of the Formation Process of Current Nagyangchun (현행 낙양춘의 형성과정 재고)

  • Yim, Hyun-taek
    • (The) Research of the performance art and culture
    • /
    • no.43
    • /
    • pp.79-120
    • /
    • 2021
  • Nagyangchun is a Dangak that has been handed down to the present time with Boheoja as a Saak of the Song Dynasty which was introduced in the Goryeo Dynasty. The title and lyrics of Nagyangchun are conveyed in the Dangakjo of Goryeosa-akji and the Jeungbomunheonbigo. The remaining scores containing Nagyangchun include Akjangyoram, Sogagwonbo Vol.4 and Vol.6 of the Joseon Dynasty and Aakbu-akbo, the 6th Aaksaeng-gyogwacheol, Leewangjikaakbu-oseonakbo during the Japanese colonial period. Besides, the current melody of Nagyangchun is based on Hangugeumak and Gugakjeonjib published by the National Gugak Center. This paper aims to examine how Nagyangchun, which is currently being performed at the National Gugak Center, went through the process of change to have the same structure and form as it is now using these scores as a research subject. The study results are summarized as follows. First, the song of Nagyangchun, which was originally Saak but transmitted as an instrumental piece without lyrics, first appeared in the Hangugeumak Vol.16 and Gugakjeonjib Vol.7 published by the National Gugak Center in 1978 and 1979. In this process, the Janggu added by Kim Ki-soo is now disappearing and is replaced by Jwago. Second, though the five notes of 黃, 太, 仲, 林, and 南 have been maintained unchanged since the Akjangyoram, the pitch of 無/應 and 夾/姑, which appear once each, gradually rises and is unified into 應 and 姑 during the period of Aaksaeng-gyogwacheol or at the latest Leewangjikaakbu-oseonakbo, and reached the present. Third, the current melody of Nagyangchun consists of a structure in which the tones and range of each phrase rises within the form of Mijeonsa (a·b·c·d) and Mihusa (e·b'·c'·d'). Particularly, except for the a-type and e-type melodies, which are the introduction for the Mijeonsa and Mihusa, the remaining melodic types show a gradually descending structure within the corresponding phrase, so the structure of ascending and descending is generally in harmony. Forth, the Ganeum that appeared from Aakbu-akbo are currently classified into seven types, and they appear in ascending pitches of 2nd, 3rd, 4th, and 5th and play a role in smoothly or fluently connecting melodic progression. Fifth, after Akjangyoram, Nagyangchun, which had been handed down as an instrumental piece without lyrics, was restored in 1960 by Lee Hye-gu, and is being passed down as the form of male and female vocals added to the instrumental accompaniment. As a result of examining the current Nagyangchun, which was formed through the process of change after Akjangyoram, it was found that there were tasks that required reconsideration of the current Nagyangchun, which is being played at the National Gugak Center, such as the arrangement of Janggu, the identification of the key, and the investigation of the lyrics. When follow-up studies are continued in the future, it will be able to contribute to the cultural transmission of Nagyangchun.

Identification of On-site Environmental Management Factors and Analysis of Responsible Parties in Public Housing Construction Sites (공공 주택건설사업의 현장환경관리 업무요소 도출 및 수행주체 분석)

  • Sohn, Jeong-Rak;Song, Sang-Hoon;Jun, Myoung-Hoon;Park, Seong-Sik
    • Land and Housing Review
    • /
    • v.4 no.4
    • /
    • pp.383-393
    • /
    • 2013
  • The trends of green growth and eco-friendliness came to be the core development indicator for the sustainable global environment. Korean government reflected these trends in the main flows of the national development index, and suggested diverse directions for green construction technologies and high quality construction environment through Third master plan for construction environment. However, the efforts to follow these trends during the construction process as a step for production phrase are not being considered enough yet. In this study, we identified the basic environmental management factors in order to enhance the eco-friendliness of public housing construction sites, and suggested the reasonable conducting parties and process for those respective factors. The results of this study are expected to be the valuable reference in defining the required activities and participants' responsibilities, and improving the work process for systematic on-site environmental management. In applying those results, the discussion should be followed on the executing party of each unit activity and the responsibility assignment for each process. At the same time, the legislation and standard related to environment need to be essentially amended. In the future, the method of evaluating the environmental management activities, and the technical solution to environmental problems are to be reviewed as a further research for successful environmental management.

Applying Meta-model Formalization of Part-Whole Relationship to UML: Experiment on Classification of Aggregation and Composition (UML의 부분-전체 관계에 대한 메타모델 형식화 이론의 적용: 집합연관 및 복합연관 판별 실험)

  • Kim, Taekyung
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.1
    • /
    • pp.99-118
    • /
    • 2015
  • Object-oriented programming languages have been widely selected for developing modern information systems. The use of concepts relating to object-oriented (OO, in short) programming has reduced efforts of reusing pre-existing codes, and the OO concepts have been proved to be a useful in interpreting system requirements. In line with this, we have witnessed that a modern conceptual modeling approach supports features of object-oriented programming. Unified Modeling Language or UML becomes one of de-facto standards for information system designers since the language provides a set of visual diagrams, comprehensive frameworks and flexible expressions. In a modeling process, UML users need to consider relationships between classes. Based on an explicit and clear representation of classes, the conceptual model from UML garners necessarily attributes and methods for guiding software engineers. Especially, identifying an association between a class of part and a class of whole is included in the standard grammar of UML. The representation of part-whole relationship is natural in a real world domain since many physical objects are perceived as part-whole relationship. In addition, even abstract concepts such as roles are easily identified by part-whole perception. It seems that a representation of part-whole in UML is reasonable and useful. However, it should be admitted that the use of UML is limited due to the lack of practical guidelines on how to identify a part-whole relationship and how to classify it into an aggregate- or a composite-association. Research efforts on developing the procedure knowledge is meaningful and timely in that misleading perception to part-whole relationship is hard to be filtered out in an initial conceptual modeling thus resulting in deterioration of system usability. The current method on identifying and classifying part-whole relationships is mainly counting on linguistic expression. This simple approach is rooted in the idea that a phrase of representing has-a constructs a par-whole perception between objects. If the relationship is strong, the association is classified as a composite association of part-whole relationship. In other cases, the relationship is an aggregate association. Admittedly, linguistic expressions contain clues for part-whole relationships; therefore, the approach is reasonable and cost-effective in general. Nevertheless, it does not cover concerns on accuracy and theoretical legitimacy. Research efforts on developing guidelines for part-whole identification and classification has not been accumulated sufficient achievements to solve this issue. The purpose of this study is to provide step-by-step guidelines for identifying and classifying part-whole relationships in the context of UML use. Based on the theoretical work on Meta-model Formalization, self-check forms that help conceptual modelers work on part-whole classes are developed. To evaluate the performance of suggested idea, an experiment approach was adopted. The findings show that UML users obtain better results with the guidelines based on Meta-model Formalization compared to a natural language classification scheme conventionally recommended by UML theorists. This study contributed to the stream of research effort about part-whole relationships by extending applicability of Meta-model Formalization. Compared to traditional approaches that target to establish criterion for evaluating a result of conceptual modeling, this study expands the scope to a process of modeling. Traditional theories on evaluation of part-whole relationship in the context of conceptual modeling aim to rule out incomplete or wrong representations. It is posed that qualification is still important; but, the lack of consideration on providing a practical alternative may reduce appropriateness of posterior inspection for modelers who want to reduce errors or misperceptions about part-whole identification and classification. The findings of this study can be further developed by introducing more comprehensive variables and real-world settings. In addition, it is highly recommended to replicate and extend the suggested idea of utilizing Meta-model formalization by creating different alternative forms of guidelines including plugins for integrated development environments.

Function of the Korean String Indexing System for the Subject Catalog (주제목록을 위한 한국용어열색인 시스템의 기능)

  • Yoon Kooho
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.15
    • /
    • pp.225-266
    • /
    • 1988
  • Various theories and techniques for the subject catalog have been developed since Charles Ammi Cutter first tried to formulate rules for the construction of subject headings in 1876. However, they do not seem to be appropriate to Korean language because the syntax and semantics of Korean language are different from those of English and other European languages. This study therefore attempts to develop a new Korean subject indexing system, namely Korean String Indexing System(KOSIS), in order to increase the use of subject catalogs. For this purpose, advantages and disadvantages between the classed subject catalog nd the alphabetical subject catalog, which are typical subject ca-alogs in libraries, are investigated, and most of remarkable subject indexing systems, in particular the PRECIS developed by the British National Bibliography, are reviewed and analysed. KOSIS is a string indexing based on purely the syntax and semantics of Korean language, even though considerable principles of PRECIS are applied to it. The outlines of KOSIS are as follows: 1) KOSIS is based on the fundamentals of natural language and an ingenious conjunction of human indexing skills and computer capabilities. 2) KOSIS is. 3 string indexing based on the 'principle of context-dependency.' A string of terms organized accoding to his principle shows remarkable affinity with certain patterns of words in ordinary discourse. From that point onward, natural language rather than classificatory terms become the basic model for indexing schemes. 3) KOSIS uses 24 role operators. One or more operators should be allocated to the index string, which is organized manually by the indexer's intellectual work, in order to establish the most explicit syntactic relationship of index terms. 4) Traditionally, a single -line entry format is used in which a subject heading or index entry is presented as a single sequence of words, consisting of the entry terms, plus, in some cases, an extra qualifying term or phrase. But KOSIS employs a two-line entry format which contains three basic positions for the production of index entries. The 'lead' serves as the user's access point, the 'display' contains those terms which are themselves context dependent on the lead, 'qualifier' sets the lead term into its wider context. 5) Each of the KOSIS entries is co-extensive with the initial subject statement prepared by the indexer, since it displays all the subject specificities. Compound terms are always presented in their natural language order. Inverted headings are not produced in KOSIS. Consequently, the precision ratio of information retrieval can be increased. 6) KOSIS uses 5 relational codes for the system of references among semantically related terms. Semantically related terms are handled by a different set of routines, leading to the production of 'See' and 'See also' references. 7) KOSIS was riginally developed for a classified catalog system which requires a subject index, that is an index -which 'trans-lates' subject index, that is, an index which 'translates' subjects expressed in natural language into the appropriate classification numbers. However, KOSIS can also be us d for a dictionary catalog system. Accordingly, KOSIS strings can be manipulated to produce either appropriate subject indexes for a classified catalog system, or acceptable subject headings for a dictionary catalog system. 8) KOSIS is able to maintain a constistency of index entries and cross references by means of a routine identification of the established index strings and reference system. For this purpose, an individual Subject Indicator Number and Reference Indicator Number is allocated to each new index strings and new index terms, respectively. can produce all the index entries, cross references, and authority cards by means of either manual or mechanical methods. Thus, detailed algorithms for the machine-production of various outputs are provided for the institutions which can use computer facilities.

  • PDF

A Review Examining the Dating, Analysis of the Painting Style, Identification of the Painter, and Investigation of the Documentary Records of Samsaebulhoedo at Yongjusa Temple (용주사(龍珠寺) <삼세불회도(三世佛會圖)> 연구의 연대 추정과 양식 분석, 작가 비정, 문헌 해석의 검토)

  • Kang, Kwanshik
    • MISULJARYO - National Museum of Korea Art Journal
    • /
    • v.97
    • /
    • pp.14-54
    • /
    • 2020
  • The overall study of Samsaebulhoedo (painting of the Assembly of Buddhas of Three Ages) at Yongjusa Temple has focused on dating it, analyzing the painting style, identifying its painter, and scrutinizing the related documents. However, its greater coherence could be achieved through additional support from empirical evidence and logical consistency. Recent studies on Samsaebulhoedo at Yongjusa Temple that postulate that the painting could have been produced by a monk-painter in the late nineteenth century and that an original version produced in 1790 could have been retouched by a painter in the 1920s using a Western painting style lack such empirical proof and logic. Although King Jeongjo's son was not yet installed as crown prince, the Samsaebulhoedo at Yongjusa Temple contained a conventional written prayer wishing for a long life for the king, queen, and crown prince: "May his majesty the King live long / May her majesty the Queen live long / May his highness the Crown Prince live long" (主上殿下壽萬歲, 王妃殿下壽萬歲, 世子邸下壽萬歲). Later, this phrase was erased using cinnabar and revised to include unusual content in an exceptional order: "May his majesty the King live long / May his highness the King's Affectionate Mother (Jagung) live long / May her majesty the Queen live long / May his highness the Crown Prince live long" (主上殿下壽萬歲, 慈宮邸下壽萬歲, 王妃殿下壽萬歲, 世子邸下壽萬歲). A comprehensive comparison of the formats and contents in written prayers found on late Joseon Buddhist paintings and a careful analysis of royal liturgy during the reign of King Jeongjo reveal Samsaebulhoedo at Yongjusa Temple to be an original version produced at the time of the founding of Yongjusa Temple in 1790. According to a comparative analysis of formats, iconography, styles, aesthetic sensibilities, and techniques found in Buddhist paintings and paintings by Joseon court painters from the eighteenth and nineteenth centuries, Samsaebulhoedo at Yongjusa Temple bears features characteristic of paintings produced around 1790, which corresponds to the result of analysis on the written prayer. Buddhist paintings created up to the early eighteenth century show deities with their sizes determined by their religious status and a two-dimensional conceptual composition based on the traditional perspective of depicting close objects in the lower section and distant objects above. This Samsaebulhoedo, however, systematically places the Buddhist deities within a threedimensional space constructed by applying a linear perspective. Through the extensive employment of chiaroscuro as found in Western painting, it expresses white highlights and shadows, evoking a feeling that the magnificent world of the Buddhas of the Three Ages actually unfolds in front of viewers. Since the inner order of a linear perspective and the outer illusion of chiaroscuro shading are intimately related to each other, it is difficult to believe that the white highlights were a later addition. Moreover, the creative convergence of highly-developed Western painting style and techniques that is on display in this Samsaebulhoedo could only have been achieved by late-Joseon court painters working during the reign of King Jeongjo, including Kim Hongdo, Yi Myeong-gi, and Kim Deuksin. Deungun, the head monk of Yongjusa Temple, wrote Yongjusa sajeok (History of Yongjusa Temple) by compiling the historical records on the temple that had been transmitted since its founding. In Yongjusa sajeok, Deungun recorded that Kim Hongdo painted Samsaebulhoedo as if it were a historical fact. The Joseon royal court's official records, Ilseongnok (Daily Records of the Royal Court and Important Officials) and Suwonbu jiryeong deungnok (Suwon Construction Records), indicate that Kim Hongdo, Yi Myeong-gi, and Kim Deuksin all served as a supervisor (gamdong) for the production of Buddhist paintings. Since within Joseon's hierarchical administrative system it was considered improper to allow court painters of government position to create Buddhist paintings which had previously been produced by monk-painters, they were appointed as gamdong in name only to avoid a political liability. In reality, court painters were ordered to create Buddhist paintings. During their reigns, King Yeongjo and King Jeongjo summoned the literati painters Jo Yeongseok and Kang Sehwang to serve as gamdong for the production of royal portraits and requested that they paint these portraits as well. Thus, the boundary between the concept of supervision and that of painting occasionally blurred. Supervision did not completely preclude painting, and a gamdong could also serve as a painter. In this light, the historical records in Yongjusa sajeok are not inconsistent with those in Ilseongnok, Suwonbu jiryeong deungnok, and a prayer written by Hwang Deok-sun, which was found inside the canopy in Daeungjeon Hall at Yongjusa Temple. These records provided the same content in different forms as required for their purposes and according to the context. This approach to the Samsaebulhoedo at Yongjusa Temple will lead to a more coherent explanation of dating the painting, analyzing its style, identifying its painter, and interpreting the relevant documents based on empirical grounds and logical consistency.