• Title/Summary/Keyword: pronoun

Search Result 72, Processing Time 0.015 seconds

Restoring Omitted Sentence Constituents in Encyclopedia Documents Using Structural SVM (Structural SVM을 이용한 백과사전 문서 내 생략 문장성분 복원)

  • Hwang, Min-Kook;Kim, Youngtae;Ra, Dongyul;Lim, Soojong;Kim, Hyunki
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.131-150
    • /
    • 2015
  • Omission of noun phrases for obligatory cases is a common phenomenon in sentences of Korean and Japanese, which is not observed in English. When an argument of a predicate can be filled with a noun phrase co-referential with the title, the argument is more easily omitted in Encyclopedia texts. The omitted noun phrase is called a zero anaphor or zero pronoun. Encyclopedias like Wikipedia are major source for information extraction by intelligent application systems such as information retrieval and question answering systems. However, omission of noun phrases makes the quality of information extraction poor. This paper deals with the problem of developing a system that can restore omitted noun phrases in encyclopedia documents. The problem that our system deals with is almost similar to zero anaphora resolution which is one of the important problems in natural language processing. A noun phrase existing in the text that can be used for restoration is called an antecedent. An antecedent must be co-referential with the zero anaphor. While the candidates for the antecedent are only noun phrases in the same text in case of zero anaphora resolution, the title is also a candidate in our problem. In our system, the first stage is in charge of detecting the zero anaphor. In the second stage, antecedent search is carried out by considering the candidates. If antecedent search fails, an attempt made, in the third stage, to use the title as the antecedent. The main characteristic of our system is to make use of a structural SVM for finding the antecedent. The noun phrases in the text that appear before the position of zero anaphor comprise the search space. The main technique used in the methods proposed in previous research works is to perform binary classification for all the noun phrases in the search space. The noun phrase classified to be an antecedent with highest confidence is selected as the antecedent. However, we propose in this paper that antecedent search is viewed as the problem of assigning the antecedent indicator labels to a sequence of noun phrases. In other words, sequence labeling is employed in antecedent search in the text. We are the first to suggest this idea. To perform sequence labeling, we suggest to use a structural SVM which receives a sequence of noun phrases as input and returns the sequence of labels as output. An output label takes one of two values: one indicating that the corresponding noun phrase is the antecedent and the other indicating that it is not. The structural SVM we used is based on the modified Pegasos algorithm which exploits a subgradient descent methodology used for optimization problems. To train and test our system we selected a set of Wikipedia texts and constructed the annotated corpus in which gold-standard answers are provided such as zero anaphors and their possible antecedents. Training examples are prepared using the annotated corpus and used to train the SVMs and test the system. For zero anaphor detection, sentences are parsed by a syntactic analyzer and subject or object cases omitted are identified. Thus performance of our system is dependent on that of the syntactic analyzer, which is a limitation of our system. When an antecedent is not found in the text, our system tries to use the title to restore the zero anaphor. This is based on binary classification using the regular SVM. The experiment showed that our system's performance is F1 = 68.58%. This means that state-of-the-art system can be developed with our technique. It is expected that future work that enables the system to utilize semantic information can lead to a significant performance improvement.

Brutal sorigeuk of the use of educational view of (잔혹소리극 <내다리내놔>의 가치 교육적 활용에 대한 고찰)

  • Kim, Jeong Sun
    • (The) Research of the performance art and culture
    • /
    • no.32
    • /
    • pp.595-628
    • /
    • 2016
  • Pansori of a creative group pansori 2006 demonstration factory floor sound brutal sorigeuk the home of is a legend 'deokttaegol' in pansori, a creative for adaptation to remakes Work is. Evil Twin 'deokttaegol' called "Give me my leg back" in of Ghost Stories, broadcast on a kbs of lines from breakneck work is considered to be a pronoun. Sound and shadow play and playing drums and payments sentiments of the cruelty I've come across in this 'Give me my leg back' audience to be deployed to the cruel is formed by the center. Based on emotional horror of cruelty. When I was little, ever heard of Korean Ghost Stories, a bedrock of the main feeling revulsion of value in a short time and is contained in a story of filial piety, while in education, to the target Provided. Done in our lives using genre called 'pansori' sentiment and efficient learning can move about the value education can know. Sound and stories, many carefree a stimulus such as Pansori is a great gesture can be a means of education. Valued with any information, work is performed in pansori, depending upon efficient and the various, education and made an emotional cultivation resulting from the value. In my life friendly, our own via a variety of materials that can easily access many values and sentiments, and to culture for each age group on languages and customs Each age groups and instructive preferred allowing them access through their rhythm, pansori, access to the target is persistent about it with curiosity and interest. Can have interest. This wealth not belong to the traditional pansori and new together private and to the tune called creative work for the Pansori. Therefore, our language and customs, their poems span a friendly, the pansori and created using the vocabulary for each age group creative content is educational effects if used in education It is expected to be big thing. These effective approach for each age group and based on the vocabulary by the content easily understood lessons by causing only a smoothly acquired Can to provide an opportunity. Therefore, the Pansori of a creative education is important to take advantage of educational value.