• Title/Summary/Keyword: 객관적 문장

Search Result 43, Processing Time 0.019 seconds

Subject-Balanced Intelligent Text Summarization Scheme (주제 균형 지능형 텍스트 요약 기법)

  • Yun, Yeoil;Ko, Eunjung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.141-166
    • /
    • 2019
  • Recently, channels like social media and SNS create enormous amount of data. In all kinds of data, portions of unstructured data which represented as text data has increased geometrically. But there are some difficulties to check all text data, so it is important to access those data rapidly and grasp key points of text. Due to needs of efficient understanding, many studies about text summarization for handling and using tremendous amounts of text data have been proposed. Especially, a lot of summarization methods using machine learning and artificial intelligence algorithms have been proposed lately to generate summary objectively and effectively which called "automatic summarization". However almost text summarization methods proposed up to date construct summary focused on frequency of contents in original documents. Those summaries have a limitation for contain small-weight subjects that mentioned less in original text. If summaries include contents with only major subject, bias occurs and it causes loss of information so that it is hard to ascertain every subject documents have. To avoid those bias, it is possible to summarize in point of balance between topics document have so all subject in document can be ascertained, but still unbalance of distribution between those subjects remains. To retain balance of subjects in summary, it is necessary to consider proportion of every subject documents originally have and also allocate the portion of subjects equally so that even sentences of minor subjects can be included in summary sufficiently. In this study, we propose "subject-balanced" text summarization method that procure balance between all subjects and minimize omission of low-frequency subjects. For subject-balanced summary, we use two concept of summary evaluation metrics "completeness" and "succinctness". Completeness is the feature that summary should include contents of original documents fully and succinctness means summary has minimum duplication with contents in itself. Proposed method has 3-phases for summarization. First phase is constructing subject term dictionaries. Topic modeling is used for calculating topic-term weight which indicates degrees that each terms are related to each topic. From derived weight, it is possible to figure out highly related terms for every topic and subjects of documents can be found from various topic composed similar meaning terms. And then, few terms are selected which represent subject well. In this method, it is called "seed terms". However, those terms are too small to explain each subject enough, so sufficient similar terms with seed terms are needed for well-constructed subject dictionary. Word2Vec is used for word expansion, finds similar terms with seed terms. Word vectors are created after Word2Vec modeling, and from those vectors, similarity between all terms can be derived by using cosine-similarity. Higher cosine similarity between two terms calculated, higher relationship between two terms defined. So terms that have high similarity values with seed terms for each subjects are selected and filtering those expanded terms subject dictionary is finally constructed. Next phase is allocating subjects to every sentences which original documents have. To grasp contents of all sentences first, frequency analysis is conducted with specific terms that subject dictionaries compose. TF-IDF weight of each subjects are calculated after frequency analysis, and it is possible to figure out how much sentences are explaining about each subjects. However, TF-IDF weight has limitation that the weight can be increased infinitely, so by normalizing TF-IDF weights for every subject sentences have, all values are changed to 0 to 1 values. Then allocating subject for every sentences with maximum TF-IDF weight between all subjects, sentence group are constructed for each subjects finally. Last phase is summary generation parts. Sen2Vec is used to figure out similarity between subject-sentences, and similarity matrix can be formed. By repetitive sentences selecting, it is possible to generate summary that include contents of original documents fully and minimize duplication in summary itself. For evaluation of proposed method, 50,000 reviews of TripAdvisor are used for constructing subject dictionaries and 23,087 reviews are used for generating summary. Also comparison between proposed method summary and frequency-based summary is performed and as a result, it is verified that summary from proposed method can retain balance of all subject more which documents originally have.

A Criticism of the Epistemological Premise of Kant's Transcendental Logic and that of Lacan's Psychoanalytic Logic, and Justification of Structure-Constructivist Epistemology(1) (칸트의 선험적 논리학과 라캉의 정신분석적 논리학의 인식론적 전제에 대한 비판과 구조-구성주의 인식론 정초(I))

  • Moun, Jean-sou
    • Journal of Korean Philosophical Society
    • /
    • v.137
    • /
    • pp.151-191
    • /
    • 2016
  • Kant and Lacan strongly criticized the epistemological premise of formal logic. However, Lacan was opposed to Kant in terms of subject, object, knowledge and truth. From the viewpoint of Kant's transcendental logic, formal logic does not have the ability to represent the nature of truth. On the other hand, from the viewpoint of Lacan's psychoanalytic logic, Kant's transcendental logic misunderstands or only partially represents the state of things. But I would like to try to criticize the epistemological premise of the two forms of logic. Transcendental logic takes the evident and new function in that it has studied the necessary condition of content rather than the form of thinking which formal logic considers as his object of study. Transcendental logic evidently studies the categories which dominate our way of thinking. Can we say that the 12 categories which Kant provided are sufficient in explaining the necessity of thinking? Lacan's psychoanalytic logics tells us that Kant's categories are only a kind of metaphor related with hypothesis that tries to explain the possibility of synthetical judge a priori. Is Lacan's psychoanalytic logic sufficient in explaining the possibility of science? It is not sufficient in explaining the objectivity and strictness of science, for it depends on metaphor and metonymy which are useful to literature and unconsciousness. I would like to try to synthesize Kant's transcendental and Lacan's psychoanalytic logic in terms of structure-constructivism which combines both formal and dialectical logic, which is consistent with the ideal of human science, and not blinkered science. My conclusion is that Kant's ethical and esthetical theory should be modified though Lacan's psychoanalytic logic, and Lacan's theory of the unconsciousness revised by Kant's transcendental logic.

A Study on HanYongUn's Sijo (한용운 시조의 내면 세계와 표현 미학)

  • Jeon, Jae-Gang
    • Sijohaknonchong
    • /
    • v.43
    • /
    • pp.177-206
    • /
    • 2015
  • This paper is written in order to research for the contents and expression of HanYongUn's Sijo. HanYongUn is very famous as monk and independent campaigner, modern poet in Korea. He wrote many kinds of literary works, for example, many modern poetry, modern novels, Sijo, Chino-Korean Poetry etc. It's very exceptional that he wrote a lot of Korean traditional Sijo and Chino-Korean Poetry. Because he was a many modern poet as same as modern novelist. So studying on his Sijo can help someone to understand the essence of HanYongUn's all literature. That's why I'm studying on HanYongUn's Sijo. The firstly, in aspect of the the contents of HanYongUn's Sijo, HanYongUn was expressing three kinds of themes, that is ideology, reality, daily life in his Sijo. The ideology consists of Buddhism and Confucianism and the reality is related with social conditions, the daily life is deeply connected with Nim. These features of his Sijo are different from his modern poetry and Chino-Korean Poetry which had a simple theme, for example, love with Nim, daily life. The secondly, in aspect of the expression of HanYongUn's Sijo, I studied the expression of HanYongUn's Sijo in three angles, that is, vocabulary and the developing of poet thinking, rhetorics. HanYongUn used essential words for expressing three kinds of themes effectively in his Sijo. And he was developing of his poet thinking by three steps in his Sijo. He applied several representative rhetorics to his Sijo, those are question and answer, exclamation, irony, distich etc. Even though I studied the characteristics of HanYongUn's Sijo in two aspects But there could be the other things to study about these kinds of theme. I might continue researching the other kinds of theme next time in the near future.

  • PDF