• Title/Summary/Keyword: generic summarization

Search Result 6, Processing Time 0.027 seconds

Generic Document Summarization using Coherence of Sentence Cluster and Semantic Feature (문장군집의 응집도와 의미특징을 이용한 포괄적 문서요약)

  • Park, Sun;Lee, Yeonwoo;Shim, Chun Sik;Lee, Seong Ro
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.12
    • /
    • pp.2607-2613
    • /
    • 2012
  • The results of inherent knowledge based generic summarization are influenced by the composition of sentence in document set. In order to resolve the problem, this papser propses a new generic document summarization which uses clustering of semantic feature of document and coherence of document cluster. The proposed method clusters sentences using semantic feature deriving from NMF(non-negative matrix factorization), which it can classify document topic group because inherent structure of document are well represented by the sentence cluster. In addition, the method can improve the quality of summarization because the importance sentences are extracted by using coherence of sentence cluster and the cluster refinement by re-cluster. The experimental results demonstrate appling the proposed method to generic summarization achieves better performance than generic document summarization methods.

Generic Summarization Using Generic Important of Semantic Features (의미특징의 포괄적 중요도를 이용한 포괄적 문서 요약)

  • Park, Sun;Lee, Jong-Hoon
    • Journal of Advanced Navigation Technology
    • /
    • v.12 no.5
    • /
    • pp.502-508
    • /
    • 2008
  • With the increased use of the internet and the tremendous amount of data it transfers, it is more necessary to summarize documents. We propose a new method using the Non-negative Semantic Variable Matrix (NSVM) and the generic important of semantic features obtained by Non-negative Matrix Factorization (NMF) to extract the sentences for automatic generic summarization. The proposed method use non-negative constraints which is more similar to the human's cognition process. As a result, the proposed method selects more meaningful sentences for summarization than the unsupervised method used the Latent Semantic Analysis (LSA) or clustering methods. The experimental results show that the proposed method archives better performance than other methods.

  • PDF

Video Summarization Using Eye Tracking and Electroencephalogram (EEG) Data (시선추적-뇌파 기반의 비디오 요약 생성 방안 연구)

  • Kim, Hyun-Hee;Kim, Yong-Ho
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.56 no.1
    • /
    • pp.95-117
    • /
    • 2022
  • This study developed and evaluated audio-visual (AV) semantics-based video summarization methods using eye tracking and electroencephalography (EEG) data. For this study, twenty-seven university students participated in eye tracking and EEG experiments. The evaluation results showed that the average recall rate (0.73) of using both EEG and pupil diameter data for the construction of a video summary was higher than that (0.50) of using EEG data or that (0.68) of using pupil diameter data. In addition, this study reported that the reasons why the average recall (0.57) of the AV semantics-based personalized video summaries was lower than that (0.69) of the AV semantics-based generic video summaries. The differences and characteristics between the AV semantics-based video summarization methods and the text semantics-based video summarization methods were compared and analyzed.

Generic Text Summarization Using Non-negative Matrix Factorization (비음수 행렬 인수분해를 이용한 일반적 문서 요약)

  • Park Sun;Lee Ju-Hong;Ahn Chan-Min;Park Tae-Su;Kim Ja-Woo;Kim Deok-Hwan
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2006.05a
    • /
    • pp.469-472
    • /
    • 2006
  • 본 논문은 비음수 행렬 인수분해(NMF, non-negative matrix factorization)를 이용하여 문장을 추출하여 문서를 요약하는 새로운 방법을 제안하였다. 제안된 방법은 문장추출에 사용되는 의미 특징(semantic feature)이 비 음수 값을 갖기 때문에 잠재의미분석에 비해 문서의 내용을 정확하게 요약한다. 또한, 적은 계산비용을 통하여 쉽게 요약 문장을 추출할 수 있는 장점을 갖는다.

  • PDF

Automatic Generic Summarization Based on Non-negative Semantic Variable Matrix (비음수 의미 가변 행렬을 기반으로 한 자동 포괄적 문서 요약)

  • Park Sun;Lee Ju-Hong;Ahn Chan-Min;Park Tae-Su;Kim Deok-Hwan
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.06a
    • /
    • pp.391-393
    • /
    • 2006
  • 인터넷의 급속한 확산과 대량 정보의 이동은 문서의 요약을 더욱 필요로 하고 있다. 본 논문은 비음수 행렬 인수분해로(NMF, non-negative matrix factorization) 얻어진 비음수 의미 가변 행렬(NSVM, non-negative semantic variable matrix)을 이용하여 자동으로 포괄적 문서요약 하는 새로운 방범을 제안하였다. 제안된 방법은 인간의 인식 과정과 유사한 비음수 제약을 사용한다. 이 결과 잠재의미색인에 비해 더욱 의미 있는 문장을 선택하여 문서를 요약할 수 있다. 또한, 비지도 학습에 의한 문서요약으로 사전 전문가에 의한 학습문장이 필요 없으며, 적은 계산비용을 통하여 쉽게 문장을 추출할 수 있는 장점을 갖는다.

  • PDF

Phylogenetic classification of Korean vascular flora according to the recent APG classification system (APG 분류체계에 따른 한국 관속식물상의 계통학적 분류)

  • Kim, Ki-Joong;Kim, Young-Dong;Kim, Joo-Hwan;Park, Seon-Joo;Park, Chong-Wook;Sun, Byung-Yun;Yoo, Ki-Oug;Choi, Byoung-Hee;Kim, Sang Tae
    • Korean Journal of Plant Taxonomy
    • /
    • v.38 no.3
    • /
    • pp.197-222
    • /
    • 2008
  • A recently published Korean Flora, "The genera of vascular plants of Korea (GFK)", includes the descriptions and keys for 217 families, 1,044 genera, and 3,209 species of Korean vascular plants. We reclassified these taxa according to the recent APG classification system, which resulted in 64 orders, 204 families, 1,044 genera and 3,209 species. Twenty-two families from the GFK were abandoned because of changes to the familial delimitations in the APG system. In contrast, the number of families in the Liliaceous group was increased. The Liliaceae in the GFK included 31 genera and 109 species. These taxa are now assigned to 10 families in four different orders including Liliales, Asparagales, Alismatales, and Dioscoreales because of the drastic changes to the monocot classification system in the past 20 years. In addition, the family name of the Aucubaceae was changed to Garryaceae. As a result, the number of families in the GFK has been reduced to 204. The results were summarized in four tables and two figures at the levels of unofficial higher taxonomic hierarchies, orders, families and genera. This new information can provide a guidelines for selecting the phylogenetic analysis unit for the Korean tree of life (KTOL) project. Futhermore, the updated classification system also provides an important summarization for the systematic community for placing the Korean flora in a modern phylogenetic context.