• Title/Summary/Keyword: sentence analysis

Search Result 492, Processing Time 0.019 seconds

Subject-Balanced Intelligent Text Summarization Scheme (주제 균형 지능형 텍스트 요약 기법)

  • Yun, Yeoil;Ko, Eunjung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.141-166
    • /
    • 2019
  • Recently, channels like social media and SNS create enormous amount of data. In all kinds of data, portions of unstructured data which represented as text data has increased geometrically. But there are some difficulties to check all text data, so it is important to access those data rapidly and grasp key points of text. Due to needs of efficient understanding, many studies about text summarization for handling and using tremendous amounts of text data have been proposed. Especially, a lot of summarization methods using machine learning and artificial intelligence algorithms have been proposed lately to generate summary objectively and effectively which called "automatic summarization". However almost text summarization methods proposed up to date construct summary focused on frequency of contents in original documents. Those summaries have a limitation for contain small-weight subjects that mentioned less in original text. If summaries include contents with only major subject, bias occurs and it causes loss of information so that it is hard to ascertain every subject documents have. To avoid those bias, it is possible to summarize in point of balance between topics document have so all subject in document can be ascertained, but still unbalance of distribution between those subjects remains. To retain balance of subjects in summary, it is necessary to consider proportion of every subject documents originally have and also allocate the portion of subjects equally so that even sentences of minor subjects can be included in summary sufficiently. In this study, we propose "subject-balanced" text summarization method that procure balance between all subjects and minimize omission of low-frequency subjects. For subject-balanced summary, we use two concept of summary evaluation metrics "completeness" and "succinctness". Completeness is the feature that summary should include contents of original documents fully and succinctness means summary has minimum duplication with contents in itself. Proposed method has 3-phases for summarization. First phase is constructing subject term dictionaries. Topic modeling is used for calculating topic-term weight which indicates degrees that each terms are related to each topic. From derived weight, it is possible to figure out highly related terms for every topic and subjects of documents can be found from various topic composed similar meaning terms. And then, few terms are selected which represent subject well. In this method, it is called "seed terms". However, those terms are too small to explain each subject enough, so sufficient similar terms with seed terms are needed for well-constructed subject dictionary. Word2Vec is used for word expansion, finds similar terms with seed terms. Word vectors are created after Word2Vec modeling, and from those vectors, similarity between all terms can be derived by using cosine-similarity. Higher cosine similarity between two terms calculated, higher relationship between two terms defined. So terms that have high similarity values with seed terms for each subjects are selected and filtering those expanded terms subject dictionary is finally constructed. Next phase is allocating subjects to every sentences which original documents have. To grasp contents of all sentences first, frequency analysis is conducted with specific terms that subject dictionaries compose. TF-IDF weight of each subjects are calculated after frequency analysis, and it is possible to figure out how much sentences are explaining about each subjects. However, TF-IDF weight has limitation that the weight can be increased infinitely, so by normalizing TF-IDF weights for every subject sentences have, all values are changed to 0 to 1 values. Then allocating subject for every sentences with maximum TF-IDF weight between all subjects, sentence group are constructed for each subjects finally. Last phase is summary generation parts. Sen2Vec is used to figure out similarity between subject-sentences, and similarity matrix can be formed. By repetitive sentences selecting, it is possible to generate summary that include contents of original documents fully and minimize duplication in summary itself. For evaluation of proposed method, 50,000 reviews of TripAdvisor are used for constructing subject dictionaries and 23,087 reviews are used for generating summary. Also comparison between proposed method summary and frequency-based summary is performed and as a result, it is verified that summary from proposed method can retain balance of all subject more which documents originally have.

Dedicatory Inscriptions on the Amitabha Buddha and Maitreya Bodhisattva Sculptures of Gamsansa Temple (감산사(甘山寺) 아미타불상(阿彌陁佛像)과 미륵보살상(彌勒菩薩像) 조상기(造像記)의 연구)

  • Nam, Dongsin
    • MISULJARYO - National Museum of Korea Art Journal
    • /
    • v.98
    • /
    • pp.22-53
    • /
    • 2020
  • This paper analyzes the contents, characteristics, and historical significance of the dedicatory inscriptions (josanggi) on the Amitabha Buddha and the Maitreya Bodhisattva statues of Gamsansa Temple, two masterpieces of Buddhist sculpture from the Unified Silla period. In the first section, I summarize research results from the past century (divided into four periods), before presenting a new perspective and methodology that questions the pre-existing notion that the Maitreya Bodhisattva has a higher rank than the Amitabha Buddha. In the second section, through my own analysis of the dedicatory inscriptions, arrangement, and overall appearance of the two images, I assert that the Amitabha Buddha sculpture actually held a higher rank and greater significance than the Maitreya Bodhisattva sculpture. In the third section, for the first time, I provide a new interpretation of two previously undeciphered characters from the inscriptions. In addition, by comparing the sentence structures from the respective inscriptions and revising the current understanding of the author (chanja) and calligrapher (seoja), I elucidate the possible meaning of some ambiguous phrases. Finally, in the fourth section, I reexamine the content of both inscriptions, differentiating between the parts relating to the patron (josangju), the dedication (josang), and the prayers of the patrons or donors (balwon). In particular, I argue that the phrase "for my deceased parents" is not merely a general axiom, but a specific reference. To summarize, the dedicatory inscriptions can be interpreted as follows: when Kim Jiseong's parents died, they were cremated and he scattered most of their remains by the East Sea. But years later, he regretted having no physical memorial of them to which to pay his respects. Thus, in his later years, he donated his estate on Gamsan as alms and led the construction of Gamsansa Temple. He then commissioned the production of the two stone sculptures of Amitabha Buddha and Maitreya Bodhisattva for the temple, asking that they be sculpted realistically to reflect the actual appearance of his parents. Finally, he enshrined the remains of his parents in the sculptures through the hole in the back of the head (jeonghyeol). The Maitreya Bodhisattva is a standing image with a nirmanakaya, or "transformation Buddha," on the crown. As various art historians have pointed out, this iconography is virtually unprecedented among Maitreya images in East Asian Buddhist sculpture, leading some to speculate that the standing image is actually the Avalokitesvara. However, anyone who reads the dedicatory inscription can have no doubt that this image is in fact the Maitreya. To ensure that the sculpture properly embodied his mother (who wished to be reborn in Tushita Heaven with Maitreya Bodhisattva), Kim Jiseong combined the iconography of the Maitreya and Avalokitesvara (the reincarnation of compassion). Hence, Kim Jiseong's deep love for his mother motivated him to modify the conventional iconography of the Maitreya and Avalokitesvara. A similar sentiment can be found in the sculpture of Amitabha Buddha. To this day, any visitor to the temple who first looks at the sculptures from the front before reading the text on the back will be deeply touched by the filial love of Kim Jiseong, who truly cherished the memory of his parents.