• Title/Summary/Keyword: General English

Search Result 462, Processing Time 0.018 seconds

Aspect-Based Sentiment Analysis Using BERT: Developing Aspect Category Sentiment Classification Models (BERT를 활용한 속성기반 감성분석: 속성카테고리 감성분류 모델 개발)

  • Park, Hyun-jung;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.4
    • /
    • pp.1-25
    • /
    • 2020
  • Sentiment Analysis (SA) is a Natural Language Processing (NLP) task that analyzes the sentiments consumers or the public feel about an arbitrary object from written texts. Furthermore, Aspect-Based Sentiment Analysis (ABSA) is a fine-grained analysis of the sentiments towards each aspect of an object. Since having a more practical value in terms of business, ABSA is drawing attention from both academic and industrial organizations. When there is a review that says "The restaurant is expensive but the food is really fantastic", for example, the general SA evaluates the overall sentiment towards the 'restaurant' as 'positive', while ABSA identifies the restaurant's aspect 'price' as 'negative' and 'food' aspect as 'positive'. Thus, ABSA enables a more specific and effective marketing strategy. In order to perform ABSA, it is necessary to identify what are the aspect terms or aspect categories included in the text, and judge the sentiments towards them. Accordingly, there exist four main areas in ABSA; aspect term extraction, aspect category detection, Aspect Term Sentiment Classification (ATSC), and Aspect Category Sentiment Classification (ACSC). It is usually conducted by extracting aspect terms and then performing ATSC to analyze sentiments for the given aspect terms, or by extracting aspect categories and then performing ACSC to analyze sentiments for the given aspect category. Here, an aspect category is expressed in one or more aspect terms, or indirectly inferred by other words. In the preceding example sentence, 'price' and 'food' are both aspect categories, and the aspect category 'food' is expressed by the aspect term 'food' included in the review. If the review sentence includes 'pasta', 'steak', or 'grilled chicken special', these can all be aspect terms for the aspect category 'food'. As such, an aspect category referred to by one or more specific aspect terms is called an explicit aspect. On the other hand, the aspect category like 'price', which does not have any specific aspect terms but can be indirectly guessed with an emotional word 'expensive,' is called an implicit aspect. So far, the 'aspect category' has been used to avoid confusion about 'aspect term'. From now on, we will consider 'aspect category' and 'aspect' as the same concept and use the word 'aspect' more for convenience. And one thing to note is that ATSC analyzes the sentiment towards given aspect terms, so it deals only with explicit aspects, and ACSC treats not only explicit aspects but also implicit aspects. This study seeks to find answers to the following issues ignored in the previous studies when applying the BERT pre-trained language model to ACSC and derives superior ACSC models. First, is it more effective to reflect the output vector of tokens for aspect categories than to use only the final output vector of [CLS] token as a classification vector? Second, is there any performance difference between QA (Question Answering) and NLI (Natural Language Inference) types in the sentence-pair configuration of input data? Third, is there any performance difference according to the order of sentence including aspect category in the QA or NLI type sentence-pair configuration of input data? To achieve these research objectives, we implemented 12 ACSC models and conducted experiments on 4 English benchmark datasets. As a result, ACSC models that provide performance beyond the existing studies without expanding the training dataset were derived. In addition, it was found that it is more effective to reflect the output vector of the aspect category token than to use only the output vector for the [CLS] token as a classification vector. It was also found that QA type input generally provides better performance than NLI, and the order of the sentence with the aspect category in QA type is irrelevant with performance. There may be some differences depending on the characteristics of the dataset, but when using NLI type sentence-pair input, placing the sentence containing the aspect category second seems to provide better performance. The new methodology for designing the ACSC model used in this study could be similarly applied to other studies such as ATSC.

A Study on the Sasang Constitutional Distribution Among the People in the United States of America (북미지역주민(北美地域住民)의 사상체질(四象體質) 분포(分布)에 관(關)한 연구(硏究))

  • Koh, Byung-hee;Kim, Seon-ho;Park, Byung-gwan;Lavelle, Jonathan D;Tecun, Marianne;Anthony Jr., Ross;Hobbs, Ron;Zolli, Frank;Chin, Kyung-hee
    • Journal of Sasang Constitutional Medicine
    • /
    • v.11 no.2
    • /
    • pp.119-150
    • /
    • 1999
  • In spite of recent remarkable recent development in both western and oriental medical sciences, there is still only a shallow understanding of individual differences for various prognoses of incurable diseases and immunopathy diseases. Nevertheless, the care, cure and prevention methods of Sasang Constitutional Medicine are broadly used as an effective treatment of incurable diseases like immunopathy diseases and stress-related diseases and diseases due to aging. In this sense, the establishment of classification norms is urgent and essential for the worldwide application of Sasang Constitutional Medicine(SCM). This study began with the confirmation process of whether Sasang Constitutional types exist in Americans. To accomodate for cultural differences, the distinguishing tool was readjusted so that Sasang Constitutional Types in Americans could be determined. Hence, the selected tool is the new QSCCII+, which is a newly revised English version of the QSCCII. QSCCII was made and standardized by Dept. of SCM in Kyung Hee Medical Center and Dr. Kim7). The evaluation methods of the old version were improved in the new QSCCII+ through necessary statistical manipulation. The original QSCCII was officially authorized by the Korean Society of Sasang Constitutional Medicine as the only computerized version of Sasang diagnostics. This study is the first attempt to design a new diagnostic tool for the classification of Sasang Constitutional types in North Americans with the revision of QSCCII. The subjects of this study were selected from the cooperative people among the students and staffs of the University of Bridgeport and the patients who visited the Clinic in the Health Science Center. This study takes for about 1 year from 1998. 8 to 1999. 8 The conclusions of the study can be summarized as follows: 1. Sasang constitutional types also exist in Americans. It can also naturally be inferred that Sasang Constitutional types exist in all human beings, for there are many different human races in America. 2. There are more So-Yang In's than any other types in American white people. This result confirms the hypothesis that there also exist Sasang Constitutional types in westerners. 3. The result of repetitive tests suggests that the new QSCCII+ is an effective diagnostic tool for westerners when we consider the constant diagnostic results of the QSCCII+. 4. Sasang Constitutional types exit in the sample group regardless of racial difference. 5. The question items that were not often checked by Americans need to be modified into more understandable expressions. 6. The standardization of diagnosis for Americans should be established by use of the QSCCII+ 7. It can be guessed that there are many Tae-yang In's among the 71 persons who could not be clearly classified by the QSCCII+. Due to the scarcity of Tae-yang-In in general, it is important to improve upon the discernability of the QSCC II+. 8. The results of the Sasang Constitutional distribution in North Americans are as follows: The percentage of So-yang In distribution in the sample group is 36.25%(87persons), that of Tae-eum In is 13.75%(33persons), and that of So-eum In is 20.41%(49persons).

  • PDF