• Title/Summary/Keyword: analysis category

Search Result 2,219, Processing Time 0.025 seconds

Identifying the Main Price Ranges of Online Product Category (온라인 상품 카테고리 내 주요 가격대 식별)

  • Kim, Jun Woo;Im, Kwang Hyuk
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.12
    • /
    • pp.733-741
    • /
    • 2012
  • In recent, many consumers visit the online shopping malls or price comparison sites to collect the information on the product category that they are interested in. However, the volumes of the data provided by such web sites are often too enormous, and significant number of consumers have trouble in making purchase decision based on the plethora of products and sellers. In this context, modern online shopping agents need to process the retrieved information in more intelligent way before providing them to the users. This paper proposes a novel approach for identifying the main price ranges hidden in a single product category. To this end, the price of an item in the category is represented as a row vector and k-means clustering analysis is applied to the price vectors to produce the clusters that consists of the product items with similar price vectors. Then, the main price ranges of the product category can be identified from the result of clustering analysis. In general, the price is one of the most important factors in the consumers' purchase decision, and the identified main price ranges will be helpful for the online shoppers to find appropriate items effectively.

Analysis for Typicality of the Leading Brand by Evaluation of Brand Personality (브랜드 개성 평가를 통한 선도브랜드의 전형성 분석 연구)

  • Park, Pumsoon
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.3
    • /
    • pp.568-577
    • /
    • 2018
  • This study investigated the typicality of the leading brand in a specific product category through comparison of personality evaluation. Measuring scales for brand personality was also used to measure the product category's personality which consumers expected generally. By pre-test, an instant coffee category was selected for product category. In the instant coffee category, three brands-'kanu', 'looka', and 'supremo'-were analyzed for this study. As a result, it was found that the leading brand, kanu had the typicality for the instant coffee category. Kanu had the same dimensions of brand personality, which were sincerity, competence, success, and sophistication, as the instant coffee product category had. Comparatively, looka had just three personalities -competence, sincerity, and sophistication-which were similar to personalities in the product category. And, supremo had only two personalities-sophistication and competence-which were similar to personalities in the instant coffee category.

Aspect-Based Sentiment Analysis Using BERT: Developing Aspect Category Sentiment Classification Models (BERT를 활용한 속성기반 감성분석: 속성카테고리 감성분류 모델 개발)

  • Park, Hyun-jung;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.4
    • /
    • pp.1-25
    • /
    • 2020
  • Sentiment Analysis (SA) is a Natural Language Processing (NLP) task that analyzes the sentiments consumers or the public feel about an arbitrary object from written texts. Furthermore, Aspect-Based Sentiment Analysis (ABSA) is a fine-grained analysis of the sentiments towards each aspect of an object. Since having a more practical value in terms of business, ABSA is drawing attention from both academic and industrial organizations. When there is a review that says "The restaurant is expensive but the food is really fantastic", for example, the general SA evaluates the overall sentiment towards the 'restaurant' as 'positive', while ABSA identifies the restaurant's aspect 'price' as 'negative' and 'food' aspect as 'positive'. Thus, ABSA enables a more specific and effective marketing strategy. In order to perform ABSA, it is necessary to identify what are the aspect terms or aspect categories included in the text, and judge the sentiments towards them. Accordingly, there exist four main areas in ABSA; aspect term extraction, aspect category detection, Aspect Term Sentiment Classification (ATSC), and Aspect Category Sentiment Classification (ACSC). It is usually conducted by extracting aspect terms and then performing ATSC to analyze sentiments for the given aspect terms, or by extracting aspect categories and then performing ACSC to analyze sentiments for the given aspect category. Here, an aspect category is expressed in one or more aspect terms, or indirectly inferred by other words. In the preceding example sentence, 'price' and 'food' are both aspect categories, and the aspect category 'food' is expressed by the aspect term 'food' included in the review. If the review sentence includes 'pasta', 'steak', or 'grilled chicken special', these can all be aspect terms for the aspect category 'food'. As such, an aspect category referred to by one or more specific aspect terms is called an explicit aspect. On the other hand, the aspect category like 'price', which does not have any specific aspect terms but can be indirectly guessed with an emotional word 'expensive,' is called an implicit aspect. So far, the 'aspect category' has been used to avoid confusion about 'aspect term'. From now on, we will consider 'aspect category' and 'aspect' as the same concept and use the word 'aspect' more for convenience. And one thing to note is that ATSC analyzes the sentiment towards given aspect terms, so it deals only with explicit aspects, and ACSC treats not only explicit aspects but also implicit aspects. This study seeks to find answers to the following issues ignored in the previous studies when applying the BERT pre-trained language model to ACSC and derives superior ACSC models. First, is it more effective to reflect the output vector of tokens for aspect categories than to use only the final output vector of [CLS] token as a classification vector? Second, is there any performance difference between QA (Question Answering) and NLI (Natural Language Inference) types in the sentence-pair configuration of input data? Third, is there any performance difference according to the order of sentence including aspect category in the QA or NLI type sentence-pair configuration of input data? To achieve these research objectives, we implemented 12 ACSC models and conducted experiments on 4 English benchmark datasets. As a result, ACSC models that provide performance beyond the existing studies without expanding the training dataset were derived. In addition, it was found that it is more effective to reflect the output vector of the aspect category token than to use only the output vector for the [CLS] token as a classification vector. It was also found that QA type input generally provides better performance than NLI, and the order of the sentence with the aspect category in QA type is irrelevant with performance. There may be some differences depending on the characteristics of the dataset, but when using NLI type sentence-pair input, placing the sentence containing the aspect category second seems to provide better performance. The new methodology for designing the ACSC model used in this study could be similarly applied to other studies such as ATSC.

The Need for Rehabilitation Day Care Program Service of Stroke Survivor's Family (재가 뇌졸중 환자 가족의 주간재활간호 서비스 요구와 관련요인)

  • Suh, Moon-Ja;Kim, Keum-Soon;Kim, In-Ja;Cho, Nam-Ok;Choi, Hee-Jung;Jeong, Seong-Hee
    • The Korean Journal of Rehabilitation Nursing
    • /
    • v.4 no.2
    • /
    • pp.207-218
    • /
    • 2001
  • This study was carried out to find out the basic data required to plan and develop Rehabilitation Day Care Program for the stroke survivor's family in Korea. The subjects comprised of 92 stroke survivor's family who discharged from 4 hospitals in Seoul during the past 2 years. The data were collected from August 3, 1998 to September 18, 1998, through interviews with questionnaires about general characteristics, activities of daily living, depression and service need of rehabilitation day care program at the outpatient clinics by trained nursing graduates. Data were analyzed with descriptive analysis, Pearson's correlation analysis, and Stepwise multiple linear regression analysis using SPSS/WIN 10.0 program. The results obtained are as follows; 1. The mean score of the general need of rehabilitation day care program of stroke survivor's family was 3.10(range 1-4). The highest need among the service categories of the rehabilitation day card program was self-care and restorative activities category(3.30), and health services referral category, recreation category, psychosocial activities category in order. The needs of each category are as follows. In the health services referral category, the need for dental examination and medical examination were highest, followed by the need for physical therapy and occupational therapy. In the psychosocial activities category, the need for family counselling was highest. In the self-care and restorative activities category, the need for ROM exercise training was highest, followed by bowel training, and ambulation training. 2. The need of family for rehabilitation day care program service displayed a correlation with the level of education, ADL, and the level of depression, and a reverse correlation with age, illness intrusiveness, depression, knowledge, subject and object burden and relationship with stroke survivors. 3. The stepwise multiple linear regression analysis revealed following results. For the need for rehabilitation day care program service, 22.6% of the variance was initially explained by level of family's knowledge about caring method for stroke survivors, 8.8% was the level of subjective burden and 5.4% was relationship with stroke survivors. In conclusion, above characteristics should be considered to develop stroke survivors' rehabilitation day care program.

  • PDF

Automatic e-mail classification using Dynamic Category Hierarchy and Principal Component Analysis (주성분 분석과 동적 분류체계를 사용한 자동 이메일 분류)

  • Park, Sun;Kim, Chul-Won;Lee, Yang-weon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2009.05a
    • /
    • pp.576-579
    • /
    • 2009
  • The amount of incoming e-mails is increasing rapidly due to the wide usage of Internet. Therefore, it is more required to classify incoming e-mails efficiently and accurately. Currently, the e-mail classification techniques are focused on two way classification to filter spam mails from normal ones based mainly on Bayesian and Rule. The clustering method has been used for the multi-way classification of e-mails. But it has a disadvantage of low accuracy of classification. In this paper, we propose a novel multi-way e-mail classification method that uses PCA for automatic category generation and dynamic category hierarchy for high accuracy of classification. It classifies a huge amount of incoming e-mails automatically, efficiently, and accurately.

  • PDF

Various Men's Body Shapes and Drops for Developing Menswear Sizing Systems in the United States

  • HwangShin, Su-Jeong;Istook, Cynthia L.;Lee, Jin-Hee
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.35 no.12
    • /
    • pp.1454-1465
    • /
    • 2011
  • Menswear body types are often labeled on garments (to indicate how the garments are designed to fit) with indicators of a size category such as regular, portly, and stout, athletic, or big and tall. A drop (relationships between the chest and waist girths) is related to the fit of a tailored suit. However, current standards are not designed for various drops or body types. There is not enough information of categorizing men's body shapes for the apparel sizing systems. In this article, a set of men's data from SizeUSA sizing survey was analyzed to investigate men's body shapes and drops. Factor analysis and a cluster analysis method were used to categorize men's body shapes. In the results, twenty-five variables were selected through the factor analysis and found four factors: girth factor, height factor, torso girth factor, and slope degree factor. According to the factor and cluster analysis, various body shapes were found: Slim Shape (SS - tall ectomorphy), Heavy Shape (HS - athletic, big & tall, endomorphy and mesomorphy), Slant Inverted Triangle Shape (SITS - regular, slight ectomorphy and slight mesomorphy weight range from normal to slightly overweight), Short Round Top Shape (SRTS - portly and stout, endomorphy). Body shapes were related to fitting categories. SS and HS were related to big & tall fitting category. SITS was related to regular. SRTS was related to portly and stout. Shape 1 (31%) and Shape 2 (26%) were related to current big & tall category. Shape 3 (34%) were related to regular. Shape 4 (9%) were in portly and stout category. ASTM D 6240 standard was the only available standard that presented a regular fitting category. Various drops were found within a same chest size group; however, this study revealed great variances of drops by body shape.

A Comparison Study of Multiclass SVM Methods in Microarray Data

  • Hwang, Jin-Soo;Lee, Ji-Young;Kim, Jee-Yun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.2
    • /
    • pp.311-324
    • /
    • 2006
  • The Support Vector Machine(SVM) is very functional and efficient classification method to any other classification analysis method. However, its optimal extension to more than two classes is not obvious. In this paper several multi-category SVM methods are introduced and compared using simulation and real data sets. Also comparison with traditional multi-category classification and SVM based methods is performed.

  • PDF

Combinatory Categorial Grammar for Korean

  • Han, Sung-Kook;Park, Chan-Gon
    • Annual Conference on Human and Language Technology
    • /
    • 1990.11a
    • /
    • pp.164-171
    • /
    • 1990
  • A commutative productive category is proposed to the current CCG for the syntactic analysis of free word order languages like Korean. The introduction of this sort of category is quite natural for categorial lexicon and functional operations. We present the theorical basis of productive category and examine the linguistic availability through typical syntactic structures of Korean.

  • PDF

Finite Element Analysis and Evaluation of Casting Defects of Steam Turbine Valve Casings of Power Plants (발전용 증기터빈 밸브 케이싱의 유한요소해석과 주조결함 평가 방법)

  • Lee Boo-Youn;Kim Won-Jin;Shin Hyun-Myung
    • Journal of Advanced Marine Engineering and Technology
    • /
    • v.29 no.5
    • /
    • pp.571-578
    • /
    • 2005
  • Stresses of main stop valve and control valve casings for the steam turbines of power plants are analyzed by the finite element method. The stress intensity is obtained to check the results on the basis of the design criteria of ASME boiler and pressure vessel code. To verify accuracy of the finite element analysis. analyzed stresses are compared with those measured during the hydrostatic pressure test. Stress category drawings. which play an important role in evaluating casting defects, are produced from the analysis results, and important points in casting of the valve casings are discussed in terms of the stress category.

Analysis of Similarity of Twitter Topic Categories among Regions

  • Yun, Hong-Won
    • Journal of information and communication convergence engineering
    • /
    • v.10 no.1
    • /
    • pp.27-32
    • /
    • 2012
  • Twitter can spread and share all kinds of information such as facts, opinions, and ideas in real time. In this paper, we empirically compare and analyze the topic categories in Twitter with all top 100 users in each of geographic region. We mainly consider the relationships among regions and selected four regions: Global, Seoul, Tokyo, and Beijing. Each of the top 100 users in Twitter is classified into a specific category and then statistical analysis is conducted. Among eight topic categories, the "Arts" category is the largest and the second is "Life". The correlation between global and Seoul groups has the lowest value among the six pairs of relationships between regional groups, and this difference is statistically significant. We find that the Seoul, Tokyo, and Beijing regional Twitter groups, all in East Asia, have high topical similarity. Based on the correlation analysis, Seoul and Tokyo saliently show a sticky trend. The correlation coefficient presents very a strong positive correlation between Seoul and Tokyo. The correlation between the global group and the East Asian groups is relatively lower than that among the East Asian groups.