• Title/Summary/Keyword: 분류모형

Search Result 1,674, Processing Time 0.037 seconds

Development of Sentiment Analysis Model for the hot topic detection of online stock forums (온라인 주식 포럼의 핫토픽 탐지를 위한 감성분석 모형의 개발)

  • Hong, Taeho;Lee, Taewon;Li, Jingjing
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.187-204
    • /
    • 2016
  • Document classification based on emotional polarity has become a welcomed emerging task owing to the great explosion of data on the Web. In the big data age, there are too many information sources to refer to when making decisions. For example, when considering travel to a city, a person may search reviews from a search engine such as Google or social networking services (SNSs) such as blogs, Twitter, and Facebook. The emotional polarity of positive and negative reviews helps a user decide on whether or not to make a trip. Sentiment analysis of customer reviews has become an important research topic as datamining technology is widely accepted for text mining of the Web. Sentiment analysis has been used to classify documents through machine learning techniques, such as the decision tree, neural networks, and support vector machines (SVMs). is used to determine the attitude, position, and sensibility of people who write articles about various topics that are published on the Web. Regardless of the polarity of customer reviews, emotional reviews are very helpful materials for analyzing the opinions of customers through their reviews. Sentiment analysis helps with understanding what customers really want instantly through the help of automated text mining techniques. Sensitivity analysis utilizes text mining techniques on text on the Web to extract subjective information in the text for text analysis. Sensitivity analysis is utilized to determine the attitudes or positions of the person who wrote the article and presented their opinion about a particular topic. In this study, we developed a model that selects a hot topic from user posts at China's online stock forum by using the k-means algorithm and self-organizing map (SOM). In addition, we developed a detecting model to predict a hot topic by using machine learning techniques such as logit, the decision tree, and SVM. We employed sensitivity analysis to develop our model for the selection and detection of hot topics from China's online stock forum. The sensitivity analysis calculates a sentimental value from a document based on contrast and classification according to the polarity sentimental dictionary (positive or negative). The online stock forum was an attractive site because of its information about stock investment. Users post numerous texts about stock movement by analyzing the market according to government policy announcements, market reports, reports from research institutes on the economy, and even rumors. We divided the online forum's topics into 21 categories to utilize sentiment analysis. One hundred forty-four topics were selected among 21 categories at online forums about stock. The posts were crawled to build a positive and negative text database. We ultimately obtained 21,141 posts on 88 topics by preprocessing the text from March 2013 to February 2015. The interest index was defined to select the hot topics, and the k-means algorithm and SOM presented equivalent results with this data. We developed a decision tree model to detect hot topics with three algorithms: CHAID, CART, and C4.5. The results of CHAID were subpar compared to the others. We also employed SVM to detect the hot topics from negative data. The SVM models were trained with the radial basis function (RBF) kernel function by a grid search to detect the hot topics. The detection of hot topics by using sentiment analysis provides the latest trends and hot topics in the stock forum for investors so that they no longer need to search the vast amounts of information on the Web. Our proposed model is also helpful to rapidly determine customers' signals or attitudes towards government policy and firms' products and services.

Extraction of Primary Factors Influencing Dam Operation Using Factor Analysis (요인분석 통계기법을 이용한 댐 운영에 대한 영향 요인 추출)

  • Kang, Min-Goo;Jung, Chan-Yong;Lee, Gwang-Man
    • Journal of Korea Water Resources Association
    • /
    • v.40 no.10
    • /
    • pp.769-781
    • /
    • 2007
  • Factor analysis has been usually employed in reducing quantity of data and summarizing information on a system or phenomenon. In this analysis methodology, variables are grouped into several factors by consideration of statistic characteristics, and the results are used for dropping variables which have lower weight than others. In this study, factor analysis was applied for extracting primary factors influencing multi-dam system operation in the Han River basin, where there are two multi-purpose dams such as Soyanggang Dam and Chungju Dam, and water has been supplied by integrating two dams in water use season. In order to fulfill factor analysis, first the variables related to two dams operation were gathered and divided into five groups (Soyanggang Dam: inflow, hydropower product, storage management, storage, and operation results of the past; Chungju Dam: inflow, hydropower product, water demand, storage, and operation results of the past). And then, considering statistic properties, in the gathered variables, some variables were chosen and grouped into five factors; hydrological condition, dam operation of the past, dam operation at normal season, water demand, and downstream dam operation. In order to check the appropriateness and applicability of factors, a multiple regression equation was newly constructed using factors as description variables, and those factors were compared with terms of objective function used in operation water resources optimally in a river basin. Reviewing the results through two check processes, it was revealed that the suggested approach provided satisfactory results. And, it was expected for extracted primary factors to be useful for making dam operation schedule considering the future situation and previous results.

Development of Needs Extraction Algorithm Fitting for Individuals in Care Management for the Elderly in Home (재가노인 사례관리의 욕구사정 정확도 향상을 위한 욕구추출 알고리즘 개발 - 데이터 마이닝 분석기법을 활용하여 -)

  • Kim, Young-Sook;Jung, Kook-In;Park, So-Rah
    • Korean Journal of Social Welfare
    • /
    • v.60 no.1
    • /
    • pp.187-209
    • /
    • 2008
  • The authors developed 28 needs assessment tools for integrated assessment centered on needs, which is the core element in care management for the elderly in home. Also, the authors collected the assessment data of 676 elderly persons in home from 120 centers under the Korea Association of Senior Welfare Centers by using the needs assessment tools, and finally developed needs extraction algorithm through decision tree analysis in data mining to identify their actual needs and provide social welfare service suitable for such needs. The needs extraction algorithm for 28 needs of the elderly in home are summarized in

    . The Need No. 8 "Having need of help in going out" of the decision-making model, for example, was divided into 80.3% of asking for help and 11.4% not asking for help with Appeal No. 23 as a major variable. The need increased by 87.9% when the elderly appealed for help to go out and they had a caregiver but decreased by 47.4% when they had no caregiver. When the elderly asked for help in going out, they had a caregiver, and they needed complete help in cleaning, their need of help in going out was shown as 94.2%. However, seen from their answer that they needed complete help in bathing of ADL even if they did not ask for help in going out, it was found that the need of help in going out sharply increased from 11.4% to 80.0%. On the other hand, when they needed partial help or self-supported in bathing, the potential for them to be classified as asking for help in going out was shown to be low as 7.7%. In the said decision-making model, the number of cases for parent node and child node was designated as 50 and 25, respectively, with level 5 of the maximum tree depth as stopping rule. By this, it was shown that their decision-making was found to be effective as 182.13% for the need "Having need of help in going out". The algorithm presented in this study can be useful as systematic and scientific fundamental data in assessment of needs of the elderly in home.

  • PDF
  • Florida, USA Food-Related Lifestyle Segments of Older Consumers in Seoul and Its Characteristics (서울지역 고령소비자의 식생활 라이프스타일에 근거한 시장세분화 및 특성 규명)

    • Jang, Yoon-Jung
      • Journal of the Korean Society of Food Science and Nutrition
      • /
      • v.39 no.1
      • /
      • pp.146-153
      • /
      • 2010
    • The objectives of this study were to explore food-related lifestyle segments of the older consumers, to identify its socio-demographic characteristics, and to investigate the differences in variables regarding health beliefs. A survey was conducted of adults 55 years of age and older living in Seoul, South Korea from March 28 to April 10, 2007. Out of the 500 distributed questionnaires, 361 were retained for final analysis: a response rate of 72.2%. As a result of cluster analysis, five consumer segments were identified; health-managing group, diet-unconcerned group, convenience-oriented group, taste-oriented group, unpracticed group. Significant differences were found among the five segments in terms of socio-demographic characteristics and variables regarding health beliefs (i.e., perceived self-efficacy, perceived barriers, perceived benefits). In the health-managing group and taste-oriented group, mean scores of perceived self-efficacy (p<0.001) and perceived benefits (p<0.001) were significantly higher than other groups. However, in the diet-unconcerned group and convenience-oriented group, the mean scores of perceived barriers (p<0.01) were significantly high. This study shows that foodservice operators targeting the older consumers should consider characteristics of each segment to develop a customized program.

    A Study of the Curriculum Design Modelling Focused on the Combination of National Competency Standards and the Already-Accredited Course in the Department of Social Welfare in the Junior College (과정이수형 자격제도 운영 학과의 NCS 기반 교육과정 설계모형 연구 - 전문대학 사회복지과를 중심으로)

    • Park, Yong Woon;Kim, Kyoung Mee;Yoo, Tae Wan
      • The Journal of the Korea Contents Association
      • /
      • v.16 no.2
      • /
      • pp.652-665
      • /
      • 2016
    • National Competency Standards or NCS is an educational system that emphasizes developing job-related abilities. Therefore it will be an effective solution in training field-oriented work forces if properly applied. However, in the department of social welfare, it is not easy to apply NCS to the curriculum since most academic subjects concerning social welfare focus not on practice but on theory and in addition, most of social welfare departments in junior colleges have an accredited curriculum for the 2nd degree of the social worker qualification. This means it is preposterous if NCS is applied to the curriculum without prior changes in the existing qualification system. So, this paper proposes a draft model to apply NCS to the already-accredited curriculum for the 2nd degree social workers in the junior colleges and details are as follows. Firstly, the competency units will be customized for the existing academic subjects in the curriculum rather than developing new subjects in accordance with NCS competency units. Secondly, some client-related competency units including children, seniors, the disabled are newly developed and then applied to the curriculum, which are crucial for the career development at the junior college level. Thirdly, the competency units are categorized into three types in accordance with the degree of job relevancy - type 1, type 2, type 3. Fourthly, four out of 11 basic job abilities are selected and then developed into academic subjects. Fifthly, all competency units concerning the main job market are regarded as one virtual competency unit and then arranged in the order of type 1s, type 2s and type 3s and then the scope of their study is adjusted to the job abilities required at the main job market.

    A Study on the Relationship between Standardization and Technological Innovation: Panel Data and Canonical Correlation Analysis through the use of Standardization Data and Patent Data (표준과 기술혁신의 관계에 관한 연구: 표준 제정·보유정보와 특허정보를 이용한 패널데이터 분석 및 정준상관 분석)

    • Lee, Heesang;Kim, Sooncheon;Jeon, Yejun
      • Journal of Korea Technology Innovation Society
      • /
      • v.19 no.3
      • /
      • pp.465-482
      • /
      • 2016
    • Previous researches have introduced various ways to analyze the impact of standardization on innovation while the works are not only small in number but based on interview or case study. This paper addresses the impact of standardization activities within South Korean industries on technological innovation applying an empirical analysis of standardization activities and technological innovation. Drawing on Korean Industrial Standards Classification from panel data of 2003 to 2012, we employed corresponding data of each industrial classification: Number of standards, Accumulated number of standards, Number of patents applied in Korea, Sales, Operational profit, Intangible asset, and R&D invest. In the first model, we run panel data models employing the number of patents applied in Korea as an independent variable, and the number of standards, accumulated number of standards, sales, and operational profit as dependent variables to observe industrial impacts upon the relationship between standards and patents, along with time lagged consideration. The result shows that number of standards are revealed to have a negative influence on patent applications in the year of research, and no significant effect appears for the next two years while positive effect shows up on the third year. Meanwhie, accumulated number of standards turned out to have positive effects on patent applications in Korea. This implies it takes time for innovation subjects to embrace newly established standards while having a significant amount of positive effect on technological innovation in the long term. In the second model, we use canonical correlation analysis to find industrial-wide characteristics. The result of this model is equivalent to the result of panel data analysis except in a few industries, where some industry specific characteristics appear. The implications of our results present that Korean policy makers have to take account of industrial effects on standardization to promote technological innovation.

    A STUDY OF DENTAL CROWDING AND ITS RELATIONSHIP TO MANDIBULAR INCISOR SHAPE BY MODEL ANALYSIS IN ADOLESCENTS (청소년 석고 모형 분석에 의한 하악절치 형태와 치아밀집의 상관관계에 관한 연구)

    • Surh, Jeong-Eun;Baik, Hyoung-Seon
      • The korean journal of orthodontics
      • /
      • v.25 no.5 s.52
      • /
      • pp.593-604
      • /
      • 1995
    • Mandibular incisor crowding is one of the most common features of malocclusion and is interesting characteristic in view of relapse and stability after orthodontic treatment. There are many potential factors in the etiology of lower anterior crowding. The tooth size variation is one of them, but biologic significance for the faciolingual width of the teeth has been overlooked. Peck and Peck reported that persons with ideal mandibular incisor alignment were shown to have incisor with smaller mesiodistal and larger faciolingual dimensions than persons with incisor crowding. On the basis of these findings they suggested MD/FL index as a clinical guideline for the assessment for lower incisor crowding. The present study was undertaken to examine the relationship between mandibular incisor crowding and mandibular incisor dimension, and determine their correlation with arch length discrepancy. 154 dental casts of people from 11 to 17 years of age were made, and were divided into normal group with irregularity index less than of 1, and crowding group with irregularity index greater than 1.The casts were measured and analyzed statistically. The results were as follows. 1. The mean mesiodistal width for mandibular incisor was larger in crowding group, and has significant difference in central inciosr measurement. There are no significant differences in the faciolingul width and MD/FL index. 2. Irregularity index has significant correlation coefficients with mesiodistal width and MD/FL index for mandibular incisor in crowding group, but no correlation with faciolingual width. It also has correlation with maxillary and mandibular arch length discrepancy, total tooth material, mandibular intercanine width, and mandibular inter first premolar width. 3. Upper and lower arch length discrepancy have significant correlation with mesiodistal width of mandibular incisor and overbite, but have no correlation with faciolingual width. Lower arch lenth discrepancy has significant correlation with MD/FL index for mandibular incisor and upper arch length discrepancy has correlation with MD/FL index for mandibular lateral incisor. 4. Significant differences were observed between normal and crowding group for the mandibular arch length discrepancy and overbite.

    • PDF

    Study of Rainfall-Runoff Variation by Grid Size and Critical Area (격자크기와 임계면적에 따른 홍수유출특성 변화)

    • Ahn, Seung-Seop;Lee, Jeung-Seok;Jung, Do-Joon;Han, Ho-Chul
      • Journal of Environmental Science International
      • /
      • v.16 no.4
      • /
      • pp.523-532
      • /
      • 2007
    • This study utilized the 1/25,000 topographic map of the upper area from the Geum-ho watermark located at the middle of Geum-ho river from the National Geographic Information Institute. For the analysis, first, the influence of the size of critical area to the hydro topographic factors was examined changing grid size to $10m{\times}10m,\;30m{\times}30m\;and\;50m{\times}50m$, and the critical area for the formation of a river to $0.01km^2{\sim}0.50km^2$. It is known from the examination result of watershed morphology according to the grid size that the smaller grid size, the better resolution and accuracy. And it is found, from the analysis result of the degree of the river according to the minimum critical area for each grid size, that the grid size does not affect on the degree of the river, and the number of rivers with 2nd and higher degree does not show remarkable difference while there is big difference in the number of 1st degree rivers. From the results above, it is thought that the critical area of $0.15km^2{\sim}0.20km^2$ is appropriate for formation of a river being irrelevant to the grid size in extraction of hydro topographic parameters that are used in the runoff analysis model using topographic maps. Therefore, the GIUH model applied analysis results by use of the river level difference law proposed in this study for the explanation on the outflow response-changing characters according to the decision of a critical value of a minimum level difference river, showed that, since an ogival occurrence time and an ogival flow volume are very significant in a flood occurrence in case of not undertow facilities, the researcher could obtain a good result for the forecast of river outflow when considering a convenient application of the model and an easy acquisition of data, so it's judged that this model is proper as an algorism for the decision of a critical value of a river basin.

    Effect of Capital Market Return On Insurance Coverage : A Financial Economic Approach (투자수익(投資收益)이 보험수요(保險需要)에 미치는 영향(影響)에 관한 이론적(理論的) 고찰(考察))

    • Hong, Soon-Koo
      • The Korean Journal of Financial Management
      • /
      • v.10 no.1
      • /
      • pp.249-280
      • /
      • 1993
    • Recent financial theory views insurance policies as financial instruments that are traded in markets and whose prices reflect the forces of supply and demand. This article analyzes individual's insurance purchasing behavior along with capital market investment activities, which will provide a more realistic look at the tradeoff between insurance and investment in the individual's budget constraint. It is shown that the financial economic concept of insurance cost should reflect the opportunity cost of insurance premium. The author demonstrates the importance of riskless and risky financial assets in reaching an equilibrium insurance premium. In addition, the paper also investigates how the investment income could affect the four established theorems on traditional insurance literature. At the present time in Korea, the price deregulation is being debated as the most important current issue in insurance industry. In view of the results of this paper, insurance companies should recognize investment income in pricing their coverage if insurance prices are deregulated. Otherwise. price competition may force insurance companies to restrict coverage or to leave the market.

    • PDF

    The Impact of SSM Market Entry on Changes in Market Shares among Retailing Types (기업형 슈퍼마켓(SSM)의 시장진입이 소매업태간 시장점유율 변화에 미친 영향)

    • Choi, Ji-Ho;Yonn, Min-Suk;Moon, Youn-Hee;Choi, Sung-Ho
      • Journal of Distribution Research
      • /
      • v.17 no.3
      • /
      • pp.115-132
      • /
      • 2012
    • This study empirically examines the impact of SSM market entry on changes in market shares among retailing types. The data is monthly time-series data spanning over the period from January 2000 to December 2010, and the effect of SSM market entry on market shares of retailing types is analyzed by utilizing several key factors such as the number of new SSM monthly entrants, total number of SSMs, the proportion of new SSM entrant that is smaller than $165m^2$ to total new SSM entrants. According to the Korean Standard Industrial Classification codes, the retailing type is classified into 5 groups: department stores, retail sale in other non-specialized large stores(big marts), supermarkets, convenience stores, and retail sale in other non-specialized stores with food or beverages predominating (others). The market shares of retailing types are calculated by the ratio of each retailing type monthly sales to total monthly retailing sales in which total retailing sales is the sum of each retailing type sales. The empirical model controls for the size effects with the number of monthly employees for each retailing type and the macroeconomic effects with M2. The empirical model employed in this study is as follows; $$MS_i=f(NewSSM,\;CumSSM,\;employ_i,\;under165,\;M2)$$ where $MS_i$ is the market share of each retailing type (department stores, big marts), supermarkets, convenience stores, and others), NewSSM is the number of new SSM monthly entrants, CumSSM is total number of SSMs, $employ_i$ is the number of monthly employees for each retailing type, and under165 is the proportion of new SSM entrant that is smaller than $165m^2$ to total new SSM entrants. The correlation among these variables are reported in

    .
    shows the descriptive statistics of the sample. Sales is the total monthly revenue of each retailing type, employees is total number of monthly employees for each retailing type, area is total floor space of each retail type($m^2$), number of store is total number of monthly stores for each retailing type, market share is the ratio of each retailing type monthly sales to total monthly retailing sales in which total retailing sales is the sum of each retailing type sales, new monthly SSMs is total number of new monthly SSM entrants, and M2 is a money supply. The empirical results of the effect of new SSM market entry on changes in market shares among retailing types (department stores, retail sale in other non-specialized large stores, supermarkets, convenience stores, and retail sale in other non-specialized stores with food or beverages predominating) are reported in
    . The dependant variables are the market share of department stores, the market share of big marts, the market share of supermarkets, the market share of convenience stores, and the market share of others. The result shows that the impact of new SSM market entry on changes in market share of retail sale in other non-specialized large stores (big marts) is statistically significant. Total number of monthly SSM stores has a significant effect on market share, but the magnitude and sign of effect is different among retailing types. The increase in the number of SSM stores has a negative effect on the market share of retail sale in other non-specialized large stores(big marts) and convenience stores, but has a positive impact on the market share of department stores, supermarkets, and retail sale in other non-specialized stores with food or beverages predominating (others). This study offers the theoretical and practical implication to these findings and also suggests the direction for the further analysis.

  • PDF

  • (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.