• Title/Summary/Keyword: 로지스틱회귀분석

Search Result 1,645, Processing Time 0.036 seconds

A Case Study on Text Analysis Using Meal Kit Product Review Data (밀키트 제품 리뷰 데이터를 이용한 텍스트 분석 사례 연구)

  • Choi, Hyeseon;Yeon, Kyupil
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.5
    • /
    • pp.1-15
    • /
    • 2022
  • In this study, text analysis was performed on the mealkit product review data to identify factors affecting the evaluation of the mealkit product. The data used for the analysis were collected by scraping 334,498 reviews of mealkit products in Naver shopping site. After preprocessing the text data, wordclouds and sentiment analyses based on word frequency and normalized TF-IDF were performed. Logistic regression model was applied to predict the polarity of reviews on mealkit products. From the logistic regression models derived for each product category, the main factors that caused positive and negative emotions were identified. As a result, it was verified that text analysis can be a useful tool that provides a basis for maximizing positive factors for a specific category, menu, and material and removing negative risk factors when developing a mealkit product.

인공신경망을 이용한 부실기업예측모형 개발에 관한 연구

  • Jung, Yoon;Hwang, Seok-Hae
    • Proceedings of the Korea Database Society Conference
    • /
    • 1999.06a
    • /
    • pp.415-421
    • /
    • 1999
  • Altman의 연구(1965, 1977)나 Beaver의 연구(1986)와 같은 전통적 예측모형은 분석자의 판단에 따른 예측도가 높은 재무비율을 선정하여 다변량판별분석(MDA: multiple discriminant analysis), 로지스틱회귀분석 등과 같은 통계기법을 주로 이용해 왔으나 1980년 후반부터 인공지능 기법인 귀납적 학습방법, 인공신경망모형, 유전모형 둥이 부실기업예측에 응용되기 시작했다. 최근 연구에서는 인공신경망을 활용한 변수 및 모형개발에 관한 보고가 있다. 그러나 지금까지의 연구가 주로 기업의 재무적 비율지표를 고려한 모형에 치중되었으며 정성적 자료인 비재무지표에 대한 검증과 선정이 자의적으로 이루어져온 경향이었다. 또한 너무 많은 입력변수를 사용할 경우 다중공선성 문제를 유발시킬 위험을 내포하고 있다. 본 연구에서는 부실기업예측모형을 수립하기 위하여 정량적 요인인 재무적 지표변수와 정성적요인인 비재무적 지표변수를 모두 고려하였다. 재무적 지표변수는 상관분석 및 요인분석들을 통하여 유의한 변수들을 도출하였으며 비재무적 지표변수는 조직생태학내에서의 조직군내 조직사멸과 관련된 생태적 과정에 대한 요인들 중 조직군 내적요인으로 조직의 연령, 조직의 규모, 조직의 산업밀도를 도출하여 4개의 실험집단으로 분류하여 비재무적 지표변수를 보완하였다. 인공신경망은 다층퍼셉트론(multi-layer perceptrons)과 역방향 학습(back-propagation )알고리듬으로 입력변수와 출력변수, 그리고 하나의 은닉층을 가지는 3층 퍼셉트론(three layer perceptron)을 사용하였으며 은닉충의 노드(node)수는 3개를 사용하였다. 입력변수로 안정성, 활동성, 수익성, 성장성을 나타내는 재무적 지표변수와 조직규모, 조직연령, 그 조직이 속한 산업의 밀도를 비재무적 지표변수로 산정하여 로지스틱회귀 분석과 인공신경망 기법으로 검증하였다. 로지스틱회귀분석 결과에서는 재무적 지표변수 모형의 전체적 예측적중률이 87.50%인 반면에 재무/비재무적 지표모형은 90.18%로서 비재무적 지표변수 사용에 대한 개선의 효과가 나타났다. 표본기업들을 훈련과 시험용으로 구분하여 분석한 결과는 전체적으로 재무/비재무적 지표를 고려한 인공신경망기법의 예측적중률이 높은 것으로 나타났다. 즉, 로지스틱회귀분석의 재무적 지표모형은 훈련, 시험용이 84.45%, 85.10%인 반면, 재무/비재무적 지표모형은 84.45%, 85.08%로서 거의 동일한 예측적중률을 가졌으나 인공신경망기법 분석에서는 재무적 지표모형이 92.23%, 85.10%인 반면, 재무/비재무적 지표모형에서는 91.12%, 88.06%로서 향상된 예측적 중률을 나타내었다.

  • PDF

Comparative Analysis of Determination of Method Location between Classes (클래스 간 메소드 위치 결정 방법의 비교)

  • Jung, Young-Ae;Park, Young-B.
    • The Journal of the Korea Contents Association
    • /
    • v.6 no.12
    • /
    • pp.80-88
    • /
    • 2006
  • In Object-Oriented Paradigm, various cohesion measurements have been studied taking into account reference relation among components - like attributes and methods - that belong to a class. In addition, a number of methods have taken into research utilizing manual analysis, that is performed by developer's intuition and experience, and automatic analysis in refactoring field. The verification of objective criteria is demanded in order to process automatic refactoring. In this paper, we propose a method exploiting logistic regression and neural network for analysis of the relationship between six factors considering reference relation and method location among classes. Experimental results demonstrate that the logistic regression predicts the results up to 97% and the neural network predicts the outcomes up to 90%. Hence, we conclude that the logistic regression based method is more effective to predict the method location. Moreover, more than 90% of experimental results from both methods show that the six factors used in Move Method in refactoring are suitable to be used as an objective criteria.

  • PDF

Graphical regression and model assessment in logistic model (로지스틱모형에서 그래픽을 이용한 회귀와 모형평가)

  • Kahng, Myung-Wook;Kim, Bu-Yong;Hong, Ju-Hee
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.1
    • /
    • pp.21-32
    • /
    • 2010
  • Graphical regression is a paradigm for obtaining regression information using plots without model assumptions. The general goal of this approach is to find lowdimensional sufficient summary plots without loss of important information. Model assessments using residual plots are less likely to be successful in models that are not linear. As an alternative approach, marginal model plots provide a general graphical method for assessing the model. We apply the methods of graphical regression and model assessment using marginal model plots to the logistic regression model.

스플라인을 이용한 스코어 카드

  • Choe, Min-Seong;Gu, Ja-Yong;Choe, Dae-U
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2003.10a
    • /
    • pp.285-288
    • /
    • 2003
  • 신용위험 관리에서 필수적인 방법론이 스코어 카드이며 이를 작성하는 데에 있어서 널리 쓰이는 방법 중의 하나가 로지스틱 회귀분석이다. 본 논문에서는 로지스틱 회귀 방법에 기초한 스플라인 방법론을 소개하고자 한다. 최종 스코어 카드는 연속형 변수를 범주형 변수화 하므로 조각 선형 스플라인을 채택하였다. 모의 실험을 통하여 제안된 방법의 성 능을 규명 하였다.

  • PDF

Analysis for Factors of Predicting Problem Drinking by Logistic Regression Analysis (로지스틱 회귀분석을 이용한 문제음주 예측요인 분석)

  • Kim, Mi-Young
    • Journal of Digital Convergence
    • /
    • v.15 no.5
    • /
    • pp.487-494
    • /
    • 2017
  • The purpose of this study was to identify factors which predict problem drinking on adults. Using the data on the Korea Welfare Panel Study for the 7th year, 3,915 people responded to the demographic factor, psychosocial factors and drinking behavior. And the logistic regression analysis was conducted to identify predictors of problem drinking. As a result, 36 percent of those surveyed showed that the problem drinking group. Gender, age, education, occupation, economic status, self-esteem, depression, and satisfaction of family and social relationships were correlated to alcohol use. In addition, the results of logistic regression, gender, age, education, job, self-esteem, depression were predicted problem drinking. Based on these findings, it is recommended practical counterplan that prevention of the problem drinking.

Development of a Logistic Regression Model for Probabilistic Prediction of Debris Flow (토석류 산사태 예측을 위한 로지스틱 회귀모형 개발)

  • 채병곤;김원영;조용찬;김경수;이춘오;최영섭
    • The Journal of Engineering Geology
    • /
    • v.14 no.2
    • /
    • pp.211-222
    • /
    • 2004
  • In this study, a probabilistic prediction model for debris flow occurrence was developed using a logistic regression analysis. The model can be applicable to metamorphic rocks and granite area. order to develop the prediction model, detailed field survey and laboratory soil tests were conducted both in the northern and the southern Gyeonggi province and in Sangju, Gyeongbuk province, Korea. The seven landslide triggering factors were selected by a logistic regression analysis as well as several basic statistical analyses. The seven factors consist of two topographic factors and five geological and geotechnical factors. The model assigns a weight value to each selected factor. The verification results reveal that the model has 90.74% of prediction accuracy. Therefore, it is possible to predict landslide occurrence in a probabilistic and quantitative manner.

Thermal Comfort in Outdoor Environment by Questionnaire Survey : Using the Logistic Regresstion (로지스틱 회귀분석을 활용한 옥외공간에서의 온열쾌적감에 대한 피험자 설문 분석)

  • Lim, Jong-Yeon;Hwang, Hyo-Keun;Ryu, Min-Kyung;Song, Doo-Sam
    • 한국태양에너지학회:학술대회논문집
    • /
    • 2009.04a
    • /
    • pp.97-101
    • /
    • 2009
  • Calculating and predicting the thermal comfort in outdoor environment are difficult than in indoor environment because composition parameters are variable, interrelations among parameters are very complex and human activities in outdoor are diverse. Moreover, the thermal expectancy of subject in outdoor environment is different from that of indoor environment. The aims of this study are to examine the difference between indoor and outdoor thermal comfort range. With this in mind, field measurement for estimating outdoor thermal environment and a questionnaire survey with simultaneous measurement around the subject were conducted.

  • PDF