• Title/Summary/Keyword: Multinomial logistic

Search Result 226, Processing Time 0.021 seconds

Prediction on Busan's Gross Product and Employment of Major Industry with Logistic Regression and Machine Learning Model (로지스틱 회귀모형과 머신러닝 모형을 활용한 주요산업의 부산 지역총생산 및 고용 효과 예측)

  • Chae-Deug Yi
    • Korea Trade Review
    • /
    • v.47 no.2
    • /
    • pp.69-88
    • /
    • 2022
  • This paper aims to predict Busan's regional product and employment using the logistic regression models and machine learning models. The following are the main findings of the empirical analysis. First, the OLS regression model shows that the main industries such as electricity and electronics, machine and transport, and finance and insurance affect the Busan's income positively. Second, the binomial logistic regression models show that the Busan's strategic industries such as the future transport machinery, life-care, and smart marine industries contribute on the Busan's income in large order. Third, the multinomial logistic regression models show that the Korea's main industries such as the precise machinery, transport equipment, and machinery influence the Busan's economy positively. And Korea's exports and the depreciation can affect Busan's economy more positively at the higher employment level. Fourth, the voting ensemble model show the higher predictive power than artificial neural network model and support vector machine models. Furthermore, the gradient boosting model and the random forest show the higher predictive power than the voting model in large order.

The Determinants of Ginseng Products Purchase during the Trip in Korea (인삼 제품 구매 선택과 결정 요인 분석)

  • Ho-Jung Yoon;Hyun Sung Cho;Sung Ah Lim
    • Journal of Ginseng Culture
    • /
    • v.5
    • /
    • pp.97-114
    • /
    • 2023
  • Despite numerous studies, research on ginseng in aspect of an economic and business perspective are insufficient. Recently, research to reveal the economic cause of ginseng products purchase is drawing attention. The purpose of this study is to analyze empirically the factors of ginseng products purchase by international consumers from a microeconomic perspective. Using the survey data, we empirically investigate the determinants of ginseng products purchase by international consumers visiting Korea. We use a multinomial logistic model to find the determinants that influence the purchase of ginseng products. This study finds the followings. First, the economic factor is an important determinant of ginseng products purchase. The average daily expenditure has a greater impact on ginseng products purchase than household income does. Even though the average daily expenditure is high, they tend to buy less ginseng products when they prefer other products. Second, demographically, gender and age are also important determinants of ginseng products purchase. It has been found that elderly male consumers are more likely to buy ginseng products. Third, international consumers for leisure purposes have a higher probability of buying ginseng products than tourism consumers for other purposes do. Finally, destination attributes such as security (safety), ease of use of mobile/Internet, and ease of finding directions are also important factors affecting ginseng products purchase. In addition, it is found that the convenience of using the mobile/Internet, the ease of finding directions, and the convenience of shopping increase the probability of buying ginseng products by international consumers. This study is meaningful in that it explored the determinants of ginseng products purchase by analyzing individual consumers' ginseng products choices.

Semiparametric mixture of experts with unspecified gate network

  • Jung, Dahai;Seo, Byungtae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.3
    • /
    • pp.685-695
    • /
    • 2017
  • The traditional mixture of experts (ME) modeled the gate network using a certain parametric function. However, if the assumed parametric function does not properly reflect the true nature, the prediction strength of ME would become weak. For example, the parametric ME often uses logistic or multinomial logistic models for the network model. However, this could be very misleading if the true nature of the data is quite different from those models. Although, in this case, we may develop more flexible parametric models by extending the model at hand, we will never be free from such misspecification problems. In order to alleviate such weakness of the parametric ME, we propose to use the semi-parametric mixture of experts (SME) in which the gate network is estimated in a non-parametrical way. Based on this, we compared the performance of the SME with those of ME and neural networks via several simulation experiments and real data examples.

Two-stage imputation method to handle missing data for categorical response variable

  • Jong-Min Kim;Kee-Jae Lee;Seung-Joo Lee
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.6
    • /
    • pp.577-587
    • /
    • 2023
  • Conventional categorical data imputation techniques, such as mode imputation, often encounter issues related to overestimation. If the variable has too many categories, multinomial logistic regression imputation method may be impossible due to computational limitations. To rectify these limitations, we propose a two-stage imputation method. During the first stage, we utilize the Boruta variable selection method on the complete dataset to identify significant variables for the target categorical variable. Then, in the second stage, we use the important variables for the target categorical variable for logistic regression to impute missing data in binary variables, polytomous regression to impute missing data in categorical variables, and predictive mean matching to impute missing data in quantitative variables. Through analysis of both asymmetric and non-normal simulated and real data, we demonstrate that the two-stage imputation method outperforms imputation methods lacking variable selection, as evidenced by accuracy measures. During the analysis of real survey data, we also demonstrate that our suggested two-stage imputation method surpasses the current imputation approach in terms of accuracy.

Trajectories of Self-rated Health among One-person Households: A Latent Class Growth Analysis (1인가구의 주관적 건강상태 변화: 잠재계층성장모형을 활용하여)

  • Kim, Eunjoo;Kim, Hyang;Yoon, Ju Young
    • Research in Community and Public Health Nursing
    • /
    • v.30 no.4
    • /
    • pp.449-459
    • /
    • 2019
  • Purpose: The aim of this study is to explore different types of self-rated health trajectories among one-person households in Korea. Methods: We used five time-point data derived from Korea Health Panel (2011~2015). A latent growth curve modeling was used to assess the overall feature of self-rated health trajectory in one-person households, and a latent class growth modeling was used to determine the number and shape of trajectories. We then applied multinomial logistic regression on each class to explore the predicting variables. Results: We found that the overall slope of self-rated health in one-person households decreases. In addition, latent class analysis demonstrated three classes: 1) High-Decreasing class (i.e., high intercept, significantly decreasing slope), 2) Moderate-Decreasing class (i.e., average intercept, significantly decreasing slope), and 3) Low-Stable class (i.e., low intercept, flat and nonsignificant slope). The multinomial logistic regression analysis showed that the predictors of each class were different. Especially, one-person households with poor health condition early were at greater risk of being Low-Stable class compared with High-Decreasing class group. Conclusion: The findings of this study demonstrate that more attentions to one-person households are needed to promote their health status. Policymakers may develop different health and welfare programs depending on different characteristics of one-person household trajectory groups in Korea.

Association between oral health status and body mass index in older adults (노인의 구강건강상태와 체질량지수의 연관성)

  • Cho, Younyoung;Lee, Yunhwan;Kim, Jinhee
    • Journal of Korean society of Dental Hygiene
    • /
    • v.16 no.1
    • /
    • pp.129-136
    • /
    • 2016
  • Objectives: The purpose of the study is to investigate the relationship between oral health status and body mass index (BMI) in adults over 65 years old. Methods: The study subjects were 4,550 adults over 65 years old from the 5th Korea National Health and Nutrition Examination Survey(KNHANES V) in 2010-2012. Mastication-related oral health status included the number of remaining teeth, and mean number of decayed, missing, and filled permanent teeth(DMFT). Body mass index(BMI, $kg/m^2$) was categorized as underweight(<18.5), normal weight (18.5-22.9), overweight(23.0-24.9), and obese(${\geq}25.0$). Multinomial logistic regression analysis was performed to examine the association of BMI categories with the number of remaining teeth and DMFT. Results: The mean number of DMFT was highest($13.0{\pm}0.7$) in the underweight group and lowest($8.8{\pm}0.3$) in the obese group. Those having less favorable masticatory ability, and fewer number of remaining teeth and no prosthesis, tended to be underweight. Those having a higher number of remaining teeth and prosthetic teeth tended to be overweight or obese. In the multinomial logistic regression analysis, compared with those having 20 or more remaining teeth, including prosthetic teeth, those having less than 20 remaining teeth and no prosthesis had 4.48 times higher odds ratio of being underweight. DMFT was positively associated with underweight, while negatively associated with overweight or obesity. Conclusions: The masticatory ability and dental caries prevention maintained the healthy body weight in adults of old age.

Failure Analysis to Derive the Causes of Abnormal Condition of Electric Locomotive Subsystem (센서 데이터를 이용한 전기 기관차의 이상 상태 요인분석)

  • So, Min-Seop;Jun, Hong-Bae;Shin, Jong-Ho
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.41 no.2
    • /
    • pp.84-94
    • /
    • 2018
  • In recent years, the diminishing of operation and maintenance cost using advanced maintenance technology is attracting many companies' attention. Especially, the heavy machinery industry regards it as a crucial problem since a failure of heavy machinery requires high cost and long downtime. To improve the current maintenance process, the heavy machinery industry tries to develop a methodology to predict failure in advance and to find its causes using usage data. A better analysis of failure causes requires more data so that various kinds of sensor are attached to machines and abundant amount of product usage data is collected through the sensor network. However, the systemic analysis of the collected product usage data is still in its infant stage. Many previous works have focused on failure occurrence as statistical data for reliability analysis. There have been less works to apply product usage data into root cause analysis of product failure. The product usage data collected while failures occur should be considered failure cause analysis. To do this, this study proposes a methodology to apply product usage data into failure cause analysis. The proposed methodology in this study is composed of several steps to transform product usage into failure causes. Various statistical analysis combined with product usage data such as multinomial logistic regression, T-test, and so on are used for the root cause analysis. The proposed methodology is applied to field data coming from operated locomotive and the analysis result shows its effectiveness.

Statistical Analysis of Factors Associated with Facial Bone Fractures (안면골 골절의 발생 인자에 대한 통계학적 분석)

  • Suh, Yong Hoon;Kim, Young Joon
    • Archives of Craniofacial Surgery
    • /
    • v.13 no.1
    • /
    • pp.36-40
    • /
    • 2012
  • Purpose: Statistical analysis of facial bone fractures has been performed in various papers. However, reports on risk factors for facial bone fractures are rare. In order to prevent facial bone fractures, it is important to determine the risk factors for their occurrence. This study seeks to perform a statistical analysis on and identify the risk factors associated with facial bone fractures. Methods: A retrospective study was performed to assess facial bone fractures in patients presenting from October 2009 to January 2011 through a chart review. The data collected included age, gender, etiology, and alcohol consumption. Data was analyzed using multinomial logistic regression analysis. The significance level was set at p<0.05 and SAS ver. 9.2 was used. Results: A total of 489 patients were analyzed. The patients' age ranged from 2 to 85 years (mean age, $31.8{\pm}15.4$ years). The ratio of men to women was 5.0:1. The predominant group was age below 19 years old (30.9%). The main causes of facial bone fractures were assaults (37.8%), falls (27.2%), and sport accidents (19.5%). On multinomial logistic regression analysis, age, especially in the teen group was associated with assaults (p<0.05) resulting in facial bone fractures. Alcohol consumption was significantly associated with assaults and falls (p<0.05) leading to facial bone fractures. Conclusion: Facial bone fracture is a challenging problem, because of its high incidence and financial cost. The findings of this study indicate that more effective policies aimed at reducing alcohol intake and teenage violence are needed.

A Study on Influence of Fishing Villages Experience Program Choice by the Tourist Characteristics (관광객 특성에 따른 어촌체험프로그램 선택의 영향력 분석)

  • Lee, Seo-Gu;Choi, Kyu-Chul;Kim, Jung-Tae
    • Journal of Korean Society of Rural Planning
    • /
    • v.26 no.3
    • /
    • pp.1-12
    • /
    • 2020
  • The purpose of this study is to analysis the influence of fishing villages experience programs choice by the tourist characteristics. As an analysis method, a statistical technique of multinomial logistic regression was used. The dependent variable have typified about 70 fishing experience programs, such as tidal-flat experience, fishery experience, and fishing experience, operated by the fishing village experience recreation villages into 9 programs. The independent variables consisted of 7 groups of people: gender, age, marital status, presence of children, experience of visiting a village in a rural and fishing village experience, preference of a village in a recreational experience, and recognition of a village in a fishing village experience. As a result of analysis, no significant differences were found that the selection group preferring 'fishing culture experience', 'leports experience', 'ecological craft experience', and 'festival and event experience' in the selection of fishing village experience program compared to the group choosing 'rural experience'. On the other hand, the group preferring 'tidal flat experience' analysis that 'married' is about 14 times higher than 'unmarried', and the group preferring 'fishing village experience' is 9.55 times higher than the group preferring 'rural village experience'. In the group preferring 'fishery experience' and 'fishing experience', the group preferring 'fishing experience recreation village' was 9.21 times and 14.34 times higher than the group preferring 'rural experience recreation village'. In the 'food experience', 'married' was 25 times higher than 'unmarried'.

Convergence Study on Research Topics for Thyroid Cancer in Korea (국내 갑상선암 논문 토픽에 대한 융합연구)

  • Yang, Ji-Yeon
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.2
    • /
    • pp.75-81
    • /
    • 2019
  • The purpose of this study was to perform a convergence study for the investigation of the trend of research topics related to thyroid cancer in Korea. We collected related research papers from DBpia and employed LDA-based topic model. In result, we identified four research topics, each of which concerns "Surgery", "Disease aggressiveness", "Survival analysis", and "Well-being of patients". With multinomial logistic regression, we found significant time trend, where "Surgery"-related topic was popular before 2000, topics regarding "Disease aggressiveness" and "Survival analysis" were frequently addressed in the 2000s, and "Survival analysis" and especially "Well-being of patients" have been pursued since 2010. The findings would serve as a reference guide for research directions. Future work may examine whether the recent change in research topics is observed in other diseases.