• Title/Summary/Keyword: Logistic Loss

Search Result 174, Processing Time 0.031 seconds

A Study on the Employee Turnover Prediction using XGBoost and SHAP (XGBoost와 SHAP 기법을 활용한 근로자 이직 예측에 관한 연구)

  • Lee, Jae Jun;Lee, Yu Rin;Lim, Do Hyun;Ahn, Hyun Chul
    • The Journal of Information Systems
    • /
    • v.30 no.4
    • /
    • pp.21-42
    • /
    • 2021
  • Purpose In order for companies to continue to grow, they should properly manage human resources, which are the core of corporate competitiveness. Employee turnover means the loss of talent in the workforce. When an employee voluntarily leaves his or her company, it will lose hiring and training cost and lead to the withdrawal of key personnel and new costs to train a new employee. From an employee's viewpoint, moving to another company is also risky because it can be time consuming and costly. Therefore, in order to reduce the social and economic costs caused by employee turnover, it is necessary to accurately predict employee turnover intention, identify the factors affecting employee turnover, and manage them appropriately in the company. Design/methodology/approach Prior studies have mainly used logistic regression and decision trees, which have explanatory power but poor predictive accuracy. In order to develop a more accurate prediction model, XGBoost is proposed as the classification technique. Then, to compensate for the lack of explainability, SHAP, one of the XAI techniques, is applied. As a result, the prediction accuracy of the proposed model is improved compared to the conventional methods such as LOGIT and Decision Trees. By applying SHAP to the proposed model, the factors affecting the overall employee turnover intention as well as a specific sample's turnover intention are identified. Findings Experimental results show that the prediction accuracy of XGBoost is superior to that of logistic regression and decision trees. Using SHAP, we find that jobseeking, annuity, eng_test, comm_temp, seti_dev, seti_money, equl_ablt, and sati_safe significantly affect overall employee turnover intention. In addition, it is confirmed that the factors affecting an individual's turnover intention are more diverse. Our research findings imply that companies should adopt a personalized approach for each employee in order to effectively prevent his or her turnover.

Individual and Occupational Factors Associated With Low Back Pain: The First-ever Occupational Health Study Among Bangladeshi Online Professionals

  • Hossian, Mosharop;Nabi, Mohammad Hayatun;Hossain, Ahmed;Hawlader, Mohammad Delwer Hossain;Kakoly, Nadira Sultana
    • Journal of Preventive Medicine and Public Health
    • /
    • v.55 no.1
    • /
    • pp.98-105
    • /
    • 2022
  • Objectives: Low back pain (LBP) is a common chronic condition among sedentary workers that causes long-term productivity loss. This study aimed to identify the relationships of individual and occupational factors with LBP among Bangladeshi online professionals. Methods: We conducted a cross-sectional study involving 468 full-time online professionals who usually worked in a sitting position. One-month LBP complaints were assessed using a musculoskeletal subscale of subjective health complaints. The chi-square test was used to measure associations between categorical predictors and LBP, and multivariable logistic regression was conducted to identify the variables significantly associated with LBP. Results: LBP within the last month was reported by 65.6% of participants. Multivariable logistic regression analysis indicated that age >30 years (adjusted odds ratio [aOR], 0.40; 95% confidence interval [CI], 0.23 to 0.70) and being married (aOR, 0.59; 95% CI, 0.36 to 0.97) had significant negative associations with LBP. Significant positive associations were found for spending >50 hours weekly on average working in a sitting position (aOR, 1.61; 95% CI, 1.05 to 2.49), being overweight and obese (aOR, 1.87; 95% CI, 1.16 to 2.99), sleeping on a soft mattress (aOR, 2.01; 95% CI, 1.06 to 3.80), and ex-smoking status (aOR, 3.33; 95% CI, 1.41 to 7.87). Conclusions: A high prevalence of LBP was found among full-time online professionals. Long working hours in a sitting position showed a significant association with developing LBP. Smoking history, body mass index, and sleeping arrangements should also be considered while considering solutions for LBP prevalence among online professionals.

Development of machine learning model for reefer container failure determination and cause analysis with unbalanced data (불균형 데이터를 갖는 냉동 컨테이너 고장 판별 및 원인 분석을 위한 기계학습 모형 개발)

  • Lee, Huiwon;Park, Sungho;Lee, Seunghyun;Lee, Seungjae;Lee, Kangbae
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.1
    • /
    • pp.23-30
    • /
    • 2022
  • The failure of the reefer container causes a great loss of cost, but the current reefer container alarm system is inefficient. Existing studies using simulation data of refrigeration systems exist, but studies using actual operation data of refrigeration containers are lacking. Therefore, this study classified the causes of failure using actual refrigerated container operation data. Data imbalance occurred in the actual data, and the data imbalance problem was solved by comparing the logistic regression analysis with ENN-SMOTE and class weight with the 2-stage algorithm developed in this study. The 2-stage algorithm uses XGboost, LGBoost, and DNN to classify faults and normalities in the first step, and to classify the causes of faults in the second step. The model using LGBoost in the 2-stage algorithm was the best with 99.16% accuracy. This study proposes a final model using a two-stage algorithm to solve data imbalance, which is thought to be applicable to other industries.

Occupational factors affecting the decline in pulmonary function among male farmers using occupational pesticide in Gyeonggi-do, South Korea

  • Sooyeon Lee;Jiyoung Han;Seung Hee Woo;Soo-Jin Lee
    • Annals of Occupational and Environmental Medicine
    • /
    • v.34
    • /
    • pp.42.1-42.11
    • /
    • 2022
  • Background: Occupational pesticide exposure is a potential risk for respiratory health effects. Most clinical studies on pesticide exposure were related to acute exposure, and only a few studies on chronic exposure have been conducted. This study investigated the chronic respiratory health status and the chronic effects of occupational pesticide exposures of farmers in Gyeonggi-do. Methods: Surveys and pulmonary function tests were conducted on 1,697 farmers in 16 regions of Gyeonggi-do. The structured questionnaire included demographic characteristics, medical history, recent respiratory symptoms and diseases, and work-related conditions, and was conducted through one-on-one interviews. The prevalence of respiratory diseases was compared by the odds ratios (ORs) at 95% confidence intervals (CIs) estimated by logistic regression analysis. Additional multivariate logistic regression analysis was also conducted. Results: Pesticide work groups showed significant association with an obstructive pattern in the lung function test (unadjusted OR, 2.38; 95% CI, 1.17-5.52). Selected work-related variables of pesticide exposure were 'start age,' 'cumulative duration,' 'mixing pesticides,' and 'protection(goggle).' The obstructive pattern of lung function test showed significant associations with mixing pesticides (OR, 2.30; 95% CI,1.07-5.46), and protection (goggle) use (OR, 0.34; 95% CI, 0.12-0.79). Conclusions: Mixing two or more pesticides showed a significant association. Wearing goggles can be seen as an indicator of awareness of the protective equipment and proper wearing of protective equipment, and loss of pulmonary function can be prevented when appropriate protection is worn.

Development of a Detection Model for the Companies Designated as Administrative Issue in KOSDAQ Market (KOSDAQ 시장의 관리종목 지정 탐지 모형 개발)

  • Shin, Dong-In;Kwahk, Kee-Young
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.157-176
    • /
    • 2018
  • The purpose of this research is to develop a detection model for companies designated as administrative issue in KOSDAQ market using financial data. Administration issue designates the companies with high potential for delisting, which gives them time to overcome the reasons for the delisting under certain restrictions of the Korean stock market. It acts as an alarm to inform investors and market participants of which companies are likely to be delisted and warns them to make safe investments. Despite this importance, there are relatively few studies on administration issues prediction model in comparison with the lots of studies on bankruptcy prediction model. Therefore, this study develops and verifies the detection model of the companies designated as administrative issue using financial data of KOSDAQ companies. In this study, logistic regression and decision tree are proposed as the data mining models for detecting administrative issues. According to the results of the analysis, the logistic regression model predicted the companies designated as administrative issue using three variables - ROE(Earnings before tax), Cash flows/Shareholder's equity, and Asset turnover ratio, and its overall accuracy was 86% for the validation dataset. The decision tree (Classification and Regression Trees, CART) model applied the classification rules using Cash flows/Total assets and ROA(Net income), and the overall accuracy reached 87%. Implications of the financial indictors selected in our logistic regression and decision tree models are as follows. First, ROE(Earnings before tax) in the logistic detection model shows the profit and loss of the business segment that will continue without including the revenue and expenses of the discontinued business. Therefore, the weakening of the variable means that the competitiveness of the core business is weakened. If a large part of the profits is generated from one-off profit, it is very likely that the deterioration of business management is further intensified. As the ROE of a KOSDAQ company decreases significantly, it is highly likely that the company can be delisted. Second, cash flows to shareholder's equity represents that the firm's ability to generate cash flow under the condition that the financial condition of the subsidiary company is excluded. In other words, the weakening of the management capacity of the parent company, excluding the subsidiary's competence, can be a main reason for the increase of the possibility of administrative issue designation. Third, low asset turnover ratio means that current assets and non-current assets are ineffectively used by corporation, or that asset investment by corporation is excessive. If the asset turnover ratio of a KOSDAQ-listed company decreases, it is necessary to examine in detail corporate activities from various perspectives such as weakening sales or increasing or decreasing inventories of company. Cash flow / total assets, a variable selected by the decision tree detection model, is a key indicator of the company's cash condition and its ability to generate cash from operating activities. Cash flow indicates whether a firm can perform its main activities(maintaining its operating ability, repaying debts, paying dividends and making new investments) without relying on external financial resources. Therefore, if the index of the variable is negative(-), it indicates the possibility that a company has serious problems in business activities. If the cash flow from operating activities of a specific company is smaller than the net profit, it means that the net profit has not been cashed, indicating that there is a serious problem in managing the trade receivables and inventory assets of the company. Therefore, it can be understood that as the cash flows / total assets decrease, the probability of administrative issue designation and the probability of delisting are increased. In summary, the logistic regression-based detection model in this study was found to be affected by the company's financial activities including ROE(Earnings before tax). However, decision tree-based detection model predicts the designation based on the cash flows of the company.

Association between Subjective Distress Symptoms and Argon Welding among Shipyard Workers in Gyeongnam Province (경남소재 일개조선소 근로자의 건강이상소견과 아르곤 용접과의 관련성)

  • Choi, Woo-Ho;Jin, Seong-Mi;Kweon, Deok-Heon;Kim, Jang-Rak;Kang, Yune-Sik;Jeong, Baek-Geum;Park, Ki-Soo;Hwang, Young-Sil;Hong, Dae-Yong
    • Journal of Korean Society of Occupational and Environmental Hygiene
    • /
    • v.24 no.4
    • /
    • pp.547-555
    • /
    • 2014
  • Objective: This study was conducted to investigate the association between subjective distress symptoms and argon welding among workers in Gyeongnam Province shipyard. Method: 31 argon and 29 non-argon welding workers were selected as study subjects in order to measure concentrations of personal dust, welding fumes and other hazardous materials such as ZnO, Pb, Cr, FeO, MnO, Cu, Ni, $TiO_2$, MgO, NO, $NO_2$, $O_3$, $O_2$, $CO_2$, CO and Ar. An interviewer-administered questionnaire survey was also performed on the same subjects. The items queried were as follows: age, height, weight, working duration, welding time, welding rod amounts used, drinking, smoking, and rate of subjective distress symptoms including headache and other symptoms such as fever, vomiting and nausea, metal fume fever, dizziness, tingling sensations, difficulty in breathing, memory loss, sleep disorders, emotional disturbance, hearing loss, hand tremors, visual impairment, neural abnormality, allergic reaction, runny nose and stuffiness, rhinitis, and suffocation. Statistical analysis was performed using SPSS software, version 18. Data are expressed as the mean ${\pm}SD$. An ${\chi}^2$-test and a normality test using a Shapiro wilk test were performed for the above variables. Logistic regression analysis was also conducted to identify the factors that affect the total score for subjective distress symptoms. Result: An association was shown between welding type (argon or non-argon welding) and the total score for subjective distress symptoms. Among the rate of complaining of subjective distress symptoms, vomiting and nausea, difficulty breathing, and allergic reactions were all significantly higher in the argon welding group. Only the concentration of dust and welding fumes was shown to be distributed normally after natural log transformation. According to logistic regression analysis, the correlations of working duration and welding type (argon or non-argon) between the total score of subjective distress symptoms were found to be statistically significant (p=0.041, p=0.049, respectively). Conclusion: Our results suggest that argon welding could cause subjective distress symptoms in shipyard workers.

Monitoring of white striping and wooden breast cases and impacts on quality of breast meat collected from commercial broilers (Gallus gallus)

  • Malila, Yuwares;U-chupaj, Juthawut;Srimarut, Yanee;Chaiwiwattrakul, Premsak;Uengwetwanit, Tanaporn;Arayamethakorn, Sopacha;Punyapornwithaya, Veerasak;Sansamur, Chalutwan;Kirschke, Catherine P.;Huang, Liping;Tepaamorndech, Surapun;Petracci, Massimiliano;Rungrassamee, Wanilada;Visessanguan, Wonnop
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.31 no.11
    • /
    • pp.1807-1817
    • /
    • 2018
  • Objective: This study aimed at investigating white striping (WS) and wooden breast (WB) cases in breast meat collected from commercial broilers. Methods: A total of 183 breast samples were collected from male Ross 308 broilers slaughtered at the age of 6 weeks (n = 100) and 7 weeks (n = 83). The breasts were subjected to meat defect inspection, meat quality determination and histology evaluation. Results: Of 183, 4 breasts from 6-week-old broilers were classified as non-defective while the others exhibited the WS lesion. Among the 6-week-old birds, the defective samples from the medium size birds (carcass weight ${\leq}2.5kg$) showed mild to moderate WS degree with no altered meat quality. Some of the breasts from the 6-week-old birds with carcass weight above 2.5 kg exhibited WB in accompanied with the WS condition. Besides of a reduction of protein content, increases in collagen matter and pH values in the defective samples (p<0.05), no other impaired quality indices were detected within this group. All 7-week-old broilers yielded carcasses weighing above 2.5 kg and showed abnormal characteristics with progressive severity. The breasts affected with severe WS and WB showed the greatest cook loss, hardness, springiness and chewiness (p<0.05). Development of WB induced significantly increased drip loss in the samples (p<0.05). Histology indicated necrotic events in the defective myofibers. Based on logistic regression, increasing percent breast weight by one unit enhanced the chance of WS and WB development with advanced severity by 50.9% and 61.0%, respectively. Delayed slaughter age from 6 to 7 weeks increased the likelihood of obtaining increased WS severity by 56.3%. Conclusion: Cases of WS and WB defects in Southeast Asia have been revealed. Despite few cases of the severe WS and WB, such abnormal conditions significantly impaired technological properties and nutritional quality of broiler breasts.

Migrant Multi-Cultural Family Women's Life Quality Related to Oral Health: Survey in Dae-Gu (다문화가족 이주여성의 구강건강관련 삶의 질: 대구지역 조사)

  • Jeon, Eun-Suk;An, Seo-Young;Choi, Yeon-Hee
    • Journal of dental hygiene science
    • /
    • v.11 no.3
    • /
    • pp.181-187
    • /
    • 2011
  • This study conducted oral examinations and individual interviews on migrant multi-cultural family women in Daegu and measured their socio-demographic characters, oral health conditions and OHIP-14 in an aim to investigate the relevance between the oral health of migrant multi-cultural family women living in some big cities and their quality of life. Based on data finally collected from 189 women, the t-test, ANOVA and binary logistic regression analysis were conducted and the conclusions are as follows: The average number of decayed teeth was 2.23, loss teeth was 1.48, and treated teeth was 5.58. Women from the Philippines had more number of loss teeth than those from other countries, and women from China relatively had a small number of filled permanent teeth. The quality of life related to oral health was found to be poor in proportion to the number of loss teeth. A comparison of life quality related to oral health depending on loss teeth showed that life quality related to oral health was lowest in the areas of mental discomfort, physical ability decrease, mental ability decrease, social ability decrease and social disadvantage. Life quality related to oral health was found to be low in proportion to the number of permanent teeth with decay experience and poor monthly household income, which shows that the number of permanent teeth with decay experience and monthly income are mostly related to life quality related to oral health. As migrant multi-cultural family women's life quality related to oral health is low in proportion to the number of loss teeth and decayed teeth, it needs to develop a program to improve their oral healthrelated life quality and conduct follow-up research to verify its effect.

Low serum 25-hydroxyvitamin D levels, tooth loss, and the prevalence of severe periodontitis in Koreans aged 50 years and older

  • Kim, Hyunju;Shin, Min-Ho;Yoon, Suk-Ja;Kweon, Sun-Seog;Lee, Young-Hoon;Choi, Chang-Kyun;Kim, OkJoon;Kim, Young-Joon;Chung, HyunJu;Kim, Ok-Su
    • Journal of Periodontal and Implant Science
    • /
    • v.50 no.6
    • /
    • pp.368-378
    • /
    • 2020
  • Purpose: Vitamin D deficiency may cause bone loss and increased inflammation, which are well-known symptoms of periodontal disease. This study investigated whether serum 25-hydroxyvitamin D (25(OH)D) levels are associated with periodontal disease status and tooth loss. Methods: Cross-sectional data from 5,405 individuals aged ≥50 years (2,253 males and 3,152 females) were obtained from the 2008-2010 Dong-gu study, a prospective cohort study of risk factors for chronic diseases. Periodontal examinations were conducted to evaluate the number of remaining teeth, the periodontal probing depth (PPD), the clinical attachment level (CAL), and bleeding on probing. The percentages of sites with PPD ≥4 mm and CAL ≥4 mm were recorded for each participant. The severity of periodontitis was classified using the Centers for Disease Control and Prevention and the American Academy of Periodontology case definitions. Serum 25(OH)D levels were classified as reflecting severe deficiency, deficiency, insufficiency, or sufficiency. Multivariate linear regression analysis was performed to assess the associations of serum 25(OH)D levels with periodontal parameters and the number of remaining teeth after adjusting for confounders including age, smoking status, alcohol consumption status, month of blood collection, and physical activity. Multivariate logistic regression was used to evaluate the association between serum vitamin D levels and severe periodontitis. An overall statistical analysis and a stratified analysis by sex were performed. Results: Overall, the rates of severe deficiency, deficiency, insufficiency, and sufficiency were 6.5%, 67.9%, 22.4%, and 3.2%, respectively. After adjustment for confounders, vitamin D levels were directly associated with the number of remaining teeth, an association that was significant in males, but not in females. Sufficient serum 25(OH)D was associated with a low frequency of severe periodontitis. Conclusions: This population-based cross-sectional study indicates that low serum 25(OH) D is significantly associated with tooth loss and severe periodontitis in Koreans aged 50 years and older.

Receiver Operating Characteristic Analysis for Prediction of Postpartum Metabolic Diseases in Dairy Cows in an Organic Farm in Korea

  • Kim, Dohee;Choi, Woojae;Ro, Younghye;Hong, Leegon;Kim, Seongdae;Yoon, Ilsu;Choe, Eunhui;Kim, Danil
    • Journal of Veterinary Clinics
    • /
    • v.39 no.5
    • /
    • pp.199-206
    • /
    • 2022
  • Postpartum diseases should be predicted to prevent productivity loss before calving especially in organic dairy farms. This study was aimed to investigate the incidence of postpartum metabolic diseases in an organic dairy farm in Korea, to confirm the association between diseases and prepartum blood biochemical parameters, and to evaluate the accuracy of these parameters with a receiver operating characteristic (ROC) analysis for identifying vulnerable cows. Data were collected from 58 Holstein cows (16 primiparous and 42 multiparous) having calved for 2 years on an organic farm. During a transition period from 4 weeks prepartum to 4 weeks postpartum, blood biochemistry was performed through blood collection every 2 weeks with a physical examination. Thirty-one (53.4%) cows (9 primiparous and 22 multiparous) were diagnosed with at least one postpartum disease. Each incidence was 27.6% for subclinical ketosis, 22.4% for subclinical hypocalcemia, 12.1% for retained placenta, 10.3% for displaced abomasum and 5.2% for clinical ketosis. Between at least one disease and no disease, there were significant differences in the prepartum levels of parameters like body condition score (BCS), non-esterified fatty acid (NEFA), total bilirubin (T-bil), direct bilirubin (D-bil) and NEFA to total cholesterol (T-chol) ratio (p < 0.05). The ROC analysis of each of these prepartum parameters had the area under the curve (AUC) <0.7. However, the ROC analysis with logistic regression including all these parameters revealed a higher AUC (0.769), sensitivity (71.0%), and specificity (77.8%). The ROC analysis with logistic regression including the prepartum BCS, NEFA, T-bil, D-bil, and NEFA to T-chol ratio can be used to identify cows that are vulnerable to postpartum diseases with moderate accuracy.