• Title/Summary/Keyword: 로지스틱모델

Search Result 240, Processing Time 0.021 seconds

Can Housing Prices Be an Alternative to a Census-based Deprivation Index? An Evaluation Based on Multilevel Modeling (주택가격이 센서스에 기반한 박탈지수의 대안이 될 수 있는가?: 다수준 모델에 기반한 평가)

  • Sohn, Chul;Nakaya, Tomoki
    • Journal of Cadastre & Land InformatiX
    • /
    • v.48 no.2
    • /
    • pp.197-211
    • /
    • 2018
  • We conducted this research to examine how well regional housing prices are suited to use as an alternative to conventional census-based regional deprivation indices in health and medical geography studies. To examine the relative performance of mean regional housing prices compared to conventional census-based regional deprivation indices, we compared several multilevel logistic regression models, where the first level was individuals and the second was health districts in the Seoul Metropolitan Area (SMA) in Korea, for the sake of adjusting the regional clustering tendency of unknown factors. In these models, we predicted two dichotomous variables that represented individuals' after-lunch tooth brushing behavior and use of dental floss by individual characteristics and regional indices. Then, we compared the relative predictive performance of the models using the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC). The results from the estimations showed that mean regional housing prices and census-based deprivation indices were correlated with the two types of dental health behavior in a statistical sense. The results also revealed that the model with mean regional housing prices showed smaller AIC and BIC compared with other models with conventional census-based deprivation indices. These results imply that it is possible for housing prices summarized using aerial units to be used as an alternative to conventional census-based deprivation indices when the census variables employed cannot properly reflect the characteristics of the aerial units.

Corporate Bankruptcy Prediction Model using Explainable AI-based Feature Selection (설명가능 AI 기반의 변수선정을 이용한 기업부실예측모형)

  • Gundoo Moon;Kyoung-jae Kim
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.2
    • /
    • pp.241-265
    • /
    • 2023
  • A corporate insolvency prediction model serves as a vital tool for objectively monitoring the financial condition of companies. It enables timely warnings, facilitates responsive actions, and supports the formulation of effective management strategies to mitigate bankruptcy risks and enhance performance. Investors and financial institutions utilize default prediction models to minimize financial losses. As the interest in utilizing artificial intelligence (AI) technology for corporate insolvency prediction grows, extensive research has been conducted in this domain. However, there is an increasing demand for explainable AI models in corporate insolvency prediction, emphasizing interpretability and reliability. The SHAP (SHapley Additive exPlanations) technique has gained significant popularity and has demonstrated strong performance in various applications. Nonetheless, it has limitations such as computational cost, processing time, and scalability concerns based on the number of variables. This study introduces a novel approach to variable selection that reduces the number of variables by averaging SHAP values from bootstrapped data subsets instead of using the entire dataset. This technique aims to improve computational efficiency while maintaining excellent predictive performance. To obtain classification results, we aim to train random forest, XGBoost, and C5.0 models using carefully selected variables with high interpretability. The classification accuracy of the ensemble model, generated through soft voting as the goal of high-performance model design, is compared with the individual models. The study leverages data from 1,698 Korean light industrial companies and employs bootstrapping to create distinct data groups. Logistic Regression is employed to calculate SHAP values for each data group, and their averages are computed to derive the final SHAP values. The proposed model enhances interpretability and aims to achieve superior predictive performance.

Change Prediction of Future Forestland Area by Transition of Land Use Types in South Korea (로지스틱 회귀모형을 이용한 우리나라 산지면적의 공간변화 예측에 관한 연구)

  • KWAK, Doo-Ahn;PARK, So-Hee
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.24 no.4
    • /
    • pp.99-112
    • /
    • 2021
  • This study was performed to predict spatial change of future forestland area in South Korea at regional level for supporting forest-related plans established by local governments. In the study, land use was classified to three types which are forestland, agricultural land, and urban and other lands. A logistic regression model was developed using transitional interaction between each land use type and topographical factors, land use restriction factors, socioeconomic indices, and development infrastructures. In this model, change probability from a target land use type to other land use types was estimated using raster dataset(30m×30m) for each variable. With priority order map based on the probability of land use change, the total annual amount of land use change was allocated to the cells in the order of the highest transition potential for the spatial analysis. In results, it was found that slope degree and slope standard value by the local government were the main factors affecting the probability of change from forestland to urban and other land. Also, forestland was more likely to change to urban and other land in the conditions of a more gentle slope, lower slope criterion allowed to developed, and higher land price and population density. Consequently, it was predicted that forestland area would decrease by 2027 due to the change from forestland to urban and others, especially in metropolitan and major cities, and that forestland area would increase between 2028 and 2050 in the most local provincial cities except Seoul, Gyeonggi-do, and Jeju Island due to locality extinction with decline in population. Thus, local government is required to set an adequate forestland use criterion for balanced development, reasonable use and conservation, and to establish the regional forest strategies and policies considering the future land use change trends.

A Study of the Nonlinear Characteristics Improvement for a Electronic Scale using Multiple Regression Analysis (다항식 회귀분석을 이용한 전자저울의 비선형 특성 개선 연구)

  • Chae, Gyoo-Soo
    • Journal of Convergence for Information Technology
    • /
    • v.9 no.6
    • /
    • pp.1-6
    • /
    • 2019
  • In this study, the development of a weight estimation model of electronic scale with nonlinear characteristics is presented using polynomial regression analysis. The output voltage of the load cell was measured directly using the reference mass. And a polynomial regression model was obtained using the matrix and curve fitting function of MS Office Excel. The weight was measured in 100g units using a load cell electronic scale measuring up to 5kg and the polynomial regression model was obtained. The error was calculated for simple($1^{st}$), $2^{nd}$ and $3^{rd}$ order polynomial regression. To analyze the suitability of the regression function for each model, the coefficient of determination was presented to indicate the correlation between the estimated mass and the measured data. Using the third order polynomial model proposed here, a very accurate model was obtained with a standard deviation of 10g and the determinant coefficient of 1.0. Based on the theory of multi regression model presented here, it can be used in various statistical researches such as weather forecast, new drug development and economic indicators analysis using logistic regression analysis, which has been widely used in artificial intelligence fields.

Learning Data Model Definition and Machine Learning Analysis for Data-Based Li-Ion Battery Performance Prediction (데이터 기반 리튬 이온 배터리 성능 예측을 위한 학습 데이터 모델 정의 및 기계학습 분석 )

  • Byoungwook Kim;Ji Su Park;Hong-Jun Jang
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.3
    • /
    • pp.133-140
    • /
    • 2023
  • The performance of lithium ion batteries depends on the usage environment and the combination ratio of cathode materials. In order to develop a high-performance lithium-ion battery, it is necessary to manufacture the battery and measure its performance while varying the cathode material ratio. However, it takes a lot of time and money to directly develop batteries and measure their performance for all combinations of variables. Therefore, research to predict the performance of a battery using an artificial intelligence model has been actively conducted. However, since measurement experiments were conducted with the same battery in the existing published battery data, the cathode material combination ratio was fixed and was not included as a data attribute. In this paper, we define a training data model required to develop an artificial intelligence model that can predict battery performance according to the combination ratio of cathode materials. We analyzed the factors that can affect the performance of lithium-ion batteries and defined the mass of each cathode material and battery usage environment (cycle, current, temperature, time) as input data and the battery power and capacity as target data. In the battery data in different experimental environments, each battery data maintained a unique pattern, and the battery classification model showed that each battery was classified with an error of about 2%.

Predictions of PD-L1 Expression Based on CT Imaging Features in Lung Squamous Cell Carcinoma (편평세포폐암에서 CT 영상 소견을 이용한 PD-L1 발현 예측)

  • Seong Hee Yeo;Hyun Jung Yoon;Injoong Kim;Yeo Jin Kim;Young Lee;Yoon Ki Cha;So Hyeon Bak
    • Journal of the Korean Society of Radiology
    • /
    • v.85 no.2
    • /
    • pp.394-408
    • /
    • 2024
  • Purpose To develop models to predict programmed death ligand 1 (PD-L1) expression in pulmonary squamous cell carcinoma (SCC) using CT. Materials and Methods A total of 97 patients diagnosed with SCC who underwent PD-L1 expression assay were included in this study. We performed a CT analysis of the tumors using pretreatment CT images. Multiple logistic regression models were constructed to predict PD-L1 positivity in the total patient group and in the 40 advanced-stage (≥ stage IIIB) patients. The area under the receiver operating characteristic curve (AUC) was calculated for each model. Results For the total patient group, the AUC of the 'total significant features model' (tumor stage, tumor size, pleural nodularity, and lung metastasis) was 0.652, and that of the 'selected feature model' (pleural nodularity) was 0.556. For advanced-stage patients, the AUC of the 'selected feature model' (tumor size, pleural nodularity, pulmonary oligometastases, and absence of interstitial lung disease) was 0.897. Among these factors, pleural nodularity and pulmonary oligometastases had the highest odds ratios (8.78 and 16.35, respectively). Conclusion Our model could predict PD-L1 expression in patients with lung SCC, and pleural nodularity and pulmonary oligometastases were notable predictive CT features of PD-L1.

Risk Factors for Binge-eating and Food Addiction : Analysis with Propensity-Score Matching and Logistic Regression (폭식행동 및 음식중독의 위험요인 분석: 성향점수매칭과 로지스틱 회귀모델을 이용한 분석)

  • Jake Jeong;Whanhee Lee;Jung In Choi;Young Hye Cho;Kwangyeol Baek
    • Journal of the Korean Applied Science and Technology
    • /
    • v.40 no.4
    • /
    • pp.685-698
    • /
    • 2023
  • This study aimed to identify binge-eating behavior and food addiction in Korean population and to determine their associations with obesity, eating behaviors, mental health and cognitive characteristics. We collected clinical questionnaire scores related to eating problems (e.g. binge eating, food addiction, food cravings), mental health (e.g. depression), and cognitive functions (e.g. impulsivity, emotion regulation) in 257 Korean adults in the normal and the obese weight ranges. Binge-eating and food addiction were most frequent in obese women (binge-eating: 46.6%, food addiction: 29.3%) when we divided the participants into 4 groups depending on gender and obesity status. The independence test using the data with propensity score matching confirmed that binge-eating and food addiction were more prevalent in obese individuals. Finally, we constructed the logistic regression models using forward selection method to evaluate the influence of various clinical questionnaire scores on binge-eating and food addiction respectively. Binge-eating was significantly associated with the clinical scales of eating disorders, food craving, state anxiety, and emotion regulation (cognitive reappraisal) as well as food addiction. Food addiction demonstrated the significant effect of food craving, binge-eating, the interaction of obesity and age, and years of education. In conclusion, we found that binge-eating and food addiction are much more frequent in females and obese individuals. Both binge-eating and food addiction commonly involved eating problems (e.g. food craving), but there was difference in mental health and cognitive risk factors. Therefore, it is required to distinguish food addiction from binge-eating and investigate intrinsic and environmental risk factors for each pathology.

Factors Influencing Adolescent Lifetime Smoking and Current Smoking in South Korea: Using data from the 10th (2014) Korea Youth Risk Behavior Web-Based Survey (청소년의 평생 흡연 및 현재 흡연 영향요인: 제10차(2014년) 청소년건강행태온라인조사 통계를 이용하여)

  • Gwon, Seok Hyun;Jeong, Suyong
    • Journal of Korean Academy of Nursing
    • /
    • v.46 no.4
    • /
    • pp.552-561
    • /
    • 2016
  • Purpose: The purpose of this study was to investigate factors influencing lifetime smoking and current smoking among adolescents in South Korea. Methods: Hierarchical logistic regression was conducted based on complex sample analysis using statistics from the 10th (2014) Korea Youth Risk Behavior Web-Based Survey. The study sample comprised 72,060 adolescents aged 12 to 18. Results: The significant factors influencing adolescent lifetime smoking were female gender, older age, higher stress, higher weekly allowance, lower economic status, living apart from parents, parental smoking, sibling smoking, peer smoking, observation of school personnel smoking, and coed school compared to boys' school. The significant factors influencing adolescent current smoking were female gender, older age, higher stress, higher weekly allowance, both higher and lower economic status compared to middle economic status, living apart from parents, parental smoking, sibling smoking, peer smoking, observation of school personnel smoking, and coed school compared to boys' school. Conclusion: Factors identified in this study need to be considered in programs directed at prevention of adolescent smoking and smoking cessation programs, as well as policies.

Style for the Journal of Korean Contents Relation between BMI and Suicide Ideation in Adult : Using Data from the Korea Health Panel 2009~2013 (성인의 체질량지수(BMI)와 자살생각의 관계 -2009~2013년 한국의료패널자료를 활용한 연구-)

  • Lee, Jong-Ik
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.2
    • /
    • pp.616-625
    • /
    • 2018
  • The purpose of this study was to analyze the relationship between the BMI and suicidal ideation using Korean Health Panel data from 2009 to 2013 to identify risk factors for suicide. We conducted a logistic regression analysis using the R statistical package to analyze the relationship between the BMI and the suicidal ideation. The results of this study show that all models with BMI had a statistically significant as a significant variable. It was found that the obese group was more likely to suicide ideation than the other groups. Based on these results, we try to find social implications for suicide prevention and intervention.

Diagnosis and Scheduling Agent Systems for Collaborative Learning (협력학습을 위한 진단과 스케줄링 에이전트 시스템)

  • 한선관
    • Journal of Intelligence and Information Systems
    • /
    • v.6 no.1
    • /
    • pp.83-96
    • /
    • 2000
  • 본연구는 웹 상에서 원격 협력 학습을 위한 수준별 협력 학습자 진단 및 스케줄링 에이전트의 설계와 구현에 관한 연구이다. 원격 협력 학습은 동일한 학습내용에 흥미를 갖는 아동이 동시에 학습할 수 있는 환경이 필요하며 학습자의 지식 또한 비슷한 수준이어야 효과적인 협력학습을 할 수 있다. 분산 환경의 이질적인 학습자를 모으기 위해서는 좀 더 자율적이고 지능적인 시스템이 필요하며 학습자에 대한 지식을 표현하는 학습자 모델이 요구된다. 이를 위해 에이전트 시스템이 적절하게 사용될수 있으며 학습자의 수준을 판단하기 위한 진단 에이전트와 협력학습이 가능한 여러명의 학습자들을 알맞은 시간과 서버에 연결하는 스케줄링 에이전트를 웹 기반 지능형 교수 시스템에 접목하였다. 학습자 수준을 진단하는 진단 에이전트는 확신도를 높이기 위해 3-모수 로지스틱 확신공식과 시간 가중치 확신인자 공식을 적용하여 신뢰도를 높였다 또한 협력학습의 스케줄링을 위해 다양한 제약조건들의 최적해를 구하기 위해 제약 만족 문제(CSP)로 스케줄링 에이전트를 모델링하였다 본 연구에서 설계 구현한 협력학습자 진단 및 스케줄링 에이전트의 효율성을 살펴보기 위해 여러명의 학습자를 대상으로 실험하였다. 실험을 통해 각 학습자의 지식 수준 진단과 다수의 학습자가 적절한 협력학습을 하기 위한 스케줄링이 매우 효율적으로 이루어짐을 볼 수 있었다.

  • PDF