• Title/Summary/Keyword: Regression testing

Search Result 707, Processing Time 0.025 seconds

Use of a multinomial logistic regression model to evaluate risk factors for porcine circovirus type 2 infection on pig farms in the Republic of Korea

  • Kim, Eu-Tteum;Pak, Son-Il
    • Journal of Preventive Veterinary Medicine
    • /
    • v.41 no.3
    • /
    • pp.129-132
    • /
    • 2017
  • The current study identified risk factors associated with porcine circovirus type 2 (PCV2) infection on pig farms in the Republic of Korea using a multinomial logistic regression model to evaluate the PCV2 infection status of pigs at different growth stages. Compulsory disinfection of visitors (odds ratio [OR]: 0.019, 95% confidence interval [CI]: <0.001-0.378, p=0.0095), compulsory registration of visitors (OR: 0.002, 95% CI: <0.001-0.184, p=0.0070), regular blood testing (OR: 0.012, 95% CI: <0.001-0.157, p=0.0007), and running on-farm biosecurity learning programs for workers (OR: 0.156, 95% CI: 0.040-0.604, p=0.0072 and OR: 0.201, 95% CI: 0.055-0.737, p=0.0155, respectively) were identified as factors which could reduce the risk of PCV2 infection. However, visitation by a regular veterinarian (OR: 32.733, 95% CI: 3.768-284.327, p=0.0016) was associated with PCV2 infection.

Determinants of the Small and Medium Enterprises Progress: A Case Study of SME Entrepreneurs in Manado, Indonesia

  • PRAMONO, Rudy;SONDAKH, L.W.;BERNARTO, Innocentius;JULIANA, Juliana;PURWANTO, Agus
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.8 no.1
    • /
    • pp.881-889
    • /
    • 2021
  • The purpose of this study is to descriptively reveal the demographic and business profile and personal-entrepreneurial characteristics in Manado, the capital of North Sulawesi, and secondly to associate these profiles and characters to their business progress. A sample size of 21 respondents was drawn - selected from those who warmly welcomed the interviewers for an open-ended structured questionnaire. SPSS 24 has been employed to descriptively reveal the sample distribution according to demographic factors and business entities and to determine the dominant factors affecting the progress of the business by testing the hypothesis on the association of variables under study using specified statistical analytical tools, such as regression analysis, especially stepwise regression formula, between specified dependent variables and independent variables and /or between all variables. The stepwise regression analysis has enabled the researcher to determine which variables are the most important reflecting the personal characteristics theorized as "locus of control": self-efficacy, needs for achievement, personal traits, and barriers to business progress The analysis reveals that the progress of business does have an association and is dependent on the source of capital and education, needs for achievement and locus of control.

Revisiting the Effect of Financial Elements on Stock Performance Using Corporate Social Responsibility Cost Growth

  • JOUHA, Faraj;ALBAKAY, Khalleefah;GHOZALI, Imam;HARTO, Puji
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.8 no.1
    • /
    • pp.767-780
    • /
    • 2021
  • The purpose of this research is to analyze the effect of financial elements (asset growth, liability growth, equity growth, revenue growth, and profit growth) on stock price performance and to analyze the growth of Corporate Social Responsibility (CSR) costs as a moderating effect. The technique analysis used is regression analysis. Samples in this analysis are manufacturing firms listed on the Indonesian Stock Exchange (IDX) for the period 2014-2018. The use of regression models for hypothesis testing must fulfill several applicable assumptions such as Normality Test, Heteroscedasticity Test, Multicollinearity Test, Autocorrelation Test, Model Fit Test, Determination Coefficient Test, and Hypothesis Test. Data analysis used two research models, namely model 1 and model 2. Model 1 is without the moderating variable, and model 2 is with the moderating variable, that is, CSR cost growth. Based on the result of the regression analysis, it can be inferred that the asset, revenue, and profit growth have a positive impact on stock price results. Liabilities and equity growth do not affect stock price performance. Operating expense growth has a significant effect on price performance. CSR cost growth can moderate the effect of growth in financial statement elements on stock price performance but is not significant.

Application of Regularized Linear Regression Models Using Public Domain data for Cycle Life Prediction of Commercial Lithium-Ion Batteries (상업용 리튬 배터리의 수명 예측을 위한 고속대량충방전 데이터 정규화 선형회귀모델의 적용)

  • KIM, JANG-GOON;LEE, JONG-SOOK
    • Journal of Hydrogen and New Energy
    • /
    • v.32 no.6
    • /
    • pp.592-611
    • /
    • 2021
  • In this study a rarely available high-throughput cycling data set of 124 commercial lithium iron phosphate/graphite cells cycled under fast-charging conditions, with widely varying cycle lives ranging from 150 to 2,300 cycles including in-cycle temperature and per-cycle IR measurements. We worked out own Python codes which reproduced the various data plots and machine learning approaches for cycle life prediction using early cycles and more details not presented in the article and the supplementary information. Particularly, we applied regularized ridge, lasso and elastic net linear regression models using features extracted from capacity fade curves, discharge voltage curves, and other data such as internal resistance and cell can temperature. We found that due to the limitation in the quantity and quality of the data from costly and lengthy battery testing a careful hyperparameter tuning may be required and that model features need to be extracted based on the domain knowledge.

Healthcare Systems and COVID-19 Mortality in Selected OECD Countries: A Panel Quantile Regression Analysis

  • Jalil Safaei;Andisheh Saliminezhad
    • Journal of Preventive Medicine and Public Health
    • /
    • v.56 no.6
    • /
    • pp.515-522
    • /
    • 2023
  • Objectives: The pandemic caused by coronavirus disease 2019 (COVID-19) has exerted an unprecedented impact on the health of populations worldwide. However, the adverse health consequences of the pandemic in terms of infection and mortality rates have varied across countries. In this study, we investigate whether COVID-19 mortality rates across a group of developed nations are associated with characteristics of their healthcare systems, beyond the differential policy responses in those countries. Methods: To achieve the study objective, we distinguished healthcare systems based on the extent of healthcare decommodification. Using available daily data from 2020, 2021, and 2022, we applied quantile regression with non-additive fixed effects to estimate mortality rates across quantiles. Our analysis began prior to vaccine development (in 2020) and continued after the vaccines were introduced (throughout 2021 and part of 2022). Results: The findings indicate that higher testing rates, coupled with more stringent containment and public health measures, had a significant negative impact on the death rate in both pre-vaccination and post-vaccination models. The data from the post-vaccination model demonstrate that higher vaccination rates were associated with significant decreases in fatalities. Additionally, our research indicates that countries with healthcare systems characterized by high and medium levels of decommodification experienced lower mortality rates than those with healthcare systems involving low decommodification. Conclusions: The results of this study indicate that stronger public health infrastructure and more inclusive social protections have mitigated the severity of the pandemic's adverse health impacts, more so than emergency containment measures and social restrictions.

Machine learning-based analysis and prediction model on the strengthening mechanism of biopolymer-based soil treatment

  • Haejin Lee;Jaemin Lee;Seunghwa Ryu;Ilhan Chang
    • Geomechanics and Engineering
    • /
    • v.36 no.4
    • /
    • pp.381-390
    • /
    • 2024
  • The introduction of bio-based materials has been recommended in the geotechnical engineering field to reduce environmental pollutants such as heavy metals and greenhouse gases. However, bio-treated soil methods face limitations in field application due to short research periods and insufficient verification of engineering performance, especially when compared to conventional materials like cement. Therefore, this study aimed to develop a machine learning model for predicting the unconfined compressive strength, a representative soil property, of biopolymer-based soil treatment (BPST). Four machine learning algorithms were compared to determine a suitable model, including linear regression (LR), support vector regression (SVR), random forest (RF), and neural network (NN). Except for LR, the SVR, RF, and NN algorithms exhibited high predictive performance with an R2 value of 0.98 or higher. The permutation feature importance technique was used to identify the main factors affecting the strength enhancement of BPST. The results indicated that the unconfined compressive strength of BPST is affected by mean particle size, followed by biopolymer content and water content. With a reliable prediction model, the proposed model can present guidelines prior to laboratory testing and field application, thereby saving a significant amount of time and money.

A Study on the Insolvency Prediction Model for Korean Shipping Companies

  • Myoung-Hee Kim
    • Journal of Navigation and Port Research
    • /
    • v.48 no.2
    • /
    • pp.109-115
    • /
    • 2024
  • To develop a shipping company insolvency prediction model, we sampled shipping companies that closed between 2005 and 2023. In addition, a closed company and a normal company with similar asset size were selected as a paired sample. For this study, data of a total of 82 companies, including 42 closed companies and 42 general companies, were obtained. These data were randomly divided into a training set (2/3 of data) and a testing set (1/3 of data). Training data were used to develop the model while test data were used to measure the accuracy of the model. In this study, a prediction model for Korean shipping insolvency was developed using financial ratio variables frequently used in previous studies. First, using the LASSO technique, main variables out of 24 independent variables were reduced to 9. Next, we set insolvent companies to 1 and normal companies to 0 and fitted logistic regression, LDA and QDA model. As a result, the accuracy of the prediction model was 82.14% for the QDA model, 78.57% for the logistic regression model, and 75.00% for the LDA model. In addition, variables 'Current ratio', 'Interest expenses to sales', 'Total assets turnover', and 'Operating income to sales' were analyzed as major variables affecting corporate insolvency.

Using multivariate regression and multilayer perceptron networks to predict soil shear strength parameters

  • Ahmed Cemiloglu
    • Geomechanics and Engineering
    • /
    • v.39 no.2
    • /
    • pp.129-142
    • /
    • 2024
  • The most significant soil parameters that are utilized in geotechnical engineering projects' design and implementations are soil strength parameters including friction (ϕ), cohesion (c), and uniaxial compressive strength (UCS). Understanding soil shear strength parameters can be guaranteed the design success and stability of structures. In this regard, professionals always looking for ways to get more accurate estimations. The presented study attempted to investigate soil shear strength parameters by using multivariate regression and multilayer perceptron predictive models which were implemented on 100 specimens' data collected from the Tabriz region (NW of Iran). The uniaxial (UCS), liquid limit (LL), plasticity index (PI), density (γ), percentage of fine-grains (pass #200), and sand (pass #4) which are used as input parameters of analysis and shear strength parameters predictions. A confusion matrix was used to validate the testing and training data which is controlled by the coefficient of determination (R2), mean absolute (MAE), mean squared (MSE), and root mean square (RMSE) errors. The results of this study indicated that MLP is able to predict the soil shear strength parameters with an accuracy of about 93.00% and precision of about 93.5%. In the meantime, the estimated error rate is MAE = 2.0231, MSE = 2.0131, and RMSE = 2.2030. Additionally, R2 is evaluated for predicted and measured values correlation for friction angle, cohesion, and UCS are 0.914, 0.975, and 0.964 in the training dataset which is considerable.

A Comparison of Statistical Prediction Models in Household Water End-Uses (가정용수의 수요량 예측을 위한 통계적 모형 비교)

  • Myoung, Sung-Min;Lee, Doo-Jin;Kim, Hwa-Soo;Jo, Jin-Nam
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.4
    • /
    • pp.567-573
    • /
    • 2011
  • This study develops a predictive model for household water end-uses based on data that have measured household characteristics, housing characteristics and other items, surveyed over 3 years in Korea. However, the measured data was left-skewed and it was not fitted to normal distribution. The parameter estimate were biased when using a multiple regression model. In addition, the results of the testing for the model were usually of significance due to the tiny residual from a large number of observations. In order to solve the problem, we suggested log-normal regression model and Weibull regression model as alternatives. The results of this study can be utilized in the planning stages of water and waste water facilities.

A Study on the Socio-economic Characteristics of the Angler Population and the Estimation of A Fishing Frequency Function (유어낚시인구의 사회경제학적 특성과 출조빈도함수의 추정에 관한 연구)

  • Park Cheol-Hyung
    • The Journal of Fisheries Business Administration
    • /
    • v.36 no.1 s.67
    • /
    • pp.81-101
    • /
    • 2005
  • This article is to estimate the fishing frequency function in Korean recreational fishery with respect to socio-economic characteristics of anglers. First, the study described the characteristics of the entire angler population on the view points of 9 socio-economic variables. And then, the study divided the total angler population into three groups of in-land, sea, and mixed angler populations in order to investigate the differences in their characteristics. The study could confirm the existence of differences in regions, size of regions, and educational levels between the in - land and the sea angler populations by testing heterogeneity in the frequency table. The fishing frequency function is estimated using Poisson regression model in order to accomodate the count data(non-negative discrete random variable) aspects of the fishing frequency. However, the model specification error is found due to overdispersion of data. The model exhibits the lack of goodness of fit. The negative binomial regression model is adopted to cure the overdispersion of the data as an alternative estimation methodology. Finally, the study can confirm overdispersion does not exist in the model any more and the goodness of fit improved significantly to the reasonable level. The results of estimation of fishing frequency population modeled by the negative binomial regression models are following. The three variables of region, sex, and education have effects on the decision making process of fishing frequency in the case of in-land recreation fishery. On the other hand, the three variables of sex, age, and marriage status do the same job in the case of sea angler population. Among the left-over variables, both income and use of Internet variables now affect on the process in mixed angler population. Finally, the results of whole angler population show that all of the previous variables are proven to be statistically significant due to the summation of data with all three sub-groups of angler population.

  • PDF