• Title/Summary/Keyword: Multi-Collinearity Test

Search Result 10, Processing Time 0.021 seconds

Development of Formative Constructs and Measurements for Performance Evaluation of Information Systems (정보시스템 성과평가를 위한 형성적 구성변수(Constructs) 및 측정지표 개발)

  • Kim, Sanghoon;Kim, Changkyu
    • Journal of Information Technology Services
    • /
    • v.11 no.4
    • /
    • pp.135-151
    • /
    • 2012
  • Traditionally in IS studies, the relationship between construct and its measurement items tends to be assumed to be reflective, meaning that the measurements are a reflection of the construct. In reality, however, the nature of the construct can be often formative, which means that its measurement items describe and define the construct rather than vice versa. The purpose of this study was to investigate theoretical and empirically-analysed differences between formative construct and reflective construct through comprehensive interdisciplinary literature review. And then on the basis of these differences, we intended to derive the rule of specifying whether the construct is formative or reflective and propose the methodology of testing the validity(content validity, construct validity, internal consistency and external construct) of formative construct and its measurements, differentiated from that in the case of reflective construct. Also, we suggested the concrete statistical testing methods such as VTT(Vanishing Tetrad Test), MIMIC(Multiple Indicators and Multiple Causes) test and multi-collinearity test. In order to examine the applicability of this methodology to developing the constructs for performance evaluation of IS(Information Systems), we tried to identify its attribute(formative or reflective) and test the validity for the construct arbitrarily chosen among them which had been derived in our previous IS performance evaluation study by using this methodology. The result of the examination was that the methodology proposed in this study was significantly valid and effective in the area of IS performance evaluation.

Multivariate Procedure for Variable Selection and Classification of High Dimensional Heterogeneous Data

  • Mehmood, Tahir;Rasheed, Zahid
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.6
    • /
    • pp.575-587
    • /
    • 2015
  • The development in data collection techniques results in high dimensional data sets, where discrimination is an important and commonly encountered problem that are crucial to resolve when high dimensional data is heterogeneous (non-common variance covariance structure for classes). An example of this is to classify microbial habitat preferences based on codon/bi-codon usage. Habitat preference is important to study for evolutionary genetic relationships and may help industry produce specific enzymes. Most classification procedures assume homogeneity (common variance covariance structure for all classes), which is not guaranteed in most high dimensional data sets. We have introduced regularized elimination in partial least square coupled with QDA (rePLS-QDA) for the parsimonious variable selection and classification of high dimensional heterogeneous data sets based on recently introduced regularized elimination for variable selection in partial least square (rePLS) and heterogeneous classification procedure quadratic discriminant analysis (QDA). A comparison of proposed and existing methods is conducted over the simulated data set; in addition, the proposed procedure is implemented to classify microbial habitat preferences by their codon/bi-codon usage. Five bacterial habitats (Aquatic, Host Associated, Multiple, Specialized and Terrestrial) are modeled. The classification accuracy of each habitat is satisfactory and ranges from 89.1% to 100% on test data. Interesting codon/bi-codons usage, their mutual interactions influential for respective habitat preference are identified. The proposed method also produced results that concurred with known biological characteristics that will help researchers better understand divergence of species.

Prevalence and risk factors of helminth infections in cattle of Bangladesh

  • Rahman, A.K.M.A.;Begum, N.;Nooruddin, M.;Rahman, Md. Siddiqur;Hossain, M.A.;Song, Hee-Jong
    • Korean Journal of Veterinary Service
    • /
    • v.32 no.3
    • /
    • pp.265-273
    • /
    • 2009
  • A cross-sectional survey was undertaken to identify risk factors and clinical signs associated with parasitic helminth infections of cattle in Mymensignh district of Bangladesh. A nonrandom convenience sampling method was used to select 138 animals from 40 farmers/herds. The eggs per gram of faeces (epg) for nematodes and trematodes were determined by McMaster and Stoll's methods respectively. Animal-level and herd-level data were recorded by means of a questionnaire. Multi-collinearity amongst explanatory variables were assessed using $2{\times}2{\times}\;X^2$ test and one variable in a pair was dropped if $P{\leq}0.05$ formultiple logistic regression models. Association study between outcome and explanatory variables was conducted using classification tree, random forests and multiple logistic regression. A positive epg was considered as infected. Analyses were performed using $STATA^{(R)}$, version 8.0/Intercooled and $R^{(R)}$, Version 2.3.0. Seventy eight percent of the cattle were found to be infected with at least one type of helminth. Twenty four pairs of combinations of explanatory variables showed significant associations. Male animals (OR=3.3, P=.006, 95% CI=1.4, 7.7) were associated with significantly increased prevalence of nematode infection. Female cattle of the study area are mostly cross-breed, kept indoor, fed relatively good diet and not used for draught purpose. Males are used for draught purpose thereby more exposed to nematode infective stage and provided with relatively poor diet. So stressed male cattle may become more susceptible to nematode infection. All of the three statistical techniques selected gender and lumen motility as most important variables in association with nematode infection in cattle. The result of this survey can only be extrapolated to the periurban cattle population of traditional management system.

Estimation of drift force by real ship using multiple regression analysis (다중회귀분석에 의한 실선의 표류력 추정)

  • AHN, Jang-Young;KIM, Kwang-il;KIM, Min-Son;LEE, Chang-Heon
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.57 no.3
    • /
    • pp.236-245
    • /
    • 2021
  • In this study, a drifting test using a experimental vessel (2,966 tons) in the northern waters of Jeju was carried out for the first time in order to obtain the fundamental data for drift. During the test, it was shown that the average leeway speed and direction by GPS position were 0.362 m/s and 155.54° respectively and the leeway rate for wind speed was 8.80%. The analysis of linear regression modes about leeway speed and direction of the experimental vessel indicated that wind or current (i.e. explanatory variable) had a greater influence upon response variable (e.g. leeway speed or direction) with the speed of the wind and current rather than their directions. On the other hand, the result of multiple regression model analysis was able to predict that the direction was negative, and it was demonstrated that predicted values of leeway speed and direction using an experimental vessel is to be more influential by current than wind while the leeway speed through variance and covariance was positive. In terms of the leeway direction of the experimental vessel, the same result of the leeway speed appeared except for a possibility of the existence of multi-collinearity. Then, it can be interpreted that the explanatory variables were less descriptive in the predicted values of the leeway direction. As a result, the prediction of leeway speed and direction can be demonstrated as following equations. Ŷ1= 0.4031-0.0032X1+0.0631X2-0.0010X3+0.4110X4 Ŷ2= 0.4031-0.6662X1+27.1955X2-0.6787X3-420.4833X4 However, many drift tests using actual vessels and various drifting objects will provide reasonable estimations, so that they can help search and rescue fishing gears as well.

Corporate Default Prediction Model Using Deep Learning Time Series Algorithm, RNN and LSTM (딥러닝 시계열 알고리즘 적용한 기업부도예측모형 유용성 검증)

  • Cha, Sungjae;Kang, Jungseok
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.1-32
    • /
    • 2018
  • In addition to stakeholders including managers, employees, creditors, and investors of bankrupt companies, corporate defaults have a ripple effect on the local and national economy. Before the Asian financial crisis, the Korean government only analyzed SMEs and tried to improve the forecasting power of a default prediction model, rather than developing various corporate default models. As a result, even large corporations called 'chaebol enterprises' become bankrupt. Even after that, the analysis of past corporate defaults has been focused on specific variables, and when the government restructured immediately after the global financial crisis, they only focused on certain main variables such as 'debt ratio'. A multifaceted study of corporate default prediction models is essential to ensure diverse interests, to avoid situations like the 'Lehman Brothers Case' of the global financial crisis, to avoid total collapse in a single moment. The key variables used in corporate defaults vary over time. This is confirmed by Beaver (1967, 1968) and Altman's (1968) analysis that Deakins'(1972) study shows that the major factors affecting corporate failure have changed. In Grice's (2001) study, the importance of predictive variables was also found through Zmijewski's (1984) and Ohlson's (1980) models. However, the studies that have been carried out in the past use static models. Most of them do not consider the changes that occur in the course of time. Therefore, in order to construct consistent prediction models, it is necessary to compensate the time-dependent bias by means of a time series analysis algorithm reflecting dynamic change. Based on the global financial crisis, which has had a significant impact on Korea, this study is conducted using 10 years of annual corporate data from 2000 to 2009. Data are divided into training data, validation data, and test data respectively, and are divided into 7, 2, and 1 years respectively. In order to construct a consistent bankruptcy model in the flow of time change, we first train a time series deep learning algorithm model using the data before the financial crisis (2000~2006). The parameter tuning of the existing model and the deep learning time series algorithm is conducted with validation data including the financial crisis period (2007~2008). As a result, we construct a model that shows similar pattern to the results of the learning data and shows excellent prediction power. After that, each bankruptcy prediction model is restructured by integrating the learning data and validation data again (2000 ~ 2008), applying the optimal parameters as in the previous validation. Finally, each corporate default prediction model is evaluated and compared using test data (2009) based on the trained models over nine years. Then, the usefulness of the corporate default prediction model based on the deep learning time series algorithm is proved. In addition, by adding the Lasso regression analysis to the existing methods (multiple discriminant analysis, logit model) which select the variables, it is proved that the deep learning time series algorithm model based on the three bundles of variables is useful for robust corporate default prediction. The definition of bankruptcy used is the same as that of Lee (2015). Independent variables include financial information such as financial ratios used in previous studies. Multivariate discriminant analysis, logit model, and Lasso regression model are used to select the optimal variable group. The influence of the Multivariate discriminant analysis model proposed by Altman (1968), the Logit model proposed by Ohlson (1980), the non-time series machine learning algorithms, and the deep learning time series algorithms are compared. In the case of corporate data, there are limitations of 'nonlinear variables', 'multi-collinearity' of variables, and 'lack of data'. While the logit model is nonlinear, the Lasso regression model solves the multi-collinearity problem, and the deep learning time series algorithm using the variable data generation method complements the lack of data. Big Data Technology, a leading technology in the future, is moving from simple human analysis, to automated AI analysis, and finally towards future intertwined AI applications. Although the study of the corporate default prediction model using the time series algorithm is still in its early stages, deep learning algorithm is much faster than regression analysis at corporate default prediction modeling. Also, it is more effective on prediction power. Through the Fourth Industrial Revolution, the current government and other overseas governments are working hard to integrate the system in everyday life of their nation and society. Yet the field of deep learning time series research for the financial industry is still insufficient. This is an initial study on deep learning time series algorithm analysis of corporate defaults. Therefore it is hoped that it will be used as a comparative analysis data for non-specialists who start a study combining financial data and deep learning time series algorithm.

A Structural Relation Among Self-Efficacy, Self-Esteem and Life Satisfaction of Highly Stressed University Students for Studying after Taking Swim Class in College (학업스트레스가 높은 대학생의 교양 수영 수업 수강에 따른 자기효능감, 자아존중감 및 생활만족도의 구조적 관계)

  • Lee, Young Jun
    • Journal of the Korean Applied Science and Technology
    • /
    • v.37 no.2
    • /
    • pp.192-205
    • /
    • 2020
  • The purpose of this study is to verify the structural relationship empirically among self-esteem, self-efficacy and life satisfaction of college students who answered that they had high academic stress. SPSS 23.0 and AMOS 21.0 were used to achieve the objectives of this study. In SPSS 23.0, frequency analysis to analyze demographic characteristics, correlation analysis to verify multi-collinearity among the questionnaire scales, and reliability verification to determine the reliability of questionnaire questions were conducted. In AMOS 21.0, confirmatory factor analysis was conducted to verify the construct validity of factors and to verify the causal relationship between variables. To determine the goodness of fit of the model, the 𝑥2 test and the goodness-of-fit index were used. Judging. First, the self-efficacy of college students with high academic stress group through swimming class was positively influenced on self-esteem. Second, the self-efficacy of college students with high academic stress group through swimming class was positively influenced on life satisfaction. Third, the self-esteem of college students with high academic stress group through swimming class was found to affect life satisfaction. This study has significance in demonstrating the problem of academic stress of Korean university students and in proposing expansion of the physical education class as a solution.

The Effect of Customer Orientation on Perceived Referral Risk and Referral Intention (보험 영업사원의 고객지향성이 지각된 소개위험과 추천의도에 미치는 영향: 고객성향의 조절효과를 중심으로)

  • Kim, Dong-Hyun;Cha, Jae-Bin;Park, Chan-Wook
    • Journal of Distribution Science
    • /
    • v.15 no.7
    • /
    • pp.61-71
    • /
    • 2017
  • Purpose - This study empirically analyzed the effect of the customer orientation in Insurance Salespersons on the perceived referral risk and referral intention. In the empirical study, we try to provide suggestions for reducing the perceived referral risk of customer oriented selling activities and improving the referral intentions according to customers' tendencies. Research design, data, and methodology - Data collection was conducted through the convenience sampling method for customers who had insurance coverage for about two months from March to May 2015. A total of 700 copies were distributed and 670 copies (95.7% recovery) were collected. Finally, 661 copies were used for final analysis. With the IBM PASW 22.0 statistical program. The interaction effect for the hypothesis test was generated by multiplying the average centralized independent variable and the control variable, and the average centralization variable was used to minimize the multi-collinearity problem of the interaction effect between the independent variable and the control variables. Results - Hypothesis 1 was adopted because the effect of customer-oriented selling activities on perceived referral risk were significantly negative. The effect of customer orientation on perceived referral risk is affected by innovative tendency, risk-taking tendency, and interpersonal tendency Interaction effect was observed. Therefore, Hypothesis 2-2, Hypothesis 2-3, Hypothesis 2-4 were adopted. The effect of customer-oriented selling activities on referral intention was significantly positive, and Hypothesis 3 was adopted. The effect of customer orientation was influenced by the interaction effect of innovative tendency. Therefore, only Hypothesis 4-2 was adopted. Finally, the effect of perceived referral risk on referral intention was significantly negative and hypothesis 5 was adopted. Conclusions - This study suggests that it is important for the salespeople to grasp the customers' propensity in consideration of the perceived referral risk and referral intention according to the moderating effect of customer orientation. In this study, we showed that customer-oriented selling activities positively influence referral intention by inducing perceived referral risk in customers with stronger risk-taking tendencies. It is thought that it will be an important basic data in designing a customer's selling strategy or conducting selling activities.

Development of Ship Valuation Model by Neural Network (신경망기법을 활용한 선박 가치평가 모델 개발)

  • Kim, Donggyun;Choi, Jung-Suk
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.27 no.1
    • /
    • pp.13-21
    • /
    • 2021
  • The purpose of this study is to develop the ship valuation model by utilizing the neural network model. The target of the valuation was secondhand VLCC. The variables were set as major factors inducing changes in the value of ship through prior research, and the corresponding data were collected on a monthly basis from January 2000 to August 2020. To determine the stability of subsequent variables, a multi-collinearity test was carried out and finally the research structure was designed by selecting six independent variables and one dependent variable. Based on this structure, a total of nine simulation models were designed using linear regression, neural network regression, and random forest algorithm. In addition, the accuracy of the evaluation results are improved through comparative verification between each model. As a result of the evaluation, it was found that the most accurate when the neural network regression model, which consist of a hidden layer composed of two layers, was simulated through comparison with actual VLCC values. The possible implications of this study first, creative research in terms of applying neural network model to ship valuation; this deviates from the existing formalized evaluation techniques. Second, the objectivity of research results was enhanced from a dynamic perspective by analyzing and predicting the factors of changes in the shipping. market.

Development of a Model for Predicting Modulus on Asphalt Pavements Using FWD Deflection Basins (FWD 처짐곡선을 이용한 아스팔트 포장구조체의 탄성계수 추정 모형 개발)

  • Park, Seong Wan;Hwang, Jung Joon;Hwang, Kyu Young;Park, Hee Mun
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.26 no.5D
    • /
    • pp.797-804
    • /
    • 2006
  • A development of regression model for asphalt concrete pavements using Falling Weight Deflectometer deflections is presented in this paper. A backcalculation program based on layered elastic theory was used to generate the synthetic modulus database, which was used to generate 95% confidence intervals of modulus in each layer. Using deflection basins of FWD data used in developing this procedure were collected from Pavement Management System in flexible pavements. Assumptions of back-calculation are that one is 3 layered flexible pavement structure and another is depth to bedrock is finite. It is found that difference of between 95% confidence intervals and modulus ranges of other papers does not exist. So, the data of 95% confidence intervals in each layer was used to develop multiple regression models. Multiple regression equations of each layer were established by SPSS, package of Statics analysis. These models were proved by regression diagnostics, which include case analysis, multi-collinearity analysis, influence diagnostics and analysis of variance. And these models have higher degree of coefficient of determination than 0.75. So this models were applied to predict modulus of domestic asphalt concrete pavement at FWD field test.

The Impact of Human Resource Innovativeness, Learning Orientation, and Their Interaction on Innovation Effect and Business Performance : Comparison of Small and Medium-Sized vs. Large-Sized Companies (인적자원의 혁신성, 학습지향성, 이들의 상호작용이 혁신효과 및 사업성과에 미치는 영향 : 중소기업과 대기업의 비교연구)

  • Yoh, Eunah
    • Korean small business review
    • /
    • v.31 no.2
    • /
    • pp.19-37
    • /
    • 2009
  • The purpose of this research is to explore differences between small and medium-sized companies and large-sized companies in the impact of human resource innovativeness(HRI), learning orientation(LO), and HRI-LO interaction on innovation effect and business performance. Although learning orientation has long been considered as a key factor influencing good performance of a business, little research was devoted to exploring the effect of HRI-LO interaction on innovation effect and business performance. In this study, it is investigated whether there is a synergy effect between innovative human workforce and learning orientation corporate culture, in addition to each by itself, to generate good business performance as well as a success of new innovations in the market. Research hypotheses were as follows, including H1) human resource innovativeness(HRI), learning orientation(LO), and interactions of HRI and LO(HRI-LO interaction) positively affect innovation effect, H2) there is a difference of the effect of HRI, LO, and HRI-LO interaction on innovation effect between large-sized and small-sized companies, H3) HRI, LO, HRI-LO interaction, innovation effect positively affect business performance, and H4) there is a difference of the effect of HRI, LO, HRI-LO interaction, and innovation effect on business performance between large-sized and small-sized companies. Data were obtained from 479 practitioners through a web survey since the web survey is an efficient method to collect a national data at a variety of fields. A single respondent from a company was allowed to participate in the study after checking whether they have more than 5-year work experiences in the company. To check whether a common source bias is existed in the sample, additional data from a convenient sample of 97 companies were gathered through the traditional survey method, and were used to confirm correlations between research variables of the original sample and the additional sample. Data were divided into two groups according to company size, such as 352 small and medium-sized companies with less than 300 employees and 127 large-sized companies with 300 or more employees. Data were analyzed through t-test and regression analyses. HRI which is the innovativeness of human resources in the company was measured with 9 items assessing the innovativenss of practitioners in staff, manager, and executive-level positions. LO is the company's effort to encourage employees' development, sharing, and utilizing of knowledge through consistent learning. LO was measured by 18 items assessing commitment to learning, vision sharing, and open-mindedness. Innovation effect which assesses a success of new products/services in the market, was measured with 3 items. Business performance was measured by respondents' evaluations on profitability, sales increase, market share, and general business performance, compared to other companies in the same field. All items were measured by using 6-point Likert scales. Means of multiple items measuring a construct were used as variables based on acceptable reliability and validity. To reduce multi-collinearity problems generated on the regression analysis of interaction terms, centered data were used for HRI, LO, and Innovation effect on regression analyses. In group comparison, large-sized companies were superior on annual sales, annual net profit, the number of new products/services in the last 3 years, the number of new processes advanced in the last 3 years, and the number of R&D personnel, compared to small and medium-sized companies. Also, large-sized companies indicated a higher level of HRI, LO, HRI-LO interaction, innovation effect and business performance than did small and medium-sized companies. The results indicate that large-sized companies tend to have more innovative human resources and invest more on learning orientation than did small-sized companies, therefore, large-sized companies tend to have more success of a new product/service in the market, generating better business performance. In order to test research hypotheses, a series of multiple-regression analysis was conducted. In the regression analysis examining the impact on innovation effect, important results were generated as : 1) HRI, LO, and HRI-LO affected innovation effect, and 2) company size indicated a moderating effect. Based on the result, the impact of HRI on innovation effect would be greater in small and medium-sized companies than in large-sized companies whereas the impact of LO on innovation effect would be greater in large-sized companies than in small and medium-sized companies. In other words, innovative workforce would be more important in making new products/services that would be successful in the market for small and medium-sized companies than for large-sized companies. Otherwise, learning orientation culture would be more effective in making successful products/services for large-sized companies than for small and medium-sized companies. Based on these results, research hypotheses 1 and 2 were supported. In the analysis of a regression examining the impact on business performance, important results were generated as : 1) innovation effect, LO, and HRI-LO affected business performance, 2) HRI by itself did not have a direct effect on business performance regardless of company size, and 3) company size indicated a moderating effect. Specifically, an effect of the HRI-LO interaction on business performance was stronger in large-sized companies than in small and medium-sized companies. It means that the synergy effect of innovative human resources and learning orientation culture tends to be stronger as company is larger. Referring to these result, research hypothesis 3 was partially supported whereas hypothesis 4 was supported. Based on research results, implications for companies were generated. Regardless of company size, companies need to develop the learning orientation corporate culture as well as human resources' innovativeness together in order to achieve successful development of innovative products and services as well as to improve sales and profits. However, the effectiveness of the HRI-LO interaction would be varied by company size. Specifically, the synergy effect of HRI-LO was stronger to make a success of new products/services in small and medium-sized companies than in large-sized companies. However, the synergy effect of HRI-LO was more effective to increase business performance of large-sized companies than that of small and medium-sized companies. In the case of small and medium-sized companies, business performance was achieved more through the success of new products/services than much directly affected by HRI-LO. The most meaningful result of this study is that the effect of HRI-LO interaction on innovation effect and business performance was confirmed. It was often ignored in the previous research. Also, it was found that the innovativeness of human workforce would not directly influence in generating good business performance, however, innovative human resources would indirectly affect making good business performance by contributing to achieving the development of new products/services that would be successful in the market. These findings would provide valuable managerial implications specifically in regard to the development of corporate culture and education program of small and medium-sized as well as large-sized companies in a variety of fields.