• Title/Summary/Keyword: multivariate modeling

Search Result 115, Processing Time 0.025 seconds

Differences in 25-hydroxy vitamin D and vitamin D-binding protein concentrations according to the severity of endometriosis

  • Baek, Jong Chul;Jo, Jae Yoon;Lee, Seon Mi;Cho, In Ae;Shin, Jeong Kyu;Lee, Soon Ae;Lee, Jong Hak;Cho, Min-Chul;Choi, Won Jun
    • Clinical and Experimental Reproductive Medicine
    • /
    • v.46 no.3
    • /
    • pp.125-131
    • /
    • 2019
  • Objective: To investigate serum 25-hydroxyl vitamin D (25(OH)D) and vitamin D-binding protein (VDBP) concentrations in women with endometriosis according to the severity of disease. Methods: Women with mild endometriosis (n = 9) and advanced endometriosis (n = 7), as well as healthy controls (n = 16), were enrolled in this observational study. Serum total 25(OH)D concentrations were analyzed using the Elecsys vitamin D total kit with the Cobas e602 module. Concentrations of bioavailable and free 25(OH)D were calculated. Concentrations of VDBP were measured using the Human Vitamin D BP Quantikine ELISA kit. Variables were tested for normality and homoscedasticity using the Shapiro-Wilk test and Leven F test, respectively. Correlation analysis was used to identify the variables related to total 25(OH)D and VDBP levels. To assess the effects of total 25(OH)D and VDBP levels in the three groups, multivariate generalized additive modeling (GAM) was performed. Results: Gravidity and parity were significantly different across the three groups. Erythrocyte sedimentation rate (ESR) and CA-125 levels increased as a function of endometriosis severity, respectively (p= 0.051, p= 0.004). The correlation analysis showed that total 25(OH)D levels were positively correlated with gravidity (r = 0.59, p< 0.001) and parity (r = 0.51, p< 0.003). Multivariate GAM showed no significant relationship of total 25(OH)D levels with EMT severity after adjusting for gravidity and ESR. However, the coefficient of total 25(OH)D levels with gravidity was significant (1.87; 95% confidence interval, 0.12-3.63; p= 0.040). Conclusion: These results indicate that vitamin D and VDBP levels were not associated with the severity of endometriosis.

A Study on Building an Integrated Model of App Performance Analysis and App Review Sentiment Analysis (앱 이용실적과 앱 리뷰 감성분석의 통합적 모델 구축에 관한 연구)

  • Kim, Dongwook;Kim, Sungbum
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.1
    • /
    • pp.58-73
    • /
    • 2022
  • The purpose of this study is to construct a predictable estimation model that reflects the relationship between the variables of mobile app performance and to verify how app reviews affect app performance. In study 1 and 2, the relationship between app performance indicators was derived using correlation analysis and random forest regression estimation of machine learning, and app performance estimation modeling was performed. In study 3, sentiment scores for app reviews were by using sentiment analysis of text mining, and it was found that app review sentiment scores have an effect one lag ahead of the number of daily installations of apps when using multivariate time series analysis. By analyzing the dissatisfaction and needs raised by app performance indicators and reviews of apps, companies can improve their apps in a timely manner and derive the timing and direction of marketing promotions.

Principal Discriminant Variate (PDV) Method for Classification of Multicollinear Data: Application to Diagnosis of Mastitic Cows Using Near-Infrared Spectra of Plasma Samples

  • Jiang, Jian-Hui;Tsenkova, Roumiana;Yu, Ru-Qin;Ozaki, Yukihiro
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1244-1244
    • /
    • 2001
  • In linear discriminant analysis there are two important properties concerning the effectiveness of discriminant function modeling. The first is the separability of the discriminant function for different classes. The separability reaches its optimum by maximizing the ratio of between-class to within-class variance. The second is the stability of the discriminant function against noises present in the measurement variables. One can optimize the stability by exploring the discriminant variates in a principal variation subspace, i. e., the directions that account for a majority of the total variation of the data. An unstable discriminant function will exhibit inflated variance in the prediction of future unclassified objects, exposed to a significantly increased risk of erroneous prediction. Therefore, an ideal discriminant function should not only separate different classes with a minimum misclassification rate for the training set, but also possess a good stability such that the prediction variance for unclassified objects can be as small as possible. In other words, an optimal classifier should find a balance between the separability and the stability. This is of special significance for multivariate spectroscopy-based classification where multicollinearity always leads to discriminant directions located in low-spread subspaces. A new regularized discriminant analysis technique, the principal discriminant variate (PDV) method, has been developed for handling effectively multicollinear data commonly encountered in multivariate spectroscopy-based classification. The motivation behind this method is to seek a sequence of discriminant directions that not only optimize the separability between different classes, but also account for a maximized variation present in the data. Three different formulations for the PDV methods are suggested, and an effective computing procedure is proposed for a PDV method. Near-infrared (NIR) spectra of blood plasma samples from mastitic and healthy cows have been used to evaluate the behavior of the PDV method in comparison with principal component analysis (PCA), discriminant partial least squares (DPLS), soft independent modeling of class analogies (SIMCA) and Fisher linear discriminant analysis (FLDA). Results obtained demonstrate that the PDV method exhibits improved stability in prediction without significant loss of separability. The NIR spectra of blood plasma samples from mastitic and healthy cows are clearly discriminated between by the PDV method. Moreover, the proposed method provides superior performance to PCA, DPLS, SIMCA and FLDA, indicating that PDV is a promising tool in discriminant analysis of spectra-characterized samples with only small compositional difference, thereby providing a useful means for spectroscopy-based clinic applications.

  • PDF

PRINCIPAL DISCRIMINANT VARIATE (PDV) METHOD FOR CLASSIFICATION OF MULTICOLLINEAR DATA WITH APPLICATION TO NEAR-INFRARED SPECTRA OF COW PLASMA SAMPLES

  • Jiang, Jian-Hui;Yuqing Wu;Yu, Ru-Qin;Yukihiro Ozaki
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1042-1042
    • /
    • 2001
  • In linear discriminant analysis there are two important properties concerning the effectiveness of discriminant function modeling. The first is the separability of the discriminant function for different classes. The separability reaches its optimum by maximizing the ratio of between-class to within-class variance. The second is the stability of the discriminant function against noises present in the measurement variables. One can optimize the stability by exploring the discriminant variates in a principal variation subspace, i. e., the directions that account for a majority of the total variation of the data. An unstable discriminant function will exhibit inflated variance in the prediction of future unclassified objects, exposed to a significantly increased risk of erroneous prediction. Therefore, an ideal discriminant function should not only separate different classes with a minimum misclassification rate for the training set, but also possess a good stability such that the prediction variance for unclassified objects can be as small as possible. In other words, an optimal classifier should find a balance between the separability and the stability. This is of special significance for multivariate spectroscopy-based classification where multicollinearity always leads to discriminant directions located in low-spread subspaces. A new regularized discriminant analysis technique, the principal discriminant variate (PDV) method, has been developed for handling effectively multicollinear data commonly encountered in multivariate spectroscopy-based classification. The motivation behind this method is to seek a sequence of discriminant directions that not only optimize the separability between different classes, but also account for a maximized variation present in the data. Three different formulations for the PDV methods are suggested, and an effective computing procedure is proposed for a PDV method. Near-infrared (NIR) spectra of blood plasma samples from daily monitoring of two Japanese cows have been used to evaluate the behavior of the PDV method in comparison with principal component analysis (PCA), discriminant partial least squares (DPLS), soft independent modeling of class analogies (SIMCA) and Fisher linear discriminant analysis (FLDA). Results obtained demonstrate that the PDV method exhibits improved stability in prediction without significant loss of separability. The NIR spectra of blood plasma samples from two cows are clearly discriminated between by the PDV method. Moreover, the proposed method provides superior performance to PCA, DPLS, SIMCA md FLDA, indicating that PDV is a promising tool in discriminant analysis of spectra-characterized samples with only small compositional difference.

  • PDF

Time-varying modeling of the composite LN-GPD (시간에 따라 변화하는 로그-정규분포와 파레토 합성 분포의 모형 추정)

  • Park, Sojin;Baek, Changryong
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.1
    • /
    • pp.109-122
    • /
    • 2018
  • The composite lognormal-generalized Pareto distribution (LN-GPD) is a mixture of right-truncated lognormal and GPD for a given threshold value. Scollnik (Scandinavian Actuarial Journal, 2007, 20-33, 2007) shows that the composite LN-GPD is adequate to describe body distribution and heavy-tailedness. This paper considers time-varying modeling of the LN-GPD based on local polynomial maximum likelihood estimation. Time-varying model provides significant detailed information of time dependent data, hence it can be applied to disciplines such as service engineering for staffing and resources management. Our work also extends to Beirlant and Goegebeur (Journal of Multivariate Analysis, 89, 97-118, 2004) in the sense of losing no data by including truncated lognormal distribution. Our proposed method is shown to perform adequately in simulation. Real data application to the service time of the Israel bank call center shows interesting findings on the staffing policy.

A Structural Model Analysis of Person-Organization Fit Influencing Job Satisfaction and Turnover Intent Mediated through Goal Commitment - Centered on Five Star Deluxe Hotel Employees - (개인 조직 적합성이 목표 몰입을 매개로 직무만족 및 이직의도에 미치는 영향에 관한 연구 - 특 1급 호텔 근무자를 중심으로 -)

  • Kim, Ji-Eun
    • Culinary science and hospitality research
    • /
    • v.18 no.5
    • /
    • pp.146-164
    • /
    • 2012
  • This study aims to investigate the influence of person-organization(P-O) fit on job satisfaction, and turnover intent mediated through goal commitment using structural equation modeling. An experimental design is applied to test seven hypotheses that reflect the research questions. Five star deluxe hotel employees are targeted for sampling. A total of 180 faithful cases out of 250 cases have been analyzed. To analyze the data, descriptive statistics and multivariate analysis of variance, and structural equation modeling(SEM) are employed using SPSS 19.0 and AMOS 4. The results indicate the hotel employees' perceived P-O fit is positively associated with goal commitment and job satisfaction while negatively linked with turnover intent. Also, goal commitment has a positive effect on job satisfaction while having a negative effect on turnover intent, mediating between P-O fit and organizational outcomes. Therefore, the needs to evaluate P-O fit and goal commitment during recruitment and after organizational entry have been raised.

  • PDF

Visual Analytics Approach for Performance Improvement of predicting youth physical growth model (청소년 신체 성장 예측 모델의 성능 향상을 위한 시각적 분석 방법)

  • Yeon, Hanbyul;Pi, Mingyu;Seo, Seongbum;Ha, Seoho;Oh, Byungjun;Jang, Yun
    • Journal of the Korea Computer Graphics Society
    • /
    • v.23 no.4
    • /
    • pp.21-29
    • /
    • 2017
  • Previous visual analytics researches has focused on reducing the uncertainty of predicted results using a variety of interactive visual data exploration techniques. The main purpose of the interactive search technique is to reduce the quality difference of the predicted results according to the level of the decision maker by understanding the relationship between the variables and choosing the appropriate model to predict the unknown variables. However, it is difficult to create a predictive model which forecast time series data whose overall trends is unknown such as youth physical growth data. In this paper, we pro pose a novel predictive analysis technique to forecast the physical growth value in small pieces of time series data with un certain trends. This model estimates the distribution of data at a particular point in time. We also propose a visual analytics system that minimizes the possible uncertainties in predictive modeling process.

A Study on Polynomial Neural Networks for Stabilized Deep Networks Structure (안정화된 딥 네트워크 구조를 위한 다항식 신경회로망의 연구)

  • Jeon, Pil-Han;Kim, Eun-Hu;Oh, Sung-Kwun
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.66 no.12
    • /
    • pp.1772-1781
    • /
    • 2017
  • In this study, the design methodology for alleviating the overfitting problem of Polynomial Neural Networks(PNN) is realized with the aid of two kinds techniques such as L2 regularization and Sum of Squared Coefficients (SSC). The PNN is widely used as a kind of mathematical modeling methods such as the identification of linear system by input/output data and the regression analysis modeling method for prediction problem. PNN is an algorithm that obtains preferred network structure by generating consecutive layers as well as nodes by using a multivariate polynomial subexpression. It has much fewer nodes and more flexible adaptability than existing neural network algorithms. However, such algorithms lead to overfitting problems due to noise sensitivity as well as excessive trainning while generation of successive network layers. To alleviate such overfitting problem and also effectively design its ensuing deep network structure, two techniques are introduced. That is we use the two techniques of both SSC(Sum of Squared Coefficients) and $L_2$ regularization for consecutive generation of each layer's nodes as well as each layer in order to construct the deep PNN structure. The technique of $L_2$ regularization is used for the minimum coefficient estimation by adding penalty term to cost function. $L_2$ regularization is a kind of representative methods of reducing the influence of noise by flattening the solution space and also lessening coefficient size. The technique for the SSC is implemented for the minimization of Sum of Squared Coefficients of polynomial instead of using the square of errors. In the sequel, the overfitting problem of the deep PNN structure is stabilized by the proposed method. This study leads to the possibility of deep network structure design as well as big data processing and also the superiority of the network performance through experiments is shown.

Longitudinal analysis of the direct and indirect influence of academic self-concept and academic support of teachers and parents on academic achievement in mathematics (학업적 자아개념 및 교사와 부모의 학업적 지원이 수학 학업성취도에 미치는 직·간접적인 영향력에 대한 종단적 분석)

  • Kim, YongSeok
    • The Mathematical Education
    • /
    • v.61 no.1
    • /
    • pp.127-156
    • /
    • 2022
  • This study used the data of students from the 6th grade to the 3rd grade of middle schoolin the Korean Educational Longitudinal Study 2013 and classified them into subgroups with similar longitudinal changes in math academic achievement. In addition, the influence of longitudinal changes in the group's academic self-concept and teachers and parents academic support on the longitudinal changes in math academic achievement were analyzed, either directly or indirectly. As a result of the analysis, it was found that the academic self-concept of each group had a positive influence on the academic achievement in mathematics. In addition, the academic support of teachers and parents was found to have a positive influence on the academic achievement in mathematics through the mediating of the academic self-concept. In terms of direct and indirect influence on academic self-concept and math vertical scale scores, it was found that teachers' academic support had more influence than parents' academic support. The educational implications of these points were discussed.

Self-Modeling Curve Resolution Analysis of On-line Near Infrared Spectra Measured during the Melt-Extrusion Transesterification of Ethylene/Vinylacetate Copolymer

  • Sasic, Slobodan;Kita, Yasuo;Furukawa, Tsuyoshi;Watari, Masahiro;Siesler, Heinz W.;Ozaki, Yukihiro
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1284-1284
    • /
    • 2001
  • The transesterification of molten ethylene/vinylacetate (EVA) copolymers by octanol as a reagent and sodium methoxide as a catalyst in an extruder has been monitored by on-line near infrared (NIR) spectroscopy. A total of 60 NIR spectra were acquired for 37 minutes with the last spectrum recorded 31 minutes after the addition of octanol and catalyst was stopped. The experimental spectra show strong baseline fluctuations which are corrected for by multiplicative scatter correction (MSC). The chemometric methods of orthogonal projection approach (OPA) and multivariate curve resolution (MCR) were used to resolve the spectra and to derive concentration profiles of the species. The detailed analysis reveals the absence of completely pure variables that leads to small errors in the calculation of pure spectra. The initial estimation of a concentration that is necessary as an input parameter for MCR also presents a non-trivial task. We obtained results that were not ideal but applicable for practical concentration control. They enable a fast monitoring of the process in real-time and resolve the spectra of the EVA copolymer and the ethylene/vinyl alcohol (EVAL) copolymer to be very close to the reference spectra. The chemometric methods used and the decomposed spectra are discussed in detail.

  • PDF