• 제목/요약/키워드: Multivariate Correlation Analysis

검색결과 380건 처리시간 0.025초

의사결정나무를 이용한 다변량 공정관리 절차 (Multivariate process control procedure using a decision tree learning technique)

  • 정광영;이재헌
    • Journal of the Korean Data and Information Science Society
    • /
    • 제26권3호
    • /
    • pp.639-652
    • /
    • 2015
  • 현대의 제조공정은 컴퓨터의 발전과 통신 및 네트워크의 발달로 컴퓨터통합제조가 가능해졌다. 이로 인해 고품질 제품의 고속 생산공정이 확대되고, 공정에서 실시간으로 전송되는 다양한 품질변수들의 데이터 축적 또한 가능하게 되었다. 이를 관리하기 위해서는 다변량 통계적 공정관리 절차가 필요하다. 전통적으로 사용하는 다변량 관리도는 이상상태 발생시 이상신호를 주지만, 이상원인이 어떠한 변수에 어떠한 영향을 주는지에 대한 정보를 제공하지 않는다는 단점이 있다. 이를 보완하기 위해 데이터마이닝과 기계학습 기법을 이용할 수 있다. 이 논문에서는 의사결정나무 학습 기법을 이용한 다변량 공정관리 절차를 소개하고, 이변량인 경우 모의실험을 통하여 그 효율을 살펴보았다. 모의실험 결과를 살펴볼 때, 상관계수에 따라 이상상태 탐지 능력은 비슷한 것으로 나타났고, 이상상태에 대한 분류 정확도는 상관계수와 이상원인의 형태에 따라 차이가 있지만 기존의 다변량 관리도에서는 제공하지 않는 이상원인의 정보를 제공하는 장점이 있음을 알 수 있다.

Development and Validation of Generalized Linear Regression Models to Predict Vessel Enhancement on Coronary CT Angiography

  • Masuda, Takanori;Nakaura, Takeshi;Funama, Yoshinori;Sato, Tomoyasu;Higaki, Toru;Kiguchi, Masao;Matsumoto, Yoriaki;Yamashita, Yukari;Imada, Naoyuki;Awai, Kazuo
    • Korean Journal of Radiology
    • /
    • 제19권6호
    • /
    • pp.1021-1030
    • /
    • 2018
  • Objective: We evaluated the effect of various patient characteristics and time-density curve (TDC)-factors on the test bolus-affected vessel enhancement on coronary computed tomography angiography (CCTA). We also assessed the value of generalized linear regression models (GLMs) for predicting enhancement on CCTA. Materials and Methods: We performed univariate and multivariate regression analysis to evaluate the effect of patient characteristics and to compare contrast enhancement per gram of iodine on test bolus (${\Delta}HUTEST$) and CCTA (${\Delta}HUCCTA$). We developed GLMs to predict ${\Delta}HUCCTA$. GLMs including independent variables were validated with 6-fold cross-validation using the correlation coefficient and Bland-Altman analysis. Results: In multivariate analysis, only total body weight (TBW) and ${\Delta}HUTEST$ maintained their independent predictive value (p < 0.001). In validation analysis, the highest correlation coefficient between ${\Delta}HUCCTA$ and the prediction values was seen in the GLM (r = 0.75), followed by TDC (r = 0.69) and TBW (r = 0.62). The lowest Bland-Altman limit of agreement was observed with GLM-3 (mean difference, $-0.0{\pm}5.1$ Hounsfield units/grams of iodine [HU/gI]; 95% confidence interval [CI], -10.1, 10.1), followed by ${\Delta}HUCCTA$ ($-0.0{\pm}5.9HU/gI$; 95% CI, -11.9, 11.9) and TBW ($1.1{\pm}6.2HU/gI$; 95% CI, -11.2, 13.4). Conclusion: We demonstrated that the patient's TBW and ${\Delta}HUTEST$ significantly affected contrast enhancement on CCTA images and that the combined use of clinical information and test bolus results is useful for predicting aortic enhancement.

Serum 25-hydroxyvitamin D3 is associated with homocysteine more than with apolipoprotein B

  • Nam-Kyu, Kim;Min-Ah, Jung;Beom-hee, Choi;Nam-Seok, Joo
    • Nutrition Research and Practice
    • /
    • 제16권6호
    • /
    • pp.745-754
    • /
    • 2022
  • BACKGROUND/OBJECTIVES: The incidence of cardiovascular diseases (CVDs) has increased worldwide. Although a low serum vitamin D level is known to be associated with the risk of CVD, the mechanism is not well understood yet. The aim of this study was to determine the relationship of serum 25-hydroxyvitamin D3 (25[OH]D) with homocysteine and apolipoprotein B (ApoB). SUBJECTS/METHODS: Of 777 subjects recruited from one health promotion center for routine heath exam from January 2010 to December 2016, 518 subjects were included in this study. Serum 25(OH)D, serum homocysteine, and other metabolic parameters including ApoB were analyzed. Simple and partial correlations were carried out after adjustments. Simple linear regression analysis was used for precise correlation of parameters. Multivariate regression analysis was done to know which factor (serum homocysteine or ApoB) was more related to serum 25(OH)D after adjustments. Finally, logarithms of homocysteine concentrations according to tertiles of serum 25(OH)D were compared. RESULTS: After sex and age adjustments, serum 25(OH)D showed negative correlations with serum homocysteine (r' = -0.114) and ApoB (r' = -0.098). In simple linear regression analysis, serum 25(OH)D showed a significant negative correlation with ApoB (P = 0.035). However, in multivariate regression analysis, serum 25(OH)D was significantly associated with serum homocysteine after adjustments (P = 0.022). In addition, serum homocysteine concentration was significantly high in the lowest 25(OH)D group (P = 0.046). CONCLUSION: Serum 25(OH)D concentration showed a stronger negative association with serum homocysteine than with ApoB.

Multivariate analysis of the cleaning efficacy of different final irrigation techniques in the canal and isthmus of mandibular posterior teeth

  • Yoo, Yeon-Jee;Lee, WooCheol;Kim, Hyeon-Cheol;Shon, Won-Jun;Baek, Seung-Ho
    • Restorative Dentistry and Endodontics
    • /
    • 제38권3호
    • /
    • pp.154-159
    • /
    • 2013
  • Objectives: The aim of this study was to compare the cleaning efficacy of different final irrigation regimens in canal and isthmus of mandibular molars, and to evaluate the influence of related variables on cleaning efficacy of the irrigation systems. Materials and Methods: Mesial root canals from 60 mandibular molars were prepared and divided into 4 experimental groups according to the final irrigation technique: Group C, syringe irrigation; Group U, ultrasonics activation; Group SC, VPro StreamClean irrigation; Group EV, EndoVac irrigation. Cross-sections at 1, 3 and 5 mm levels from the apex were examined to calculate remaining debris area in the canal and isthmus spaces. Statistical analysis was completed by using Kruskal-Wallis test and Mann-Whitney U test for comparison among groups, and multivariate linear analysis to identify the significant variables (regular replenishment of irrigant, vapor lock management, and ultrasonic activation of irrigant) affecting the cleaning efficacy of the experimental groups. Results: Group SC and EV showed significantly higher canal cleanliness values than group C and U at 1 mm level (p < 0.05), and higher isthmus cleanliness values than group U at 3 mm and all levels of group C (p < 0.05). Multivariate linear regression analysis demonstrated that all variables had independent positive correlation at 1 mm level of canal and at all levels of isthmus with statistical significances. Conclusions: Both VPro StreamClean and EndoVac system showed favorable result as final irrigation regimens for cleaning debris in the complicated root canal system having curved canal and/or isthmus. The debridement of the isthmi significantly depends on the variables rather than the canals.

Complication After Gastrectomy for Gastric Cancer According to Hospital Volume: Based on Korean Gastric Cancer Association-Led Nationwide Survey Data

  • Sang-Ho Jeong;Moon-Won Yoo ;Miyeong Park ;Kyung Won Seo ;Jae-Seok Min;Information Committee of the Korean Gastric Cancer Association
    • Journal of Gastric Cancer
    • /
    • 제23권3호
    • /
    • pp.462-475
    • /
    • 2023
  • Purpose: This study aimed to analyze the incidence and risk factors of complications following gastric cancer surgery in Korea and to compare the correlation between hospital complications based on the annual number of gastrectomies performed. Materials and Methods: A retrospective analysis was conducted using data from 12,244 patients from 64 Korean institutions. Complications were classified using the Clavien-Dindo classification (CDC). Univariate and multivariate analyses were performed to identify the risk factors for severe complications. Results: Postoperative complications occurred in 14% of the patients, severe complications (CDC IIIa or higher) in 4.9%, and postoperative death in 0.2%. The study found that age, stage, American Society of Anesthesiologists (ASA) score, Eastern Cooperative Oncology Group (ECOG) score, hospital stay, approach methods, and extent of gastric resection showed statistically significant differences depending on hospital volumes (P<0.05). In the univariate analysis, patient age, comorbidity, ASA score, ECOG score, approach methods, extent of gastric resection, tumor-node-metastasis (TNM) stage, and hospital volume were significant risk factors for severe complications. However, only age, sex, ASA score, ECOG score, extent of gastric resection, and TNM stage were statistically significant in the multivariate analysis (P<0.05). Hospital volume was not a significant risk factor in the multivariate analysis (P=0.152). Conclusions: Hospital volume was not a significant risk factor for complications after gastric cancer surgery. The differences in the frequencies of complications based on hospital volumes may be attributed to larger hospitals treating patients with younger age, lower ASA scores, better general conditions, and earlier TNM stages.

주암호의 조류 발생 특성과 수질요인의 상관성 연구 (Relationships Between the Characteristics of Algae Occurrence and Environmental Factors in Lake Juam, Korea)

  • 서경애;정수정;박종환;황경섭;임병진
    • 한국물환경학회지
    • /
    • 제29권3호
    • /
    • pp.317-328
    • /
    • 2013
  • The purpose of this study was to investigate the change of phytoplankton fluctuation and long term of water quality of Lake Juam and to evaluate the relationship between phytoplankton pattern and environmental factors data. Correlation and factor analyses were employed to identify key environmental factors affecting phytoplankton dynamics. Of 18 parameters, pH, temperature, COD, BOD and T-P were highly correlated with Chl-a. Phytoplankton data showed that cyanobacteria were dominant, and more than 60% of total algae density. Also Lake Juam received a lot of influence of the Asian monsoon climate. This study presents necessity of multivariate statistic techniques for evaluation of Lake Juam complex data set with a view to get better information data and effective management of water source.

어머니의 부모 효능감 및 양육 스트레스와 유아의 감성지능과의 관계 (Child's Emotional Intelligence : Relationship with Mother's Parenting Efficacy and Child Rearing Stress)

  • 이승은;서현
    • 아동학회지
    • /
    • 제28권4호
    • /
    • pp.127-144
    • /
    • 2007
  • Mothers of 101 5- to 6-year-old children were administered the Parenting Efficacy Test (Shin & Jung, 1998; Ann & Park 2002) and the Parenting Stress Index (Lee, Yeom, & Shin, 2000). Children's emotional intelligence (EI) was measured by the Emotional Intelligence Test for Children (Lee & Lee, 2004b). Data were analyzed by correlation and multivariate analysis of variance (MANOVA). Correlation analysis demonstrated a relationship of parenting efficacy and stress with child's EI. MANOVA revealed that children, whose maternal parenting efficacy was in the upper thirty percent, showed higher EI than parenting efficacy in the lower thirty percent : children whose maternal parenting stress was in the upper thirty percent, showed lower EI than those with stress in the lower thirty percent.

  • PDF

수도권 전동차에서의 공기질 현황 및 다변량 통계분석을 이용한 공기질 영향인자 분석 (Air Quality in the Subway Cabins of the Seoul Metropolitan Area and Analysis of Its Influencing Factors Using Multivariate Statistics)

  • 박은영;박덕신;조영민;권순박;최경희;권명희
    • 한국대기환경학회지
    • /
    • 제27권2호
    • /
    • pp.142-151
    • /
    • 2011
  • In this study, we have observed PM-10 and $CO_2$ concentration in the subway cabins and analyzed the factors affecting air quality using a multivariate statistical analysis. The measurements have been conducted at Seoul metropolitan subway lines. The results show that the mean concentration of the PM-10 and $CO_2$ inside subway cabins is in the range of 62.6 to 108.0 ${\mu}g/m^3$ and 907 to 2,008 ppm, respectively. $CO_2$ level in specific sections during the rush hours has exceeded air quality guidelines for public transportation, which requires designated train ventilation controls. Correlation and regression analyses of influencing factors imply that $CO_2$ level is severely influenced by the number of passengers and PM-10 level is also correlated with the number of passengers. In particular, PM-10 level in the cabins indicates a positive correlation with outdoor PM-10 level. In addition, the PM concentration has been highly affected by the number of passengers and distance between stations.

Epidemiological Characteristics of Gallbladder Cancer in Jeju Island: A Single-Center, Clinically Based, Age-Sex-Matched, Case-Control Study

  • Cha, Byung Hyo
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제16권18호
    • /
    • pp.8451-8454
    • /
    • 2016
  • Background: Gallbladder cancer (GBC) is a rare but highly invasive malignancy characterized by poor survival. In a national cancer survey, the age-standardized incidence rate of GBC was highest in Jeju Island among the 15 provinces in South Korea. The aim of this descriptive epidemiological study was to suggest the modifiable risk factors for this rare malignant disease in Jeju Island by performing an age-sex-matched case-control study. Materials and Methods: The case group included patients diagnosed with GBC at the Department of Internal Medicine of Cheju Halla General Hospital, Jeju, South Korea, within the 5-year study period. The control group consisted of age-sex-matched subjects selected from among the participants of the health promotion center at the same institute and in the same period. We compared 78 case-control pairs in terms of clinical variables such as histories of hypertension, diabetes, vascular occlusive disorders, alcohol and smoking consumption, obesity, and combined polypoid lesions of the gallbladder (PLG) or gallstone diseases (GSDs). Results: Among the relevant risk factors, alcohol consumption, parity ${\geq}2$, PLG, and GSDs were significant risk factors in the univariate analysis. PLG (p < 0.01; OR, 51.1; 95% confidence interval [CI], 2.98-875.3) and GSD (p < 0.01; OR, 54.9; 95% CI, 3.00-1001.8) were associated risk factors of GBC in the multivariate analysis with the conditional logistic regression model. However, we failed to find any correlation between obesity and GBC. We also found a negative correlation between alcohol consumption history and GBC in the multivariate analysis (p < 0.01; OR, 0.06; 95% CI, 0.01-0.31). Conclusions: These results suggest that combined PLG and GSDs are strongly associated with the GBC in Jeju Island and mild to moderate alcohol consumption may negatively correlate with GBC risk.

다변량 프로빗 모형을 이용한 가전제품 구매의 상관관계 분석 (Correlation among Ownership of Home Appliances Using Multivariate Probit Model)

  • 김창섭;신정우;이미숙;이종수
    • 마케팅과학연구
    • /
    • 제19권2호
    • /
    • pp.17-26
    • /
    • 2009
  • As the lifestyle of consumers changes and the need for various products increases, new products are being developed in the market. Each household owns various home appliances which are purchased through the choice of a decision maker. These appliances include not only large-sized products such as TV, refrigerator, and washing machine, but also small-sized products such as microwave oven and air cleaner. There exists latent correlation among possession of home appliances, even though they are purchased independently. The purpose of this research is to analyze the effect of demographic factors on the purchase and possession of each home appliances, and to derive some relationships among various appliances. To achieve this purpose, the present status on the possession of each home appliances are investigated through consumer survey data on the electric and energy product. And a multivariate probit(MVP) model is applied for the empirical analysis. From the estimation results, some appliances show a substitutive or complementary pattern as expected, while others which look apparently unrelated have correlation by co-incidence. This research has several advantages compared to previous literatures on home appliances. First, this research focuses on the various products which are purchased by each household, while previous researches such as Matsukawa and Ito(1998) and Yoon(2007) focus just on a particular product. Second, the methodology of this research can consider a choice process of each product and correlation among products simultaneously. Lastly, this research can analyze not only a substitutive or complementary relationship in the same category, but also the correlation among products in the different categories. As the data on the possession of home appliances in each household has a characteristic of multiple choice, not a single choice, a MVP model are used for the empirical analysis. A MVP model is derived from a random utility model, and has an advantage compared to a multinomial logit model in that correlation among error terms can be derive(Manchanda et al., 1999; Edwards and Allenby, 2003). It is assumed that the error term has a normal distribution with zero mean and variance-covariance matrix ${\Omega}$. Hence, the sign and value of correlation coefficients means the relationship between two alternatives(Manchanda et al., 1999). This research uses the data of 'TEMEP Household ICT/Energy Survey (THIES) 2008' which is conducted by Technology Management, Economics and Policy Program in Seoul National University. The empirical analysis of this research is accomplished in two steps. First, a MVP model with demographic variables is estimated to analyze the effect of the characteristics of household on the purchase of each home appliances. In this research, some variables such as education level, region, size of family, average income, type of house are considered. Second, a MVP model excluding demographic variables is estimated to analyze the correlation among each home appliances. According to the estimation results of variance-covariance matrix, each households tend to own some appliances such as washing machine-refrigerator-cleaner-microwave oven, and air conditioner-dish washer-washing machine and so on. On the other hand, several products such as analog braun tube TV-digital braun tube TV and desktop PC-portable PC show a substitutive pattern. Lastly, the correlation map of home appliances are derived using multi-dimensional scaling(MDS) method based on the result of variance-covariance matrix. This research can provide significant implications for the firm's marketing strategies such as bundling, pricing, display and so on. In addition, this research can provide significant information for the development of convergence products and related technologies. A convergence product can decrease its market uncertainty, if two products which consumers tend to purchase together are integrated into it. The results of this research are more meaningful because it is based on the possession status of each household through the survey data.

  • PDF