• Title/Summary/Keyword: Multivariate Statistical Analysis

Search Result 632, Processing Time 0.028 seconds

A Study on the Categorization of Korean Foot Shapes (한국인 발 형상 분류에 관한 연구)

  • Seong, Deok-Hyeon;Jeong, Ui-Seung;Jo, Yong-Ju
    • Journal of the Ergonomics Society of Korea
    • /
    • v.25 no.2
    • /
    • pp.107-118
    • /
    • 2006
  • Recently, Korean's 3-D foot data have been extensively collected through 5th national anthropometric survey known as 'Size Korea'. In this study, Korean foot shape was investigated and subsequently classified, based on the existing standard for foot shaping. This study analyzed and categorized Korean foot shapes through the following methods. Although the data used in this study were limited to those of Korean adults, major factors affecting the foot shape were deduced and then categorically grouped by the multivariate statistical analysis. For those whose age ranged from 14 to 70, major factors affecting the foot shape for the male were related to foot breadth, ankle thickness, 1st toe shape, malleolus height, heel to top of the foot length, the ratio between toe-side and heel-side and 5th toe shape. For the female, the ball of foot height was added to the above factors. From the factors extracted, the Korean foot shape was categorized into three groups for the male and four groups for the female. They were the ladder type, the inverted triangle type and the square type. For the female, the triangular type was added to the three types. These findings will serve as useful information for the footwear production industry in Korea.

Study on the Relationship between Weather Conditions, Sewage and Operational Variables of WWTPs using Multivariate Statistical Methods (기상조건이 하수발생량 및 하수처리장 운전인자에 미치는 영향에 관한 통계적 분석)

  • Lee, Jae-Hyun
    • Journal of Korean Society on Water Environment
    • /
    • v.28 no.2
    • /
    • pp.285-291
    • /
    • 2012
  • Generally, the rainfall and the influent of wastewater treatment plants (WWTPs) have strong relationship at the case of combined sewers. With the fact that the influent variations in terms of quantity and sewage quality is the most common and significant disturbance, the impact factor to the characteristics of sewage should be searched for. In this paper, the relationship between weather conditions such as humidity, temperature and rainfall and influent flowrate and contaminant concentration was analysed using factor analysis. Additionally, 3 influent types were deduced using cluster analysis and the distributions of operational variables were compared to the each groups by one-way ANOVA. The applied dataset were clustered to three groups that have the similar weather and influent conditions. These different conditions can cause the different operating conditions at WWTPs. That is, the Group 1 is for the condition with high humidity and rainfall, so DO concentration in the reactor was very high but MLSS concentration was very low because of too large flowrate. However, the Group 3 is classified to the case having low humidity, temperature, and rainfall, therefore, the SRT was the longest and the SVI was the highest due to the worst settleability in the winter for a year.

KCYP data analysis using Bayesian multivariate linear model (베이지안 다변량 선형 모형을 이용한 청소년 패널 데이터 분석)

  • Insun, Lee;Keunbaik, Lee
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.6
    • /
    • pp.703-724
    • /
    • 2022
  • Although longitudinal studies mainly produce multivariate longitudinal data, most of existing statistical models analyze univariate longitudinal data and there is a limitation to explain complex correlations properly. Therefore, this paper describes various methods of modeling the covariance matrix to explain the complex correlations. Among them, modified Cholesky decomposition, modified Cholesky block decomposition, and hypersphere decomposition are reviewed. In this paper, we review these methods and analyze Korean children and youth panel (KCYP) data are analyzed using the Bayesian method. The KCYP data are multivariate longitudinal data that have response variables: School adaptation, academic achievement, and dependence on mobile phones. Assuming that the correlation structure and the innovation standard deviation structure are different, several models are compared. For the most suitable model, all explanatory variables are significant for school adaptation, and academic achievement and only household income appears as insignificant variables when cell phone dependence is a response variable.

Estimation of Water Quality of Fish Farms using Multivariate Statistical Analysis

  • Ceong, Hee-Taek;Kim, Hae-Ran
    • Journal of information and communication convergence engineering
    • /
    • v.9 no.4
    • /
    • pp.475-482
    • /
    • 2011
  • In this research, we have attempted to estimate the water quality of fish farms in terms of parameters such as water temperature, dissolved oxygen, pH, and salinity by employing observational data obtained from a coastal ocean observatory of a national institution located close to the fish farm. We requested and received marine data comprising nine factors including water temperature from Korea Hydrographic and Oceanographic Administration. For verifying our results, we also established an experimental fish farm in which we directly placed the sensor module of an optical mode, YSI-6920V2, used for self-cleaning inside fish tanks and used the data measured and recorded by a environment monitoring system that was communicating serially with the sensor module. We investigated the differences in water temperature and salinity among three areas - Goheung Balpo, Yeosu Odongdo, and the experimental fish farm, Keumho. Water temperature did not exhibit significant differences but there was a difference in salinity (significance <5%). Further, multiple regression analysis was performed to estimate the water quality of the fish farm at Keumho based on the data of Goheung Balpo. The water temperature and dissolved-oxygen estimations had multiple regression linear relationships with coefficients of determination of 98% and 89%, respectively. However, in the case of the pH and salinity estimated using the oceanic environment with nine factors, the adjusted coefficient of determination was very low at less than 10%, and it was therefore difficult to predict the values. We plotted the predicted and measured values by employing the estimated regression equation and found them to fit very well; the values were close to the regression line. We have demonstrated that if statistical model equations that fit well are used, the expense of fish-farm sensor and system installations, maintenances, and repairs, which is a major issue with existing environmental information monitoring systems of marine farming areas, can be reduced, thereby making it easier for fish farmers to monitor aquaculture and mariculture environments.

Knowledge Extraction from Affective Data using Rough Sets Model and Comparison between Rough Sets Theory and Statistical Method (러프집합이론을 중심으로 한 감성 지식 추출 및 통계분석과의 비교 연구)

  • Hong, Seung-Woo;Park, Jae-Kyu;Park, Sung-Joon;Jung, Eui-S.
    • Journal of the Ergonomics Society of Korea
    • /
    • v.29 no.4
    • /
    • pp.631-637
    • /
    • 2010
  • The aim of affective engineering is to develop a new product by translating customer affections into design factors. Affective data have so far been analyzed using a multivariate statistical analysis, but the affective data do not always have linear features assumed under normal distribution. Rough sets model is an effective method for knowledge discovery under uncertainty, imprecision and fuzziness. Rough sets model is to deal with any type of data regardless of their linearity characteristics. Therefore, this study utilizes rough sets model to extract affective knowledge from affective data. Four types of scent alternatives and four types of sounds were designed and the experiment was performed to look into affective differences in subject's preference on air conditioner. Finally, the purpose of this study also is to extract knowledge from affective data using rough sets model and to figure out the relationships between rough sets based affective engineering method and statistical one. The result of a case study shows that the proposed approach can effectively extract affective knowledge from affective data and is able to discover the relationships between customer affections and design factors. This study also shows similar results between rough sets model and statistical method, but it can be made more valuable by comparing fuzzy theory, neural network and multivariate statistical methods.

Comparing Role of Two Chemotherapy Regimens, CMF and Anthracycline-Based, on Breast Cancer Survival in the Eastern Mediterranean Region and Asia by Multivariate Mixed Effects Models: a Meta-Analysis

  • Ghanbari, Saeed;Ayatollahi, Seyyed Mohammad Taghi;Zare, Najaf
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.16 no.14
    • /
    • pp.5655-5661
    • /
    • 2015
  • Purpose: To assess the role of two adjuvant chemotherapy regimens, anthracycline-based and CMF on disease free survival and overall survival breast cancer patients by meta-analysis approach in Eastern Mediterranean and Asian countries to determine which is more effective and evaluate the appropriateness and efficiency of two different proposed statistical models. Materials and Methods: Survival curves were digitized and the survival proportions and times were extracted and modeled to appropriate covariates by two multivariate mixed effects models. Studies which reported disease free survival and overall survival curves for anthracycline-based or CMF as adjuvant chemotherapy that were published in English in the Eastern Mediterranean region and Asia were included in this systematic review. The two transformations of survival probabilities (Ln (-Ln(S)) and Ln(S/ (1-S))) as dependent variables were modeled by a multivariate mixed model to same covariates in order to have precise estimations with high power and appropriate interpretation of covariate effects. The analysis was carried out with SAS Proc MIXED and STATA software. Results: A total of 32 studies from the published literature were analysed, covering 4,092 patients who received anthracycline-based and 2,501 treated with CMF for the disease free survival and in order to analyze the overall survival, 13 studies reported the overall survival curves in which 2,050 cases were treated with anthracycline-based and 1,282 with CMF regimens. Conclusions: The findings illustrated that the model with dependent variable Ln (-Ln(S)) had more precise estimations of the covariate effects and showed significant difference between the effects of two adjuvant chemotherapy regimens. Anthracycline-based treatment gave better disease free survival and overall survival. As an IPD meta-analysis in the Italy the results of Angelo et al in 2011 also confirmed that anthracycline-based regimens were more effective for survival of breast cancer patients. The findings of Zare et al 2012 on disease free survival curves in Asia also provided similar evidence.

Robust Design for Multiple Quality Characteristics using Principal Component Analysis

  • Kwon, Yong-Man;Hong, Yeon-Woong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.3
    • /
    • pp.545-551
    • /
    • 2003
  • Robust design is to identify appropriate settings of control factors that make the system's performance robust to changes in the noise factors that represent the source of variation. In this paper we propose how to simultaneously optimize multiple quality characteristics using the principal component analysis of multivariate statistical analysis. An example is illustrated to compare it with already proposed method.

  • PDF

Resistant Principal Factor Analysis

  • Park, Youg-Seok;Byun, Ho-Seon
    • Journal of the Korean Statistical Society
    • /
    • v.25 no.1
    • /
    • pp.67-80
    • /
    • 1996
  • Factor analysis is a multivariate technique for describing the in-terrelationship among many variables in terms of a few underlying but unobservable random variables called factors. There are various approaches for this factor analysis. In particular, principal factor analysis is one of the most popular methods. This follows the mathematical algorithm of the principal component analysis based on the singular value decomposition. But it is known that the singular value decomposition is not resistant, i.e., it is very sensitive to small changes in the input data. In this article, using the resistant singular value decomposition of Choi and Huh (1994), we derive a resistant principal factor analysis relatively little influenced by notable observations.

  • PDF

A Study on Hydrologic Clustering for Standard Watersheds of Korea Water Resources Unit Map Using Multivariate Statistical Analysis (다변량 통계분석기법을 이용한 전국 표준유역 대상 수문학적 군집화 연구)

  • Ahn, So-Ra;Kim, Sang-Ho;Kim, Seong-Joon
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.17 no.1
    • /
    • pp.91-106
    • /
    • 2014
  • This study tries to cluster the 795 standard watersheds of Korea Water Resources Unit Map using multivariate statistical analysis technique. The 30 factors of watershed characteristics related to topography, stream, meteorology, soil, land cover and hydrology were selected for comprehensive analysis. From the factor analysis, 16 representative factors were selected. The significant factors in order were the pedological feature, scale and geological location and meteorological and hydrological features of the watershed. As a next step, the 73 gauged watersheds were selected for cluster analysis. They are scattered properly to the whole country and the discharge data were within a confidential level. Based on the 73 watersheds, the other ungaged watersheds were clustered by applying the 16 factors and calculating Euclidian distances. The clustering results showed that the similarity between standard watersheds within the same river basin were 87%, 69%, 41%, 52%, and 27% for Han, Nakdong, Geum, Seomjin, and Yeongsan river basins respectively.

A Study on the Construction and Analysis of Fractional Designs by Using Arrays for Factorial Experiments (배열을 이용한 효과적인 일부실시법의 설계 및 분석방법에 관한 연구)

  • Kim, Sang-Ik
    • Journal of Korean Society for Quality Management
    • /
    • v.40 no.1
    • /
    • pp.15-24
    • /
    • 2012
  • For the construction of fractional factorial designs, the various arrays can be widely used. In this paper we review the statistical properties of fractional designs constructed by two arrays such as orthogonal array and partially balanced array, and develop a quick and easy method for analyzing unreplicated saturated designs. The proposed method can be characterized that we control the error rate by experiment-wise way and exploit the multivariate Student $t$-distribution. Especially the proposed method can be used efficiently together with some exploratory analysis methods, such as half normal probability plot method.