• Title/Summary/Keyword: Biplots

Search Result 14, Processing Time 0.018 seconds

Comparison of Shape Variability in Principal Component Biplot with Missing Values

  • Shin, Sang-Min;Choi, Yong-Seok;Lee, Nae-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.6
    • /
    • pp.1109-1116
    • /
    • 2008
  • Biplots are the multivariate analogue of scatter plots. They are useful for giving a graphical description of the data matrix, for detecting patterns and for displaying results found by more formal methods of analysis. Nevertheless, when some values are missing in data matrix, most biplots are not directly applicable. In particular, we are interested in the shape variability of principal component biplot which is the most popular in biplots with missing values. For this, we estimate the missing data using the EM algorithm and mean imputation according to missing rates. Even though we estimate missing values of biplot of incomplete data, we have different shapes of biplots according to the imputation methods and missing rates. Therefore we propose a RMS(root mean square) for measuring and comparing the shape variability between the original biplots and the estimated biplots.

The Development of Biplots System (행렬도 시스템(BIPLOTS SYSTEM)의 개발)

  • 최용석;현기홍
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.2
    • /
    • pp.297-306
    • /
    • 2000
  • Many users have made the most often use of the SAS system in statistical data analysis all over the world. But it is difficult to use the grand procedures and language of the SAS system. Therefore the side of program development has changed in the graphic-oriented and menu-centered way like SASj ASSIST, SASjINSICHT after a version 6.08 turned into the Window environment. A biplots is a multivariate data analysis technique that graphically describes both relationships among the multidimensional observations and relationships among the variables. But there were not the procedure and graphic interface for a biplots algorithm in the SAS system. In this paper, there are two objects. First, we supply users with the convenience of the environment of CLI, which is constructed with SASj AF and SCL, to solve the problem that we have programed a biplots algorithm with the SASjIML one by one. Second, we reflect the current of the Information Age which means the spread of various kinds of system construction to extract useful information from data with the help of the development of hardware and software.

  • PDF

Applications of Cluster Analysis in Biplots (행렬도에서 군집분석의 활용)

  • Choi, Yong-Seok;Kim, Hyoung-Young
    • Communications for Statistical Applications and Methods
    • /
    • v.15 no.1
    • /
    • pp.65-76
    • /
    • 2008
  • Biplots are the multivariate analogue of scatter plots. They approximate the multivariate distribution of a sample in a few dimensions, typically two, and they superimpose on this display representations of the variables on which the samples are measured(Gower and Hand, 1996, Chapter 1). And the relationships between the observations and variables can be easily seen. Thus, biplots are useful for giving a graphical description of the data. However, this method does not give some concise interpretations between variables and observations when the number of observations are large. Therefore, in this study, we will suggest to interpret the biplot analysis by applying the K-means clustering analysis. It shows that the relationships between the clusters and variables can be easily interpreted. So, this method is more useful for giving a graphical description of the data than using raw data.

A Study on Shape Variability in Canonical Correlation Biplot with Missing Values (결측값이 있는 정준상관 행렬도의 형상변동 연구)

  • Hong, Hyun-Uk;Choi, Yong-Seok;Shin, Sang-Min;Ka, Chang-Wan
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.5
    • /
    • pp.955-966
    • /
    • 2010
  • Canonical correlation biplot is a useful biplot for giving a graphical description of the data matrix which consists of the association between two sets of variables, for detecting patterns and displaying results found by more formal methods of analysis. Nevertheless, when some values are missing in data, most biplots are not directly applicable. To solve this problem, we estimate the missing data using the median, mean, EM algorithm and MCMC imputation methods according to missing rates. Even though we estimate the missing values of biplot of incomplete data, we have different shapes of biplots according to the imputation methods and missing rates. Therefore we use a RMS(root mean square) which was proposed by Shin et al. (2007) and PS(procrustes statistic) for measuring and comparing the shape variability between the original biplots and the estimated biplots.

Biplot method algorithm and application in tire engineering (Biplot 이론과 타이어 제조공학에의 응용)

  • 조완현
    • The Korean Journal of Applied Statistics
    • /
    • v.9 no.2
    • /
    • pp.55-72
    • /
    • 1996
  • It is essential in modern industry that quality and procuctivity are improved continuously. To accomplish this purpose, quality control must be maintained in all parts of a company. Recently, some tire manufacture companies are beginning to show interest in quality control. They have tried to achive some results through the statistical analysis for the experimental data which has accumulated up to now and then they strive to determine the structural relationship between the design factors in tire construction and tire performance characteristics. The measurement data obtained from the construction engineering is given in multivariate form owing to the various properties found in tire design components as wll as in performance. Also it may be existed the relationship among the multimple response variables. Thus we proposes the use of the biplot graphical display as an analytic tool of data matrices with complex respects. The proposed biplots are also availalbe to understand both the underlying structure of the data and the roles played by the different components. In particular, we consider the matter of how best to use the biplots in the maltivariate analysis of variance and multiple response data. Finally we apply this method to analyze the actual data.

  • PDF

Biplots of Multivariate Data Guided by Linear and/or Logistic Regression

  • Huh, Myung-Hoe;Lee, Yonggoo
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.2
    • /
    • pp.129-136
    • /
    • 2013
  • Linear regression is the most basic statistical model for exploring the relationship between a numerical response variable and several explanatory variables. Logistic regression secures the role of linear regression for the dichotomous response variable. In this paper, we propose a biplot-type display of the multivariate data guided by the linear regression and/or the logistic regression. The figures show the directional flow of the response variable as well as the interrelationship of explanatory variables.

Robust Simple Correspondence Analysis

  • Park, Yong-Seok;Huh, Myung-Hoe
    • Journal of the Korean Statistical Society
    • /
    • v.28 no.3
    • /
    • pp.337-346
    • /
    • 1999
  • Simple correspondence analysis is a technique for giving a joint display of points representing both the rows and columns of an n$\times$p two-way contigency table. In simple correspondence analysis, the singular value decomposition is the main algebraic tool. But, Choi and Huh(1996) pointed out the singular value decomposition is not robust. Instead, they developed a robust singular value decomposition and provided applications in principal component analysis and biplots. In this article, by using the analogous procedures of Choi and Huh(1996), we derive a robust version of simple correspondence analysis.

  • PDF

Local Projective Display of Multivariate Numerical Data

  • Huh, Myung-Hoe;Lee, Yong-Goo
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.4
    • /
    • pp.661-668
    • /
    • 2012
  • For displaying multivariate numerical data on a 2D plane by the projection, principal components biplot and the GGobi are two main tools of data visualization. The biplot is very useful for capturing the global shape of the dataset, by representing $n$ observations and $p$ variables simultaneously on a single graph. The GGobi shows a dynamic movie of the images of $n$ observations projected onto a sequence of unit vectors floating on the $p$-dimensional sphere. Even though these two methods are certainly very valuable, there are drawbacks. The biplot is too condensed to describe the detailed parts of the data, and the GGobi is too burdensome for ordinary data analyses. In this paper, "the local projective display(LPD)" is proposed for visualizing multivariate numerical data. Main steps of the LDP are 1) $k$-means clustering of the data into $k$ subsets, 2) drawing $k$ principal components biplots of individual subsets, and 3) sequencing $k$ plots by Hurley's (2004) endlink algorithm for cognitive continuity.

A Study of Applications of Sequential Biplots in Multiresponse Data (다중반응치 자료에 대한 순차적 BIPLOT활용에 대한 연구)

  • 장대흥
    • The Korean Journal of Applied Statistics
    • /
    • v.11 no.2
    • /
    • pp.451-459
    • /
    • 1998
  • The analysis of data from a multiresponse experiment requires careful consideration of the multivariate nature of the data. In a multiresponse sitation, the optimization problem is more complex than in the single response case. The biplot is a graphical tool which make the analyst to understand the correlation of the response variables, the relation of the response variables arid the explanatory variables and the relative importance of the explanatory variables. In case of good fitting of the first order model, we can draw the biplot with the first order experimental design. Otherwise, we can make the biplot with the second order experimental design by adding other experimental points.

  • PDF

Semi-Partial Canonical Correlation Biplot

  • Lee, Bo-Hui;Choi, Yong-Seok;Shin, Sang-Min
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.3
    • /
    • pp.521-529
    • /
    • 2012
  • Simple canonical correlation biplot is a graphical method to investigate two sets of variables and observations in simple canonical correlation analysis. If we consider the set of covariate variables that linearly affects two sets of variables, we can apply the partial canonical correlation biplot in partial canonical correlation analysis that removes the linear effect of the set of covariate variables on two sets of variables. On the other hand, we consider the set of covariate variables that linearly affect one set of variables but not the other. In this case, if we apply the simple or partial canonical correlation biplot, we cannot clearly interpret other two sets of variables. Therefore, in this study, we will apply the semi-partial canonical correlation analysis of Timm (2002) and remove the linear effect of the set of covariate variables on one set of variables but not the other. And we suggest the semi-partial canonical correlation biplot for interpreting the semi-partial canonical correlation analysis. In addition, we will compare shapes and shape the variabilities of the simple, partial and semi-partial canonical correlation biplots using a procrustes analysis.