• Title/Summary/Keyword: Statistical graphics

Search Result 70, Processing Time 0.019 seconds

Regular Polyprism Parallel Coordinate Plot as a Statistical Graphics Tool (통계적 그래픽스 도구로서의 정다각기둥평행좌표그림)

  • Jang, Dae-Heung
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.4
    • /
    • pp.695-704
    • /
    • 2008
  • The parallel coordinate plot is a graphical data analysis technique for plotting multivariate data. The parallel coordinate plot overcomes the visualization problem of the Cartesian coordinate system for dimensions greater than 4. But, using different ordering of coordinate axes in the parallel coordinate plot of the same data may make different interpretations. Hence, we can use the regular polyprism parallel coordinate plot as an alternative for overcoming the variable arrangement problem of the parallel coordinate plot.

A study on rethinking EDA in digital transformation era (DX 전환 환경에서 EDA에 대한 재고찰)

  • Seoung-gon Ko
    • The Korean Journal of Applied Statistics
    • /
    • v.37 no.1
    • /
    • pp.87-102
    • /
    • 2024
  • Digital transformation refers to the process by which a company or organization changes or innovates its existing business model or sales activities using digital technology. This requires the use of various digital technologies - cloud computing, IoT, artificial intelligence, etc. - to strengthen competitiveness in the market, improve customer experience, and discover new businesses. In addition, in order to derive knowledge and insight about the market, customers, and production environment, it is necessary to select the right data, preprocess the data to an analyzable state, and establish the right process for systematic analysis suitable for the purpose. The usefulness of such digital data is determined by the importance of pre-processing and the correct application of exploratory data analysis (EDA), which is useful for information and hypothesis exploration and visualization of knowledge and insights. In this paper, we reexamine the philosophy and basic concepts of EDA and discuss key visualization information, information expression methods based on the grammar of graphics, and the ACCENT principle, which is the final visualization review standard, for effective visualization.

Optimization of Mutual Information for Multiresolution Image Registration (다해상도 영상정합을 위한 상호정보 최적화)

  • Hong, Helen;Kim, Myoung-Hee
    • Journal of the Korea Computer Graphics Society
    • /
    • v.7 no.1
    • /
    • pp.37-49
    • /
    • 2001
  • We propose an optimization of mutual information for multiresolution image registration to represent useful information as integrated form obtaining from complementary information of multi modality images. The method applies mutual information as cost function to measure the statistical dependency or information redundancy between the image intensities of corresponding pixels in both images, which is assumed to be maximal if the images are geometrically aligned. As experimental results we validate visual inspection for accuracy, changning initial condition and addictive noise for robustness. Since our method uses the native image rather than prior feature extraction, few user interaction is required to perform the registration. In addition it leads to robust density estimation and convergence as applying non-parametric density estimation and stochastic multiresolution optimization.

  • PDF

A Statistical Program for Measurement Process Capability Analysis based on KS Q ISO 22514-7 Using R (R을 이용한 KS Q ISO 22514-7 측정 프로세스 능력 분석용 프로그램)

  • Lee, Seung-Hoon;Lim, Keun
    • Journal of Korean Society for Quality Management
    • /
    • v.47 no.4
    • /
    • pp.713-723
    • /
    • 2019
  • Purpose: The purpose of this study is to develop a statistical program for capability analysis of measuring system and measurement process based upon KS Q ISO 22514-7. Methods: R is a powerful open source functional programming language that provides high level graphics and interfaces to other languages. Therefore, in this study, we will develop the statistical program using R language. Results: The R program developed in this study consists of the following five modules. ① Measuring system capability analysis with Type 1 study data: MSCA_Type1.R ② Measuring system capability analysis with Linearity study(Type 4 study) data: MSCA_Type4.R ③ Measurement process capability analysis with Type 1 study & Gage R&R study data: MPCA_T1GRR.R ④ Measurement process capability analysis with Type 4 study & Gage R&R study data: MPCA_T4GRR.R ⑤ Attribute measurement processes capability analysis : AttributeMP.R Conclusion: KS Q ISO 22514-7 evaluates measuring systems and measurement processes on the basis of the measurement uncertainty that was determined according to the GUM(KS Q ISO/IEC Guide 98-3). KS Q ISO 22514-7 offers precise procedures, however, computations are more intensive. The R program of this study will help to evaluate the measurement process.

Outlier Detection Using Dynamic Plots (동적 그림을 이용한 이상치 검색)

  • Ahn, Byung-Jin;Seo, Han-Son
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.5
    • /
    • pp.979-986
    • /
    • 2011
  • A linear regression method is commonly used to analyze data because of its simplicity and applicability; however, it is well known that data may contain some outliers and influential cases that may have a harmful effect on a statistical analysis. Thus detection and examination of outliers or influential cases are important parts of data analysis. In detecting multiple outliers, masking effects usually occur and make it difficult to identify the true outliers. We propose to use dynamic plots as a method resistant to masking effect. The procedure using dynamic plots is useful to find appropriate basic sets with which a dependent outliers detection method start and detect a true outliers set. Examples are given to demonstrate the effectiveness of the suggested idea.

Dynamic graphic approach for regression diagnostics system (REDS) (동적그래픽스에 의한 회귀진단시스템(REDS)의 구현)

  • 유종영;안기수;허문열
    • The Korean Journal of Applied Statistics
    • /
    • v.10 no.2
    • /
    • pp.241-251
    • /
    • 1997
  • Several studies have bee down on the work of dynamic graphical methods for regression diagnostics. The main propose of the methods were to investigate (1) the effects of change of data, or (2) the effects of change of regression coefficients on the regression models. But, by contrast, we can also investigate the effects of change of regression residuals on the regression model. This method can be used in fitting better a certain set of observations to a regression model than the other observations. Our research team approaches regression diagnostics by using dynamic graphics (REDS), and we introduce REDS in this thesis.

  • PDF

Nonparametric estimation of the derivative of function via the Bezier curve (베지에 곡선을 이용한 함수의 미분에 대한 비모수적 추정)

  • 김충락;정미선;김형순
    • The Korean Journal of Applied Statistics
    • /
    • v.11 no.1
    • /
    • pp.193-204
    • /
    • 1998
  • It is quite that we have to estimate the derivative of the regression function. The Bezier curve, rarely known to statisticians, is very popular in computer graphics area. In this paper, we use nonparametric method via the Bezier curve, and apply this method to real data set. This method seems to be very easy to compute and can be easily applied to other smoothing techniques.

  • PDF

QCanvas: An Advanced Tool for Data Clustering and Visualization of Genomics Data

  • Kim, Nayoung;Park, Herin;He, Ningning;Lee, Hyeon Young;Yoon, Sukjoon
    • Genomics & Informatics
    • /
    • v.10 no.4
    • /
    • pp.263-265
    • /
    • 2012
  • We developed a user-friendly, interactive program to simultaneously cluster and visualize omics data, such as DNA and protein array profiles. This program provides diverse algorithms for the hierarchical clustering of two-dimensional data. The clustering results can be interactively visualized and optimized on a heatmap. The present tool does not require any prior knowledge of scripting languages to carry out the data clustering and visualization. Furthermore, the heatmaps allow the selective display of data points satisfying user-defined criteria. For example, a clustered heatmap of experimental values can be differentially visualized based on statistical values, such as p-values. Including diverse menu-based display options, QCanvas provides a convenient graphical user interface for pattern analysis and visualization with high-quality graphics.

Moving Data Pictures (움직이는 데이터 그림)

  • Huh, Myung-Hoe
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.6
    • /
    • pp.999-1007
    • /
    • 2013
  • This research shows several types of moving pictures from the data: 1) the word cloud of Korean texts, 2) the heat map of n ${\times}$ p matrices, 3) the moving image of p ${\times}$ p scatterplot matrix, 4) the local projective display of k clusters (Huh and Lee, 2012). Moving pictures may reveal the hidden information and beauty of the datasets and ignite the curiosity of information consumers. Video files are attached.

Hangul Component Decomposition in Outline Fonts (한글 외곽선 폰트의 자소 분할)

  • Koo, Sang-Ok;Jung, Soon-Ki
    • Journal of the Korea Computer Graphics Society
    • /
    • v.17 no.4
    • /
    • pp.11-21
    • /
    • 2011
  • This paper proposes a method for decomposing a Hangul glyph of outline fonts into its initial, medial and final components using statistical-structural information. In a font family, the positions of components are statistically consistent and the stroke relationships of a Hangul character reflect its structure. First, we create the component histograms that accumulate the shapes and positions of the same components. Second, we make pixel clusters from character image based on pixel direction probabilities and extract the candidate strokes using position, direction, size of clusters and adjacencies between clusters. Finally, we find the best structural match between candidate strokes and predefined character model by relaxation labeling. The proposed method in this paper can be used for a study on formative characteristics of Hangul font, and for a font classification/retrieval system.