• 제목/요약/키워드: statistical data analysis

검색결과 9,158건 처리시간 0.035초

가이드 맵과 인터랙티브 시각화를 이용한 의료 통계분석 시스템 (A System for Medical Statistical Analysis Using Guide Maps and Interactive Visualization)

  • 이돈수;최수미
    • 한국멀티미디어학회논문지
    • /
    • 제8권7호
    • /
    • pp.1000-1011
    • /
    • 2005
  • 본 논문에서는 통계에 대한 지식이 부족한 임상 의학자들이 보다 쉽고 정확하게 데이터를 분석할 수 있도록 표본 데이터의 분포에 따라 적절한 분석 방법을 제시해주고, 분석 과정을 아이콘들의 트리로 구성한 가이드맵을 제공하는 의료 통계분석 시스템을 개발하였다. 개 발 시스템은 일반적으로 활용되는 통계 방법, 반복측정자료에 활용되는 통계 방법, 생존분석 등 의료 분야에서 자주 사용되는 분석법들을 포함하고 있다. 또한 3차원 글리프를 이용하여 결과를 인터랙티브하게 보여주고, 불확실성을 시각화함으로써 분석된 결과를 더욱 쉽게 이해할 수 있도록 하였다.

  • PDF

An Identification of Outlying Cells in Contingency Table via Correspondence Analysis Map

  • Hong, Chong Sun;Lee, Jong Cheol
    • Communications for Statistical Applications and Methods
    • /
    • 제8권1호
    • /
    • pp.39-49
    • /
    • 2001
  • When an appropriate model is fitted to explain a certain categorical data, outlying cell detection plays very important role to reduce the lack of fit. There exist many statistical methods to identify outlying cells in contingency table. In this paper, correspondence analysis is applied to identify one or two outlying cells. When corresponding relationships between categories of the row and columns are explored, we find that outlying cells could be identified via the correspondence analysis map.

  • PDF

수질자료의 추세분석을 위한 비모수적 통계검정에 관한 연구 (A Study of Non-parametric Statistical Tests to Analyze Trend in Water Quality Data)

  • 이상훈
    • 환경영향평가
    • /
    • 제4권2호
    • /
    • pp.93-103
    • /
    • 1995
  • This study was carried out to suggest the best statistical test to analyze the trend in monthly water quality data. Traditional parametric tests such as t-test and regression analysis are based on the assumption that the underlying population has a normal distribution and regression analysis additionally assumes that residual errors are independent. Analyzing 9-years monthly COD data collected at Paldang in Han River, the underlying population was found to be neither normal nor independent. Therefore parametric tests are invalid for trend detection. Four Kinds of nonparametric statistical tests, such as Run Test, Daniel test, Mann-Kendall test, and Time Series Residual Analysis were applied to analyze the trend in the COD data, Daniel test and Mann-Kendall test indicated upward trend in COD data. The best nonparametric test was suggested to be Daniel test, which is simple in computation and easy to understand the intuitive meaning.

  • PDF

R에 의한 통계그래픽스 : 강의 내용 및 방법의 논의 (Teaching Statistical Graphics using R)

  • 박동련
    • 응용통계연구
    • /
    • 제20권3호
    • /
    • pp.619-634
    • /
    • 2007
  • 자료분석과정에서 그래프의 이용은 필수적이라고 하겠다. 다양하게 개발된 수많은 그래픽 기법들을 적절하게 사용할 수 있다면 한 단계 업그레이드된 통계분석이 가능할 것이며, 이런 면에서 볼 때 통계그래픽스는 통계학을 전공하는 학생들에게 꼭 필요한 강좌라고 할 수 있다. 다양하게 개발된 그래픽 기법의 막강한 파워를 제대로 느끼기 위해서는 적절한 통계 소프트웨어의 선택이 매우 중요한 문제라고 할 수 있는데, 뛰어난 그래픽 기능이 있는 R을 사용하는 것이 효율적으로 다양한 그래픽 기법을 구현할 수 있는 가장 바람직한 선택이라고 하겠다. 이 논문에서는 통계 그래픽스를 R을 이용하여 구현하는 강좌를 개설하고자 하는 경우에 사용할 수 있는 적절한 교과내용을 제안하고, 어떤 방식으로 강의하는 것이 가장 효과적인지에 대한 고민을 함께 해 볼 수 있는 기회를 제공하고자 한다.

Cubic normal distribution and its significance in structural reliability

  • Zhao, Yan-Gang;Lu, Zhao-Hui
    • Structural Engineering and Mechanics
    • /
    • 제28권3호
    • /
    • pp.263-280
    • /
    • 2008
  • Information on the distribution of the basic random variable is essential for the accurate analysis of structural reliability. The usual method for determining the distributions is to fit a candidate distribution to the histogram of available statistical data of the variable and perform approximate goodness-of-fit tests. Generally, such candidate distribution would have parameters that may be evaluated from the statistical moments of the statistical data. In the present paper, a cubic normal distribution, whose parameters are determined using the first four moments of available sample data, is investigated. A parameter table based on the first four moments, which simplifies parameter estimation, is given. The simplicity, generality, flexibility and advantages of this distribution in statistical data analysis and its significance in structural reliability evaluation are discussed. Numerical examples are presented to demonstrate these advantages.

Symbolic Cluster Analysis for Distribution Valued Dissimilarity

  • Matsui, Yusuke;Minami, Hiroyuki;Misuta, Masahiro
    • Communications for Statistical Applications and Methods
    • /
    • 제21권3호
    • /
    • pp.225-234
    • /
    • 2014
  • We propose a novel hierarchical clustering for distribution valued dissimilarities. Analysis of large and complex data has attracted significant interest. Symbolic Data Analysis (SDA) was proposed by Diday in 1980's, which provides a new framework for statistical analysis. In SDA, we analyze an object with internal variation, including an interval, a histogram and a distribution, called a symbolic object. In the study, we focus on a cluster analysis for distribution valued dissimilarities, one of the symbolic objects. A hierarchical clustering has two steps in general: find out step and update step. In the find out step, we find the nearest pair of clusters. We extend it for distribution valued dissimilarities, introducing a measure on their order relations. In the update step, dissimilarities between clusters are redefined by mixture of distributions with a mixing ratio. We show an actual example of the proposed method and a simulation study.

Cluster Analysis of Car Parking Data, and Development of their Web Applications

  • Kubota, Takafumi;Hayashi, Takayuki;Tarumi, Tomoyuki
    • Communications for Statistical Applications and Methods
    • /
    • 제18권4호
    • /
    • pp.549-557
    • /
    • 2011
  • In this paper, we apply cluster analysis to "Okayama parking data" that is one of the spatial point patterns data that includes locations and the fare structure of car parking space in Okayama central area. This study classifies the characteristics of small areas through Okayama parking data as well as visualizes the results of the cluster analysis. We develop web applications that connect the results of a cluster analysis and overlay objects including points of balloons and rectangles of small areas over a map of Okayama central area.

웹기반 임상자료의 동적 통계분석 시스템 개발 (Development of web-based system for dynamic statistical analysis of clinical data)

  • 신임희;곽상규;박전우
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권1호
    • /
    • pp.27-36
    • /
    • 2014
  • 많은 응용분야에서 통계분석을 이용하여 의사결정을 뒷받침하는 정보를 얻는다. 그러나 PC용 통계분석 프로그램은 경제적인 부담과 시간 및 위치의 제한을 받는다. 이를 최소화하기 위하여 인터넷을 이용하여 서버 PC의 통계분석 프로그램을 사용하거나, 웹브라우저를 이용하여 통계분석 프로그램을 사용할 수 있는 웹기반 시스템이 개발되어 왔다. 그러나 기존 웹기반 시스템의 연구는 특정 통계분석 프로그램을 사용하여야 하거나 서버에 저장된 자료에 대해서만 이루어 졌다. 자료가 수정되거나 새로 생성되면 서버관리자가 다시 자료를 탑재하여야만 통계분석이 가능하였다. 이를 개선하기 위하여 웹에서 사용되어 지는 HTML, java, JSP 등의 언어를 사용하여 동적 (動的) 자료에 대해서도 통계분석이 가능한 웹기반 시스템을 개발하였다.

Receiver Operating Characteristic Analysis by Data Mining

  • 이성원;이제영
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2001년도 추계학술발표회 논문집
    • /
    • pp.195-197
    • /
    • 2001
  • Data Mining is used to discover patterns and relationships in huge amounts of data. Researchers in many different fields have shown great interest in data mining analysis. Using the classification technique of data mining analysis, the available model for Receiver Operating Characteristic(ROC) method is presented. We present that this may help analyze result of data mining techniques.

  • PDF

국내정수장의 잔류염소농도에 대한 조사연구 (Statistical Analysis of Chlorine Residual in Korean Drinking Water)

  • 손진식;강효순
    • 상하수도학회지
    • /
    • 제20권2호
    • /
    • pp.281-287
    • /
    • 2006
  • Maintaining adequate chlorine residual is crucial in water treatment facilities, Treatment technique, newly promulgated regulation, requires sufficient disinfection in order to control more resistant microorganisms such as Viruses and Giardia lamblia. Each water treatment plant should report various water qualities including chlorine residual and disinfection by-products, thus plenty of data has been generated. Even though statistical analysis using these data are forced to investigate the status and effect of water qualities in water facilities very few researches have been performed in korea. This study performed statistical analysis of chlorine residual during three years in Korean drinking water. The average chlorine residual concentrations were 0.701mg/L, 0.738mg/L, 0.763mg/L in 2002, 2003, 2004, respectively. Monthly variations of chlorine residual was not significant. ANOVA result showed that yearly variance of chlorine residual is different in only less than $5000m^3/day$ of water treatment capacity. The statistical analysis can help government to establish new regulation with scientific basis.