• 제목/요약/키워드: Statistical Data

검색결과 14,775건 처리시간 0.042초

Exploratory Data Analysis for microarray experiments with replicates

  • Lee, Eun-Kyung;Yi, Sung-Gon;Park, Tae-Sung
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2005년도 추계 학술발표회 논문집
    • /
    • pp.37-41
    • /
    • 2005
  • Exploratory data analysis(EDA) is the initial stage of data analysis and provides a useful overview about the whole microarray experiment. If the experiments are replicated, the analyst should check the quality and reliability of microarray data within same experimental condition before the deeper statistical analysis. We shows EDA method focusing on the quality and reproducibility for replicates.

  • PDF

Robustness, Data Analysis, and Statistical Modeling: The First 50 Years and Beyond

  • Barrios, Erniel B.
    • Communications for Statistical Applications and Methods
    • /
    • 제22권6호
    • /
    • pp.543-556
    • /
    • 2015
  • We present a survey of contributions that defined the nature and extent of robust statistics for the last 50 years. From the pioneering work of Tukey, Huber, and Hampel that focused on robust location parameter estimation, we presented various generalizations of these estimation procedures that cover a wide variety of models and data analysis methods. Among these extensions, we present linear models, clustered and dependent observations, times series data, binary and discrete data, models for spatial data, nonparametric methods, and forward search methods for outliers. We also present the current interest in robust statistics and conclude with suggestions on the possible future direction of this area for statistical science.

A Study for the Features of Data Analysis Methods Used in Medical Research

  • 신재경;장덕준;문승호
    • Journal of the Korean Data and Information Science Society
    • /
    • 제14권2호
    • /
    • pp.257-264
    • /
    • 2003
  • The perception of the importance of statistical methods for processing medical data in Korea's medical research and the practical use of the analysis method are insufficient. From this standpoint, in order to examine the features of the data analysis method used in the medical journals of Korea and America, we have examined the research papers which has been published in the exemplary medical journals of both countries. It showed that there was a large difference in the quantity and quality between Korea and America. Especially in the medical research of Korea, we could notice that the use of statistical methods were comparatively low. Hence the researchers in the medical area are encouraged to use more statistical methods in processing medical data.

  • PDF

Comparative Study on Statistical Packages for using Multivariate Q-technique

  • Choi, Yong-Seok;Moon, Hee-jung
    • Communications for Statistical Applications and Methods
    • /
    • 제10권2호
    • /
    • pp.433-443
    • /
    • 2003
  • In this study, we provide a comparison of multivariate Q-techniques in the up-to-date versions of SAS, SPSS, Minitab and S-plus well known to those who study statistics. We can analyze data through the direct Input method(command) in SAS and use of menu method in SPSS, Minitab and S-plus. The analysis performance method is chosen by the high frequency of use. Widely we compare with each Q-techniques form according to input data, input option, statistical chart and statistical output.

R에 의한 통계그래픽스 : 강의 내용 및 방법의 논의 (Teaching Statistical Graphics using R)

  • 박동련
    • 응용통계연구
    • /
    • 제20권3호
    • /
    • pp.619-634
    • /
    • 2007
  • 자료분석과정에서 그래프의 이용은 필수적이라고 하겠다. 다양하게 개발된 수많은 그래픽 기법들을 적절하게 사용할 수 있다면 한 단계 업그레이드된 통계분석이 가능할 것이며, 이런 면에서 볼 때 통계그래픽스는 통계학을 전공하는 학생들에게 꼭 필요한 강좌라고 할 수 있다. 다양하게 개발된 그래픽 기법의 막강한 파워를 제대로 느끼기 위해서는 적절한 통계 소프트웨어의 선택이 매우 중요한 문제라고 할 수 있는데, 뛰어난 그래픽 기능이 있는 R을 사용하는 것이 효율적으로 다양한 그래픽 기법을 구현할 수 있는 가장 바람직한 선택이라고 하겠다. 이 논문에서는 통계 그래픽스를 R을 이용하여 구현하는 강좌를 개설하고자 하는 경우에 사용할 수 있는 적절한 교과내용을 제안하고, 어떤 방식으로 강의하는 것이 가장 효과적인지에 대한 고민을 함께 해 볼 수 있는 기회를 제공하고자 한다.

Bayesian Methods for Generalized Linear Models

  • Paul E. Green;Kim, Dae-Hak
    • Communications for Statistical Applications and Methods
    • /
    • 제6권2호
    • /
    • pp.523-532
    • /
    • 1999
  • Generalized linear models have various applications for data arising from many kinds of statistical studies. Although the response variable is generally assumed to be generated from a wide class of probability distributions we focus on count data that are most often analyzed using binomial models for proportions or poisson models for rates. The methods and results presented here also apply to many other categorical data models in general due to the relationship between multinomial and poisson sampling. The novelty of the approach suggested here is that all conditional distribution s can be specified directly so that staraightforward Gibbs sampling is possible. The prior distribution consists of two stages. We rely on a normal nonconjugate prior at the first stage and a vague prior for hyperparameters at the second stage. The methods are demonstrated with an illustrative example using data collected by Rosenkranz and raftery(1994) concerning the number of hospital admissions due to back pain in Washington state.

  • PDF

지역통계 데이타베이스 구축및 활용방안 (A Study on Database of Region Statistic and Application)

  • 이희춘;김승구
    • 산업경영시스템학회지
    • /
    • 제19권38호
    • /
    • pp.199-205
    • /
    • 1996
  • The purpose of this study, therefore, was to construct the region statistical information to present methods of the data. The results of this paper are as follows: First, the construction of region statistical data is much in need of utilizing the server of regional information center, or the database to the server of public institutions, Second, there are some difficulties to receive the region statistical data because of only depending on the main source of KOSIS provided by national units from National Statistical Office. Third, as there is another problem which is text searching system served by KOSIS, GU system should be established for the user's satisfaction served by easier accessing screen. Fourth, there should be a standard software production to suit for the accessing software of the region statistical data.

  • PDF

Statistical Evaluation of Fracture Characteristics of RPV Steels in the Ductile-Brittle Transition Temperature Region

  • Kang, Sung-Sik;Chi, Se-Hwan;Hong, Jun-Hwa
    • Nuclear Engineering and Technology
    • /
    • 제30권4호
    • /
    • pp.364-376
    • /
    • 1998
  • The statistical analysis method was applied to the evaluation of fracture toughness in the ductile-brittle transition temperature region. Because cleavage fracture in steel is of a statistical nature, fracture toughness data or values show a similar statistical trend. Using the three-parameter Weibull distribution, a fracture toughness vs. temperature curve (K-curve) was directly generated from a set of fracture toughness data at a selected temperature. Charpy V-notch impact energy was also used to obtain the K-curve by a $K_{IC}$ -CVN (Charpy V-notch energy) correlation. Furthermore, this method was applied to evaluate the neutron irradiation embrittlement of reactor pressure vessel (RPV) steel. Most of the fracture toughness data were within the 95% confidence limits. The prediction of a transition temperature shift by statistical analysis was compared with that from the experimental data.

  • PDF

데이터 마이닝과 통계적 기법을 통합한 최적화 기법 (Optimization Methodology Integrated Data Mining and Statistical Method)

  • 정혜진;송서일
    • 한국품질경영학회:학술대회논문집
    • /
    • 한국품질경영학회 2006년도 추계 학술대회
    • /
    • pp.205-210
    • /
    • 2006
  • Nowaday manufacture technology and manufacture environment are changing rapidly. By development of computer and enlargement of technique, most of manufacture field are computerized. It is measured automatically do much quality characteristics thereby and great many data happen in a day. corporations is important if have gotten fast information that are useful from wide data to go first in international competition according to these change. Statistical process control(SPC) techniques are used as a problem solution tool at manufacturing process until present. However, this statistical methods is not applied more extensively because have much restrictions in realistic problem. In this paper, wish to develop more realistic and scientific new statistical design techniques doing to integrate data mining(DM) and statistical methods by the alternative to cope these problem. First step selects significant factor using DM techniques from datas of manufacturing process including much factors and second step wish to find optimum of process after get the estimated response function through response surf ace methodology(RSM) that is statistical techniques.

  • PDF

한국한의학연구원 논문집에 사용된 통계기법의 평가 (An Evaluation of the Statistical Techniques Used in the 1995-2007 Editions of the Korea Institute of Oriental Medicine)

  • 강경원;강병갑;고미미;신선화;최선미
    • 한국한의학연구원논문집
    • /
    • 제13권2호통권20호
    • /
    • pp.121-125
    • /
    • 2007
  • Background and Purpose : The purpose of this study was done to investigate what kinds of statistical techniques have been used to analyze data from oriental medicine research Methods : 135 original articles which used statistical techniques in their data analysis were selected from the articles published in The Journal of Korea Institute of Oriental Medicine(JKIOM) between 1995 to 2007. Results : Among 135 articles, 59 articles used descriptive statistics while 76 articles used inferential statistics for data analysis. For that 76 articles, two-sample t-test(33 articles), analysis of variance(29 articles), regression(9 articles), chi-square test(5 articles), nonparametic test(4 articles), Fisher's exact test(3 articles), and other test(9 articles) were chosen to analyze the data. SAS and SPSS statistical softwares(82.50%) were mostly used to analyze the data. Nonparametic tests were used to 4 articles(6.97%) of 67 articles and parametic tests were used to 63 articles(93.03%) of 67 articles. Among 29 articles used analysis of variance, duncan(8 articles), dunnet(4 articles), bonferroni(4 articles), turkey(3 articles), scheff(1 article) were used to do multiple comparison. 9 articles did not carry out the multiple comparison. Conclusions : It was found that the frequencies of statistical package used and statistical analysis used were not much by now. High level statistical analyses were not used most for oriental medicine research.

  • PDF