• Title/Summary/Keyword: Statistical data

Search Result 14,748, Processing Time 0.086 seconds

Exploratory Data Analysis for microarray experiments with replicates

  • Lee, Eun-Kyung;Yi, Sung-Gon;Park, Tae-Sung
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2005.11a
    • /
    • pp.37-41
    • /
    • 2005
  • Exploratory data analysis(EDA) is the initial stage of data analysis and provides a useful overview about the whole microarray experiment. If the experiments are replicated, the analyst should check the quality and reliability of microarray data within same experimental condition before the deeper statistical analysis. We shows EDA method focusing on the quality and reproducibility for replicates.

  • PDF

Robustness, Data Analysis, and Statistical Modeling: The First 50 Years and Beyond

  • Barrios, Erniel B.
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.6
    • /
    • pp.543-556
    • /
    • 2015
  • We present a survey of contributions that defined the nature and extent of robust statistics for the last 50 years. From the pioneering work of Tukey, Huber, and Hampel that focused on robust location parameter estimation, we presented various generalizations of these estimation procedures that cover a wide variety of models and data analysis methods. Among these extensions, we present linear models, clustered and dependent observations, times series data, binary and discrete data, models for spatial data, nonparametric methods, and forward search methods for outliers. We also present the current interest in robust statistics and conclude with suggestions on the possible future direction of this area for statistical science.

A Study for the Features of Data Analysis Methods Used in Medical Research

  • Sin, Jae-Gyeong;Jang, Deok-Jun;Mun, Seung-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.2
    • /
    • pp.257-264
    • /
    • 2003
  • The perception of the importance of statistical methods for processing medical data in Korea's medical research and the practical use of the analysis method are insufficient. From this standpoint, in order to examine the features of the data analysis method used in the medical journals of Korea and America, we have examined the research papers which has been published in the exemplary medical journals of both countries. It showed that there was a large difference in the quantity and quality between Korea and America. Especially in the medical research of Korea, we could notice that the use of statistical methods were comparatively low. Hence the researchers in the medical area are encouraged to use more statistical methods in processing medical data.

  • PDF

Comparative Study on Statistical Packages for using Multivariate Q-technique

  • Choi, Yong-Seok;Moon, Hee-jung
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.433-443
    • /
    • 2003
  • In this study, we provide a comparison of multivariate Q-techniques in the up-to-date versions of SAS, SPSS, Minitab and S-plus well known to those who study statistics. We can analyze data through the direct Input method(command) in SAS and use of menu method in SPSS, Minitab and S-plus. The analysis performance method is chosen by the high frequency of use. Widely we compare with each Q-techniques form according to input data, input option, statistical chart and statistical output.

Teaching Statistical Graphics using R (R에 의한 통계그래픽스 : 강의 내용 및 방법의 논의)

  • Park, Dong-Ryeon
    • The Korean Journal of Applied Statistics
    • /
    • v.20 no.3
    • /
    • pp.619-634
    • /
    • 2007
  • It is well known that graphical display is critical to data analysis. A lot of research for data visualization has been done, so many effective graphical tools are now available. With the proper use of these graphical tools, we can penetrate the complex structure of data set easily. To enjoy the benefit of the powerful graphical display, the choice of the statistical software is very crucial. R is a popular open source software tool for statistical analysis and graphics, and can provide the very powerful graphics facilities. Moreover, many researchers believe that R is the best software for statistical graphics. In this paper, we would like to discuss what we teach and how we teach in statistical graphics course using R.

Bayesian Methods for Generalized Linear Models

  • Paul E. Green;Kim, Dae-Hak
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.2
    • /
    • pp.523-532
    • /
    • 1999
  • Generalized linear models have various applications for data arising from many kinds of statistical studies. Although the response variable is generally assumed to be generated from a wide class of probability distributions we focus on count data that are most often analyzed using binomial models for proportions or poisson models for rates. The methods and results presented here also apply to many other categorical data models in general due to the relationship between multinomial and poisson sampling. The novelty of the approach suggested here is that all conditional distribution s can be specified directly so that staraightforward Gibbs sampling is possible. The prior distribution consists of two stages. We rely on a normal nonconjugate prior at the first stage and a vague prior for hyperparameters at the second stage. The methods are demonstrated with an illustrative example using data collected by Rosenkranz and raftery(1994) concerning the number of hospital admissions due to back pain in Washington state.

  • PDF

A Study on Database of Region Statistic and Application (지역통계 데이타베이스 구축및 활용방안)

  • 이희춘;김승구
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.19 no.38
    • /
    • pp.199-205
    • /
    • 1996
  • The purpose of this study, therefore, was to construct the region statistical information to present methods of the data. The results of this paper are as follows: First, the construction of region statistical data is much in need of utilizing the server of regional information center, or the database to the server of public institutions, Second, there are some difficulties to receive the region statistical data because of only depending on the main source of KOSIS provided by national units from National Statistical Office. Third, as there is another problem which is text searching system served by KOSIS, GU system should be established for the user's satisfaction served by easier accessing screen. Fourth, there should be a standard software production to suit for the accessing software of the region statistical data.

  • PDF

Statistical Evaluation of Fracture Characteristics of RPV Steels in the Ductile-Brittle Transition Temperature Region

  • Kang, Sung-Sik;Chi, Se-Hwan;Hong, Jun-Hwa
    • Nuclear Engineering and Technology
    • /
    • v.30 no.4
    • /
    • pp.364-376
    • /
    • 1998
  • The statistical analysis method was applied to the evaluation of fracture toughness in the ductile-brittle transition temperature region. Because cleavage fracture in steel is of a statistical nature, fracture toughness data or values show a similar statistical trend. Using the three-parameter Weibull distribution, a fracture toughness vs. temperature curve (K-curve) was directly generated from a set of fracture toughness data at a selected temperature. Charpy V-notch impact energy was also used to obtain the K-curve by a $K_{IC}$ -CVN (Charpy V-notch energy) correlation. Furthermore, this method was applied to evaluate the neutron irradiation embrittlement of reactor pressure vessel (RPV) steel. Most of the fracture toughness data were within the 95% confidence limits. The prediction of a transition temperature shift by statistical analysis was compared with that from the experimental data.

  • PDF

Optimization Methodology Integrated Data Mining and Statistical Method (데이터 마이닝과 통계적 기법을 통합한 최적화 기법)

  • Jung, Hey-Jin;Song, Suh-Ill
    • Proceedings of the Korean Society for Quality Management Conference
    • /
    • 2006.11a
    • /
    • pp.205-210
    • /
    • 2006
  • Nowaday manufacture technology and manufacture environment are changing rapidly. By development of computer and enlargement of technique, most of manufacture field are computerized. It is measured automatically do much quality characteristics thereby and great many data happen in a day. corporations is important if have gotten fast information that are useful from wide data to go first in international competition according to these change. Statistical process control(SPC) techniques are used as a problem solution tool at manufacturing process until present. However, this statistical methods is not applied more extensively because have much restrictions in realistic problem. In this paper, wish to develop more realistic and scientific new statistical design techniques doing to integrate data mining(DM) and statistical methods by the alternative to cope these problem. First step selects significant factor using DM techniques from datas of manufacturing process including much factors and second step wish to find optimum of process after get the estimated response function through response surf ace methodology(RSM) that is statistical techniques.

  • PDF

An Evaluation of the Statistical Techniques Used in the 1995-2007 Editions of the Korea Institute of Oriental Medicine (한국한의학연구원 논문집에 사용된 통계기법의 평가)

  • Kang, Kyung-Won;Kang, Byung-Gab;Go, Mi-Mi;Shin, Sun-Hwa;Choi, Sun-Mi
    • Korean Journal of Oriental Medicine
    • /
    • v.13 no.2 s.20
    • /
    • pp.121-125
    • /
    • 2007
  • Background and Purpose : The purpose of this study was done to investigate what kinds of statistical techniques have been used to analyze data from oriental medicine research Methods : 135 original articles which used statistical techniques in their data analysis were selected from the articles published in The Journal of Korea Institute of Oriental Medicine(JKIOM) between 1995 to 2007. Results : Among 135 articles, 59 articles used descriptive statistics while 76 articles used inferential statistics for data analysis. For that 76 articles, two-sample t-test(33 articles), analysis of variance(29 articles), regression(9 articles), chi-square test(5 articles), nonparametic test(4 articles), Fisher's exact test(3 articles), and other test(9 articles) were chosen to analyze the data. SAS and SPSS statistical softwares(82.50%) were mostly used to analyze the data. Nonparametic tests were used to 4 articles(6.97%) of 67 articles and parametic tests were used to 63 articles(93.03%) of 67 articles. Among 29 articles used analysis of variance, duncan(8 articles), dunnet(4 articles), bonferroni(4 articles), turkey(3 articles), scheff(1 article) were used to do multiple comparison. 9 articles did not carry out the multiple comparison. Conclusions : It was found that the frequencies of statistical package used and statistical analysis used were not much by now. High level statistical analyses were not used most for oriental medicine research.

  • PDF