• 제목/요약/키워드: Statistical data analyses

검색결과 1,094건 처리시간 0.03초

서울시 지역별 $SO_2$ 오염도 분석에 관한 연구 (A Study on the Analysis of $SO_2$ Concentration in the Metro Seoul)

  • 송동웅;김원만
    • 한국대기환경학회지
    • /
    • 제7권1호
    • /
    • pp.23-30
    • /
    • 1991
  • The major purpose of this study is to delineate and assess the regional $SO_2$ levels in Seoul. This study is based on 1988 year-round data from 20 air quality continuous monitoring stations in Seoul. Statistical analyses were attempted, statistical parameters such as average concentration, standard deviation, maximum concentration, minimum concentration and monthly highest concentration were included in the analyses. In addition, Larsen's averaging time analysis was evaluated in terms of 24-hr concentration. The $SO_2$ levels in 1988 were that most stations except Daechidong, Sinlimdong, Jamsil 2, Bangidong violate the long-term standard (annual average 0.05 ppm) and the percentage of number of days within a year in which the 24-hr average concentration observed exceeds short-term standard (0.15 ppm) are; 37% at Deungchondong, 30% at Sinseoldong, mor than 20% at Ssangmundong, Myunmogdong and Oryudong.

  • PDF

PREDICTION OF DAILY MAXIMUM X-RAY FLUX USING MULTILINEAR REGRESSION AND AUTOREGRESSIVE TIME-SERIES METHODS

  • Lee, J.Y.;Moon, Y.J.;Kim, K.S.;Park, Y.D.;Fletcher, A.B.
    • 천문학회지
    • /
    • 제40권4호
    • /
    • pp.99-106
    • /
    • 2007
  • Statistical analyses were performed to investigate the relative success and accuracy of daily maximum X-ray flux (MXF) predictions, using both multilinear regression and autoregressive time-series prediction methods. As input data for this work, we used 14 solar activity parameters recorded over the prior 2 year period (1989-1990) during the solar maximum of cycle 22. We applied the multilinear regression method to the following three groups: all 14 variables (G1), the 2 so-called 'cause' variables (sunspot complexity and sunspot group area) showing the highest correlations with MXF (G2), and the 2 'effect' variables (previous day MXF and the number of flares stronger than C4 class) showing the highest correlations with MXF (G3). For the advanced three days forecast, we applied the autoregressive timeseries method to the MXF data (GT). We compared the statistical results of these groups for 1991 data, using several statistical measures obtained from a $2{\times}2$ contingency table for forecasted versus observed events. As a result, we found that the statistical results of G1 and G3 are nearly the same each other and the 'effect' variables (G3) are more reliable predictors than the 'cause' variables. It is also found that while the statistical results of GT are a little worse than those of G1 for relatively weak flares, they are comparable to each other for strong flares. In general, all statistical measures show good predictions from all groups, provided that the flares are weaker than about M5 class; stronger flares rapidly become difficult to predict well, which is probably due to statistical inaccuracies arising from their rarity. Our statistical results of all flares except for the X-class flares were confirmed by Yates' $X^2$ statistical significance tests, at the 99% confidence level. Based on our model testing, we recommend a practical strategy for solar X-ray flare predictions.

반복측정자료 분석에 대한 고찰: 신장이식 환자의 신기능 부전 연구를 중심으로 (Analysis of Repeated Measures Data: Chronic Renal Allograft Dysfunction Data from the Renal Transplanted Patients)

  • 박태성;이승연;성건형;강종명;강경원
    • 응용통계연구
    • /
    • 제11권2호
    • /
    • pp.205-219
    • /
    • 1998
  • 신장이식 수술을 받은 114명의 환자를 대상으로 이식 수술 후에 나타나는 신기능 부전에 영향을 미치는 인자들을 밝히기 위한 통계분석을 실시하였다. 신기능의 변화는 혈청 크레아터닌값을 통해 조사하였고 각 환자들로부터 이식 후 1년에서 5년 사이의 혈청 크레아티닌값을 평균 3개월 간격으로 반복적으로 측정하였다. 크레아티닌의 역수값에 영향을 미치는 인자를 조사하기 위해 반복측정 자료 분석에 사용되는 회귀 모형을 사용하였다. 본 논문에서는 이러한 반복측정자료의 분석 시에 발생하는 상관행렬의 선택에 관한 통계 적 인 문제 점 들을 고찰해 보았다.

  • PDF

Introduction to Mediation Analysis and Examples of Its Application to Real-world Data

  • Jung, Sun Jae
    • Journal of Preventive Medicine and Public Health
    • /
    • 제54권3호
    • /
    • pp.166-172
    • /
    • 2021
  • Traditional epidemiological assessments, which mainly focused on evaluating the statistical association between two major components-the exposure and outcome-have recently evolved to ascertain the in-between process, which can explain the underlying causal pathway. Mediation analysis has emerged as a compelling method to disentangle the complex nature of these pathways. The statistical method of mediation analysis has evolved from simple regression analysis to causal mediation analysis, and each amendment refined the underlying mathematical theory and required assumptions. This short guide will introduce the basic statistical framework and assumptions of both traditional and modern mediation analyses, providing examples conducted with real-world data.

A Penalized Spline Based Method for Detecting the DNA Copy Number Alteration in an Array-CGH Experiment

  • Kim, Byung-Soo;Kim, Sang-Cheol
    • 응용통계연구
    • /
    • 제22권1호
    • /
    • pp.115-127
    • /
    • 2009
  • The purpose of statistical analyses of array-CGH experiment data is to divide the whole genome into regions of equal copy number, to quantify the copy number in each region and finally to evaluate its significance of being different from two. Several statistical procedures have been proposed which include the circular binary segmentation, and a Gaussian based local regression for detecting break points (GLAD) by estimating a piecewise constant function. We propose in this note a penalized spline regression and its simultaneous confidence band(SCB) approach to evaluate the statistical significance of regions of genetic gain/loss. The region of which the simultaneous confidence band stays above 0 or below 0 can be considered as a region of genetic gain or loss. We compare the performance of the SCB procedure with GLAD and hidden Markov model approaches through a simulation study in which the data were generated from AR(1) and AR(2) models to reflect spatial dependence of the array-CGH data in addition to the independence model. We found that the SCB method is more sensitive in detecting the low level copy number alterations.

통계적 지반 공간 정보 기법을 이용한 지층구조 분석 (Application of Statistical Geo-Spatial Information Technology to Soil Stratification)

  • 김한샘;김현기;신시열;정충기
    • 한국지반공학회논문집
    • /
    • 제27권7호
    • /
    • pp.59-68
    • /
    • 2011
  • 지반조사 결과 자료는 지반의 불균질성과 낮은 신뢰성을 갖는 일부 자료로 인해 불확실성을 갖게 된다. 이에 따라 지반조사 결과 자료를 활용하여 지반 특성을 해석할 경우 합리적인 공학적 판단을 위해 적절한 통계분석이 요구된다. 본 연구에서는 이상치 분석기법과 교차검증기법을 접목한 통계적 지반 공간 정보 분석 기법을 이용하여 대상지역 지반조사 자료 중 경향성을 상회하는 지반조사 지점을 선별하는 전문가시스템을 개발하였다. 개발 시스템을 이용하여 서울시 여의도 지역의 시추조사 자료를 바탕으로 지층구조 분석을 수행하였다. 그 결과 신뢰도가 낮은 것으로 판단되는 시추조사 자료를 결정하고, 본 지점을 제외한 여의도 지역의 기반암 섬도 분포를 확인하였다.

Validity Study of Kohonen Self-Organizing Maps

  • Huh, Myung-Hoe
    • Communications for Statistical Applications and Methods
    • /
    • 제10권2호
    • /
    • pp.507-517
    • /
    • 2003
  • Self-organizing map (SOM) has been developed mainly by T. Kohonen and his colleagues as a unsupervised learning neural network. Because of its topological ordering property, SOM is known to be very useful in pattern recognition and text information retrieval areas. Recently, data miners use Kohonen´s mapping method frequently in exploratory analyses of large data sets. One problem facing SOM builder is that there exists no sensible criterion for evaluating goodness-of-fit of the map at hand. In this short communication, we propose valid evaluation procedures for the Kohonen SOM of any size. The methods can be used in selecting the best map among several candidates.

세라믹스의 피로수명에 대한 통계적 분석 (Statistical Analysis for Fatigue Lifetime of Ceramics)

  • 박성은;김성욱;이홍림
    • 한국세라믹학회지
    • /
    • 제34권9호
    • /
    • pp.927-934
    • /
    • 1997
  • Static and cyclic fatigue tests were carried out for alumina specimen to study the statistical analyses (normal, lognormal and Weibull distribution) of fatigue lifetime data and nominal initial crack length data. Fatigue lifetime data followed Weibull distribution better than normal or lognormal distribution, for the shape parameter of the notched specimen was larger than that of the unnotched specimen. The nominal initial crack length data obtained from fatigue lifetime followed the lognormal and Weibull distribution better than normal distribution, for the coefficient of variation of the unnotched specimen was larger than that of the notched specimen, and shape parameter of unnotched specimen was smaller than that of the notched specimen.

  • PDF

Performance Comparison of Classication Methods with the Combinations of the Imputation and Gene Selection Methods

  • Kim, Dong-Uk;Nam, Jin-Hyun;Hong, Kyung-Ha
    • 응용통계연구
    • /
    • 제24권6호
    • /
    • pp.1103-1113
    • /
    • 2011
  • Gene expression data is obtained through many stages of an experiment and errors produced during the process may cause missing values. Due to the distinctness of the data so called 'small n large p', genes have to be selected for statistical analysis, like classification analysis. For this reason, imputation and gene selection are important in a microarray data analysis. In the literature, imputation, gene selection and classification analysis have been studied respectively. However, imputation, gene selection and classification analysis are sequential processing. For this aspect, we compare the performance of classification methods after imputation and gene selection methods are applied to microarray data. Numerical simulations are carried out to evaluate the classification methods that use various combinations of the imputation and gene selection methods.

경영정보학 분야의 구조방정식모형 적용분석 : Lisrel과 PLS 방법을 중심으로 (The Structural Equation Modeling in MIS : The Perspectives of Lisrel and PLS Applications)

  • 김인재;민금영;심형섭
    • 한국IT서비스학회지
    • /
    • 제10권2호
    • /
    • pp.203-221
    • /
    • 2011
  • The purpose of this study is to investigate the applications of Structural Equation Modeling(SEM) into MIS area in recent years. Two methodologies, Lisrel and PLS, are adopted for the method comparison. A research model, based upon TAM(Technology Acceptance Model) is used for the analysis of the data set of a previous study. The research model includes six research variables that are composed of twenty-eight question items. 272 data are used for data analyses through Lisrel v.8.72 and Visual PLS v.1.04. This study shows the statistical results of Lisrel are the same to those of PLS. The contribution of this study can be suggested as the followings; (1) A theoretical comparison of two methodologies is shown, (2) A statistical analysis is done at a real-situated data set, and (3) Several implications are suggested.