한국데이터정보과학회 2003년도 춘계학술대회
-
In man cases, the measurement error variances may be functions of the unknown true values or related covariate. In some cases, the measurement error variances increase in proportion to the value of predictor. This paper develops estimators of the parameters of a linear measurement error variance function under stratified multistage random sampling design and additional conditions. Also, this paper evaluates and compares the power of an asymptotically unbiased test with that of an asymptotically biased test. The proposed method are applied to blood sample measurements from the U.S. Third National Health and Nutrition Examination Survey(NHANES III)
-
부품 및 소재산업의 육성을 위하여 2003년 4월부터 운영되는 신뢰성보험사업과 관련된 각종 제도를 검토하며, 신뢰성보험의 개념, 담보하는 위험의 분류, 운영체계 및 상품의 설계내용에 대하여 알아본다.
-
데이터베이스에 내재된 패턴이나 관계를 묘사한 것만으로도 의사결정에 필요한 정보를 제공할 수 있는데 이 데이터들의 변수들을 비슷한 특징을 가지는 소그룹으로 나누어 패턴을 찾는 것을 군집분석이라 한다. 이러한 군집 분석에는 분리군집방법과 계층적군집방법이 있는데, 재할당이 가능한 분리군집방법의 여러 알고리즘에 대해 비교해보자. 분리군집알고리즘에는 중심을 평균으로 하는 k-평균 알고리즘과, 중심을 메도이드로하는 PAM, CLARA, CLARANS 알고리즘이 있다. 이러한 알고리즘에 대한 이론과, 장단점을 설명하고, 분산과 중심들간의 평균 거리로 비교해 본다.
-
In this paper, we develop noninformative priors for two parameter Pareto distribution. Specially, we derive Jeffrey's prior, probability matching prior and reference prior for the parameter of interest. In our case, the probability matching prior is only a first order and there does not exist a second order matching prior. Some simulation reveals that the matching prior performs better to achieve the coverage probability. And a real example will be given.
-
한우 6번 염색체 유전자 지도에서 한우의 질을 높이기 위한 QTL(quantitative trait loci)분석을 실시하여 선별된 Loci 값을 Permutation Test를 이용하여 계산하였다. 한편, 경제적으로 주요한 한우의 특성부위(질적부위와 육량등)에 따른, 우수 경제형질 DNA marker를 K-평균 군집법을 실시 파악하였다. 이들 QTL과 K-평균법에 의해 한우의 염색체 6번, ILST035의 주요 경제 형질별 DNA marker들을 선별하여, Bootstrap BCa방법을 이용하여 각 DNA marker들의 신뢰구간을 구했다.
-
Korea has the most level of Internet Infrastructure in the world. But, in the educational aspect, it does not have an enough foundation about Statistical Education. In this paper we consider the methods of activation about statistics. Also, we present what is the Enterprise Guide and what does it have characteristics as statistical analysis tool from educational point of view. And we suggest a new paradigm in statistical education.
-
시대적으로 요구되는 수요자(학습자) 중심 교육의 구현과 교육 공급자(학교)는 학습자의 적성, 능력, 흥미, 진로에 부합하는 교육과정을 개방하고 이에 적합한 교육환경을 제공하여 학습 주체인 학습자가 교육을 통해 생활에 필요한 능력과 적성에 맞는 진로를 찾을 수 있도록 교육과정을 운영하는 것이 제7차 교육 과정의 핵심이다. 1997년부터 시행되어 2002년 고등학교에서 처음 실시한 수학과교육과정 속의 초등 및 중등학교 통계교육과정의 개요와 특성을 살펴보고 문제점을 제시한다.
-
In the Taguchi parameter design, the product array approach using orthogonal arrays is mainly used. However, it often requires an excessive number of experiments. An alternative approach, which is called the combined array approach, was suggested by Welch et. al. (1990) and studied by others. In these studies, only single response variable was considered. We propose how to simultaneously optimize multiple responses when we use the combined array approach.
-
Cluster analysis has been widely used in many applications, such that data analysis, pattern recognition, image processing, etc. But clustering requires many hours to get clusters that we want, because it is more primitive, explorative and we make many data an object of cluster analysis. In this paper we propose a new clustering method, 'Clustering algorithm using a center of gravity for grid-based sample'. It is more fast than any traditional clustering method and maintains accuracy. It reduces running time by using grid-based sample and keeps accuracy by using representative point, a center of gravity.
-
Cumulative sum(CUSUM) control charts for monitoring dispersion matrix under multivariate normal process are proposed. Performances of the proposed CUSUM charts are measured in terms of average run length(ARL) by simulation. Numerical results show that small reference values of the proposed CUSUM chart is more efficient for small shifts in the production process.
-
Cluster analysis has been widely used in many applications, such that pattern analysis or recognition, data analysis, image processing, market research on on-line or off-line and so on. Clustering can identify dense and sparse regions among data attributes or object attributes. But it requires many hours to get clusters that we want, because of clustering is more primitive, explorative and we make many data an object of cluster analysis. In this paper we propose a new method of clustering using sample based on grid. It is more fast than any traditional clustering method and maintains its accuracy. It reduces running time by using grid-based sample. And other clustering applications can be more effective by using this methods with its original methods.
-
In this paper we propose a local smoothing of the Nelson type estimator for the survival function based on an approximation by the Weibull distribution function. It appears that Mean Square Error and Bias of the smoothed estimator of the Nelson type survival function estimator is significantly smaller then that of the smoothed estimator of the Kaplan-Meier survival function estimator.
-
In this paper, we consider two-components system which the lifetimes follow bivariate pareto model with censored data. We develop large sample tests for testing independence between two-components. Also we present simulated study which is the test based on asymptotic normal distribution in testing independence.
-
본 연구에서는 국내 A 사의 자동차용 와이퍼 모터의 수명분포를 와이블이라 가정하고 이를 추정하였다. 와이퍼 모터의 고장에 영향을 미치는 요인은 성능시험 요인과 수명시험 요인이 있는데, 이들 요인별 시험 조건을 국내와 관련 규격을 중심으로 정리하였다.
-
A data set having missing observations is often completed by using imputed values. In this paper, performances and accuracy of five imputation procedures are evaluated when missing values exist only on the response variable in the exponential regression model. Our simulation results show that adjusted exponential regression imputation procedure can be well used to compensate for missing data, in particular, compared to other imputation procedures. An illustrative example using real data is provided.
-
불완전 데이터 즉, 결측값을 가지는 데이터를 분석할 경우 결측데이터에 대해서 어떠한 처리를 해야할 필요가 있다. 결측데이터에 대한 처리로서 주로 이용되어온 방법으로는 결측값을 포함한 관측값(case)을 제외하는 방법이었다. 이후 여러 방법들이 제안되어 EM알고리즘이나 회귀알고리즘에 의한 추정을 바탕으로 결측값에 대한 추정을 해서 그 추정값으로 결측값을 대치하는 방법을 사용할 수 있게되었다. 본 논문에서는 복수 개의 데이터세트를 생성해서 대치하는 다중대입 소프트인 SOLAS를 소개한다.
-
This Paper analyzed simultaneous activities of the time use survey by Korea National Statistical Office to use data mining‘s association rule. The survey of National Statistical Office in 1999 considered general analysis for simultaneous activities. But if we use the association rule, we can found the ratio of particular activities at the same time. And we found the probability that another activities practise if we act one particular activity. Using this association rule of data mining we can do more developed and analytical sociological study.
-
This paper deals with the problem of predicting order statistics in samples from a Rayleigh population when an outlier is present. Bayesian predictive distribution and prediction bounds of the p-th order statistics is obtained where an outlier of type
$\theta\delta$ is present. In this connection, some identies are derived.