• Title/Summary/Keyword: Multivariate statistical models

검색결과 126건 처리시간 0.025초

주성분을 이용한 다변량 고빈도 실현 변동성의 주기 선택 (Choice of frequency via principal component in high-frequency multivariate volatility models)

  • 진민경;윤재은;황선영
    • 응용통계연구
    • /
    • 제30권5호
    • /
    • pp.747-757
    • /
    • 2017
  • 본 논문은 다변량 실현 변동성 계산에서 주기 선택 방안에 대해 연구하고 있다. 고빈도(high frequency) 시계열 자료에 기초한 일간 변동성인 실현변동성을 계산하고 차원 축소 방법인 주성분을 도입하였다. Cholesky 모형을 포함한 다양한 다변량 변동성모형을 주성분을 통해 비교하였으며 KOSPI/삼성전자/현대차 고빈도 수익률 자료를 이용하여 예시하였다.

회귀모형의 기울기에 대한 품행성 검정 (Parallelism Test of Slope in Simple Linear Regression Models)

  • 박현욱;김동재
    • Communications for Statistical Applications and Methods
    • /
    • 제16권1호
    • /
    • pp.75-83
    • /
    • 2009
  • 단순선형 회귀모형의 기울기에 대한 평행성 검정법을 제안하였다. 세 군 이상에서 기울기에 대하여 Tukey (1953)가 제안한 HSD방법을 이용한 모수적 검정법과 Kruskal-Wallis (1952) 검정법을 이용한 비모수적 검정법을 각각 제안하였다. 또한 모의실험을 통하여 기존의 검정법과 제안한 검정법의 검정력을 비교하였다.

Multiple imputation for competing risks survival data via pseudo-observations

  • Han, Seungbong;Andrei, Adin-Cristian;Tsui, Kam-Wah
    • Communications for Statistical Applications and Methods
    • /
    • 제25권4호
    • /
    • pp.385-396
    • /
    • 2018
  • Competing risks are commonly encountered in biomedical research. Regression models for competing risks data can be developed based on data routinely collected in hospitals or general practices. However, these data sets usually contain the covariate missing values. To overcome this problem, multiple imputation is often used to fit regression models under a MAR assumption. Here, we introduce a multivariate imputation in a chained equations algorithm to deal with competing risks survival data. Using pseudo-observations, we make use of the available outcome information by accommodating the competing risk structure. Lastly, we illustrate the practical advantages of our approach using simulations and two data examples from a coronary artery disease data and hepatocellular carcinoma data.

블록 반복측정을 이용한 품질통계 모형의 유형화 (Model Classification of Quality Statistics Using Block Repeated Measures)

  • 최성운
    • 대한안전경영과학회지
    • /
    • 제9권3호
    • /
    • pp.165-171
    • /
    • 2007
  • Dependent models in quality statistics are classified as serially autocorrelated model, multivariate model and dependent sample model. Dependent sample model is most efficient in time and cost to obtain samples among the above models. This paper proposes to implement parametric and nonparametric models into production system depended on demand pattern. Nonparametric models have distribution free and asymptotic distribution free techniques. Quality statistical models are classified into two categories ; the number of dependent sample and the type of data. The type of data consists of nominal, ordinal, interval and ratio data. The number of dependent sample divides into 2 samples and more than 3 samples.

Bayesian mixed models for longitudinal genetic data: theory, concepts, and simulation studies

  • Chung, Wonil;Cho, Youngkwang
    • Genomics & Informatics
    • /
    • 제20권1호
    • /
    • pp.8.1-8.14
    • /
    • 2022
  • Despite the success of recent genome-wide association studies investigating longitudinal traits, a large fraction of overall heritability remains unexplained. This suggests that some of the missing heritability may be accounted for by gene-gene and gene-time/environment interactions. In this paper, we develop a Bayesian variable selection method for longitudinal genetic data based on mixed models. The method jointly models the main effects and interactions of all candidate genetic variants and non-genetic factors and has higher statistical power than previous approaches. To account for the within-subject dependence structure, we propose a grid-based approach that models only one fixed-dimensional covariance matrix, which is thus applicable to data where subjects have different numbers of time points. We provide the theoretical basis of our Bayesian method and then illustrate its performance using data from the 1000 Genome Project with various simulation settings. Several simulation studies show that our multivariate method increases the statistical power compared to the corresponding univariate method and can detect gene-time/ environment interactions well. We further evaluate our method with different numbers of individuals, variants, and causal variants, as well as different trait-heritability, and conclude that our method performs reasonably well with various simulation settings.

단독주택가격 추정을 위한 기계학습 모형의 응용 (Application of machine learning models for estimating house price)

  • 이창로;박기호
    • 대한지리학회지
    • /
    • 제51권2호
    • /
    • pp.219-233
    • /
    • 2016
  • 수리 또는 계량적 모형을 사용하는 사회과학연구에서 분석의 초점은 종속변수와 설명변수의 관계를 밝히는 것, 즉 설명 중심의 모형(explanatory modeling)이 지금까지 주류를 이루었다. 반면 예측(prediction) 능력 제고에 초점을 맞춘 분석은 드물었다. 본 연구에서는 이론 및 가설을 검증하거나 변수 간의 관계를 밝히는 설명 중심의 모형이 아니라 신규 관찰치에 대한 예측 오차를 줄이는, 예측 중심의 비모수 모형(non-parametric model)을 검토하였다. 서울시 강남구를 사례지역으로 선정한 후, 2011년부터 2014년까지 신고된 단독주택 실거래가를 기초자료로 하여 주택가격을 추정하였다. 적용한 비모수 모형은 기계학습 분야에서 제시된 일반가산모형(generalized additive model), 랜덤 포리스트, MARS(multivariate adaptive regression splines), SVM(support vector machines) 등이며 비교적 최근에 개발된 MARS나 SVM의 예측력이 뛰어남을 확인할 수 있었다. 마지막으로 이러한 비모수 모형에 공간적 자기상관성을 추가적으로 반영한 결과, 모형의 가격 예측력이 보다 개선되었음을 알 수 있었다. 본 연구를 계기로 그간 모수 모형에 집중되었던 부동산 가격추정 방법론이 비모수 모형으로 확대 및 다양화되기를 기대한다.

  • PDF

Application of metabolic profiling for biomarker discovery

  • Hwang, Geum-Sook
    • 한국응용약물학회:학술대회논문집
    • /
    • 한국응용약물학회 2007년도 Proceedings of The Convention
    • /
    • pp.19-27
    • /
    • 2007
  • An important potential of metabolomics-based approach is the possibility to develop fingerprints of diseases or cellular responses to classes of compounds with known common biological effect. Such fingerprints have the potential to allow classification of disease states or compounds, to provide mechanistic information on cellular perturbations and pathways and to identify biomarkers specific for disease severity and drug efficacy. Metabolic profiles of biological fluids contain a vast array of endogenous metabolites. Changes in those profiles resulting from perturbations of the system can be observed using analytical techniques, such as NMR and MS. $^1H$ NMR was used to generate a molecular fingerprint of serum or urinary sample, and then pattern recognition technique was applied to identity molecular signatures associated with the specific diseases or drug efficiency. Several metabolites that differentiate disease samples from the control were thoroughly characterized by NMR spectroscopy. We investigated the metabolic changes in human normal and clinical samples using $^1H$ NMR. Spectral data were applied to targeted profiling and spectral binning method, and then multivariate statistical data analysis (MVDA) was used to examine in detail the modulation of small molecule candidate biomarkers. We show that targeted profiling produces robust models, generates accurate metabolite concentration data, and provides data that can be used to help understand metabolic differences between healthy and disease population. Such metabolic signatures could provide diagnostic markers for a disease state or biomarkers for drug response phenotypes.

  • PDF

PROCESS ANALYSIS OF AUTOMOTIVE PARTS USING GRAPHICAL MODELLING

  • IRIKURA Norio;KUZUYA Kazuyoshi;NISHINA Ken
    • 한국품질경영학회:학술대회논문집
    • /
    • 한국품질경영학회 1998년도 The 12th Asia Quality Management Symposium* Total Quality Management for Restoring Competitiveness
    • /
    • pp.295-300
    • /
    • 1998
  • Recently graphical modelling is being studied as a useful process analysis tool for exploratory causal analysis. Graphical modelling is a presentation method that uses graphs to describe statistical models of the structures of multivariate data. This paper describes an application of this graphical modeling with two cases from the automotive parts industry. One case is the unbalance problem of the pulley, an automotive generator part. There is multivariate data of the product from each of the processes which are connected in the series. By means of exploratory causal analysis between the variables using graphical modeling, the key processes which causes the variation of the final characteristics and their mechanism of the causal relationship have become clear. Another case is, also, the unbalanced problem of automotive starter parts which consists of many parts and is manufactured by complex machinery and assembling process. By means of the similar technique, the key processes are obtained easily and the results are reasonable from technical knowledge.

  • PDF

품질향상을 통한 고객만족과 기업윤리차원의 기업이미지 전략 -소수의 관측치들의 활용을 위한 모형들 중심으로- (Corporate Image Strategy of Corporate Ethics and Customer Satisfaction through Quality Improvement -Discriminant Models based on the Utilization of a Small Number of Observed Values-)

  • 김종순
    • 품질경영학회지
    • /
    • 제24권4호
    • /
    • pp.168-189
    • /
    • 1996
  • In order for the corporation to get a good image from the customers it should consider several variables, but especially important are corproate ethics and customer satisfaction through quality improvement. Standard multivariate data analysis can be applied to find out the importance of customer satisfaction and corporate ethics as influence factors in the corporate competitive strategy. When applying this Methodology, multivariate normal distributions density function and the identical covariance between groups assumptions have to be satisfied. By using the evaluation result from a small number of specialists in an attempt to decide on the strategical factors that will create a better company image than its competitor, if it chooses to use statistical discriminant analysis method, it would be difficult to satisfy the two assumptions mentioned above. This thesis introduces discriminant analysis method that uses LP/GP effectively which is applicable to this particular situation.

  • PDF

A Bayesian Approach to Dependent Paired Comparison Rankings

  • Kim, Hea-Jung;Kim, Dae-Hwang
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2003년도 춘계 학술발표회 논문집
    • /
    • pp.85-90
    • /
    • 2003
  • In this paper we develop a method for finding optimal ordering of K statistical models. This is based on a dependent paired comparison experimental arrangement whose results can naturally be represented by a completely oriented graph (also so called tournament graph). Introducing preference probabilities, strong transitivity conditions, and an optimal criterion to the graph, we show that a Hamiltonian path obtained from row sum ranking is the optimal ordering. Necessary theories involved in the method and computation are provided. As an application of the method, generalized variances of K multivariate normal populations are compared by a Bayesian approach.

  • PDF