• Title/Summary/Keyword: 다중차원축소방법

Search Result 29, Processing Time 0.026 seconds

Support vector machine and multifactor dimensionality reduction for detecting major gene interactions of continuous data (서포트 벡터 머신 알고리즘을 활용한 연속형 데이터의 다중인자 차원축소방법 적용)

  • Lee, Jea-Young;Lee, Jong-Hyeong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.6
    • /
    • pp.1271-1280
    • /
    • 2010
  • We have used multifactor dimensionality reduction (MDR) method to study genegene interaction effect of statistical model in general. But, MDR method could not be applied in the continuous data. In this paper, continuous-type data by the support vector machine (SVM) algorithm are proposed to the MDR method which provides an introduction to the technique. Also we apply the method on the identify major interaction effects of single nucleotide polymorphisms (SNPs) responsible for economic traits in a Korean cattle population.

Power of Expanded Multifactor Dimensionality Reduction with CART Algorithm (CART 알고리즘을 활용한 확장된 다중인자 차원축소방법의 검정력 평가)

  • Lee, Jea-Young;Lee, Jong-Hyeong;Lee, Ho-Guen
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.5
    • /
    • pp.667-678
    • /
    • 2010
  • It is important to detect the gene-gene interaction in GWAS(Genome-Wide Association Study). There are many studies about detecting gene-gene interaction. The one is Multifactor dimensionality reduction method. But MDR method is not applied continuous data and expanded multifactor dimensionality reduction(E-MDR) method is suggested. The goal of this study is to evaluate the power of E-MDR for identifying gene-gene interaction by simulation. Also we applied the method on the identify interaction e ects of single nucleotid polymorphisms(SNPs) responsible for economic traits in a Korean cattle population (real data).

An extension of multifactor dimensionality reduction method for detecting gene-gene interactions with the survival time (생존시간과 연관된 유전자 간의 교호작용에 관한 다중차원축소방법의 확장)

  • Oh, Jin Seok;Lee, Seung Yeoun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.5
    • /
    • pp.1057-1067
    • /
    • 2014
  • Many genetic variants have been identified to be associated with complex diseases such as hypertension, diabetes and cancers throughout genome-wide association studies (GWAS). However, there still exist a serious missing heritability problem since the proportion explained by genetic variants from GWAS is very weak less than 10~15%. Gene-gene interaction study may be helpful to explain the missing heritability because most of complex disease mechanisms are involved with more than one single SNP, which include multiple SNPs or gene-gene interactions. This paper focuses on gene-gene interactions with the survival phenotype by extending the multifactor dimensionality reduction (MDR) method to the accelerated failure time (AFT) model. The standardized residual from AFT model is used as a residual score for classifying multiple geno-types into high and low risk groups and algorithm of MDR is implemented. We call this method AFT-MDR and compares the power of AFT-MDR with those of Surv-MDR and Cox-MDR in simulation studies. Also a real data for leukemia Korean patients is analyzed. It was found that the power of AFT-MDR is greater than that of Surv-MDR and is comparable with that of Cox-MDR, but is very sensitive to the censoring fraction.

The study on risk factors for diagnosis of metabolic syndrome and odds ratio using multifactor dimensionality reduction method (다중인자 차원 축소 방법에 의한 대사증후군의 위험도 분석과 오즈비)

  • Jin, Mi-Hyun;Lee, Jea-Young
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.4
    • /
    • pp.867-876
    • /
    • 2013
  • Metabolic syndrome has been known as a major factor of cardiovascular disease. Several metabolic disorders, particularly chronic disease is complex, and from individuals that appear in our country, the prevalence of the metabolic syndrome is increasing gradually. Therefore, this study, using a multi-factor dimensionality reduction method, checks the major single risk factor of metabolic syndrome and suggests a new diagnosis results of metabolic syndrome. Data of 3990 adults who responded to all the questionnaires of health interview are used from the database of the 5th Korea national health and nutrition examination survey conducted in 2010. As the result, the most dangerous single risk factor for metabolic syndrome was waist circumference and the most dangerous combination factors were waist circumference, triglyceride, and hypertension. This is the result of a new diagnosis of the metabolic syndrome. Especially, waist circumference, low HDL-cholesterol and hypertension were the most dangerous combination for male. In particular, the combination of waist circumference, triglyceride and diabetes was dangerous for obese people.

Power and major gene-gene identification of dummy multifactor dimensionality reduction algorithm (더미 다중인자 차원축소법에 의한 검증력과 주요 유전자 규명)

  • Yeo, Jungsou;La, Boomi;Lee, Ho-Guen;Lee, Seong-Won;Lee, Jea-Young
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.2
    • /
    • pp.277-287
    • /
    • 2013
  • It is important to detect the gene-gene interaction in GWAS (genome-wide association study). There have been many studies on detecting gene-gene interaction. The one is D-MDR (dummy multifoactor dimensionality reduction) method. The goal of this study is to evaluate the power of D-MDR for identifying gene-gene interaction by simulation. Also we applied the method on the identify interaction effects of single nucleotide polymorphisms (SNPs) responsible for economic traits in a Korean cattle population (real data).

Topological Consistency for Collapse Operator on Multi-Scale Databases (다중축척 공간 데이터베이스에서 축소연산자를 위한 위상 일관성)

  • 권오제;강혜경;이기준
    • Proceedings of the Korean Association of Geographic Inforamtion Studies Conference
    • /
    • 2004.10a
    • /
    • pp.27-40
    • /
    • 2004
  • When we derive multi-scale databases from a source spatial database, thegeometries and topological relations in the source database are transformed according to a predefined set of constraints. This means that the derived databases should be checked to see if the constraints are respected during the construction or updates of databases and to maintain the consistency of multi-scale databases. In this paper, we focus on the topological consistency between the source and derived databases, which is one of the important constraints to respect. In particular, we deal with the method of assessment of topological consistency, when 2-dimensional objects are collapsed to 1-dimensional ones. We introduce eight types of topological relations between 2-dimensional objects and 19 topological ones between 1-dimensional objects and propose four different strategies to convert 2-dimensional topological relations in the source database to 1-dimensional ones objects in the target database. With these strategies, we guarantee the topological consistency between multi-scale databases.

  • PDF

Multifactor Dimensionality Reduction(MDR) Analysis by Dummy Variables (더미(dummy) 변수를 활용한 다중인자 차원 축소(MDR) 방법)

  • Lee, Jea-Young;Lee, Ho-Guen
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.2
    • /
    • pp.435-442
    • /
    • 2009
  • Multiple genes interacting is a difficult due to the limitations of parametric statistical method like as logistic regression for detection of gene effects that are dependent solely on interactions with other genes and with environmental exposures. Multifactor dimensionality reduction(MDR) statistical method by dummy variables was applied to identify interaction effects of single nucleotide polymorphisms(SNPs) responsible for longissimus mulcle dorsi area(LMA), carcass cold weight(CWT) and average daily gain(ADG) in a Hanwoo beef cattle population.

A dimensional reduction method in cluster analysis for multidimensional data: principal component analysis and factor analysis comparison (다차원 데이터의 군집분석을 위한 차원축소 방법: 주성분분석 및 요인분석 비교)

  • Hong, Jun-Ho;Oh, Min-Ji;Cho, Yong-Been;Lee, Kyung-Hee;Cho, Wan-Sup
    • The Journal of Bigdata
    • /
    • v.5 no.2
    • /
    • pp.135-143
    • /
    • 2020
  • This paper proposes a pre-processing method and a dimensional reduction method in the analysis of shopping carts where there are many correlations between variables when dividing the types of consumers in the agri-food consumer panel data. Cluster analysis is a widely used method for dividing observational objects into several clusters in multivariate data. However, cluster analysis through dimensional reduction may be more effective when several variables are related. In this paper, the food consumption data surveyed of 1,987 households was clustered using the K-means method, and 17 variables were re-selected to divide it into the clusters. Principal component analysis and factor analysis were compared as the solution for multicollinearity problems and as the way to reduce dimensions for clustering. In this study, both principal component analysis and factor analysis reduced the dataset into two dimensions. Although the principal component analysis divided the dataset into three clusters, it did not seem that the difference among the characteristics of the cluster appeared well. However, the characteristics of the clusters in the consumption pattern were well distinguished under the factor analysis method.

The Consistency Assessment of Topological Relationships For a Collapse Operator in Multi-Scale Spatial Databases (다중축척 공간 데이터베이스의 축소연산자를 위한 위상관계 일관성 평가)

  • Kang Hae-Kyong;Li Ki-Joune
    • The KIPS Transactions:PartD
    • /
    • v.12D no.6 s.102
    • /
    • pp.837-848
    • /
    • 2005
  • A multi-scale database is a set of spatial database, covering same geographic area with different scales and it can be derived from pre-existing databases. In the derivation processes of a new multi-scale spatial database, the geometries and topological relations on the source database can be transformed and the transformation can be the cause of the lack of integrity Therefore, it is necessary to assess the transformation whether it is consistent or not after the derivation process of a new multi-scale database. Thus, we propose assessment methods for the topological consistency between a source database and a derived multi-scale database in this paper. In particular, we focus on the case that 2-dimensional objects are collapsed to 1-dimensional ones in the derivation process of a multi-scale database. We also describe implementation of the assessment methods and show the results of the implementation with experimental data.

Major genotype identification affecting economic traits in FABP4, SCD, FASN and SREBPs genes of Korean cattle (한우의 FABP4, SCD, FASN, SREBPs 유전자에서 경제형질에 영향을 미치는 우수 유전자형 선별)

  • Lee, Jea-Young;Park, Jae-Cheol
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.7
    • /
    • pp.1247-1255
    • /
    • 2016
  • Kim and Lee (2015) identified a superior FABP4 gene that improves the grade and fatty acid of Korean cattle. This study selects a superior genotype by expanding genes that influence the economic traits of Korean cattle. Expanded genes are FABP4, SCD, FASN and SREBPs that are related to grade and fatty acid (Oh, 2014). We use the adjusted economic-trait values with environmental factors excluded. We also applied multifactor dimensionality reduction(MDR) method to data of the adjusted economic-trait values. As a result, we identified superior genes and genotypes which improved the grade and fatty acid of Korean cattle.