• 제목/요약/키워드: gene-time interaction

검색결과 99건 처리시간 0.03초

진화연산에 기반한 유전자 발현 데이터로부터의 유전자 상호작용 네트워크 구성 (Construction of Gene Interaction Networks from Gene Expression Data Based on Evolutionary Computation)

  • 정성훈;조광현
    • 제어로봇시스템학회논문지
    • /
    • 제10권12호
    • /
    • pp.1189-1195
    • /
    • 2004
  • This paper investigates construction of gene (interaction) networks from gene expression time-series data based on evolutionary computation. To illustrate the proposed approach in a comprehensive way, we first assume an artificial gene network and then compare it with the reconstructed network from the gene expression time-series data generated by the artificial network. Next, we employ real gene expression time-series data (Spellman's yeast data) to construct a gene network by applying the proposed approach. From these experiments, we find that the proposed approach can be used as a useful tool for discovering the structure of a gene network as well as the corresponding relations among genes. The constructed gene network can further provide biologists with information to generate/test new hypotheses and ultimately to unravel the gene functions.

Gene-Gene Interaction Analysis for the Accelerated Failure Time Model Using a Unified Model-Based Multifactor Dimensionality Reduction Method

  • Lee, Seungyeoun;Son, Donghee;Yu, Wenbao;Park, Taesung
    • Genomics & Informatics
    • /
    • 제14권4호
    • /
    • pp.166-172
    • /
    • 2016
  • Although a large number of genetic variants have been identified to be associated with common diseases through genome-wide association studies, there still exits limitations in explaining the missing heritability. One approach to solving this missing heritability problem is to investigate gene-gene interactions, rather than a single-locus approach. For gene-gene interaction analysis, the multifactor dimensionality reduction (MDR) method has been widely applied, since the constructive induction algorithm of MDR efficiently reduces high-order dimensions into one dimension by classifying multi-level genotypes into high- and low-risk groups. The MDR method has been extended to various phenotypes and has been improved to provide a significance test for gene-gene interactions. In this paper, we propose a simple method, called accelerated failure time (AFT) UM-MDR, in which the idea of a unified model-based MDR is extended to the survival phenotype by incorporating AFT-MDR into the classification step. The proposed AFT UM-MDR method is compared with AFT-MDR through simulation studies, and a short discussion is given.

Boosting Multifactor Dimensionality Reduction Using Pre-evaluation

  • Hong, Yingfu;Lee, Sangbum;Oh, Sejong
    • ETRI Journal
    • /
    • 제38권1호
    • /
    • pp.206-215
    • /
    • 2016
  • The detection of gene-gene interactions during genetic studies of common human diseases is important, and the technique of multifactor dimensionality reduction (MDR) has been widely applied to this end. However, this technique is not free from the "curse of dimensionality" -that is, it works well for two- or three-way interactions but requires a long execution time and extensive computing resources to detect, for example, a 10-way interaction. Here, we propose a boosting method to reduce MDR execution time. With the use of pre-evaluation measurements, gene sets with low levels of interaction can be removed prior to the application of MDR. Thus, the problem space is decreased and considerable time can be saved in the execution of MDR.

Sequence-specific interaction between ABD-B homeodomain and castor gene in Drosophila

  • Kim, Keon-Hee;Yoo, Siuk
    • BMB Reports
    • /
    • 제47권2호
    • /
    • pp.92-97
    • /
    • 2014
  • We have examined the effect of bithorax complex genes on the expression of castor gene. During the embryonic stages 12-15, both Ultrabithorax and abdominal-A regulated the castor gene expression negatively, whereas Abdominal-B showed a positive correlation with the castor gene expression according to real-time PCR. To investigate whether ABD-B protein directly interacts with the castor gene, electrophoretic mobility shift assays were performed using the recombinant ABD-B homeodomain and oligonucleotides, which are located within the region 10 kb upstream of the castor gene. The results show that ABD-B protein directly binds to the castor gene specifically. ABD-B binds more strongly to oligonucleotides containing two 5'-TTAT-3' canonical core motifs than the probe containing the 5'-TTAC-3' motif. In addition, the sequences flanking the core motif are also involved in the protein-DNA interaction. The results demonstrate the importance of HD for direct binding to target sequences to regulate the expression level of the target genes.

Grid-based Gaussian process models for longitudinal genetic data

  • Chung, Wonil
    • Communications for Statistical Applications and Methods
    • /
    • 제29권1호
    • /
    • pp.65-83
    • /
    • 2022
  • Although various statistical methods have been developed to map time-dependent genetic factors, most identified genetic variants can explain only a small portion of the estimated genetic variation in longitudinal traits. Gene-gene and gene-time/environment interactions are known to be important putative sources of the missing heritability. However, mapping epistatic gene-gene interactions is extremely difficult due to the very large parameter spaces for models containing such interactions. In this paper, we develop a Gaussian process (GP) based nonparametric Bayesian variable selection method for longitudinal data. It maps multiple genetic markers without restricting to pairwise interactions. Rather than modeling each main and interaction term explicitly, the GP model measures the importance of each marker, regardless of whether it is mostly due to a main effect or some interaction effect(s), via an unspecified function. To improve the flexibility of the GP model, we propose a novel grid-based method for the within-subject dependence structure. The proposed method can accurately approximate complex covariance structures. The dimension of the covariance matrix depends only on the number of fixed grid points although each subject may have different numbers of measurements at different time points. The deviance information criterion (DIC) and the Bayesian predictive information criterion (BPIC) are proposed for selecting an optimal number of grid points. To efficiently draw posterior samples, we combine a hybrid Monte Carlo method with a partially collapsed Gibbs (PCG) sampler. We apply the proposed GP model to a mouse dataset on age-related body weight.

유전체 코호트 연구의 주요 통계학적 과제 (Statistical Issues in Genomic Cohort Studies)

  • 박소희
    • Journal of Preventive Medicine and Public Health
    • /
    • 제40권2호
    • /
    • pp.108-113
    • /
    • 2007
  • When conducting large-scale cohort studies, numerous statistical issues arise from the range of study design, data collection, data analysis and interpretation. In genomic cohort studies, these statistical problems become more complicated, which need to be carefully dealt with. Rapid technical advances in genomic studies produce enormous amount of data to be analyzed and traditional statistical methods are no longer sufficient to handle these data. In this paper, we reviewed several important statistical issues that occur frequently in large-scale genomic cohort studies, including measurement error and its relevant correction methods, cost-efficient design strategy for main cohort and validation studies, inflated Type I error, gene-gene and gene-environment interaction and time-varying hazard ratios. It is very important to employ appropriate statistical methods in order to make the best use of valuable cohort data and produce valid and reliable study results.

hOGG1, p53 Genes, and Smoking Interactions are Associated with the Development of Lung Cancer

  • Cheng, Zhe;Wang, Wei;Song, Yong-Na;Kang, Yan;Xia, Jie
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제13권5호
    • /
    • pp.1803-1808
    • /
    • 2012
  • This study aimed to investigate the effects of Ser/Cys polymorphism in hOGG1 gene, Arg/Pro polymorphism in p53 gene, smoking and their interactions on the development of lung cancer. Ser/Cys polymorphism in hOGG1 and Arg/Pro polymorphism in p53 among 124 patients with lung cancer and 128 normal people were detected using PCR-RFLP. At the same time, smoking status was investigated between the two groups. Logistic regression was used to estimate the effects of Ser/Cys polymorphism and Arg/Pro polymorphisms, smoking and their interactions on the development of lung cancer. ORs (95% CI) of smoking, hOGG1 Cys/Cys and p53 Pro/Pro genotypes were 2.34 (1.41-3.88), 2.12 (1.03-4.39), and 2.12 (1.15-3.94), respectively. The interaction model of smoking and Cys/Cys was super-multiplicative or multiplicative, and the OR (95% CI) for their interaction item was 1.67 (0.36 -7.78). The interaction model of smoking and Pro/Pro was super-multiplicative with an OR (95%CI) of their interaction item of 5.03 (1.26-20.1). The interaction model of Pro/Pro and Cys/Cys was multiplicative and the OR (95%CI) of their interaction item was 0.99 (0.19-5.28). Smoking, hOGG1 Cys/Cys, p53 Pro/Pro and their interactions may be the important factors leading to the development of lung cancer.

Interaction of Heliothis armigera Nuclear Polyhedrosis Viral Capsid Protein with its Host Actin

  • Lu, Song-Ya;Qi, Yi-Peng;Ge, Guo-Qiong
    • BMB Reports
    • /
    • 제35권6호
    • /
    • pp.562-567
    • /
    • 2002
  • In order to find the cellular interaction factors of the Heliothis armigera nuclear polyhedrosis virus capsid protein VP39, a Heliothis armigera cell cDNA library was constructed. Then VP39 was used as bait. The host actin gene was isolated from the cDNA library with the yeast two-hybrid system. This demonstrated that VP39 could interact with its host actin in yeast. In order to corroborate this interaction in vivo, the vp39 gene was fused with the green fluorescent protein gene in plasmid pEGFP39. The fusion protein was expressed in the Hz-AM1 cells under the control of the Autographa californica multiple nucleopolyhedrovirus immediate early gene promoter. The host actin was labeled specifically by the red fluorescence substance, tetramethy rhodamine isothicyanete-phalloidin. Observation under a fluorescence microscopy showed that VP39, which was indicated by green fluorescence, began to appear in the cells 6 h after being transfected with pEGFP39. Red actin cables were also formed in the cytoplasm at the same time. Actin was aggregated in the nucleus 9 h after the transfection. The green and red fluorescence always appeared in the same location of the cells, which demonstrated that VP39 could combine with the host actin. Such a combination would result in the actin skeleton rearrangement.

Bayesian mixed models for longitudinal genetic data: theory, concepts, and simulation studies

  • Chung, Wonil;Cho, Youngkwang
    • Genomics & Informatics
    • /
    • 제20권1호
    • /
    • pp.8.1-8.14
    • /
    • 2022
  • Despite the success of recent genome-wide association studies investigating longitudinal traits, a large fraction of overall heritability remains unexplained. This suggests that some of the missing heritability may be accounted for by gene-gene and gene-time/environment interactions. In this paper, we develop a Bayesian variable selection method for longitudinal genetic data based on mixed models. The method jointly models the main effects and interactions of all candidate genetic variants and non-genetic factors and has higher statistical power than previous approaches. To account for the within-subject dependence structure, we propose a grid-based approach that models only one fixed-dimensional covariance matrix, which is thus applicable to data where subjects have different numbers of time points. We provide the theoretical basis of our Bayesian method and then illustrate its performance using data from the 1000 Genome Project with various simulation settings. Several simulation studies show that our multivariate method increases the statistical power compared to the corresponding univariate method and can detect gene-time/ environment interactions well. We further evaluate our method with different numbers of individuals, variants, and causal variants, as well as different trait-heritability, and conclude that our method performs reasonably well with various simulation settings.

Inference of Gene Regulatory Networks via Boolean Networks Using Regression Coefficients

  • Kim, Ha-Seong;Choi, Ho-Sik;Lee, Jae-K.;Park, Tae-Sung
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2005년도 BIOINFO 2005
    • /
    • pp.339-343
    • /
    • 2005
  • Boolean networks(BN) construction is one of the commonly used methods for building gene networks from time series microarray data. However, BN has two major drawbacks. First, it requires heavy computing times. Second, the binary transformation of the microarray data may cause a loss of information. This paper propose two methods using liner regression to construct gene regulatory networks. The first proposed method uses regression based BN variable selection method, which reduces the computing time significantly in the BN construction. The second method is the regression based network method that can flexibly incorporate the interaction of the genes using continuous gene expression data. We construct the network structure from the simulated data to compare the computing times between Boolean networks and the proposed method. The regression based network method is evaluated using a microarray data of cell cycle in Caulobacter crescentus.

  • PDF