• 제목/요약/키워드: Genome Analysis

검색결과 2,364건 처리시간 0.029초

PDGFC, MARK3 and BCL2 Polymorphisms are Associated with Left Ventricular Hypertrophy in Korean Population

  • Jeon, Tae-Eun;Jin, Hyun-Soek
    • 대한의생명과학회지
    • /
    • 제25권3호
    • /
    • pp.237-246
    • /
    • 2019
  • Left ventricular hypertrophy (LVH) refers to the expansion and the enlarged myocardium due to the increased resistance to ejection from the left ventricle to the aorta and/or the periphery, or the long-term burden imposed by the blood increase. Hypertension is a major risk factor that accounts for more than 50% of the causes of cardiovascular disease. If hypertension endure in the long term, the myocardium responds to abnormal heartbeat in the heart. Therefore, the prevalence of left ventricular hypertrophy also increases. As a result of genome-wide association study (GWAS) analysis for European people, PDGFC, MARK3, and BCL2 were related to blood pressures. In this study, the genetic polymorphisms of PDGFC, MARK3, and BCL2 were extracted and selected based on Korean genomic and epidemiologic data, and then logistic regression analysis was performed on LVH. As a result, one SNP (rs9307953) in PDGFC gene, four SNPs (rs6575983, rs17679475, rs2273703 and rs10141388) in MARK3 gene and two SNPs (rs17756073 and rs17070739) in BCL2 gene were statistically significant. The rs6575983 of the MARK3 gene showed the highest significance level ($P=7.2{\times}10^{-3}$) among the SNPs and the relative risk of 1.08 (95% confidence interval: 1.06 to 1.45). These results suggest that the polymorphisms of PDGFC, MARK3, and BCL2 not only affect European blood pressures but also correlate with LVH in Korean. These results suggest that increased understanding of the genetic correlations of the pathogenesis of LVH.

Metabolic Syndrome Prediction Using Machine Learning Models with Genetic and Clinical Information from a Nonobese Healthy Population

  • Choe, Eun Kyung;Rhee, Hwanseok;Lee, Seungjae;Shin, Eunsoon;Oh, Seung-Won;Lee, Jong-Eun;Choi, Seung Ho
    • Genomics & Informatics
    • /
    • 제16권4호
    • /
    • pp.31.1-31.7
    • /
    • 2018
  • The prevalence of metabolic syndrome (MS) in the nonobese population is not low. However, the identification and risk mitigation of MS are not easy in this population. We aimed to develop an MS prediction model using genetic and clinical factors of nonobese Koreans through machine learning methods. A prediction model for MS was designed for a nonobese population using clinical and genetic polymorphism information with five machine learning algorithms, including naïve Bayes classification (NB). The analysis was performed in two stages (training and test sets). Model A was designed with only clinical information (age, sex, body mass index, smoking status, alcohol consumption status, and exercise status), and for model B, genetic information (for 10 polymorphisms) was added to model A. Of the 7,502 nonobese participants, 647 (8.6%) had MS. In the test set analysis, for the maximum sensitivity criterion, NB showed the highest sensitivity: 0.38 for model A and 0.42 for model B. The specificity of NB was 0.79 for model A and 0.80 for model B. In a comparison of the performances of models A and B by NB, model B (area under the receiver operating characteristic curve [AUC] = 0.69, clinical and genetic information input) showed better performance than model A (AUC = 0.65, clinical information only input). We designed a prediction model for MS in a nonobese population using clinical and genetic information. With this model, we might convince nonobese MS individuals to undergo health checks and adopt behaviors associated with a preventive lifestyle.

RNA-Seq De Novo Assembly and Differential Transcriptome Analysis of Korean Medicinal Herb Cirsium japonicum var. spinossimum

  • Roy, Neha Samir;Kim, Jung-A;Choi, Ah-Young;Ban, Yong-Wook;Park, Nam-Il;Park, Kyong-Cheul;Yang, Hee-sun;Choi, Ik-Young;Kim, Soonok
    • Genomics & Informatics
    • /
    • 제16권4호
    • /
    • pp.34.1-34.9
    • /
    • 2018
  • Cirsium japonicum belongs to the Asteraceae or Compositae family and is a medicinal plant in Asia that has a variety of effects, including tumour inhibition, improved immunity with flavones, and antidiabetic and hepatoprotective effects. Silymarin is synthesized by 4-coumaroyl-CoA via both the flavonoid and phenylpropanoid pathways to produce the immediate precursors taxifolin and coniferyl alcohol. Then, the oxidative radicalization of taxifolin and coniferyl alcohol produces silymarin. We identified the expression of genes related to the synthesis of silymarin in C. japonicum in three different tissues, namely, flowers, leaves, and roots, through RNA sequencing. We obtained 51,133 unigenes from transcriptome sequencing by de novo assembly using Trinity v2.1.1, TransDecoder v2.0.1, and CD-HIT v4.6 software. The differentially expressed gene analysis revealed that the expression of genes related to the flavonoid pathway was higher in the flowers, whereas the phenylpropanoid pathway was more highly expressed in the roots. In this study, we established a global transcriptome dataset for C. japonicum. The data shall not only be useful to focus more deeply on the genes related to product medicinal metabolite including flavolignan but also to study the functional genomics for genetic engineering of C. japonicum.

Characterization of Gel16 as a Cytochrome P450 in Geldanamycin Biosynthesis and in-silico Analysis for an Endogenous Electron Transport System

  • Rimal, Hemraj;Yu, Sang-Cheol;Lee, Byeongsan;Hong, Young-Soo;Oh, Tae-Jin
    • Journal of Microbiology and Biotechnology
    • /
    • 제29권1호
    • /
    • pp.44-54
    • /
    • 2019
  • Geldanamycin and its derivatives, inhibitors of heat shock protein 90, are considered potent anticancer drugs, although their biosynthetic pathways have not yet been fully elucidated. The key step of conversion of 4,5-dihydrogeldanamycin to geldanamycin was expected to catalyze by a P450 monooxygenase, Gel16. The adequate bioconversions by cytochrome P450 mostly rely upon its interaction with redox partners. Several ferredoxin and ferredoxin reductases are available in the genome of certain organisms, but only a few suitable partners can operate in full efficiency. In this study, we have expressed cytochrome P450 gel16 in Escherichia coli and performed an in vitro assay using 4,5-dihydrogeldanamycin as a substrate. We demonstrated that the in silico method can be applicable for the efficient mining of convenient endogenous redox partners (9 ferredoxins and 6 ferredoxin reductases) against CYP Gel16 from Streptomyces hygroscopicus. The distances for ligand FDX4-FDR6 were found to be $9.384{\AA}$. Similarly, the binding energy between Gel16-FDX4 and FDX4-FDR6 were -611.88 kcal/mol and -834.48 kcal/mol, respectively, suggesting the lowest distance and binding energy rather than other redox partners. These findings suggest that the best redox partners of Gel16 could be NADPH ${\rightarrow}$ FDR6 ${\rightarrow}$ FDX4 ${\rightarrow}$ Gel16.

Stage specific transcriptome profiles at cardiac lineage commitment during cardiomyocyte differentiation from mouse and human pluripotent stem cells

  • Cho, Sung Woo;Kim, Hyoung Kyu;Sung, Ji Hee;Han, Jin
    • BMB Reports
    • /
    • 제54권9호
    • /
    • pp.464-469
    • /
    • 2021
  • Cardiomyocyte differentiation occurs through complex and finely regulated processes including cardiac lineage commitment and maturation from pluripotent stem cells (PSCs). To gain some insight into the genome-wide characteristics of cardiac lineage commitment, we performed transcriptome analysis on both mouse embryonic stem cells (mESCs) and human induced PSCs (hiPSCs) at specific stages of cardiomyocyte differentiation. Specifically, the gene expression profiles and the protein-protein interaction networks of the mESC-derived platelet-derived growth factor receptor-alpha (PDGFRα)+ cardiac lineage-committed cells (CLCs) and hiPSC-derived kinase insert domain receptor (KDR)+ and PDGFRα+ cardiac progenitor cells (CPCs) at cardiac lineage commitment were compared with those of mesodermal cells and differentiated cardiomyocytes. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway analyses revealed that the genes significantly upregulated at cardiac lineage commitment were associated with responses to organic substances and external stimuli, extracellular and myocardial contractile components, receptor binding, gated channel activity, PI3K-AKT signaling, and cardiac hypertrophy and dilation pathways. Protein-protein interaction network analysis revealed that the expression levels of genes that regulate cardiac maturation, heart contraction, and calcium handling showed a consistent increase during cardiac differentiation; however, the expression levels of genes that regulate cell differentiation and multicellular organism development decreased at the cardiac maturation stage following lineage commitment. Additionally, we identified for the first time the protein-protein interaction network connecting cardiac development, the immune system, and metabolism during cardiac lineage commitment in both mESC-derived PDGFRα+ CLCs and hiPSC-derived KDR+PDGFRα+ CPCs. These findings shed light on the regulation of cardiac lineage commitment and the pathogenesis of cardiometabolic diseases.

Study on Microbial Community Succession and Protein Hydrolysis of Donkey Meat during Refrigerated Storage Based on Illumina NOVA Sequencing Technology

  • Wei, Zixiang;Chu, Ruidong;Li, Lanjie;Zhang, Jingjing;Zhang, Huachen;Pan, Xiaohong;Dong, Yifan;Liu, Guiqin
    • 한국축산식품학회지
    • /
    • 제41권4호
    • /
    • pp.701-714
    • /
    • 2021
  • In this study, the microbial community succession and the protein hydrolysis of donkey meat during refrigerated (4℃) storage were investigated. 16S rDNA sequencing method was used to analyze the bacteria community structure and succession in the level of genome. Meanwhile, the volatile base nitrogen (TVB-N) was measured to evaluate the degradation level of protein. After sorting out the sequencing results, 1,274,604 clean data were obtained, which were clustered into 2,064 into operational taxonomic units (OTUs), annotated to 32 phyla and 527 genus. With the prolonging of storage time, the composition of microorganism changed greatly. At the same time, the diversity and richness of microorganism decreased and then increased. During the whole storage period, Proteobacteria was the dominant phyla, and the Photobacterium, Pseudompnas, and Acinetobacter were the dominant genus. According to correlation analysis, it was found that the abundance of these dominant bacteria was significantly positively correlated with the variation of TVB-N. And Pseudomonas might play an important role in the production of TVB-N during refrigerated storage of donkey meat. The predicted metabolic pathways, based on PICRUSt analysis, indicated that amino metabolism in refrigerated donkey meat was the main metabolic pathways. This study provides insight into the process involved in refrigerated donkey meat spoilage, which provides a foundation for the development of antibacterial preservative for donkey meat.

The Relationship between Parkinson's Disease and Acute Myocardial Infarction in Korea : A Nationwide Longitudinal Cohort Study

  • Sheen, Seung Hun;Hong, Je Beom;Kim, Hakyung;Kim, Jimin;Han, In-bo;Sohn, Seil
    • Journal of Korean Neurosurgical Society
    • /
    • 제65권4호
    • /
    • pp.507-513
    • /
    • 2022
  • Objective : The goal of the following statewide age and gender-coordinated cohort study in Korea is to find out if there is a link between acute myocardial infarction (AMI) and Parkinson's disease (PD). Methods : Utilizing the National Health Insurance Sharing Service cohort, patient data were collected. Six thousand four hundred seventy-five individuals with PD were distinguished by utilizing the International Classification of Diseases 10 code G20 and have enrolled in the PD group. The number of participants decreased to 5259 after excluding 1039 patients who were hospitalized less than one time or who visited an outpatient clinic less than twice. Then, 26295 individuals were selected as part of the control group after case control matching was conducted through 1 : 5 age- and gender-coordinated matching. The Cox proportional hazard regression analysis and Kaplan-Meier method were utilized to analyze the likelihood of AMI in PD. Results : After controlling for age and gender, the hazard ratio of AMI in the PD group was 3.603 (95% confidence interval [CI], 2.837-4.577). After that, the following hazard ratio of AMI in the PD group was modified against for co-morbid medical disorders, resulting in 3.551 (95% CI, 2.795-4.511). According to a subgroup analysis, in males and females aged <65 and aged ≥65 and in the non-diabetes and diabetes, hypertension and non-hypertension, dyslipidemia and non-dyslipidemia subgroups, the AMI incidence rates were dramatically higher in the PD group compared to that of the control. Conclusion : Individuals with PD have a greater chance of AMI, according to this cross-national study.

Modification of ginsenoside saponin composition via the CRISPR/Cas9-mediated knockout of protopanaxadiol 6-hydroxylase gene in Panax ginseng

  • Choi, Han Suk;Koo, Hyo Bin;Jeon, Sung Won;Han, Jung Yeon;Kim, Joung Sug;Jun, Kyong Mi;Choi, Yong Eui
    • Journal of Ginseng Research
    • /
    • 제46권4호
    • /
    • pp.505-514
    • /
    • 2022
  • Background: The roots of Panax ginseng contain two types of tetracyclic triterpenoid saponins, namely, protopanaxadiol (PPD)-type saponins and protopanaxatiol (PPT)-type saponins. In P. ginseng, the protopanaxadiol 6-hydroxylase (PPT synthase) enzyme catalyses protopanaxatriol (PPT) production from protopanaxadiol (PPD). In this study, we constructed homozygous mutant lines of ginseng by CRISPR/Cas9-mediated mutagenesis of the PPT synthase gene and obtained the mutant ginseng root lines having complete depletion of the PPT-type ginsenosides. Methods: Two sgRNAs (single guide RNAs) were designed for target mutations in the exon sequences of the two PPT synthase genes (both PPTa and PPTg sequences) with the CRISPR/Cas9 system. Transgenic ginseng roots were generated through Agrobacterium-mediated transformation. The mutant lines were screened by ginsenoside analysis and DNA sequencing. Result: Ginsenoside analysis revealed the complete depletion of PPT-type ginsenosides in three putative mutant lines (Cr4, Cr7, and Cr14). The reduction of PPT-type ginsenosides in mutant lines led to increased accumulation of PPD-type ginsenosides. The gene editing in the selected mutant lines was confirmed by targeted deep sequencing. Conclusion: We have established the genome editing protocol by CRISPR/Cas9 system in P. ginseng and demonstrated the mutated roots producing only PPD-type ginsenosides by depleting PPT-type ginsenosides. Because the pharmacological activity of PPD-group ginsenosides is significantly different from that of PPT-group ginsenosides, the new type of ginseng mutant producing only PPD-group ginsenosides may have new pharmacological characteristics compared to wild-type ginseng. This is the first report to generate target-induced mutations for the modification of saponin biosynthesis in Panax species using CRISPR-Cas9 system.

High-performance computing for SARS-CoV-2 RNAs clustering: a data science-based genomics approach

  • Oujja, Anas;Abid, Mohamed Riduan;Boumhidi, Jaouad;Bourhnane, Safae;Mourhir, Asmaa;Merchant, Fatima;Benhaddou, Driss
    • Genomics & Informatics
    • /
    • 제19권4호
    • /
    • pp.49.1-49.11
    • /
    • 2021
  • Nowadays, Genomic data constitutes one of the fastest growing datasets in the world. As of 2025, it is supposed to become the fourth largest source of Big Data, and thus mandating adequate high-performance computing (HPC) platform for processing. With the latest unprecedented and unpredictable mutations in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the research community is in crucial need for ICT tools to process SARS-CoV-2 RNA data, e.g., by classifying it (i.e., clustering) and thus assisting in tracking virus mutations and predict future ones. In this paper, we are presenting an HPC-based SARS-CoV-2 RNAs clustering tool. We are adopting a data science approach, from data collection, through analysis, to visualization. In the analysis step, we present how our clustering approach leverages on HPC and the longest common subsequence (LCS) algorithm. The approach uses the Hadoop MapReduce programming paradigm and adapts the LCS algorithm in order to efficiently compute the length of the LCS for each pair of SARS-CoV-2 RNA sequences. The latter are extracted from the U.S. National Center for Biotechnology Information (NCBI) Virus repository. The computed LCS lengths are used to measure the dissimilarities between RNA sequences in order to work out existing clusters. In addition to that, we present a comparative study of the LCS algorithm performance based on variable workloads and different numbers of Hadoop worker nodes.

Genetic analysis of the postsynaptic transmembrane X-linked neuroligin 3 gene in autism

  • Hegde, Rajat;Hegde, Smita;Kulkarni, Suyamindra S.;Pandurangi, Aditya;Gai, Pramod B.;Das, Kusal K.
    • Genomics & Informatics
    • /
    • 제19권4호
    • /
    • pp.44.1-44.9
    • /
    • 2021
  • Autism is a complex neurodevelopmental disorder, the prevalence of which has increased drastically in India in recent years. Neuroligin is a type I transmembrane protein that plays a crucial role in synaptogenesis. Alterations in synaptic genes are most commonly implicated in autism and other cognitive disorders. The present study investigated the neuroligin 3 gene in the Indian autistic population by sequencing and in silico pathogenicity prediction of molecular changes. In total, 108 clinically described individuals with autism were included from the North Karnataka region of India, along with 150 age-, sex-, and ethnicity-matched healthy controls. Genomic DNA was extracted from peripheral blood, and exonic regions were sequenced. The functional and structural effects of variants of the neuroligin 3 protein were predicted. One coding sequence variant (a missense variant) and four non-coding variants (two 5'-untranslated region [UTR] variants and two 3'-UTR variants) were recorded. The novel missense variant was found in 25% of the autistic population. The C/C genotype of c.551T>C was significantly more common in autistic children than in controls (p = 0.001), and a significantly increased risk of autism (24.7-fold) was associated with this genotype (p = 0.001). The missense variant showed pathogenic effects and high evolutionary conservation over the functions of the neuroligin 3 protein. In the present study, we reported a novel missense variant, V184A, which causes abnormal neuroligin 3 and was found with high frequency in the Indian autistic population. Therefore, neuroligin is a candidate gene for future molecular investigations and functional analysis in the Indian autistic population.