• 제목/요약/키워드: genome database

검색결과 355건 처리시간 0.025초

Loss of Heterozygosity at the Calcium Regulation Gene Locus on Chromosome 10q in Human Pancreatic Cancer

  • Long, Jin;Zhang, Zhong-Bo;Liu, Zhe;Xu, Yuan-Hong;Ge, Chun-Lin
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제16권6호
    • /
    • pp.2489-2493
    • /
    • 2015
  • Background: Loss of heterozygosity (LOH) on chromosomal regions is crucial in tumor progression and this study aimed to identify genome-wide LOH in pancreatic cancer. Materials and Methods: Single-nucleotide polymorphism (SNP) profiling data GSE32682 of human pancreatic samples snap-frozen during surgery were downloaded from Gene Expression Omnibus database. Genotype console software was used to perform data processing. Candidate genes with LOH were screened based on the genotype calls, SNP loci of LOH and dbSNP database. Gene annotation was performed to identify the functions of candidate genes using NCBI (the National Center for Biotechnology Information) database, followed by Gene Ontology, INTERPRO, PFAM and SMART annotation and UCSC Genome Browser track to the unannotated genes using DAVID (the Database for Annotation, Visualization and Integration Discovery). Results: The candidate genes with LOH identified in this study were MCU, MICU1 and OIT3 on chromosome 10. MCU was found to encode a calcium transporter and MICU1 could encode an essential regulator of mitochondrial $Ca^{2+}$ uptake. OIT3 possibly correlated with calcium binding revealed by the annotation analyses and was regulated by a large number of transcription factors including STAT, SOX9, CREB, NF-kB, PPARG and p53. Conclusions: Global genomic analysis of SNPs identified MICU1, MCU and OIT3 with LOH on chromosome 10, implying involvement of these genes in progression of pancreatic cancer.

StrokeBase: A Database of Cerebrovascular Disease-related Candidate Genes

  • Kim, Young-Uk;Kim, Il-Hyun;Bang, Ok-Sun;Kim, Young-Joo
    • Genomics & Informatics
    • /
    • 제6권3호
    • /
    • pp.153-156
    • /
    • 2008
  • Complex diseases such as stroke and cancer have two or more genetic loci and are affected by environmental factors that contribute to the diseases. Due to the complex characteristics of these diseases, identifying candidate genes requires a system-level analysis of the following: gene ontology, pathway, and interactions. A database and user interface, termed StrokeBase, was developed; StrokeBase provides queries that search for pathways, candidate genes, candidate SNPs, and gene networks. The database was developed by using in silico data mining of HGNC, ENSEMBL, STRING, RefSeq, UCSC, GO, HPRD, KEGG, GAD, and OMIM. Forty candidate genes that are associated with cerebrovascular disease were selected by human experts and public databases. The networked cerebrovascular disease gene maps also were developed; these maps describe genegene interactions and biological pathways. We identified 1127 genes, related indirectly to cerebrovascular disease but directly to the etiology of cerebrovascular disease. We found that a protein-protein interaction (PPI) network that was associated with cerebrovascular disease follows the power-law degree distribution that is evident in other biological networks. Not only was in silico data mining utilized, but also 250K Affymetrix SNP chips were utilized in the 320 control/disease association study to generate associated markers that were pertinent to the cerebrovascular disease as a genome-wide search. The associated genes and the genes that were retrieved from the in silico data mining system were compared and analyzed. We developed a well-curated cerebrovascular disease-associated gene network and provided bioinformatic resources to cerebrovascular disease researchers. This cerebrovascular disease network can be used as a frame of systematic genomic research, applicable to other complex diseases. Therefore, the ongoing database efficiently supports medical and genetic research in order to overcome cerebrovascular disease.

Utilization of whole genome treasure for the library construction of industrial enzymes

  • Kim, Won-Ho;Cho, Kyoung-Won;Jung, In-Su;Choi, Keum-Hwa;Hur, Byung-Ki;Kim, Geun-Joong
    • 한국생물공학회:학술대회논문집
    • /
    • 한국생물공학회 2003년도 생물공학의 동향(XIII)
    • /
    • pp.815-820
    • /
    • 2003
  • A huge database resulted from whole genome sequencing has provided a possibility of new information that is likely to extent the scope and thus changes the way of approach for the functional assigning of putative open reading frames annotated by whole genome sequence analyses. These are mainly realized by ease, one-step identification of putative genes using genomics or proteomics tools. A major challenge remained in biotechnology may translate these informations into better ways to screen or select a gene as a representative sequence. Further attempts to mine the related whole genes or partial DNA fragment from whole genome treasure, and then the incorporation of these sequences into a representative template, will result in the use of putative genes that can be translated into functional proteins or allowed the generation of new lineages as a valuable pool. Such screens enable rapid biochemical analysis and easy isolation of the target activity, thereby accelerating the screening of novel enzymes from the expanded library with related sequences. Information-based PCR amplification of whole genes and reconstitution of functional DNA fragments will provide a platform for expanding the functional spaces of potential enzymes, especially when used mixed- or metagenome as gene resources.

  • PDF

Functional Annotation and Analysis of Korean Patented Biological Sequences Using Bioinformatics

  • Lee, Byung Wook;Kim, Tae Hyung;Kim, Seon Kyu;Kim, Sang Soo;Ryu, Gee Chan;Bhak, Jong
    • Molecules and Cells
    • /
    • 제21권2호
    • /
    • pp.269-275
    • /
    • 2006
  • A recent report of the Korean Intellectual Property Office(KIPO) showed that the number of biological sequence-based patents is rapidly increasing in Korea. We present biological features of Korean patented sequences though bioinformatic analysis. The analysis is divided into two steps. The first is an annotation step in which the patented sequences were annotated with the Reference Sequence (RefSeq) database. The second is an association step in which the patented sequences were linked to genes, diseases, pathway, and biological functions. We used Entrez Gene, Online Mendelian Inheritance in Man (OMIM), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Gene Ontology (GO) databases. Through the association analysis, we found that nearly 2.6% of human genes were associated with Korean patenting, compared to 20% of human genes in the U.S. patent. The association between the biological functions and the patented sequences indicated that genes whose products act as hormones on defense responses in the extra-cellular environments were the most highly targeted for patenting. The analysis data are available at http://www.patome.net

Perspectives on Clinical Informatics: Integrating Large-Scale Clinical, Genomic, and Health Information for Clinical Care

  • Choi, In Young;Kim, Tae-Min;Kim, Myung Shin;Mun, Seong K.;Chung, Yeun-Jun
    • Genomics & Informatics
    • /
    • 제11권4호
    • /
    • pp.186-190
    • /
    • 2013
  • The advances in electronic medical records (EMRs) and bioinformatics (BI) represent two significant trends in healthcare. The widespread adoption of EMR systems and the completion of the Human Genome Project developed the technologies for data acquisition, analysis, and visualization in two different domains. The massive amount of data from both clinical and biology domains is expected to provide personalized, preventive, and predictive healthcare services in the near future. The integrated use of EMR and BI data needs to consider four key informatics areas: data modeling, analytics, standardization, and privacy. Bioclinical data warehouses integrating heterogeneous patient-related clinical or omics data should be considered. The representative standardization effort by the Clinical Bioinformatics Ontology (CBO) aims to provide uniquely identified concepts to include molecular pathology terminologies. Since individual genome data are easily used to predict current and future health status, different safeguards to ensure confidentiality should be considered. In this paper, we focused on the informatics aspects of integrating the EMR community and BI community by identifying opportunities, challenges, and approaches to provide the best possible care service for our patients and the population.

Gene Expression Profiling of 6-MP (6-mercaptopurine) in Liver

  • Kim Hyung-Lae;Kim Han-Na;Lee Eun-Ju
    • Genomics & Informatics
    • /
    • 제4권1호
    • /
    • pp.16-22
    • /
    • 2006
  • The KFDA (Korea Food & Drug Administration) has performed a collaborative toxicogenomics project since 2003. Its aim is to construct a toxicology database of 12 compounds administered to mice at initial phase. We chose 6-MP (6-mercaptopurine) which has been used in the treatment of childhood leukemia. It was administered at low (0.224 mg/kg) and at high (2.24 mg/kg) dose (5 mice per group) intraperitonealy to the postnatal 6 weeks mice, then the serum and liver were collected at the indicated time (6, 24 and 72 h) after scarification. Serum biochemical markers for liver toxicity were measured and histopathologic studies also were carried out. The gene expression profiling was carried out by using Applied Biosystems 1700 Full Genome Expression Mouse. By self-organization maps (SOM), we identified groups with unique gene expression patterns, some of them are supposed to be related to 6-MP induced toxicity, including lipid metabolism abnormality, inflammatory response, oxidative stress, ATP depletion and cell death. The potential toxic effects appearing as gene expression changes are dependent of the time of 6-MP but independent of the dosage of it. This study would contribute to establishment of international database as well as national one about hepatotoxicity.

Genes expression monitoring using cDNA microarray: Protocol and Application

  • Muramatsu Masa-aki
    • 한국독성학회:학술대회논문집
    • /
    • 한국독성학회 2000년도 국제심포지움 및 추계학술대회
    • /
    • pp.31-41
    • /
    • 2000
  • The major issue in the post genome sequencing era is determination of gene expression patterns in variety of biological systems. A microarray system is a powerful technology for analyzing the expression profile of thousands of genes at one experiment. In this study, we constructed cDNA microarray which carries 2,304 cDNAS derived from oligo-capped mouse cDNA library. Using this hand-made microarray we determined gene expression in various biological systems. To determine tissue specific genes, we compared Nine genes were highly-expressed in adult mouse brain compared to kidney, liver, and skeletal muscle. Tissue distribution analysis using DNA microarray extracted 9 genes that were predominantly expressed in the brain. A database search showed that five of the 9 genes, MBP, SC1, HiAT3, S100 protein-beta, and SNAP25, were previously known to be expressed at high level in the brain and in the nervous system. One gene was highly sequence similar to rat S-Rex-s/human NSP-C, suggesting that the gene is a mouse homologue. The remaining three genes did not match to known genes in the GenBank/EMBL database, indicating that these are novel genes highly-expressed in the brain. Our DNA microarray was also used to detect differentiation specific genes, hormone dependent genes, and transcription-factor-induced genes. We conclude that DNA microarray is an excellent tool for identifying differentially expressed genes.

  • PDF

Identification of Viral Taxon-Specific Genes (VTSG): Application to Caliciviridae

  • Kang, Shinduck;Kim, Young-Chang
    • Genomics & Informatics
    • /
    • 제16권4호
    • /
    • pp.23.1-23.5
    • /
    • 2018
  • Virus taxonomy was initially determined by clinical experiments based on phenotype. However, with the development of sequence analysis methods, genotype-based classification was also applied. With the development of genome sequence analysis technology, there is an increasing demand for virus taxonomy to be extended from in vivo and in vitro to in silico. In this study, we verified the consistency of the current International Committee on Taxonomy of Viruses taxonomy using an in silico approach, aiming to identify the specific sequence for each virus. We applied this approach to norovirus in Caliciviridae, which causes 90% of gastroenteritis cases worldwide. First, based on the dogma "protein structure determines its function," we hypothesized that the specific sequence can be identified by the specific structure. Firstly, we extracted the coding region (CDS). Secondly, the CDS protein sequences of each genus were annotated by the conserved domain database (CDD) search. Finally, the conserved domains of each genus in Caliciviridae are classified by RPS-BLAST with CDD. The analysis result is that Caliciviridae has sequences including RNA helicase in common. In case of Norovirus, Calicivirus coat protein C terminal and viral polyprotein N-terminal appears as a specific domain in Caliciviridae. It does not include in the other genera in Caliciviridae. If this method is utilized to detect specific conserved domains, it can be used as classification keywords based on protein functional structure. After determining the specific protein domains, the specific protein domain sequences would be converted to gene sequences. This sequences would be re-used one of viral bio-marks.