• Title/Summary/Keyword: Human genome

Search Result 910, Processing Time 0.027 seconds

Codon Usage Bias and Determining Forces in Taenia solium Genome

  • Yang, Xing;Ma, Xusheng;Luo, Xuenong;Ling, Houjun;Zhang, Xichen;Cai, Xuepeng
    • Parasites, Hosts and Diseases
    • /
    • v.53 no.6
    • /
    • pp.689-697
    • /
    • 2015
  • The tapeworm Taenia solium is an important human zoonotic parasite that causes great economic loss and also endangers public health. At present, an effective vaccine that will prevent infection and chemotherapy without any side effect remains to be developed. In this study, codon usage patterns in the T. solium genome were examined through 8,484 protein-coding genes. Neutrality analysis showed that T. solium had a narrow GC distribution, and a significant correlation was observed between GC12 and GC3. Examination of an NC (ENC vs GC3s)-plot showed a few genes on or close to the expected curve, but the majority of points with low-ENC (the effective number of codons) values were detected below the expected curve, suggesting that mutational bias plays a major role in shaping codon usage. The Parity Rule 2 plot (PR2) analysis showed that GC and AT were not used proportionally. We also identified 26 optimal codons in the T. solium genome, all of which ended with either a G or C residue. These optimal codons in the T. solium genome are likely consistent with tRNAs that are highly expressed in the cell, suggesting that mutational and translational selection forces are probably driving factors of codon usage bias in the T. solium genome.

The Algorithm of implementation for genome analysis ecosystems : Mitochondria's case (유전체 생태계 분석을 위한 알고리즘 구현: 미토콘드리아 사례)

  • Choi, Sung-Ja;Cho, Han-Wook
    • Journal of Digital Convergence
    • /
    • v.14 no.4
    • /
    • pp.349-353
    • /
    • 2016
  • The studies on the human environment and ecosystem analysis is being actively researched. In recent years, The service of genome analysis has been offering the customized service to prevent the disease as reading an individual's genome information. The genome information by analyzing technology is being required accurate and fast analyses of ecosystem-dielectrics due to the spread of the disease, the use of genetically modified organism and the influx of exotic. In this paper the algorithm of K-Mean clustering for a new classification system was utilized. It will provide new dielectrics information as quickly and accurately for many biologists.

Effect of Combining Multiple CNV Defining Algorithms on the Reliability of CNV Calls from SNP Genotyping Data

  • Kim, Soon-Young;Kim, Ji-Hong;Chung, Yeun-Jun
    • Genomics & Informatics
    • /
    • v.10 no.3
    • /
    • pp.194-199
    • /
    • 2012
  • In addition to single-nucleotide polymorphisms (SNP), copy number variation (CNV) is a major component of human genetic diversity. Among many whole-genome analysis platforms, SNP arrays have been commonly used for genomewide CNV discovery. Recently, a number of CNV defining algorithms from SNP genotyping data have been developed; however, due to the fundamental limitation of SNP genotyping data for the measurement of signal intensity, there are still concerns regarding the possibility of false discovery or low sensitivity for detecting CNVs. In this study, we aimed to verify the effect of combining multiple CNV calling algorithms and set up the most reliable pipeline for CNV calling with Affymetrix Genomewide SNP 5.0 data. For this purpose, we selected the 3 most commonly used algorithms for CNV segmentation from SNP genotyping data, PennCNV, QuantiSNP; and BirdSuite. After defining the CNV loci using the 3 different algorithms, we assessed how many of them overlapped with each other, and we also validated the CNVs by genomic quantitative PCR. Through this analysis, we proposed that for reliable CNV-based genomewide association study using SNP array data, CNV calls must be performed with at least 3 different algorithms and that the CNVs consistently called from more than 2 algorithms must be used for association analysis, because they are more reliable than the CNVs called from a single algorithm. Our result will be helpful to set up the CNV analysis protocols for Affymetrix Genomewide SNP 5.0 genotyping data.

Cell-Free miR-27a, a Potential Diagnostic and Prognostic Biomarker for Gastric Cancer

  • Park, Jong-Lyul;Kim, Mirang;Song, Kyu-Sang;Kim, Seon-Young;Kim, Yong Sung
    • Genomics & Informatics
    • /
    • v.13 no.3
    • /
    • pp.70-75
    • /
    • 2015
  • MicroRNAs (miRNAs) have been demonstrated to play an important role in carcinogenesis. Previous studies revealed that miRNAs are present in human plasma in a remarkably stable form that is protected from endogenous RNase activity. In this study, we measured the plasma expression levels of three miRNAs (miR-21, miR-27a, and miR-155) to investigate the usefulness of miRNAs for gastric cancer detection. We initially examined plasma miRNA expression levels in a screening cohort consisting of 15 patients with gastric cancer and 15 healthy controls from Korean population, using TaqMan quantitative real-time polymerase chain reaction. We observed that the expression level of miR-27a was significantly higher in patients with gastric cancer than in healthy controls, whereas the miR-21 and miR-155a expression levels were not significantly higher in the patients with gastric cancer. Therefore, we further validated the miR-27a expression level in 73 paired gastric cancer tissues and in a validation plasma cohort from 35 patients with gastric cancer and 35 healthy controls. In both the gastric cancer tissues and the validation plasma cohort, the miR-27a expression levels were significantly higher in patients with gastric cancer. Receiver-operator characteristic (ROC) analysis of the validation cohort, revealed an area under the ROC curve value of 0.70 with 75% sensitivity and 56% specificity in discriminating gastric cancer. Thus, the miR-27a expression level in plasma could be a useful biomarker for the diagnosis and/or prognosis of gastric cancer.

Draft genome sequence of lytic bacteriophage KP1 infecting bacterial pathogen Klebsiella pneumoniae (병원균 Klebsiella pneumoniae를 감염시키는 용균 박테리오파지 KP1의 유전체 염기서열 초안)

  • Kim, Youngju;Bang, Ina;Yeon, Young Eun;Park, Joon Young;Han, Beom Ku;Kim, Hyunil;Kim, Donghyuk
    • Korean Journal of Microbiology
    • /
    • v.54 no.2
    • /
    • pp.152-154
    • /
    • 2018
  • Klebsiella pneumoniae is a Gram-negative, rod-shape bacterium causing disease in human and animal lungs. K. pneumoniae has been often found to gain antimicrobial resistance, thus it has been difficult to treat K. pneumoniae infection with antibiotics. For such infection, bacteriophage can provide an alternative approach for pathogenic bacterial infection with antimicrobial resistance, because of its sensitivity and specificity to the host bacteria. Bacteriophage KP1 was isolated in sewage and showed specific infectivity to K. pneumoniae. Here, we report the draft genome sequence of Klebsiella pneumoniae phage KP1. The draft genome of KP1 is 167,989 bp long, and the G + C content is 39.6%. The genome has 295 predicted ORFs and 14 tRNA genes. In addition, it encodes various enzymes which involve in lysis of the host cell such as lysozyme and holin.

Genome sequence of Veillonella atypica KHUD-V1 isolated from a human subgingival dental plaque of periodontitis lesion (사람 치주염 병소의 치은 연하 치태에서 분리된 Veillonella atypica KHUD-V1의 유전체 염기서열 해독)

  • Lee, Jae-Hyung;Shin, Seung-Yun;Lee, Han;Yang, Seok Bin;Jang, Eun-Young;Ryu, Jae-In;Lee, Jin-Yong;Moon, Ji-Hoi
    • Korean Journal of Microbiology
    • /
    • v.55 no.1
    • /
    • pp.77-79
    • /
    • 2019
  • Here we report the genome sequence of Veillonella atypica strain KHUD-V1 isolated from subgingival dental plaque of Korean chronic periodontitis patients. Unlike other V. atypica strains, KHUD-V1 carries two prophage regions and prophage remnants, as well as several genes homologous to prophage-associated virulence factors, such as virulence-associated protein E, a Clp protease, and a toxin-antitoxin system. The isolate and its genome sequence obtained here will aid to understand the diversity of the genome architecture of Veillonella within an evolutionary framework and the role of prophages that contribute to the genetic diversity as well as the virulence of V. atypica.

Genomic DNA Chip: Genome-wide profiling in Cancer

  • 이종호
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2001.10a
    • /
    • pp.61-86
    • /
    • 2001
  • All cancers are caused by abnormalities in DNA sequence. Throughout life, the DNA in human cells is exposed to mutagens and suffers mistakes in replication, resulting in progressive, subtle changes in the DNA sequence in each cell. Since the development of conventional and molecular cytogenetic methods to the analysis of chromosomal aberrations in cancers, more than 1,800 recurring chromosomal breakpoints have been identified. These breakpoints and regions of nonrandom copy number changes typically point to the location of genes involved in cancer initiation and progression. With the introduction of molecular cytogenetic methodologies based on fluorescence in situ hybridization (FISH), namely, comparative genomic hybridization (CGH) and multicolor FISH (m-FISH) in carcinomas become susceptible to analysis. Conventional CGH has been widely applied for the detection of genomic imbalances in tumor cells, and used normal metaphase chromosomes as targets for the mapping of copy number changes. However, this limits the mapping of such imbalances to the resolution limit of metaphase chromosomes (usually 10 to 20 Mb). Efforts to increase this resolution have led to the "new"concept of genomic DNA chip (1 to 2 Mb), whereby the chromosomal target is replaced with cloned DNA immobilized on such as glass slides. The resulting resolution then depends on the size of the immobilized DNA fragments. We have completed the first draft of its Korean Genome Project. The project proceeded by end sequencing inserts from a library of 96,768 bacterial artificial chromosomes (BACs) containing genomic DNA fragments from Korean ethnicity. The sequenced BAC ends were then compared to the Human Genome Project′s publicly available sequence database and aligned according to known cancer gene sequences. These BAC clones were biotinylated by nick translation, hybridized to cytogenetic preparations of metaphase cells, and detected with fluorescein-conjugated avidin. Only locations of unique or low-copy Portions of the clone are identified, because high-copy interspersed repetitive sequences in the probe were suppressed by the addition of unlabelled Cotl DNA. Banding patterns were produced using DAPI. By this means, every BAC fragment has been matched to its appropriate chromosomal location. We have placed 86 (156 BAC clones) cytogenetically defined landmarks to help with the characterization of known cancer genes. Microarray techniques would be applied in CGH by replacement of metaphase chromosome to arrayed BAC confirming in oncogene and tumor suppressor gene: and an array BAC clones from the collection is used to perform a genome-wide scan for segmental aneuploidy by array-CGH. Therefore, the genomic DNA chip (arrayed BAC) will be undoubtedly provide accurate diagnosis of deletions, duplication, insertions and rearrangements of genomic material related to various human phenotypes, including neoplasias. And our tumor markers based on genetic abnormalities of cancer would be identified and contribute to the screening of the stage of cancers and/or hereditary diseases

  • PDF

EST Analysis system for panning gene

  • Hur, Cheol-Goo;Lim, So-Hyung;Goh, Sung-Ho;Shin, Min-Su;Cho, Hwan-Gue
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2000.11a
    • /
    • pp.21-22
    • /
    • 2000
  • Expressed sequence tags (EFTs) are the partial segments of cDNA produced from 5 or 3 single-pass sequencing of cDNA clones, error-prone and generated in highly redundant sets. Advancement and expansion of Genomics made biologists to generate huge amount of ESTs from variety of organisms-human, microorganisms as well as plants, and the cumulated number of ESTs is over 5.3 million, As the EST data being accumulate more rapidly, it becomes bigger that the needs of the EST analysis tools for extraction of biological meaning from EST data. Among the several needs of EST analyses, the extraction of protein sequence or functional motifs from ESTs are important for the identification of their function in vivo. To accomplish that purpose the precise and accurate identification of the region where the coding sequences (CDSs) is a crucial problem to solve primarily, and it will be helpful to extract and detect of genuine CD5s and protein motifs from EST collections. Although several public tools are available for EST analysis, there is not any one to accomplish the object. Furthermore, they are not targeted to the plant ESTs but human or microorganism. Thus, to correspond the urgent needs of collaborators deals with plant ESTs and to establish the analysis system to be used as general-purpose public software we constructed the pipelined-EST analysis system by integration of public software components. The software we used are as follows - Phred/Cross-match for the quality control and vector screening, NCBI Blast for the similarity searching, ICATools for the EST clustering, Phrap for EST contig assembly, and BLOCKS/Prosite for protein motif searching. The sample data set used for the construction and verification of this system was 1,386 ESTs from human intrathymic T-cells that verified using UniGene and Nr database of NCBI. The approach for the extraction of CDSs from sample data set was carried out by comparison between sample data and protein sequences/motif database, determining matched protein sequences/motifs that agree with our defined parameters, and extracting the regions that shows similarities. In recent future, in addition to these components, it is supposed to be also integrated into our system and served that the software for the peptide mass spectrometry fingerprint analysis, one of the proteomics fields. This pipelined-EST analysis system will extend our knowledge on the plant ESTs and proteins by identification of unknown-genes.

  • PDF

Genome-wide hepatic DNA methylation changes in high-fat diet-induced obese mice

  • Yoon, AhRam;Tammen, Stephanie A.;Park, Soyoung;Han, Sung Nim;Choi, Sang-Woon
    • Nutrition Research and Practice
    • /
    • v.11 no.2
    • /
    • pp.105-113
    • /
    • 2017
  • BACKGROUND/OBJECTIVES: A high-fat diet (HFD) induces obesity, which is a major risk factor for cardiovascular disease and cancer, while a calorie-restricted diet can extend life span by reducing the risk of these diseases. It is known that health effects of diet are partially conveyed through epigenetic mechanism including DNA methylation. In this study, we investigated the genome-wide hepatic DNA methylation to identify the epigenetic effects of HFD-induced obesity. MATERIALS AND METHODS: Seven-week-old male C57BL/6 mice were fed control diet (CD), calorie-restricted control diet (CRCD), or HFD for 16 weeks (after one week of acclimation to the control diet). Food intake, body weight, and liver weight were measured. Hepatic triacylglycerol and cholesterol levels were determined using enzymatic colorimetric methods. Changes in genome-wide DNA methylation were determined by a DNA methylation microarray method combined with methylated DNA immunoprecipitation. The level of transcription of individual genes was measured by real-time PCR. RESULTS: The DNA methylation statuses of genes in biological networks related to lipid metabolism and hepatic steatosis were influenced by HFD-induced obesity. In HFD group, a proinflammatory Casp1 (Caspase 1) gene had hypomethylated CpG sites at the 1.5-kb upstream region of its transcription start site (TSS), and its mRNA level was higher compared with that in CD group. Additionally, an energy metabolism-associated gene Ndufb9 (NADH dehydrogenase 1 beta subcomplex 9) in HFD group had hypermethylated CpG sites at the 2.6-kb downstream region of its TSS, and its mRNA level was lower compared with that in CRCD group. CONCLUSIONS: HFD alters DNA methylation profiles in genes associated with liver lipid metabolism and hepatic steatosis. The methylation statuses of Casp1 and Ndufb9 were particularly influenced by the HFD. The expression of these genes in HFD differed significantly compared with CD and CRCD, respectively, suggesting that the expressions of Casp1 and Ndufb9 in liver were regulated by their methylation statuses.