• Title/Summary/Keyword: Genomic Structure

Search Result 286, Processing Time 0.026 seconds

Genomic Organization of ancop Gene for ${\alpha}-COP$ Homolog from Aspergillus nidulans

  • Lee, Hwan-Hee;Chae, Shun-Kee;Kim, Jeong-Yoon;Maeng, Pil-Jae;Park, Hee-Moon
    • Mycobiology
    • /
    • v.28 no.4
    • /
    • pp.171-176
    • /
    • 2000
  • We have cloned a ${\alpha}-COP$ homolog, ancop, from Aspergillus nidulans by colony hybridization of chromosome specific library using ${\alpha}-COP$ homologous fragment as a probe. The probe DNA was amplified with degenerated primers designed by comparison of conserved region of the amino acid sequences of Saccharomyces cerevisiae ${\alpha}-COP$, Homo sapiens HEP-COP, and Drosophila melanogaster ${\alpha}-COP$. Full length cDNA clone was also amplified by RT-PCR. Comparison of genomic DNA sequence with cDNA sequence obtained by RT-PCR revealed 7 introns. Amino acid sequence similarity search of the anCop with other ${\alpha}-COPs$ gave an overall identity of 52% with S. cerevisiae, 47% with human and bovine, 45% with Drosophila and Arabidopsis. In upstream region from the transcription start site, a putative TATA and CAAT motif were also identified.

  • PDF

Comparison Architecture for Large Number of Genomic Sequences

  • Choi, Hae-won;Ryoo, Myung-Chun;Park, Joon-Ho
    • Journal of Information Technology and Architecture
    • /
    • v.9 no.1
    • /
    • pp.11-19
    • /
    • 2012
  • Generally, a suffix tree is an efficient data structure since it reveals the detailed internal structures of given sequences within linear time. However, it is difficult to implement a suffix tree for a large number of sequences because of memory size constraints. Therefore, in order to compare multi-mega base genomic sequence sets using suffix trees, there is a need to re-construct the suffix tree algorithms. We introduce a new method for constructing a suffix tree on secondary storage of a large number of sequences. Our algorithm divides three files, in a designated sequence, into parts, storing references to the locations of edges in hash tables. To execute experiments, we used 1,300,000 sequences around 300Mbyte in EST to generate a suffix tree on disk.

Full Length cDNA, Genomic Organizations and Expression Profiles of the Porcine Proteasomal ATPases PSMC5 Gene

  • Wang, Y.F.;Yu, M.;Liu, B.;Fan, B.;Wang, H.;Zhu, M.J.;Li, K.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.17 no.7
    • /
    • pp.897-902
    • /
    • 2004
  • PSMC5 subunit, which belongs to the 26S proteasomal subunit family, plays an important role in the antigen presentation mediated by MHC class I molecular. Full-length cDNA of porcine PSMC5 was isolated using the in silico cloning and rapid amplification of cDNA ends (RACE). Amino acid was deduced and the primary structure was analyzed. Results revealed that the porcine PSMC5 gene shares the high degree of sequence similarity with its mammalian counterparts at both the nucleotide level and the amino acid level. The RT-PCR was performed to detect the porcine PSMC5 expression pattern in seven tissues and the result showed that high express level was observed in spleen, lung, marrow and liver while the low express level was in muscle. The full-length genomic DNA sequence of porcine PSMC5 gene was amplified by PCR and the genomic structure revealed that this gene was comprised by 12 exons and 11 introns. Best alignment of the cDNA and genomic exon DNA sequence presents 4 mismatches and this information potentially bears further study in gene polymorphisms.

Genomic Diversity of Helicobacter pylori

  • Lee, Woo-Kon;Choi, Sang-Haeng;Park, Seong-Gyu;Choi, Yeo-Jeong;Choe, Mi-Young;Park, Jeong-Won;Jung, Sun-Ae;Byun, Eun-Young;Song, Jae-Young;Jung, Tae-Sung;Lee, Byung-Sang;Baik, Seung-Chul;Cho, Myung-Je
    • The Journal of the Korean Society for Microbiology
    • /
    • v.34 no.6
    • /
    • pp.519-532
    • /
    • 1999
  • Helicobacter pylori is a causative agent of type B gastritis and plays a central role in the pathogenesis of gastroduodenal ulcer and gastric cancer. To elucidate the host-parasite relationship of the H. pylori infection on the basis of molecular biology, we tried to evaluate the genomic diversity of H. pylori. An ordered overlapping bacterial artificial chromosome (BAC) library of a Korean isolate, H. pylori 51 was constructed to set up a genomic map. A circular physical map was constructed by aligning ApaI, NotI and SfiI-digested chromosomal DNA. When the physical map of H. pylori 51 was compared to that of unrelated strain, H. pylori 26695, completely different restriction patterns were shown. Fifteen known genes were mapped on the chromosome of H. pylori 51 and the genetic map was compared with those of strain 26695 and J99, of which the entire genomic sequences were reported. There were some variability in the gene location as well as gene order among three strains. For further analysis on the genomic diversity of H. pylori, when comparing the genomic structure of 150 H. pylori Korean isolates with one another, genomic macrodiversity of H. pylori was characterized by several features: whether or not susceptible to restriction digestion of the chromsome, variation in chromosomal restriction fingerprint and/or high frequency of gene rearrangement. We also examined the extent of allelic variation in nucleotide or deduced amino acid sequences at the individual gene level. fucT, cagA and vacA were confirmed to carry regions of high variation in nucleotide sequence among strains. The plasticity zone and strain-specific genes of H. pylori 51 were analyzed and compared with the former two genomic sequences. It should be noted that the H. pylori 51-specific sequences were dispersed on the chromosome, not congregated in the plasticity zone unlike J99- or 26695-specific genes, suggesting the high frequency of gene rearrangement in H. pylori genome. The genome of H. pylori 51 shows differences in the overall genomic organization, gene order, and even in the nucleotide sequences among the H. pylori strains, which are far greater than the differences reported on the genomic diversity of H. pylori.

  • PDF

Genomic Organization of Penicillium chrysogenum chs4, a Class III Chitin Synthase Gene

  • Park, Yoon-Dong;Lee, Myung-Sook;Kim, Ji-Hoon;Jun Namgung;Park, Bum-Chan;Bae, Kyung-Sook;Park, Hee-Moon
    • Journal of Microbiology
    • /
    • v.38 no.4
    • /
    • pp.230-238
    • /
    • 2000
  • Class III chitin synthases in filamentous fungi are important for hyphal growth and differentiation of several filamentous fungi. A genomic clone containing the full gene encoding Chs4, a class III chitin synthase in Penicillium chrysogenum, was cloned by PCR screening and colony hybridization from the genomic library. Nucleotide sequence analysis and transcript mapping of chs4 revealed an open reading frame (ORF) that consisted of 5 exons and 4 introns and encoded a putative protein of 915 amino acids. Nucleotide sequence analysis of the 5'flanking region of the ORF revealed a potential TATA box and several binding sites for transcription activators. The putative transcription initiation site at -716 position was identified by primer extension and the expression of the chs4 during the vegetative growth was confirmed by Northern blot analysis. Amino acid sequence analysis of the Chs4 revealed at least 5 transmembrane helices and several sites for past-transnational modifications. Comparison of the amino acid sequence of Chs4 with those of other fungi showed a close relationship between P chrysogenum and genus Aspergillus.

  • PDF

Network-based regularization for analysis of high-dimensional genomic data with group structure (그룹 구조를 갖는 고차원 유전체 자료 분석을 위한 네트워크 기반의 규제화 방법)

  • Kim, Kipoong;Choi, Jiyun;Sun, Hokeun
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.6
    • /
    • pp.1117-1128
    • /
    • 2016
  • In genetic association studies with high-dimensional genomic data, regularization procedures based on penalized likelihood are often applied to identify genes or genetic regions associated with diseases or traits. A network-based regularization procedure can utilize biological network information (such as genetic pathways and signaling pathways in genetic association studies) with an outstanding selection performance over other regularization procedures such as lasso and elastic-net. However, network-based regularization has a limitation because cannot be applied to high-dimension genomic data with a group structure. In this article, we propose to combine data dimension reduction techniques such as principal component analysis and a partial least square into network-based regularization for the analysis of high-dimensional genomic data with a group structure. The selection performance of the proposed method was evaluated by extensive simulation studies. The proposed method was also applied to real DNA methylation data generated from Illumina Innium HumanMethylation27K BeadChip, where methylation beta values of around 20,000 CpG sites over 12,770 genes were compared between 123 ovarian cancer patients and 152 healthy controls. This analysis was also able to indicate a few cancer-related genes.

An Analysis System for Whole Genomic Sequence Using String B-Tree (스트링 B-트리를 이용한 게놈 서열 분석 시스템)

  • Choe, Jeong-Hyeon;Jo, Hwan-Gyu
    • The KIPS Transactions:PartA
    • /
    • v.8A no.4
    • /
    • pp.509-516
    • /
    • 2001
  • As results of many genome projects, genomic sequences of many organisms are revealed. Various methods such as global alignment, local alignment are used to analyze the sequences of the organisms, and k -mer analysis is one of the methods for analyzing the genomic sequences. The k -mer analysis explores the frequencies of all k-mers or the symmetry of them where the k -mer is the sequenced base with the length of k. However, existing on-memory algorithms are not applicable to the k -mer analysis because a whole genomic sequence is usually a large text. Therefore, efficient data structures and algorithms are needed. String B-tree is a good data structure that supports external memory and fits into pattern matching. In this paper, we improve the string B-tree in order to efficiently apply the data structure to k -mer analysis, and the results of k -mer analysis for C. elegans and other 30 genomic sequences are shown. We present a visualization system which enables users to investigate the distribution and symmetry of the frequencies of all k -mers using CGR (Chaotic Game Representation). We also describe the method to find the signature which is the part of the sequence that is similar to the whole genomic sequence.

  • PDF

Analysis of Genetic Diversity and Population Structure of Wild Strains and Cultivars Using Genomic SSR Markers in Lentinula edodes

  • Lee, Hwa-Yong;Moon, Suyun;Ro, Hyeon-Su;Chung, Jong-Wook;Ryu, Hojin
    • Mycobiology
    • /
    • v.48 no.2
    • /
    • pp.115-121
    • /
    • 2020
  • In this study, the genetic diversity and the population structure of 77 wild strains and 23 cultivars of Lentinula edodes from Korea were analyzed using 20 genomic SSRs, and their genetic relationship was investigated. The tested strains of L. edodes were divided into three sub-groups consisting of only wild strains, mainly wild strains and several cultivars, and mainly cultivars and several wild strains by distance-based analysis. Using model-based analysis, L. edodes strains were divided into two subpopulations; the first one consisting of only wild strains and the second one with mainly cultivars and several wild strains. Moreover, AMOVA analysis revealed that the genetic variation in the cultivars was higher than that in the wild strains. The expected and observed heterozygosity and values indicating the polymorphic information content of L. edodes cultivars from Korea were also higher than that of the wild strains. Based on these results, we presume that the cultivars in Korea have developed by using numerous strains from other countries. In conclusion, the usage of wild strains for the development of new cultivars could improve the adaptability of L. edodes to biotic and abiotic stress.

Identification of genomic diversity and selection signatures in Luxi cattle using whole-genome sequencing data

  • Mingyue Hu;Lulu Shi;Wenfeng Yi;Feng Li;Shouqing Yan
    • Animal Bioscience
    • /
    • v.37 no.3
    • /
    • pp.461-470
    • /
    • 2024
  • Objective: The objective of this study was to investigate the genetic diversity, population structure and whole-genome selection signatures of Luxi cattle to reveal its genomic characteristics in terms of meat and carcass traits, skeletal muscle development, body size, and other traits. Methods: To further analyze the genomic characteristics of Luxi cattle, this study sequenced the whole-genome of 16 individuals from the core conservation farm in Shandong region, and collected 174 published genomes of cattle for conjoint analysis. Furthermore, three different statistics (pi, Fst, and XP-EHH) were used to detect potential positive selection signatures related to selection in Luxi cattle. Moreover, gene ontology and Kyoto encyclopedia of genes and genomes pathway enrichment analyses were performed to reveal the potential biological function of candidate genes harbored in selected regions. Results: The results showed that Luxi cattle had high genomic diversity and low inbreeding levels. Using three complementary methods (pi, Fst, and XP-EHH) to detect the signatures of selection in the Luxi cattle genome, there were 2,941, 2,221 and 1,304 potentially selected genes identified, respectively. Furthermore, there were 45 genes annotated in common overlapping genomic regions covered 0.723 Mb, including PLAG1 zinc finger (PLAG1), dedicator of cytokinesis 3 (DOCK3), ephrin A2 (EFNA2), DAZ associated protein 1 (DAZAP1), Ral GTPase activating protein catalytic subunit alpha 1 (RALGAPA1), mediator complex subunit 13 (MED13), and decaprenyl diphosphate synthase subunit 2 (PDSS2), most of which were enriched in pathways related to muscle growth and differentiation and immunity. Conclusion: In this study, we provided a series of genes associated with important economic traits were found in positive selection regions, and a scientific basis for the scientific conservation and genetic improvement of Luxi cattle.