• 제목/요약/키워드: Genome Structure

검색결과 440건 처리시간 0.031초

A Survey of the Brassica rapa Genome by BAC-End Sequence Analysis and Comparison with Arabidopsis thaliana

  • Hong, Chang Pyo;Plaha, Prikshit;Koo, Dal-Hoe;Yang, Tae-Jin;Choi, Su Ryun;Lee, Young Ki;Uhm, Taesik;Bang, Jae-Wook;Edwards, David;Bancroft, Ian;Park, Beom-Seok;Lee, Jungho;Lim, Yong Pyo
    • Molecules and Cells
    • /
    • 제22권3호
    • /
    • pp.300-307
    • /
    • 2006
  • Brassica rapa ssp. pekinensis (Chinese cabbage) is an economically important crop and a model plant for studies on polyploidization and phenotypic evolution. To gain an insight into the structure of the B. rapa genome we analyzed 12,017 BAC-end sequences for the presence of transposable elements (TEs), SSRs, centromeric satellite repeats and genes, and similarity to the closely related genome of Arabidopsis thaliana. TEs were estimated to occupy 14% of the genome, with 12.3% of the genome represented by retrotransposons. It was estimated that the B. rapa genome contains 43,000 genes, 1.6 times greater than the genome of A. thaliana. A number of centromeric satellite sequences, representing variations of a 176-bp consensus sequence, were identified. This sequence has undergone rapid evolution within the B. rapa genome and has diverged among the related species of Brassicaceae. A study of SSRs demonstrated a non-random distribution with a greater abundance within predicted intergenic regions. Our results provide an initial characterization of the genome of B. rapa and provide the basis for detailed analysis through whole-genome sequencing.

A truncated form of human alpha 1-acid glycoprotein is useful as a molecular tool for insect glycobiology

  • Morokuma, Daisuke;Hino, Masato;Tsuchioka, Miho;Masuda, Akitsu;Mon, Hiroaki;Fujiyama, Kazuhito;Kajiura, Hiroyuki;Kusakabe, Takahiro;Lee, Jae Man
    • International Journal of Industrial Entomology and Biomaterials
    • /
    • 제36권1호
    • /
    • pp.15-24
    • /
    • 2018
  • N-glycosylation is an important posttranslational modification that results in a variety of biological activities, structural stability, and protein-protein interactions. There are still many mysteries in the structure and function of N-glycans, and detailed elucidation is necessary. Baculovirus expression system (BES) is widely used to produce recombinant glycoproteins, but it is not suitable for clinical use due to differences in N-glycan structure between insects and mammals. It is necessary to develop adequate model glycoproteins for analysis to efficiently alter the insect-type N-glycosylation pathway to human type. The previous research shows the recombinant alpha 1-acid glycoprotein (${\alpha}1AGP$) secreted from silkworm cultured cells or larvae is highly glycosylated and expected to be an excellent research candidate for the glycoprotein analysis expressed by BES. Therefore, we improved the ${\alpha}1AGP$ to be a better model for studying glycosylation. The modified ${\alpha}1AGP$ (${\alpha}1AGP{\Delta}$) recombinant protein was successfully expressed and purified by using BES, however, the expression level in silkworm cultured cells and larvae were lower than that of the ${\alpha}1AGP$. Subsequently, we confirmed the detailed profile of N-glycan on the ${\alpha}1AGP{\Delta}$ by LS/MS analysis the N-glycan structure at each glycosylation site. These results indicated that the recombinant ${\alpha}1AGP{\Delta}$ could be usable as a better model glycoprotein of N-glycosylation research in BES.

Phylogenomics and its Growing Impact on Algal Phylogeny and Evolution

  • ;윤환수
    • ALGAE
    • /
    • 제21권1호
    • /
    • pp.1-10
    • /
    • 2006
  • Genomic data is accumulating in public database at an unprecedented rate. Although presently dominated by the sequences of metazoan, plant, parasitic, and picoeukaryotic taxa, both expressed sequence tag (EST) and complete genomes of free-living algae are also slowly appearing. This wealth of information offers the opportunity to clarify many long-standing issues in algal and plant evolution such as the contribution of the plastid endosymbiont to nuclear genome evolution using the tools of comparative genomics and multi-gene phylogenetics. A particularly powerful approach for the automated analysis of genome data from multiple taxa is termed phylogenomics. Phylogenomics is the convergence of genomics science (the study of the function and structure of genes and genomes) and molecular phylogenetics (the study of the hierarchical evolutionary relationships among organisms, their genes and genomes). The use of phylogenetics to drive comparative genome analyses has facilitated the reconstruction of the evolutionary history of genes, gene families, and organisms. Here we survey the available genome data, introduce phylogenomic pipelines, and review some initial results of phylogenomic analyses of algal genome data.

Complete genome sequence analysis Hosta virus X and comparison to other potexviruses

  • Park, M.H.;K.H. Ryu
    • 한국식물병리학회:학술대회논문집
    • /
    • 한국식물병리학회 2003년도 정기총회 및 추계학술발표회
    • /
    • pp.113.1-113
    • /
    • 2003
  • A potexvirus, Hosta virus X (HVX-Kr), causing mosaic and mottle symptoms was isolated from hosta plants (Hosta spp.), and its entire genome RNA sequence was determined. in Korea using cDNA library and RACE methods. The genome of HVX encodes five open reading frames coding for viral replicase, triple gene block (TGB), and viral coat protein (CP) from the 5'to 3' ends, which is a typical genome structure of potexviruses. The 3-terminal region of the virus includes the TGBI (26 kDa), TGB2 (13 kDa), TGB3 (8 kDa), and 23 kDa coat protein (CP) and the 3-nontranslated region (NTR). The CP gene of the type isolate of HVX (HVX-U) was amplified by RT-PCR and its nucleotide sequence was determined. The CPs of HVX-Kr and HVX-U had 100% and 98.9% identical amino acids and nucleotides, respectively. Most of the regions of the genome HVX had over 50% nucleotide identical to other sequenced potexviruses. This is the first report of complete genome sequence information of HVX and molecular evidence supporting the virus as a distinct species of the genus Potexvirus.

  • PDF

Nitrosomonadales 목의 핵심유전체(core genome)와 범유전체(pan-genome)의 비교유전체학적 연구 (Comparative analysis of core and pan-genomes of order Nitrosomonadales)

  • 이진환;김경호
    • 미생물학회지
    • /
    • 제51권4호
    • /
    • pp.329-337
    • /
    • 2015
  • Nitrosomonadales 목에서 속하는 균주 중 현재 유전체 서열이 알려진 모든 유전체(N=10)를 이용하여 범유전체 및 핵심유전체 분석을 수행한 결과, 각각 9,808개와 908개 유전자클러스터를 포함하는 것을 확인하였다. Betaproteobacteria의 다른 목의 참조군들과 비교를 통하여 범유전체와 핵심유전체의 크기에 유전체의 수와 집단 내의 유전체들의 차이가 영향을 미치는 것을 확인하였다. Nitrosomonas 속과 Nitrosospira 속의 범유전체는 7,180개와 4,586개, 핵심유전체는 1,092개와 1,600로로 각각 측정되어 Nitrosospira 속의 동질성이 더 높은 것을 확인하였다. Nitrosomonadales 목의 범유전체와 핵심유전체의 크기에 Nitrosomonas 속이 대부분의 영향을 미치는 것을 확인하였다. COG 분석을 통하여 핵심유전체의 크기에는 J (translation, ribosomal structure and biogenesis) 범주가 가장 큰 비율(9.7-21.0%)을 차지하며, 유전체 사이의 유전적 거리가 먼 집단일수록 그 비율이 높아지는 것을 확인하였다. 범유전체의 크기에는 "-" (unclassified) 범주가 34-51%의 높은 비율을 차지하고 있을 정도로 큰 영향을 미치는 것을 확인하였다. 총 97개의 유전자 클러스터가 참조군에는 없고 Nitrosomonadales에만 존재하는 것을 확인하였다. 이들 클러스터들은 Nitrosomonadales을 특징 지우는 유전자들인 ammonia monooxygenase의 유전자인 amoA와 amoB와 그와 관련 있는 amoE와 amoD들을 포함하는 반면에 unclassified 유전자들도 상당량(16-45%)을 포함하고 있다. 이러한 유전자 클러스터는 Nitrosomonadales의 유전적 특이성을 밝히는 데 중요한 역할을 할 것이다.

전유전체(Whole gerlome) 서열 분석과 가시화를 위한 워크벤치 개발 (Development of Workbench for Analysis and Visualization of Whole Genome Sequence)

  • 최정현;진희정;김철민;장철훈;조환규
    • 정보처리학회논문지A
    • /
    • 제9A권3호
    • /
    • pp.387-398
    • /
    • 2002
  • 최근 활발한 소단위 게놈 프로젝트의 수행으로 많은 생물체의 유전체 전체 서열이 밝혀짐에 따라서 전유전체(whole genome)를 기본 단위로 하여 개별 유전자나 그에 관련된 기능 연구가 매우 활발히 이루어지고 있다. 전유전체의 염기 서열은 수백만 bp(base pairs)에서 수백억 bp(base pairs) 정도의 대용량 텍스트 데이터이기 때문에 단순한 온라인 문자 일치(on-line string matching) 알고리즘으로 분석하는 것은 매우 비효율적이다. 본 논문에서는 대용량의 유전체 서열을 분석하는데 적합한 자료 구조인 스트링 B-트리를 사용하여 유전체 서열의 분석과 가시화를 위한 워크벤치를 개발한 과정을 소개한다. 본 연구에서 개발한 시스템은 크게 질의문 부분과 가시화 부분으로 나뉘어 진다. 질의문 부분에는 유전체 서열에 특정 서열이 나타나는 부분의 위치와 횟수를 알아보거나 k번 나타나는 서열을 조사하는 것과 같은 기본적인 패턴 검색 부분과 k-mer 분석을 위한 질의어가 다양하게 준비되어 있다. 가시화 부분은 전유전체 서열과 주석(annotation)을 보여주거나, 유전체 분석을 용이하도록 여러 가시화 방법, CGR(Chaos Game Representation), k-mer graph, RWP(Random Walk Plot) 등으로 생물학자들이 쉽게 전체 구조와 특성 파악할 수 있도록 도와준다. 본 논문이 제안하는 분석 시스템은 생물체의 진화적 관계를 밝히고, 염색체 내에 아직 알려지지 않은 새로운 유전자나 기능이 밝혀지지 않은 junk DNA들의 기능 등을 연구하는데 사용할 수 있다.

유전자 및 유전체 연구 기술과 동향 (Trend and Technology of Gene and Genome Research)

  • 이진성;김기환;서동상;강석우;황재삼
    • 한국잠사곤충학회지
    • /
    • 제42권2호
    • /
    • pp.126-141
    • /
    • 2000
  • A major step towards understanding of the genetic basis of an organism is the complete sequence determination of all genes in target genome. The nucleotide sequence encoded in the genome contains the information that specifies the amino acid sequence of every protein and functional RNA molecule. In principle, it will be possible to identify every protein resposible for the structure and function of the body of the target organism. The pattern of expression in different cell types will specify where and when each protein is used. The amino acid sequence of the proteins encoded by each gene will be derived from the conceptional translation of the nucleotide sequence. Comparison of these sequences with those of known proteins, whose sequences are sorted in database, will suggest an approximate function for many proteins. This mini review describes the development of new sequencing methods and the optimization of sequencing strategies for whole genome, various cDNA and genomic analysis.

  • PDF

The complete chloroplast genome sequence of Rhododendron caucasicum (Ericaceae)

  • Myounghai KWAK;Rainer W. BUSSMANN
    • 식물분류학회지
    • /
    • 제53권3호
    • /
    • pp.230-236
    • /
    • 2023
  • Rhododendron caucasicum Pall. is a shrub distributed in the mountainous areas of the Caucasus from northeastern Türkiye towards the Caspian Sea. This study reports the first complete chloroplast genome sequence of R. caucasicum. The plastome is 199,487 base pairs (bp) long and exhibits a typical quadripartite structure comprising a large single-copy region of 107,645 bp, a small single-copy region of 2,598 bp, and a pair of identical inverted repeat regions of 44,622 bp each. It contains 143 genes, comprising 93 protein-coding genes, 42 tRNA genes, and eight rRNA genes. The large chloroplast genome size is likely due to the expansion of inverted repeats. A phylogenetic analysis of chloroplast genomes with other Rhododendron species supports previously recognized infrageneric relationship.

The complete chloroplast genome of Polygonatum falcatum (Asparagaceae)

  • CHOI, Tae-Young;YUN, Se-Hyun;LEE, Soo-Rang
    • 식물분류학회지
    • /
    • 제52권1호
    • /
    • pp.80-83
    • /
    • 2022
  • Polygonatum falcatum is a perennial herb distributed in East Asia. We determined the characteristics of the complete chloroplast genome in P. falcatum for the first time, with a de novo assembly strategy. The chloroplast genome was 154,579bp in length harboring 87 protein coding genes, 38 tRNA genes and eight rRNA genes. It exhibits typical quadripartite structure comprising a large single-copy (LSC) (83,528bp), a small single-copy (SSC) (18,457bp) and a pair of inverted repeats (IRs) (26,297bp). Phylogenetic analysis of 16 chloroplast genomes from Asparagaceae reveals that the genus Polygonatum is a monophyletic group and that P. falcatum is clustered together with the congener, P. odoratum.

Structural Characterization of the Genome of BERV γ4 the Most Abundant Endogenous Retrovirus Family in Cattle

  • Xiao, Rui;Park, Kwangha;Oh, Younshin;Kim, Jinhoi;Park, Chankyu
    • Molecules and Cells
    • /
    • 제26권4호
    • /
    • pp.404-408
    • /
    • 2008
  • The genome of replication-competent BERV ${\gamma}4$ provirus, which is the most abundant ERV family in the bovine genome, was characterized in detail. The BERV ${\gamma}4$ genome showed that BERV ${\gamma}4$ harbors 8576 nucleotides and has the typical 5'-long terminal repeat (LTR)-gag-pro-pol-env-LTR-3' retroviral organization with a long leader region positioned before the gag open reading frame. Multiple sequences analysis showed that the nucleotide difference between 5' and 3' LTRs was 4.2% (mean value 0.042) in average, suggesting that the provirus formed at most 13.3 million years ago. Gag separated by a stop codon from pro-pol in the same reading frame, while env resides in another reading frame lacking of a functional surface domain. According to the current bovine genome sequence assembly, the full-length BERV ${\gamma}4$ provirus sequences were only found in the chromosomes 1, 2, 6, 10, 15, 23, 26, 28, X, and unassigned, although the partial sequences almost evenly distributed in the entire bovine genome. This is the first detailed study describing the genome structure of BERV ${\gamma}4$, the most abundant ERV family present in bovine genome. Combined with our recent reports on characterization of ERVs in bovine, this study will contribute to illuminate ERVs in the cattle of which no information was previously available.