• 제목/요약/키워드: whole-genome DNA

검색결과 178건 처리시간 0.023초

Flanking Sequence and Copy-Number Analysis of Transformation Events by Integrating Next-Generation Sequencing Technology with Southern Blot Hybridization

  • Qin, Yang;Woo, Hee-Jong;Shin, Kong-Sik;Lim, Myung-Ho;Cho, Hyun-Suk;Lee, Seong-Kon
    • Plant Breeding and Biotechnology
    • /
    • 제5권4호
    • /
    • pp.269-281
    • /
    • 2017
  • With the continual development of genetically modified (GM) crops, it has become necessary to develop detailed and effective molecular characterization methods to select candidate events from a large pool of transformation events. Relative to traditional molecular analysis methods such as the polymerase chain reaction (PCR) and Southern blot hybridization, next generation sequencing (NGS) technology for whole-genome sequencing of complex crop genomes had proven comparatively useful for in-depth molecular characterization. In this study, four transformation events, including one in Bacillus thuringiensis (Bt)-resistant rice, one in resveratrol-producing rice, and two in beta-carotene-enhanced soybeans, were selected for molecular characterization. To merge NGS analysis and Southern blot-hybridization results, we confirmed the transgene insertion sites, insertion construction, and insertion numbers of these four transformation events. In addition, the read-coverage depth assessed by NGS analysis for inserted genes might provide consistent results in terms of inserted T-DNA numbers in case of complex insertion structures and highly duplicated donor genomes; however, PCR-based methods can produce incorrect conclusions. Our combined method provides an effective and complete analytical approach for whole-genome visual inspection of transformation events that require biosafety assessment.

Perspectives of International Human Epigenome Consortium

  • Bae, Jae-Bum
    • Genomics & Informatics
    • /
    • 제11권1호
    • /
    • pp.7-14
    • /
    • 2013
  • As the International Human Epigenome Consortium (IHEC) launched officially at the 2010 Washington meeting, a giant step toward the conquest of unexplored regions of the human genome has begun. IHEC aims at the production of 1,000 reference epigenomes to the international scientific community for next 7-10 years. Seven member institutions, including South Korea, Korea National Institute of Health (KNIH), will produce 25-200 reference epigenomes individually, and the produced data will be publically available by using a data center. Epigenome data will cover from whole genome bisulfite sequencing, histone modification, and chromatin access information to miRNA-seq. The final goal of IHEC is the production of reference maps of human epigenomes for key cellular status relevant to health and disease.

유전자 및 유전체 연구 기술과 동향 (Trend and Technology of Gene and Genome Research)

  • 이진성;김기환;서동상;강석우;황재삼
    • 한국잠사곤충학회지
    • /
    • 제42권2호
    • /
    • pp.126-141
    • /
    • 2000
  • A major step towards understanding of the genetic basis of an organism is the complete sequence determination of all genes in target genome. The nucleotide sequence encoded in the genome contains the information that specifies the amino acid sequence of every protein and functional RNA molecule. In principle, it will be possible to identify every protein resposible for the structure and function of the body of the target organism. The pattern of expression in different cell types will specify where and when each protein is used. The amino acid sequence of the proteins encoded by each gene will be derived from the conceptional translation of the nucleotide sequence. Comparison of these sequences with those of known proteins, whose sequences are sorted in database, will suggest an approximate function for many proteins. This mini review describes the development of new sequencing methods and the optimization of sequencing strategies for whole genome, various cDNA and genomic analysis.

  • PDF

Identification of the Marker Genes Related With Chronic Mitral Valve Disease in Dogs

  • Yoon, Byung-Gook;Lee, Dong-Soo;Seo, Kyoung-Won;Song, Kun-Ho
    • 한국임상수의학회지
    • /
    • 제36권4호
    • /
    • pp.190-195
    • /
    • 2019
  • We aimed to identify genomic variations as well as the marker genes related with chronic mitral valve disease (CMVD) in Canis lupus familiaris using whole genome resequencing, which provides valuable resources for further study. Two ten-year old female Canis lupus familiaris English cocker spaniels were used for this study, one control and one who had been diagnosed as CMVD. For the whole genome resequencing, muscles from the left ventricular wall were collected from each dog. With the HiSeq DNA Shotgun library and $HiSeq^{TM}$ 2000 platform, whole genome resequencing was performed. From the results, we identified 5 million and 6 million variants in gene expression in the control and CMVD-diagnosed subject, respectively. We then selected the top 1,000 genes from the SNP, INS, and DEL mutation and 675 genes among them were overlapped for every mutation between the control and CMVD-diagnosed patient. Interestingly, in both groups, the intron variant (91.16 and 91.18%) and upstream variant (3.10 and 3.08%) are most highly related. Among the overlapped 675 genes, gene ontology for intracellular signal transduction is highly counted in INS, and DEL, and SNPs (35, 33, 31, respectively). In this study, we found that the COL and CDH gene families could be key molecules in identifying the difference in gene expression between control and CMVD-diagnosed dogs. We believe further studies will prove the importance of variants in key molecule expression and that these data will serve as a valuable foundation stone the study of canine CMVD.

배추 유전체열구의 현황과 전망 (Korea Brassica Genome Project: Current Status and Prospective)

  • 최수련;박지영;박범석;김호일;임용표
    • Journal of Plant Biotechnology
    • /
    • 제33권3호
    • /
    • pp.153-160
    • /
    • 2006
  • 유전체 연구란 목적하는 유전체의 구조를 밝히고 가지고 있는 모든 유전자의 기능 및 진화과정을 망라하여 이해하고자 하는 것이다. 계통발생학상 애기장대와 연관되어 있는 Brassica rapa는 채소, 유지 및 사료로 이용되는 중요한 작물의 하나이다. Brassica rapa의 유전체 연구를 착수하는 데는 적합한 유전학적 재료 및 유전체 재료가 있어야 한다. 우리는 배추 (Brassica rapa spp. pekinensis)를 재료로 하여 표준 mapping 집단을 개발하여, 78계통으로 구성된 DH집단과 약 250 계통으로 구성된 RI집단을 개발하였다. 2가지 제한효소 (HintIII, BamHI)를 이용해 세균인공염색체 (BAC) library (KBrH, KBrB)를 만들었고, 이들은 각각 56,592개와 50,688개의 클론으로 구성되어 있다. 또한 배추의 각기 다른 부위를 이용하여 만든 22가지의 cDNA library를 이용하여 평균 575bp의 길이를 가지는 104,914개의 EST 분석을 실시 하였다. 세계적으로 'Multinational Brassica Genome Project (MBGP)' 조직이 구성되었고 배추의 전 염기서열 분석을 하기로 2003년 결정되었다. 그 첫 단계로 104,914개의 BAC 클론의 BAC-end 염기서열분석이 제안되어 2006년 9월 5개국 공동 프로젝트로 추진하여 완성하게 되었다. 이러한 BAC-end 염기서열분석의 결과는 유전자의 염기서열 해석, 및 풍부하게 존재하는 반복염기서열 DNA를 분석함으로써 배추의 유전체 구조를 이해할 수 있는 실마리를 주었다. BAC 클론의 전체 염기서열분석은, 비록 단편 내에 유전자의 결실이 변화무쌍하게 일어나지만 배추 DNA 단편이 유전체에서 광범위하게 삼중복으로 존재함을 나타냈다. 이러한 BAC-end 염기서열을 아기장대 염기서열에 비교하여 629개의 종자 BAC을 선정하게 되었고, 이들의 염기서열 분석을 완성하였다. MBGP에서는2단계로서 배추의 전 유전체 염기서열 분석을 추진하게 되었고, 유전자지도에 위치한 종자 BAC을 이용하여 인접한 BAC 클론을 찾아 염기서열 분석하는 BAC-to-BAC 방법을 추진하고 있으며 8개국에서 참여하여 현재 염기서열 분석을 추진 중 이다. 최근에 각 국에서는 생물정보학기법을 활용한 염기서열 분석 기반에 대하여 많은 토론이 진행되고 있다. 앞으로 다양한 유전체 정보가 축적됨에 따라 배추의 유전체 구조를 이해하고 농업적으로 적용하고자 하는데 기여를 할 것이다.

김치에서 분리한 진세노사이드 전환 능력이 있는 Lactobacillus koreensis 26-25의 유전체 서열 분석 (Complete genome sequence of Lactobacillus koreensis 26-25, a ginsenoside converting bacterium, isolated from Korean kimchi)

  • 김주현;류청매;스리니바산 사티야라지;김명겸;김상용;위지향;임완택
    • 미생물학회지
    • /
    • 제54권4호
    • /
    • pp.477-479
    • /
    • 2018
  • 김치로부터 분리한 Lactobacillus koreensis 26-25 균주의 유전체서열을 분석하였다. 균주 26-25의 유전체는 G + C 비율이 49.23%이며, 2,720개의 유전자와 2,556개의 단백질 코딩 유전자, 85개의 위유전자 그리고 78개의 RNA 유전자를 포함한 단일 원형 염색체로 구성되었으면 그 크기는 3,006,812 bp였다. 균주 26-25는 인삼사포닌의 당 분해에 관여하는 여러 타입의 글라이코시다제 유전자를 가지고 있었다. 이러한 지놈 분석은 주요 진세노사이드가 우수한 약리학적 활성의 미량 진세노사이드로 전환하는데 관여하는 유전자 특징을 이해하는데 큰 기여가 되었다.

Comparative Genomic Analysis of Staphylococcus aureus FORC_001 and S. aureus MRSA252 Reveals the Characteristics of Antibiotic Resistance and Virulence Factors for Human Infection

  • Lim, Sooyeon;Lee, Dong-Hoon;Kwak, Woori;Shin, Hakdong;Ku, Hye-Jin;Lee, Jong-eun;Lee, Gun Eui;Kim, Heebal;Choi, Sang-Ho;Ryu, Sangryeol;Lee, Ju-Hoon
    • Journal of Microbiology and Biotechnology
    • /
    • 제25권1호
    • /
    • pp.98-108
    • /
    • 2015
  • Staphylococcus aureus is an important foodborne pathogen that causes diverse diseases ranging from minor infections to life-threatening conditions in humans and animals. To further understand its pathogenesis, the genome of the strain S. aureus FORC_001 was isolated from a contaminated food. Its genome consists of 2,886,017 bp double-stranded DNA with a GC content of 32.8%. It is predicted to contain 2,728 open reading frames, 57 tRNAs, and 6 rRNA operons, including 1 additional 5S rRNA gene. Comparative phylogenetic tree analysis of 40 complete S. aureus genome sequences using average nucleotide identity (ANI) revealed that strain FORC_001 belonged to Group I. The closest phylogenetic match was S. aureus MRSA252, according to a whole-genome ANI (99.87%), suggesting that they might share a common ancestor. Comparative genome analysis of FORC_001 and MRSA252 revealed two non-homologous regions: Regions I and II. The presence of various antibiotic resistance genes, including the SCCmec cluster in Region I of MRSA252, suggests that this strain might have acquired the SCCmec cluster to adapt to specific environments containing methicillin. Region II of both genomes contains prophage regions but their DNA sequence identity is very low, suggesting that the prophages might differ. This is the first report of the complete genome sequence of S. aureus isolated from a real foodborne outbreak in South Korea. This report would be helpful to extend our understanding about the genome, general characteristics, and virulence factors of S. aureus for further studies of pathogenesis, rapid detection, and epidemiological investigation in foodborne outbreak.

Mining and analysis of microsatellites in human coronavirus genomes using the in-house built Java pipeline

  • Umang, Umang;Bharti, Pawan Kumar;Husain, Akhtar
    • Genomics & Informatics
    • /
    • 제20권3호
    • /
    • pp.35.1-35.9
    • /
    • 2022
  • Microsatellites or simple sequence repeats are motifs of 1 to 6 nucleotides in length present in both coding and non-coding regions of DNA. These are found widely distributed in the whole genome of prokaryotes, eukaryotes, bacteria, and viruses and are used as molecular markers in studying DNA variations, gene regulation, genetic diversity and evolutionary studies, etc. However, in vitro microsatellite identification proves to be time-consuming and expensive. Therefore, the present research has been focused on using an in-house built java pipeline to identify, analyse, design primers and find related statistics of perfect and compound microsatellites in the seven complete genome sequences of coronavirus, including the genome of coronavirus disease 2019, where the host is Homo sapiens. Based on search criteria among seven genomic sequences, it was revealed that the total number of perfect simple sequence repeats (SSRs) found to be in the range of 76 to 118 and compound SSRs from 01 to10, thus reflecting the low conversion of perfect simple sequence to compound repeats. Furthermore, the incidence of SSRs was insignificant but positively correlated with genome size (R2 = 0.45, p > 0.05), with simple sequence repeats relative abundance (R2 = 0.18, p > 0.05) and relative density (R2 = 0.23, p > 0.05). Dinucleotide repeats were the most abundant in the coding region of the genome, followed by tri, mono, and tetra. This comparative study would help us understand the evolutionary relationship, genetic diversity, and hypervariability in minimal time and cost.

Complete Chloroplast Genome Sequence of Dumortiera hirsuta

  • Kwon, Woochan;Kim, Yongsung;Park, Jongsun
    • 한국자원식물학회:학술대회논문집
    • /
    • 한국자원식물학회 2018년도 춘계학술발표회
    • /
    • pp.43-43
    • /
    • 2018
  • Dumortiera hirsuta (Sw.) Nees (Dumortieraceae) is a thallose liverwort distributed in tropics and subtropics. It is the only species in family Dumortieraceae, which is the second basal family in order Marchantiales. D. hirsuta is characterized by hairy receptacles and lacking air chamber. The complete chloroplast genome of D. hirsuta was successfully rescued from raw reads generated by HiSeq4000. Its total length is 122,050 bp consisting of four regions: large single copy (LSC) region (81,697 bp), small single copy (SSC) region (20,061 bp), and two inverted repeats (IRs; 10,146 bp per each). It contained 129 genes (84 coding DNA sequence (CDS), eight rRNAs, and 37 tRNAs); 18 genes including four rRNAs, and five tRNAs are duplicated in the IR regions. The overall GC content of D. hirsuta is 28.7%, which is almost same to that of Marchantia paleacea. Phylogenetic tree based on all genes from whole chloroplast genomes will provides phylogenetic position of D. hirstua. This sequence will be an fundamental resources for further researches of order Marchantiales.

  • PDF