• 제목/요약/키워드: high-throughput nucleotide sequencing

검색결과 48건 처리시간 0.02초

Bioinformatics for the Korean Functional Genomics Project

  • Kim, Sang-Soo
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2000년도 International Symposium on Bioinformatics
    • /
    • pp.45-52
    • /
    • 2000
  • Genomic approach produces massive amount of data within a short time period, New high-throughput automatic sequencers can generate over a million nucleotide sequence information overnight. A typical DNA chip experiment produces tens of thousands expression information, not to mention the tens of megabyte image files, These data must be handled automatically by computer and stored in electronic database, Thus there is a need for systematic approach of data collection, processing, and analysis. DNA sequence information is translated into amino acid sequence and is analyzed for key motif related to its biological and/or biochemical function. Functional genomics will play a significant role in identifying novel drug targets and diagnostic markers for serious diseases. As an enabling technology for functional genomics, bioinformatics is in great need worldwide, In Korea, a new functional genomics project has been recently launched and it focuses on identi☞ing genes associated with cancers prevalent in Korea, namely gastric and hepatic cancers, This involves gene discovery by high throughput sequencing of cancer cDNA libraries, gene expression profiling by DNA microarray and proteomics, and SNP profiling in Korea patient population, Our bioinformatics team will support all these activities by collecting, processing and analyzing these data.

  • PDF

Genomic Tools and Their Implications for Vegetable Breeding

  • Phan, Ngan Thi;Sim, Sung-Chur
    • 원예과학기술지
    • /
    • 제35권2호
    • /
    • pp.149-164
    • /
    • 2017
  • Next generation sequencing (NGS) technologies have led to the rapid accumulation of genome sequences through whole-genome sequencing and re-sequencing of crop species. Genomic resources provide the opportunity for a new revolution in plant breeding by facilitating the dissection of complex traits. Among vegetable crops, reference genomes have been sequenced and assembled for several species in the Solanaceae and Cucurbitaceae families, including tomato, pepper, cucumber, watermelon, and melon. These reference genomes have been leveraged for re-sequencing of diverse germplasm collections to explore genome-wide sequence variations, especially single nucleotide polymorphisms (SNPs). The use of genome-wide SNPs and high-throughput genotyping methods has led to the development of new strategies for dissecting complex quantitative traits, such as genome-wide association study (GWAS). In addition, the use of multi-parent populations, including nested association mapping (NAM) and multiparent advanced generation intercross (MAGIC) populations, has helped increase the accuracy of quantitative trait loci (QTL) detection. Consequently, a number of QTL have been discovered for agronomically important traits, such as disease resistance and fruit traits, with high mapping resolution. The molecular markers for these QTL represent a useful resource for enhancing selection efficiency via marker-assisted selection (MAS) in vegetable breeding programs. In this review, we discuss current genomic resources and marker-trait association analysis to facilitate genome-assisted breeding in vegetable species in the Solanaceae and Cucurbitaceae families.

Early-onset epileptic encephalopathies and the diagnostic approach to underlying causes

  • Hwang, Su-Kyeong;Kwon, Soonhak
    • Clinical and Experimental Pediatrics
    • /
    • 제58권11호
    • /
    • pp.407-414
    • /
    • 2015
  • Early-onset epileptic encephalopathies are one of the most severe early onset epilepsies that can lead to progressive psychomotor impairment. These syndromes result from identifiable primary causes, such as structural, neurodegenerative, metabolic, or genetic defects, and an increasing number of novel genetic causes continue to be uncovered. A typical diagnostic approach includes documentation of anamnesis, determination of seizure semiology, electroencephalography, and neuroimaging. If primary biochemical investigations exclude precipitating conditions, a trial with the administration of a vitaminic compound (pyridoxine, pyridoxal-5-phosphate, or folinic acid) can then be initiated regardless of presumptive seizure causes. Patients with unclear etiologies should be considered for a further workup, which should include an evaluation for inherited metabolic defects and genetic analyses. Targeted next-generation sequencing panels showed a high diagnostic yield in patients with epileptic encephalopathy. Mutations associated with the emergence of epileptic encephalopathies can be identified in a targeted fashion by sequencing the most likely candidate genes. Next-generation sequencing technologies offer hope to a large number of patients with cryptogenic encephalopathies and will eventually lead to new therapeutic strategies and more favorable long-term outcomes.

Analysis of allele-specific expression using RNA-seq of the Korean native pig and Landrace reciprocal cross

  • Ahn, Byeongyong;Choi, Min-Kyeung;Yum, Joori;Cho, In-Cheol;Kim, Jin-Hoi;Park, Chankyu
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제32권12호
    • /
    • pp.1816-1825
    • /
    • 2019
  • Objective: We tried to analyze allele-specific expression in the pig neocortex using bioinformatic analysis of high-throughput sequencing results from the parental genomes and offspring transcriptomes from reciprocal crosses between Korean Native and Landrace pigs. Methods: We carried out sequencing of parental genomes and offspring transcriptomes using next generation sequencing. We subsequently carried out genome scale identification of single nucleotide polymorphisms (SNPs) in two different ways using either individual genome mapping or joint genome mapping of the same breed parents that were used for the reciprocal crosses. Using parent-specific SNPs, allele-specifically expressed genes were analyzed. Results: Because of the low genome coverage (${\sim}4{\times}$) of the sequencing results, most SNPs were non-informative for parental lineage determination of the expressed alleles in the offspring and were thus excluded from our analysis. Consequently, 436 SNPs covering 336 genes were applicable to measure the imbalanced expression of paternal alleles in the offspring. By calculating the read ratios of parental alleles in the offspring, we identified seven genes showing allele-biased expression (p<0.05) including three previously reported and four newly identified genes in this study. Conclusion: The newly identified allele-specifically expressing genes in the neocortex of pigs should contribute to improving our knowledge on genomic imprinting in pigs. To our knowledge, this is the first study of allelic imbalance using high throughput analysis of both parental genomes and offspring transcriptomes of the reciprocal cross in outbred animals. Our study also showed the effect of the number of informative animals on the genome level investigation of allele-specific expression using RNA-seq analysis in livestock species.

Effect of Next-Generation Exome Sequencing Depth for Discovery of Diagnostic Variants

  • Kim, Kyung;Seong, Moon-Woo;Chung, Won-Hyong;Park, Sung Sup;Leem, Sangseob;Park, Won;Kim, Jihyun;Lee, KiYoung;Park, Rae Woong;Kim, Namshin
    • Genomics & Informatics
    • /
    • 제13권2호
    • /
    • pp.31-39
    • /
    • 2015
  • Sequencing depth, which is directly related to the cost and time required for the generation, processing, and maintenance of next-generation sequencing data, is an important factor in the practical utilization of such data in clinical fields. Unfortunately, identifying an exome sequencing depth adequate for clinical use is a challenge that has not been addressed extensively. Here, we investigate the effect of exome sequencing depth on the discovery of sequence variants for clinical use. Toward this, we sequenced ten germ-line blood samples from breast cancer patients on the Illumina platform GAII(x) at a high depth of ${\sim}200{\times}$. We observed that most function-related diverse variants in the human exonic regions could be detected at a sequencing depth of $120{\times}$. Furthermore, investigation using a diagnostic gene set showed that the number of clinical variants identified using exome sequencing reached a plateau at an average sequencing depth of about $120{\times}$. Moreover, the phenomena were consistent across the breast cancer samples.

A Study on Transcriptome Analysis Using de novo RNA-sequencing to Compare Ginseng Roots Cultivated in Different Environments

  • Yang, Byung Wook
    • 한국자원식물학회:학술대회논문집
    • /
    • 한국자원식물학회 2018년도 춘계학술발표회
    • /
    • pp.5-5
    • /
    • 2018
  • Ginseng (Panax ginseng C.A. Meyer), one of the most widely used medicinal plants in traditional oriental medicine, is used for the treatment of various diseases. It has been classified according to its cultivation environment, such as field cultivated ginseng (FCG) and mountain cultivated ginseng (MCG). However, little is known about differences in gene expression in ginseng roots between field cultivated and mountain cultivated ginseng. In order to investigate the whole transcriptome landscape of ginseng, we employed High-Throughput sequencing technologies using the Illumina HiSeqTM2500 system, and generated a large amount of sequenced transcriptome from ginseng roots. Approximately 77 million and 87 million high-quality reads were produced in the FCG and MCG roots transcriptome analyses, respectively, and we obtained 256,032 assembled unigenes with an average length of 1,171 bp by de novo assembly methods. Functional annotations of the unigenes were performed using sequence similarity comparisons against the following databases: the non-redundant nucleotide database, the InterPro domains database, the Gene Ontology Consortium database, and the Kyoto Encyclopedia of Genes and Genomes pathway database. A total of 4,207 unigenes were assigned to specific metabolic pathways, and all of the known enzymes involved in starch and sucrose metabolism pathways were also identified in the KEGG library. This study indicated that alpha-glucan phosphorylase 1, putative pectinesterase/pectinesterase inhibitor 17, beta-amylase, and alpha-glucan phosphorylase isozyme H might be important factors involved in starch and sucrose metabolism between FCG and MCG in different environments.

  • PDF

Analytical Tools and Databases for Metagenomics in the Next-Generation Sequencing Era

  • Kim, Mincheol;Lee, Ki-Hyun;Yoon, Seok-Whan;Kim, Bong-Soo;Chun, Jongsik;Yi, Hana
    • Genomics & Informatics
    • /
    • 제11권3호
    • /
    • pp.102-113
    • /
    • 2013
  • Metagenomics has become one of the indispensable tools in microbial ecology for the last few decades, and a new revolution in metagenomic studies is now about to begin, with the help of recent advances of sequencing techniques. The massive data production and substantial cost reduction in next-generation sequencing have led to the rapid growth of metagenomic research both quantitatively and qualitatively. It is evident that metagenomics will be a standard tool for studying the diversity and function of microbes in the near future, as fingerprinting methods did previously. As the speed of data accumulation is accelerating, bioinformatic tools and associated databases for handling those datasets have become more urgent and necessary. To facilitate the bioinformatics analysis of metagenomic data, we review some recent tools and databases that are used widely in this field and give insights into the current challenges and future of metagenomics from a bioinformatics perspective.

Novel compound heterozygous mutations of ATM in ataxia-telangiectasia: A case report and calculated prevalence in the Republic of Korea

  • Jang, Min Jeong;Lee, Cha Gon;Kim, Hyun Jung
    • Journal of Genetic Medicine
    • /
    • 제15권2호
    • /
    • pp.110-114
    • /
    • 2018
  • Ataxia-telangiectasia (AT; OMIM 208900) is a rare autosomal recessive inherited progressive neurodegenerative disorder, with onset in early childhood. AT is caused by homozygous or compound heterozygous mutations in ATM (OMIM 607585) on chromosome 11q22. The average prevalence of the disease is estimated at 1 of 100,000 children worldwide. The prevalence of AT in the Republic of Korea is suggested to be extremely low, with only a few cases genetically confirmed thus far. Herein, we report a 5-year-old Korean boy with clinical features such as progressive gait and truncal ataxia, both ankle spasticity, dysarthria, and mild intellectual disability. The patient was identified as a compound heterozygote with two novel genetic variants: a paternally derived c.5288_5289insGA p.(Tyr1763*) nonsense variant and a maternally derived c.8363A>C p.(His2788Pro) missense variant, as revealed by next-generation sequencing and confirmed by Sanger sequencing. Based on claims data from the Health Insurance Review and Assessment Service Republic of Korea, we calculated the prevalence of AT in the Republic of Korea to be about 0.9 per million individuals, which is similar to the worldwide average. Therefore, we suggest that multi-gene panel sequencing including ATM should be considered early diagnosis.

Effects of Somatic Mutations Are Associated with SNP in the Progression of Individual Acute Myeloid Leukemia Patient: The Two-Hit Theory Explains Inherited Predisposition to Pathogenesis

  • Park, Soyoung;Koh, Youngil;Yoon, Sung-Soo
    • Genomics & Informatics
    • /
    • 제11권1호
    • /
    • pp.34-37
    • /
    • 2013
  • This study evaluated the effects of somatic mutations and single nucleotide polymorphisms (SNPs) on disease progression and tried to verify the two-hit theory in cancer pathogenesis. To address this issue, SNP analysis was performed using the UCSC hg19 program in 10 acute myeloid leukemia patients (samples, G1 to G10), and somatic mutations were identified in the same tumor sample using SomaticSniper and VarScan2. SNPs in KRAS were detected in 4 out of 10 different individuals, and those of DNMT3A were detected in 5 of the same patient cohort. In 2 patients, both KRAS and DNMT3A were detected simultaneously. A somatic mutation in IDH2 was detected in these 2 patients. One of the patients had an additional mutation in FLT3, while the other patient had an NPM1 mutation. The patient with an FLT3 mutation relapsed shortly after attaining remission, while the other patient with the NPM1 mutation did not suffer a relapse. Our results indicate that SNPs with additional somatic mutations affect the prognosis of AML.

Transferability of Cupped Oyster EST (Expressed Sequence Tag)-Derived SNP (Single Nucleotide Polymorphism) Markers to Related Crassostrea and Ostrea Species

  • Kim, Woo-Jin;Jung, Hyungtaek;Shin, Eun-Ha;Baek, Ilseon
    • 한국패류학회지
    • /
    • 제30권3호
    • /
    • pp.197-210
    • /
    • 2014
  • Single nucleotide polymorphisms (SNPs) are widely acknowledged as the marker of choice for many genetic and genomic applications because they show co-dominant inheritance, are highly abundant across genomes and are suitable for high-throughput genotyping. Here we evaluated the applicability of SNP markers developed from Crassostrea gigas and C. virginica expressed sequence tags (ESTs) in closely related Crassostrea and Ostrea species. A total of 213 putative interspecific level SNPs were identified from re-sequencing data in six amplicons, yielding on average of one interspecific level SNP per seven bp. High polymorphism levels were observed and the high success rate of transferability show that genic EST-derived SNP markers provide an efficient method for rapid marker development and SNP discovery in closely related oyster species. The six EST-SNP markers identified here will provide useful molecular tools for addressing questions in molecular ecology and evolution studies including for stock analysis (pedigree monitoring) in related oyster taxa.