• 제목/요약/키워드: reference genome

검색결과 188건 처리시간 0.029초

Two-Dimensional Reference Map of Schizosaccharomyces pombe Proteins (Update)

  • Kim, Sun-Kyung;Won, Mi-Sun;Sun, Nam-Kyu;Jang, Jae-Won;Lee, Seung-Hee;Shin, Hee-Young;Song, Kyung-Bin
    • Journal of Microbiology and Biotechnology
    • /
    • 제16권10호
    • /
    • pp.1499-1512
    • /
    • 2006
  • Based on the first 2D reference map of the fission yeast Schizosaccharomyces pombe protein reported previously, we expanded and updated the map using narrower pI ranges. In this paper, 240 protein spots were identified on our reference map. In the pI 4-7 range, 144 spots corresponding to 86 different proteins were identified. In the pI 6-9 range, 43 spots corresponding to 35 different proteins were identified. Fifty-three new spots corresponding to 39 different proteins were further identified in the pI 5-6 range.

Whole-Genome Resequencing Analysis of Hanwoo and Yanbian Cattle to Identify Genome-Wide SNPs and Signatures of Selection

  • Choi, Jung-Woo;Choi, Bong-Hwan;Lee, Seung-Hwan;Lee, Seung-Soo;Kim, Hyeong-Cheol;Yu, Dayeong;Chung, Won-Hyong;Lee, Kyung-Tai;Chai, Han-Ha;Cho, Yong-Min;Lim, Dajeong
    • Molecules and Cells
    • /
    • 제38권5호
    • /
    • pp.466-473
    • /
    • 2015
  • Over the last 30 years, Hanwoo has been selectively bred to improve economically important traits. Hanwoo is currently the representative Korean native beef cattle breed, and it is believed that it shared an ancestor with a Chinese breed, Yanbian cattle, until the last century. However, these two breeds have experienced different selection pressures during recent decades. Here, we whole-genome sequenced 10 animals each of Hanwoo and Yanbian cattle (20 total) using the Illumina HiSeq 2000 sequencer. A total of approximately 3.12 and 3.07 billion sequence reads were mapped to the bovine reference sequence assembly (UMD 3.1) at an average of approximately 10.71- and 10.53-fold coverage for Hanwoo and Yanbian cattle, respectively. A total of 17,936,399 single nucleotide polymorphisms (SNPs) were yielded, of which 22.3% were found to be novel. By annotating the SNPs, we further retrieved numerous nonsynonymous SNPs that may be associated with traits of interest in cattle. Furthermore, we performed whole-genome screening to detect signatures of selection throughout the genome. We located several promising selective sweeps that are potentially responsible for economically important traits in cattle; the PPP1R12A gene is an example of a gene that potentially affects intramuscular fat content. These discoveries provide valuable genomic information regarding potential genomic markers that could predict traits of interest for breeding programs of these cattle breeds.

Single Nucleotide Polymorphism Marker Discovery from Transcriptome Sequencing for Marker-assisted Backcrossing in Capsicum

  • Kang, Jin-Ho;Yang, Hee-Bum;Jeong, Hyeon-Seok;Choe, Phillip;Kwon, Jin-Kyung;Kang, Byoung-Cheorl
    • 원예과학기술지
    • /
    • 제32권4호
    • /
    • pp.535-543
    • /
    • 2014
  • Backcross breeding is the method most commonly used to introgress new traits into elite lines. Conventional backcross breeding requires at least 4-5 generations to recover the genomic background of the recurrent parent. Marker-assisted backcrossing (MABC) represents a new breeding approach that can substantially reduce breeding time and cost. For successful MABC, highly polymorphic markers with known positions in each chromosome are essential. Single nucleotide polymorphism (SNP) markers have many advantages over other marker systems for MABC due to their high abundance and amenability to genotyping automation. To facilitate MABC in hot pepper (Capsicum annuum), we utilized expressed sequence tags (ESTs) to develop SNP markers in this study. For SNP identification, we used Bukang $F_1$-hybrid pepper ESTs to prepare a reference sequence through de novo assembly. We performed large-scale transcriptome sequencing of eight accessions using the Illumina Genome Analyzer (IGA) IIx platform by Solexa, which generated small sequence fragments of about 90-100 bp. By aligning each contig to the reference sequence, 58,151 SNPs were identified. After filtering for polymorphism, segregation ratio, and lack of proximity to other SNPS or exon/intron boundaries, a total of 1,910 putative SNPs were chosen and positioned to a pepper linkage map. We further selected 412 SNPs evenly distributed on each chromosome and primers were designed for high throughput SNP assays and tested using a genetic diversity panel of 27 Capsicum accessions. The SNP markers clearly distinguished each accession. These results suggest that the SNP marker set developed in this study will be valuable for MABC, genetic mapping, and comparative genome analysis.

Evaluation of selection program by assessing the genetic diversity and inbreeding effects on Nellore sheep growth through pedigree analysis

  • Illa, Satish Kumar;Gollamoori, Gangaraju;Nath, Sapna
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제33권9호
    • /
    • pp.1369-1377
    • /
    • 2020
  • Objective: The main objectives of the present study were to assess the genetic diversity, population structure and to appraise the efficiency of ongoing selective breeding program in the closed nucleus herd of Nellore sheep through pedigree analysis. Methods: Information utilized in the study was collected from the pedigree records of Livestock Research Station, Palamaner during the period from 1989 to 2016. Genealogical parameters like generation interval, pedigree completeness, inbreeding level, average relatedness among the animals and genetic conservation index were estimated based on gene origin probabilities. Lambs born during 2012 and 2016 were considered as reference population. Two animal models either with the use of Fi or ΔFi as linear co-variables were evaluated to know the effects of inbreeding on the growth traits of Nellore sheep. Results: Average generation interval and realized effective population size for the reference cohort were estimated as 3.38±0.10 and 91.56±1.58, respectively and the average inbreeding coefficient for reference population was 3.32%. Similarly, the effective number of founders, ancestors and founder genome equivalent of the reference population were observed as 47, 37, and 22.48, respectively. Fifty per cent of the genetic variability was explained by 14 influential ancestors in the reference cohort. The ratio fe/fa obtained in the study was 1.21, which is an indicator of bottlenecks in the population. The number of equivalent generations obtained in the study was 4.23 and this estimate suggested the fair depth of the pedigree. Conclusion: Study suggested that the population had decent levels of genetic diversity and a non-significant influence of inbreeding coefficient on growth traits of Nellore lambs. However, small portion of genetic diversity was lost due to a disproportionate contribution of founders and bottlenecks. Hence, breeding strategies which improve the genetic gain, widens the selection process and with optimum levels of inbreeding are recommended for the herd.

Genetic Linkage Mapping of RAPD Markers Segregating in Korean Ogol Chicken - White Leghorn Backcross Population

  • Hwang, K.C.;Song, K.D.;Kim, T.H.;Jeong, D.K.;Sohn, S.H.;Lillehoj, H.S.;Han, J.Y.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제14권3호
    • /
    • pp.302-306
    • /
    • 2001
  • This study was carried out to construct mapping population and to evaluate the methods involved, including polymorphic DNA marker system and appropriate statistical analysis. As an initial step to establish chicken genome mapping project, White Leghorn (WL) and Korean Ogol chicken (KOC) were used for generating backcross population. From 8 initial parents, total 280 backcross progenies were obtained and 40 were used for genotyping and linkage analysis. For development of novel polymorphic markers for KOC, Random Amplified Polymorphic DNA (RAPD) markers specific for this chicken line were generated. Also included in this study were six microsatellite markers from East Lansing map as reference loci. For segregation analysis, 15 RAPD markers and 6 microsatellites were used to genotype the backcross population. Among the RAPD markers that we developed, 2 pairs of markers were identified to be linked and another 4 RAPD markers showed linkage with microsatellites of known map. In summary, this study showed that our backcross population generated from the mating of KOC to WL serves as a valuable genetic resource for genotyping. Furthermore, RAPD markers are proved to be valuable in linkage mapping analysis.

Determining differentially expressed genes in a microarray expression dataset based on the global connectivity structure of pathway information

  • Chung, Tae-Su;Kim, Kee-Won;Lee, Hye-Won;Kim, Ju-Han
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2004년도 The 3rd Annual Conference for The Korean Society for Bioinformatics Association of Asian Societies for Bioinformatics 2004 Symposium
    • /
    • pp.124-130
    • /
    • 2004
  • Microarray expression datasets are incessantly cumulated with the aid of recent technological advances. One of the first steps for analyzing these data under various experimental conditions is determining differentially expressed genes (DEGs) in each condition. Reasonable choices of thresholds for determining differentially expressed genes are used for the next -step-analysis with suitable statistical significances. We present a model for identifying DEGs using pathway information based on the global connectivity structure. Pathway information can be regarded as a collection of biological knowledge, thus we are tying to determine the optimal threshold so that the consequential connectivity structure can be the most compatible with the existing pathway information. The significant feature of our model is that it uses established knowledge as a reference to determine the direction of analyzing microarray dataset. In the most of previous work, only intrinsic information in the miroarray is used for the identifying DEGs. We hope that our proposed method could contribute to construct biologically meaningful network structure from microarray datasets.

  • PDF

AU-rich elements (ARE) found in the U-rich region of Alu repeats at 3' untranslated regions

  • An, Hyeong-Jun;Lee, Kwang-Hyung;Bhak, Jong-Hwa;Lee, Do-Heon
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2004년도 The 3rd Annual Conference for The Korean Society for Bioinformatics Association of Asian Societies for Bioinformatics 2004 Symposium
    • /
    • pp.77-85
    • /
    • 2004
  • A significant portion (about 8% in human genome) of mammalian mRNA sequences contains AU(Adenine and Uracil) rich elements or AREs at their 3' untranslated regions (UTR). These mRNA sequences are usually stable. ARE motifs are assorted into three classes. The importance of AREs in biology is that they make certain mRNA unstable. We analyzed the occurrences of AREs and Alu, and propose a possible mechanism on how human mRNA could acquire and keep A REs at its 3' UTR originated from Alu repeats. Interspersed in the human genome, Alu repeats occupy 5% of the 3' UTR of mRNA sequences. Alu has poly-adenine (poly-A) regions at the end that lead to poly -thymine (poly-T) regions at the end of its complementary Alu. It has been discovered that AREs are present at the poly -T regions. In the all ARE's classes, 27-40% of ARE repeats were found in the poly -T region of Alu with mismatch allowed within 10% of ARE's length from the 3' UTRs of the NCBI's reference m RNA sequence database. We report that Alu, which has been reported as a junk DNA element, is a source of AREs. We found that one third of AREs were derived from the poly -T regions of the complementary Alu.

  • PDF

A ChIP-Seq Data Analysis Pipeline Based on Bioconductor Packages

  • Park, Seung-Jin;Kim, Jong-Hwan;Yoon, Byung-Ha;Kim, Seon-Young
    • Genomics & Informatics
    • /
    • 제15권1호
    • /
    • pp.11-18
    • /
    • 2017
  • Nowadays, huge volumes of chromatin immunoprecipitation-sequencing (ChIP-Seq) data are generated to increase the knowledge on DNA-protein interactions in the cell, and accordingly, many tools have been developed for ChIP-Seq analysis. Here, we provide an example of a streamlined workflow for ChIP-Seq data analysis composed of only four packages in Bioconductor: dada2, QuasR, mosaics, and ChIPseeker. 'dada2' performs trimming of the high-throughput sequencing data. 'QuasR' and 'mosaics' perform quality control and mapping of the input reads to the reference genome and peak calling, respectively. Finally, 'ChIPseeker' performs annotation and visualization of the called peaks. This workflow runs well independently of operating systems (e.g., Windows, Mac, or Linux) and processes the input fastq files into various results in one run. R code is available at github: https://github.com/ddhb/Workflow_of_Chipseq.git.

Application of DNA Microarray Technology to Molecular Microbial Ecology

  • Cho Jae-Chang
    • 한국미생물학회:학술대회논문집
    • /
    • 한국미생물학회 2002년도 추계학술대회
    • /
    • pp.22-26
    • /
    • 2002
  • There are a number of ways in which environmental microbiology and microbial ecology will benefit from DNA micro array technology. These include community genome arrays, SSU rDNA arrays, environmental functional gene arrays, population biology arrays, and there are clearly more different applications of microarray technology that can be applied to relevant problems in environmental microbiology. Two types of the applications, bacterial identification chip and functional gene detection chip, will be presented. For the bacterial identification chip, a new approach employing random genome fragments that eliminates the disadvantages of traditional DNA-DNA hybridization is proposed to identify and type bacteria based on genomic DNA-DNA similarity. Bacterial genomes are fragmented randomly, and representative fragments are spotted on a glass slide and then hybridized to test genomes. Resulting hybridization profiles are used in statistical procedures to identify test strains. Second, the direct binding version of microarray with a different array design and hybridization scheme is proposed to quantify target genes in environmental samples. Reference DNA was employed to normalize variations in spot size and hybridization. The approach for designing quantitative microarrays and the inferred equation from this study provide a simple and convenient way to estimate the target gene concentration from the hybridization signal ratio.

  • PDF