• Title/Summary/Keyword: Contig

Search Result 64, Processing Time 0.025 seconds

Confirming Single Nucleotide Polymorphisms from Expressed Sequence Tag Datasets Derived from Three Cattle cDNA Libraries

  • Lee, Seung-Hwan;Park, Eung-Woo;Cho, Yong-Min;Lee, Ji-Woong;Kim, Hyoung-Yong;Lee, Jun-Heon;Oh, Sung-Jong;Cheong, Il-Cheong;Yoon, Du-Hak
    • BMB Reports
    • /
    • v.39 no.2
    • /
    • pp.183-188
    • /
    • 2006
  • Using the Phred/Phrap/Polyphred/Consed pipeline established in the National Livestock Research Institute of Korea, we predicted candidate coding single nucleotide polymorphisms (cSNPs) from 7,600 expressed sequence tags (ESTs) derived from three cDNA libraries (liver, M. longissimus dorsi, and intermuscular fat) of Hanwoo (Korean native cattle) steers. From the 7,600 ESTs, 829 contigs comprising more than two EST reads were assembled using the Phrap assembler. Based on the contig analysis, 201 candidate cSNPs were identified in 129 contigs, in which transitions (69%) outnumbered transversions (31%). To verify whether the predicted cSNPs are real, 17 SNPs involved in lipid and energy metabolism were selected from the ESTs. Twelve of these were confirmed to be real while five were identified as artifacts, possibly due to expressed sequence tag sequence error. Further analysis of the 12 verified cSNPs was performed using the program BLASTX. Five were identified as nonsynonymous cSNPs, five were synonymous cSNPs, and two SNPs were located in 3'-UTRs. Our data indicated that a relatively high SNP prediction rate (71%) from a large EST database could produce abundant cSNPs rapidly, which can be used as valuable genetic markers in cattle.

Characterization of a Strain of Malva Vein Clearing Virus in Alcea rosea via Deep Sequencing

  • Wang, Defu;Cui, Liyan;Pei, Yanni;Ma, Zhennan;Shen, Shaofei;Long, Dandan;Li, Lingyu;Niu, Yanbing
    • The Plant Pathology Journal
    • /
    • v.36 no.5
    • /
    • pp.468-475
    • /
    • 2020
  • Malva vein clearing virus (MVCV) is a member of the Potyvirus species, and has a negative impact on the aesthetic development of Alcea rosea. It was first reported in Germany in 1957, but its complete genome sequence data are still scarce. In the present work, A. rosea leaves with vein-clearing and mosaic symptoms were sampled and analyzed with small RNA deep sequencing. By denovo assembly the raw sequences of virus-derived small interfering RNAs (vsiRs) and whole genome amplification of malva vein cleaning virus SX strain (MVCV-SX) by specific primers targeting identified contig gaps, the full-length genome sequences (9,645 nucleotides) of MVCV-SX were characterized, constituting of an open reading frame that is long enough to encode 3,096 amino acids. Phylogenetic analysis showed that MVCV-SX was clustered with euphorbia ringspot virus and yam mosaic virus. Further analyses of the vsiR profiles revealed that the most abundant MVCV-vsiRs were between 21 and 22 nucleotides in length and a strong bias was found for "A" and "U" at the 5′-terminal residue. The results of polarity assessment indicated that the amount of sense strand was almost equal to that of the antisense strand in MVCV-vsiRs, and the main hot-spot region in MVCV-SX genome was found at cylindrical inclusion. In conclusion, our findings could provide new insights into the RNA silencing-mediated host defence mechanism in A. rosea infected with MVCV-SX, and offer a basis for the prevention and treatment of this virus disease.

Genetical and Pathological Studies on the Mutant Mice as an Animal Model for Deafness Disease

  • Lee, Jeong-Woong;Lee, Eun-Ju;Lee, Hoon-Taek;Chung, Kil-Saeng;Ryoo, Zae-Young
    • Proceedings of the KSAR Conference
    • /
    • 2001.03a
    • /
    • pp.48-48
    • /
    • 2001
  • A new neurological mutant has been found in the ICR outbred strain mouse. Affected mice display profound deafness and a head-tossing and bidirectional circling behavior, showing an autosomal recessive mode of inheritance. It was, therefore, named cir/Kr with the gene symbol cir. The auditory tests identified clearly the hearing loss of the cir mice when compared to wild type mice. Pathological studies confirmed the developmental defects in the middle ear, cochlea, cochlear nerve, and semicircular canal areas, which were correlated to the abnormal behavior observed in the cir mice. Thus, cir mice may be useful as a model for studying inner ear abnormalities and deafness. We have constructed a genetic linkage map by positioning 14 microsatellite markers across the (cir) region and intraspecific backcross between cir and C57BL/6J mice. The cir mouse harbors an autosomal recessive mutation on mouse chromosome 9. The cir gene was mapped to a region between D9Mit116 and D9Mit38 Estimated distances between cir and D9Mit116, and between cir and D9Mit38 are 0.7 and 0.2 cM, respectively. The gene in order was defines : centromere-D9Mit182-D9Mit51/D9Mit79/D9Mit310-D9Mit212/D9Mit184-D9Mit116-cir-D9Mit38-D9Mit20-D9Mit243-D9Mit16-D9Mit55/D9Mit125-D9Mit281. The mouse map location of the cir locus appears to be in a region homologous to human 3q21. Our present date suggest that the nearest flanking marker D9Mit38 provides a useful anchor for the isolation of the cir gene in a yeast artificial chromosome contig.

  • PDF

Construction of Deletion Map of 16q by LOH Analysis from HCC Patients and Physical Map on 16q 23.3 - 24.1 Region

  • Chung, Jiyeol;Choi, Nae Yun;Shim, Myoung Sup;Choi, Dong Wook;Kang, Hyen Sam;Kim, Chang Min;Kim, Ung Jin;Park, Sun Hwa;Kim, Hyeon;Lee, Byeong Jae
    • Genomics & Informatics
    • /
    • v.1 no.2
    • /
    • pp.101-107
    • /
    • 2003
  • Loss of heterozygosity (LOH) has been used to detect deleted regions of a specific chromosome in cancer cells. LOH on chromosome 16q has been reported to occur frequently in progressed hepatocellular carcinoma (HCC). Liver tissues from 37 Korean HCC patients were analyzed for LOH by using 25 polymorphic microsatellite markers distributed along 16q. Out of the 37 HCC patients studied, 21 patients (56.8%) showed LOH in various regions of 16q with at least one polymorphic marker. Puring the analysis of these 21 LOH cases, 6 patients showed interstitial LOHs in which the boundary of the LOH region was defined. With two rounds of LOH analysis, five commonly occurring interstitial LOH regions were identified; 16q21-22.1, 16q22.2 - 22.3, 16q22.3, 16q23.2 and 16q23.3 - 24.1. Among the five LOH regions the 16q23.3 - 24.1 region has been reported to be related with chromosome instability. A complete physical map, which covers the 3.2 Mb region of 16q23.3 - 24.1 (D16S402 and D16S486), was constructed to identify novel candidate tumor suppressor genes. We provide the minimally tiling path map consisting of 28 BAC clones. There was one gap between NT_10422.11 and NT_019609.9 of the human genome sequence contig (NCBI sequence build 33, April 29, 2003). This gap can be filled by sequencing the R-1425M20 clone which bridges these sequence contigs.

Single Nucleotide Polymorphism Marker Discovery from Transcriptome Sequencing for Marker-assisted Backcrossing in Capsicum

  • Kang, Jin-Ho;Yang, Hee-Bum;Jeong, Hyeon-Seok;Choe, Phillip;Kwon, Jin-Kyung;Kang, Byoung-Cheorl
    • Horticultural Science & Technology
    • /
    • v.32 no.4
    • /
    • pp.535-543
    • /
    • 2014
  • Backcross breeding is the method most commonly used to introgress new traits into elite lines. Conventional backcross breeding requires at least 4-5 generations to recover the genomic background of the recurrent parent. Marker-assisted backcrossing (MABC) represents a new breeding approach that can substantially reduce breeding time and cost. For successful MABC, highly polymorphic markers with known positions in each chromosome are essential. Single nucleotide polymorphism (SNP) markers have many advantages over other marker systems for MABC due to their high abundance and amenability to genotyping automation. To facilitate MABC in hot pepper (Capsicum annuum), we utilized expressed sequence tags (ESTs) to develop SNP markers in this study. For SNP identification, we used Bukang $F_1$-hybrid pepper ESTs to prepare a reference sequence through de novo assembly. We performed large-scale transcriptome sequencing of eight accessions using the Illumina Genome Analyzer (IGA) IIx platform by Solexa, which generated small sequence fragments of about 90-100 bp. By aligning each contig to the reference sequence, 58,151 SNPs were identified. After filtering for polymorphism, segregation ratio, and lack of proximity to other SNPS or exon/intron boundaries, a total of 1,910 putative SNPs were chosen and positioned to a pepper linkage map. We further selected 412 SNPs evenly distributed on each chromosome and primers were designed for high throughput SNP assays and tested using a genetic diversity panel of 27 Capsicum accessions. The SNP markers clearly distinguished each accession. These results suggest that the SNP marker set developed in this study will be valuable for MABC, genetic mapping, and comparative genome analysis.

Simple sequence repeat marker development from Codonopsis lanceolata and genetic relation analysis

  • Kim, Serim;Jeong, Ji Hee;Chung, Hee;Kim, Ji Hyeon;Gil, Jinsu;Yoo, Jemin;Um, Yurry;Kim, Ok Tae;Kim, Tae Dong;Kim, Yong-Yul;Lee, Dong Hoon;Kim, Ho Bang;Lee, Yi
    • Journal of Plant Biotechnology
    • /
    • v.43 no.2
    • /
    • pp.181-188
    • /
    • 2016
  • In this study, we developed 15 novel polymorphic simple sequence repeat (SSR) markers by SSR-enriched genomic library construction from Codonopsis lanceolata. We obtained a total of 226 non-redundant contig sequences from the assembly process and designed primer sets. These markers were applied to 53 accessions representing the cultivated C. lanceolata in South Korea. Fifteen markers were sufficiently polymorphic, and were used to analyze the genetic relationships between the cultivated C. lanceolata. One hundred three alleles of the 15 SSR markers ranged from 3 to 19 alleles at each locus, with an average of 6.87. By cluster analysis, we detected clear genetic differences in most of the accessions, with genetic distance varying from 0.73 to 0.93. Phylogenic analysis indicated that the accessions that were collected from the same area were distributed evenly in the phylogenetic tree. These results indicate that there is no correlative genetic relationship between geographic areas. These markers will be useful in differentiating C. lanceolata genetic resources and in selecting suitable lines for a systemic breeding program.

Survey of Expressed Sequence Tags from Tissue-Specific cDNA Libraries in Hemibarbus mylodon, an Endangered Fish Species (멸종위기 어류 어름치 Hemibarbus mylodon (Cypriniformes)로부터 조직별 EST library 제작 및 발현 유전자 탐색)

  • Bang, In-Chul;Lim, Yoon-Hee;Cho, Young-Sun;Lee, Sang-Yoon;Nam, Yoon-Kwon
    • Journal of Aquaculture
    • /
    • v.20 no.4
    • /
    • pp.248-254
    • /
    • 2007
  • Representative cDNA libraries were constructed from various tissue sources of Hemibarbus mylodon, an endangered freshwater fish species in Korea, for the mining of expressed sequence tags (ESTs). Randomized and non-normalized EST analysis was performed with 7 unidirectional cDNA libraries generated from brain, intestine, kidney, liver, muscle, ovary or testis. Of 3,383 ESTs in total, the number of singleton was 2,029, and 333 contigs containing 1,354 ESTs were assembled (percent of unigene = 70.0%). Abundantly expressed gene transcripts and broad clustering of putative gene function were tissue-specific in general, and the redundancy was also variable among those libraries. Over half of H. mylodon ESTs were matched with orthologues from other teleosts among which zebrafish gene sequences were the most frequent in those matches. This initial setting of EST libraries achieved in the present study would be a fundamental basis for the banking of gene resources from this endangered fish species.

A Direct Approach for Finding Functional Lipolytic Enzymes from the Paenibacillus polymyxa Genome

  • JUNG, YEO-JIN;KIM, HYUNG-KWOUN;KIM, JIHYUN F.;PARK, SEUNG-HWAN;OH, TAE-KWANG;LEE, JUNG-KEE
    • Journal of Microbiology and Biotechnology
    • /
    • v.15 no.1
    • /
    • pp.155-160
    • /
    • 2005
  • Abstract A direct approach was used to retrieve active lipases from Paenibacillus polymyxa genome databases. Twelve putative lipase genes were tested using a typical lipase sequence rule built on the basis of a consensus sequence of a catalytic triad and oxyanion hole. Among them, six genes satisfied the sequence rule and had similarity (about 25%) with known bacterial lipases. To obtain the six lipase proteins, lipase genes were expressed in E. coli cells and lipolytic activities were measured by using tributyrin plate and pnitrophenyl caproate. One of them, contig 160-26, was expressed as a soluble and active form in E. coli cell. After purifying on Ni-NTA column, its detailed biochemical properties were characterized. It had a maximum hydrolytic activity at $30^{\circ}C$ and pH 7- 8, and was stable up to $40^{\circ}C$ and in the range of pH 5- 8. It most rapidly hydrolyzed pNPC$_6$ among various PNPesters. The other contigs were expressed more or less as soluble forms, although no lipolytic activities were detected. As they have many conserved regions with lipase 160-26 as well as other bacterial lipases throughout their equence, they are suggested as true lipase genes.

The Heavy Metal Tolerant Soil Bacterium Achromobacter sp. AO22 Contains a Unique Copper Homeostasis Locus and Two mer Operons

  • Ng, Shee Ping;Palombo, Enzo A.;Bhave, Mrinal
    • Journal of Microbiology and Biotechnology
    • /
    • v.22 no.6
    • /
    • pp.742-753
    • /
    • 2012
  • Copper-containing compounds are introduced into the environment through agricultural chemicals, mining, and metal industries and cause severe detrimental effects on ecosystems. Certain microorganisms exposed to these stressors exhibit molecular mechanisms to maintain intracellular copper homeostasis and avoid toxicity. We have previously reported that the soil bacterial isolate Achromobacter sp. AO22 is multi-heavy metal tolerant and exhibits a mer operon associated with a Tn21 type transposon. The present study reports that AO22 also hosts a unique cop locus encoding copper homeostasis determinants. The putative cop genes were amplified from the strain AO22 using degenerate primers based on reported cop and pco sequences, and a constructed 10,552 base pair contig (GenBank Accession No. GU929214). BLAST analyses of the sequence revealed a unique cop locus of 10 complete open reading frames, designated copSRABGOFCDK, with unusual separation of copCD from copAB. The promoter areas exhibit two putative cop boxes, and copRS appear to be transcribed divergently from other genes. The putative protein CopA may be a copper oxidase involved in export to the periplasm, CopB is likely extracytoplasmic, CopC may be periplasmic, CopD is cytoplasmic/inner membrane, CopF is a P-type ATPase, and CopG, CopO, and CopK are likely copper chaperones. CopA, B, C, and D exhibit several potential copper ligands and CopS and CopR exhibit features of two-component regulatory systems. Sequences flanking indicate the AO22 cop locus may be present within a genomic island. Achromobacter sp. strain AO22 is thus an ideal candidate for understanding copper homeostasis mechanisms and exploiting them for copper biosensor or biosorption systems.

Deep Sequencing Analysis of Apple Infecting Viruses in Korea

  • Cho, In-Sook;Igori, Davaajargal;Lim, Seungmo;Choi, Gug-Seoun;Hammond, John;Lim, Hyoun-Sub;Moon, Jae Sun
    • The Plant Pathology Journal
    • /
    • v.32 no.5
    • /
    • pp.441-451
    • /
    • 2016
  • Deep sequencing has generated 52 contigs derived from five viruses; Apple chlorotic leaf spot virus (ACLSV), Apple stem grooving virus (ASGV), Apple stem pitting virus (ASPV), Apple green crinkle associated virus (AGCaV), and Apricot latent virus (ApLV) were identified from eight apple samples showing small leaves and/or growth retardation. Nucleotide (nt) sequence identity of the assembled contigs was from 68% to 99% compared to the reference sequences of the five respective viral genomes. Sequences of ASPV and ASGV were the most abundantly represented by the 52 contigs assembled. The presence of the five viruses in the samples was confirmed by RT-PCR using specific primers based on the sequences of each assembled contig. All five viruses were detected in three of the samples, whereas all samples had mixed infections with at least two viruses. The most frequently detected virus was ASPV, followed by ASGV, ApLV, ACLSV, and AGCaV which were withal found in mixed infections in the tested samples. AGCaV was identified in assembled contigs ID 1012480 and 93549, which showed 82% and 78% nt sequence identity with ORF1 of AGCaV isolate Aurora-1. ApLV was identified in three assembled contigs, ID 65587, 1802365, and 116777, which showed 77%, 78%, and 76% nt sequence identity respectively with ORF1 of ApLV isolate LA2. Deep sequencing assay was shown to be a valuable and powerful tool for detection and identification of known and unknown virome in infected apple trees, here identifying ApLV and AGCaV in commercial orchards in Korea for the first time.