• 제목/요약/키워드: Contig

검색결과 64건 처리시간 0.027초

Confirming Single Nucleotide Polymorphisms from Expressed Sequence Tag Datasets Derived from Three Cattle cDNA Libraries

  • Lee, Seung-Hwan;Park, Eung-Woo;Cho, Yong-Min;Lee, Ji-Woong;Kim, Hyoung-Yong;Lee, Jun-Heon;Oh, Sung-Jong;Cheong, Il-Cheong;Yoon, Du-Hak
    • BMB Reports
    • /
    • 제39권2호
    • /
    • pp.183-188
    • /
    • 2006
  • Using the Phred/Phrap/Polyphred/Consed pipeline established in the National Livestock Research Institute of Korea, we predicted candidate coding single nucleotide polymorphisms (cSNPs) from 7,600 expressed sequence tags (ESTs) derived from three cDNA libraries (liver, M. longissimus dorsi, and intermuscular fat) of Hanwoo (Korean native cattle) steers. From the 7,600 ESTs, 829 contigs comprising more than two EST reads were assembled using the Phrap assembler. Based on the contig analysis, 201 candidate cSNPs were identified in 129 contigs, in which transitions (69%) outnumbered transversions (31%). To verify whether the predicted cSNPs are real, 17 SNPs involved in lipid and energy metabolism were selected from the ESTs. Twelve of these were confirmed to be real while five were identified as artifacts, possibly due to expressed sequence tag sequence error. Further analysis of the 12 verified cSNPs was performed using the program BLASTX. Five were identified as nonsynonymous cSNPs, five were synonymous cSNPs, and two SNPs were located in 3'-UTRs. Our data indicated that a relatively high SNP prediction rate (71%) from a large EST database could produce abundant cSNPs rapidly, which can be used as valuable genetic markers in cattle.

Characterization of a Strain of Malva Vein Clearing Virus in Alcea rosea via Deep Sequencing

  • Wang, Defu;Cui, Liyan;Pei, Yanni;Ma, Zhennan;Shen, Shaofei;Long, Dandan;Li, Lingyu;Niu, Yanbing
    • The Plant Pathology Journal
    • /
    • 제36권5호
    • /
    • pp.468-475
    • /
    • 2020
  • Malva vein clearing virus (MVCV) is a member of the Potyvirus species, and has a negative impact on the aesthetic development of Alcea rosea. It was first reported in Germany in 1957, but its complete genome sequence data are still scarce. In the present work, A. rosea leaves with vein-clearing and mosaic symptoms were sampled and analyzed with small RNA deep sequencing. By denovo assembly the raw sequences of virus-derived small interfering RNAs (vsiRs) and whole genome amplification of malva vein cleaning virus SX strain (MVCV-SX) by specific primers targeting identified contig gaps, the full-length genome sequences (9,645 nucleotides) of MVCV-SX were characterized, constituting of an open reading frame that is long enough to encode 3,096 amino acids. Phylogenetic analysis showed that MVCV-SX was clustered with euphorbia ringspot virus and yam mosaic virus. Further analyses of the vsiR profiles revealed that the most abundant MVCV-vsiRs were between 21 and 22 nucleotides in length and a strong bias was found for "A" and "U" at the 5′-terminal residue. The results of polarity assessment indicated that the amount of sense strand was almost equal to that of the antisense strand in MVCV-vsiRs, and the main hot-spot region in MVCV-SX genome was found at cylindrical inclusion. In conclusion, our findings could provide new insights into the RNA silencing-mediated host defence mechanism in A. rosea infected with MVCV-SX, and offer a basis for the prevention and treatment of this virus disease.

Genetical and Pathological Studies on the Mutant Mice as an Animal Model for Deafness Disease

  • Lee, Jeong-Woong;Lee, Eun-Ju;Lee, Hoon-Taek;Chung, Kil-Saeng;Ryoo, Zae-Young
    • 한국동물번식학회:학술대회논문집
    • /
    • 한국동물번식학회 2001년도 춘계학술발표대회
    • /
    • pp.48-48
    • /
    • 2001
  • A new neurological mutant has been found in the ICR outbred strain mouse. Affected mice display profound deafness and a head-tossing and bidirectional circling behavior, showing an autosomal recessive mode of inheritance. It was, therefore, named cir/Kr with the gene symbol cir. The auditory tests identified clearly the hearing loss of the cir mice when compared to wild type mice. Pathological studies confirmed the developmental defects in the middle ear, cochlea, cochlear nerve, and semicircular canal areas, which were correlated to the abnormal behavior observed in the cir mice. Thus, cir mice may be useful as a model for studying inner ear abnormalities and deafness. We have constructed a genetic linkage map by positioning 14 microsatellite markers across the (cir) region and intraspecific backcross between cir and C57BL/6J mice. The cir mouse harbors an autosomal recessive mutation on mouse chromosome 9. The cir gene was mapped to a region between D9Mit116 and D9Mit38 Estimated distances between cir and D9Mit116, and between cir and D9Mit38 are 0.7 and 0.2 cM, respectively. The gene in order was defines : centromere-D9Mit182-D9Mit51/D9Mit79/D9Mit310-D9Mit212/D9Mit184-D9Mit116-cir-D9Mit38-D9Mit20-D9Mit243-D9Mit16-D9Mit55/D9Mit125-D9Mit281. The mouse map location of the cir locus appears to be in a region homologous to human 3q21. Our present date suggest that the nearest flanking marker D9Mit38 provides a useful anchor for the isolation of the cir gene in a yeast artificial chromosome contig.

  • PDF

Construction of Deletion Map of 16q by LOH Analysis from HCC Patients and Physical Map on 16q 23.3 - 24.1 Region

  • Chung, Jiyeol;Choi, Nae Yun;Shim, Myoung Sup;Choi, Dong Wook;Kang, Hyen Sam;Kim, Chang Min;Kim, Ung Jin;Park, Sun Hwa;Kim, Hyeon;Lee, Byeong Jae
    • Genomics & Informatics
    • /
    • 제1권2호
    • /
    • pp.101-107
    • /
    • 2003
  • Loss of heterozygosity (LOH) has been used to detect deleted regions of a specific chromosome in cancer cells. LOH on chromosome 16q has been reported to occur frequently in progressed hepatocellular carcinoma (HCC). Liver tissues from 37 Korean HCC patients were analyzed for LOH by using 25 polymorphic microsatellite markers distributed along 16q. Out of the 37 HCC patients studied, 21 patients (56.8%) showed LOH in various regions of 16q with at least one polymorphic marker. Puring the analysis of these 21 LOH cases, 6 patients showed interstitial LOHs in which the boundary of the LOH region was defined. With two rounds of LOH analysis, five commonly occurring interstitial LOH regions were identified; 16q21-22.1, 16q22.2 - 22.3, 16q22.3, 16q23.2 and 16q23.3 - 24.1. Among the five LOH regions the 16q23.3 - 24.1 region has been reported to be related with chromosome instability. A complete physical map, which covers the 3.2 Mb region of 16q23.3 - 24.1 (D16S402 and D16S486), was constructed to identify novel candidate tumor suppressor genes. We provide the minimally tiling path map consisting of 28 BAC clones. There was one gap between NT_10422.11 and NT_019609.9 of the human genome sequence contig (NCBI sequence build 33, April 29, 2003). This gap can be filled by sequencing the R-1425M20 clone which bridges these sequence contigs.

Single Nucleotide Polymorphism Marker Discovery from Transcriptome Sequencing for Marker-assisted Backcrossing in Capsicum

  • Kang, Jin-Ho;Yang, Hee-Bum;Jeong, Hyeon-Seok;Choe, Phillip;Kwon, Jin-Kyung;Kang, Byoung-Cheorl
    • 원예과학기술지
    • /
    • 제32권4호
    • /
    • pp.535-543
    • /
    • 2014
  • Backcross breeding is the method most commonly used to introgress new traits into elite lines. Conventional backcross breeding requires at least 4-5 generations to recover the genomic background of the recurrent parent. Marker-assisted backcrossing (MABC) represents a new breeding approach that can substantially reduce breeding time and cost. For successful MABC, highly polymorphic markers with known positions in each chromosome are essential. Single nucleotide polymorphism (SNP) markers have many advantages over other marker systems for MABC due to their high abundance and amenability to genotyping automation. To facilitate MABC in hot pepper (Capsicum annuum), we utilized expressed sequence tags (ESTs) to develop SNP markers in this study. For SNP identification, we used Bukang $F_1$-hybrid pepper ESTs to prepare a reference sequence through de novo assembly. We performed large-scale transcriptome sequencing of eight accessions using the Illumina Genome Analyzer (IGA) IIx platform by Solexa, which generated small sequence fragments of about 90-100 bp. By aligning each contig to the reference sequence, 58,151 SNPs were identified. After filtering for polymorphism, segregation ratio, and lack of proximity to other SNPS or exon/intron boundaries, a total of 1,910 putative SNPs were chosen and positioned to a pepper linkage map. We further selected 412 SNPs evenly distributed on each chromosome and primers were designed for high throughput SNP assays and tested using a genetic diversity panel of 27 Capsicum accessions. The SNP markers clearly distinguished each accession. These results suggest that the SNP marker set developed in this study will be valuable for MABC, genetic mapping, and comparative genome analysis.

Simple sequence repeat marker development from Codonopsis lanceolata and genetic relation analysis

  • Kim, Serim;Jeong, Ji Hee;Chung, Hee;Kim, Ji Hyeon;Gil, Jinsu;Yoo, Jemin;Um, Yurry;Kim, Ok Tae;Kim, Tae Dong;Kim, Yong-Yul;Lee, Dong Hoon;Kim, Ho Bang;Lee, Yi
    • Journal of Plant Biotechnology
    • /
    • 제43권2호
    • /
    • pp.181-188
    • /
    • 2016
  • In this study, we developed 15 novel polymorphic simple sequence repeat (SSR) markers by SSR-enriched genomic library construction from Codonopsis lanceolata. We obtained a total of 226 non-redundant contig sequences from the assembly process and designed primer sets. These markers were applied to 53 accessions representing the cultivated C. lanceolata in South Korea. Fifteen markers were sufficiently polymorphic, and were used to analyze the genetic relationships between the cultivated C. lanceolata. One hundred three alleles of the 15 SSR markers ranged from 3 to 19 alleles at each locus, with an average of 6.87. By cluster analysis, we detected clear genetic differences in most of the accessions, with genetic distance varying from 0.73 to 0.93. Phylogenic analysis indicated that the accessions that were collected from the same area were distributed evenly in the phylogenetic tree. These results indicate that there is no correlative genetic relationship between geographic areas. These markers will be useful in differentiating C. lanceolata genetic resources and in selecting suitable lines for a systemic breeding program.

멸종위기 어류 어름치 Hemibarbus mylodon (Cypriniformes)로부터 조직별 EST library 제작 및 발현 유전자 탐색 (Survey of Expressed Sequence Tags from Tissue-Specific cDNA Libraries in Hemibarbus mylodon, an Endangered Fish Species)

  • 방인철;임윤희;조영선;이상윤;남윤권
    • 한국양식학회지
    • /
    • 제20권4호
    • /
    • pp.248-254
    • /
    • 2007
  • 멸종위기 천연기념물 어류 어름치(Hemibarbus mylodon)를 대상으로한 어름치 유전자 은행 구축 연구의 일환으로 뇌, 소화관, 근육, 간, 신장, 난소 및 정소 조직으로부터 expressed sequence tag (EST) library들을 구축하고 발현 유전자의 탐색을 실시하였다. EST 탐색을 통해 총 3,383개의 발현 유전자 염기서열 단편을 확보하였고 이들로부터 1,354개의 EST를 포함하는 총 333개의 contig들이 형성됨으로써 비교적 높은 빈도(69.8%)의 unigene 확보율을 나타내었다. EST의 조직 별 출현 양상은 orthologue들과의 상동성 정도 및 유추 기능의 대분류를 기준으로 분석할 때 각 조직들은 서로 다른 특징을 나타내었다. 어름치에서 발굴된 EST들은 zebrafish의 유전자들과 가장 높은 match 빈도를 나타내었다. 본 연구를 통해 확보된 EST library들과 염기서열 정보는 본 종의 장외 복원을 위한 효율적인 인공증식 기술 개발에 유용한 기초 정보로 활용될 수 있을 것이다.

A Direct Approach for Finding Functional Lipolytic Enzymes from the Paenibacillus polymyxa Genome

  • JUNG, YEO-JIN;KIM, HYUNG-KWOUN;KIM, JIHYUN F.;PARK, SEUNG-HWAN;OH, TAE-KWANG;LEE, JUNG-KEE
    • Journal of Microbiology and Biotechnology
    • /
    • 제15권1호
    • /
    • pp.155-160
    • /
    • 2005
  • Abstract A direct approach was used to retrieve active lipases from Paenibacillus polymyxa genome databases. Twelve putative lipase genes were tested using a typical lipase sequence rule built on the basis of a consensus sequence of a catalytic triad and oxyanion hole. Among them, six genes satisfied the sequence rule and had similarity (about 25%) with known bacterial lipases. To obtain the six lipase proteins, lipase genes were expressed in E. coli cells and lipolytic activities were measured by using tributyrin plate and pnitrophenyl caproate. One of them, contig 160-26, was expressed as a soluble and active form in E. coli cell. After purifying on Ni-NTA column, its detailed biochemical properties were characterized. It had a maximum hydrolytic activity at $30^{\circ}C$ and pH 7- 8, and was stable up to $40^{\circ}C$ and in the range of pH 5- 8. It most rapidly hydrolyzed pNPC$_6$ among various PNPesters. The other contigs were expressed more or less as soluble forms, although no lipolytic activities were detected. As they have many conserved regions with lipase 160-26 as well as other bacterial lipases throughout their equence, they are suggested as true lipase genes.

The Heavy Metal Tolerant Soil Bacterium Achromobacter sp. AO22 Contains a Unique Copper Homeostasis Locus and Two mer Operons

  • Ng, Shee Ping;Palombo, Enzo A.;Bhave, Mrinal
    • Journal of Microbiology and Biotechnology
    • /
    • 제22권6호
    • /
    • pp.742-753
    • /
    • 2012
  • Copper-containing compounds are introduced into the environment through agricultural chemicals, mining, and metal industries and cause severe detrimental effects on ecosystems. Certain microorganisms exposed to these stressors exhibit molecular mechanisms to maintain intracellular copper homeostasis and avoid toxicity. We have previously reported that the soil bacterial isolate Achromobacter sp. AO22 is multi-heavy metal tolerant and exhibits a mer operon associated with a Tn21 type transposon. The present study reports that AO22 also hosts a unique cop locus encoding copper homeostasis determinants. The putative cop genes were amplified from the strain AO22 using degenerate primers based on reported cop and pco sequences, and a constructed 10,552 base pair contig (GenBank Accession No. GU929214). BLAST analyses of the sequence revealed a unique cop locus of 10 complete open reading frames, designated copSRABGOFCDK, with unusual separation of copCD from copAB. The promoter areas exhibit two putative cop boxes, and copRS appear to be transcribed divergently from other genes. The putative protein CopA may be a copper oxidase involved in export to the periplasm, CopB is likely extracytoplasmic, CopC may be periplasmic, CopD is cytoplasmic/inner membrane, CopF is a P-type ATPase, and CopG, CopO, and CopK are likely copper chaperones. CopA, B, C, and D exhibit several potential copper ligands and CopS and CopR exhibit features of two-component regulatory systems. Sequences flanking indicate the AO22 cop locus may be present within a genomic island. Achromobacter sp. strain AO22 is thus an ideal candidate for understanding copper homeostasis mechanisms and exploiting them for copper biosensor or biosorption systems.

Deep Sequencing Analysis of Apple Infecting Viruses in Korea

  • Cho, In-Sook;Igori, Davaajargal;Lim, Seungmo;Choi, Gug-Seoun;Hammond, John;Lim, Hyoun-Sub;Moon, Jae Sun
    • The Plant Pathology Journal
    • /
    • 제32권5호
    • /
    • pp.441-451
    • /
    • 2016
  • Deep sequencing has generated 52 contigs derived from five viruses; Apple chlorotic leaf spot virus (ACLSV), Apple stem grooving virus (ASGV), Apple stem pitting virus (ASPV), Apple green crinkle associated virus (AGCaV), and Apricot latent virus (ApLV) were identified from eight apple samples showing small leaves and/or growth retardation. Nucleotide (nt) sequence identity of the assembled contigs was from 68% to 99% compared to the reference sequences of the five respective viral genomes. Sequences of ASPV and ASGV were the most abundantly represented by the 52 contigs assembled. The presence of the five viruses in the samples was confirmed by RT-PCR using specific primers based on the sequences of each assembled contig. All five viruses were detected in three of the samples, whereas all samples had mixed infections with at least two viruses. The most frequently detected virus was ASPV, followed by ASGV, ApLV, ACLSV, and AGCaV which were withal found in mixed infections in the tested samples. AGCaV was identified in assembled contigs ID 1012480 and 93549, which showed 82% and 78% nt sequence identity with ORF1 of AGCaV isolate Aurora-1. ApLV was identified in three assembled contigs, ID 65587, 1802365, and 116777, which showed 77%, 78%, and 76% nt sequence identity respectively with ORF1 of ApLV isolate LA2. Deep sequencing assay was shown to be a valuable and powerful tool for detection and identification of known and unknown virome in infected apple trees, here identifying ApLV and AGCaV in commercial orchards in Korea for the first time.