• Title/Summary/Keyword: NCBI nucleotide database

Search Result 33, Processing Time 0.028 seconds

Genomic Organization of Heat Shock Protein Genes of Silkworm Bombyx mori

  • Velu, Dhanikachalam;Ponnuvel, Kangayam M.;Qadri, Sayed M. Hussaini
    • International Journal of Industrial Entomology and Biomaterials
    • /
    • v.15 no.2
    • /
    • pp.123-130
    • /
    • 2007
  • The Hsp 20.8 and Hsp 90 cDNA sequence retrieved from NCBI database and consists of 764 bp and 2582 bp lengths respectively. The corresponding cDNA homologus sequences were BLAST searched in Bombyx mori genomic DNA database and two genomic contigs viz., BAAB01120347 and AADK01011786 showed maximum homology. In B. mori Hsp 20.8 and Hsp 90 is encoded by single gene without intron. Specific primers were used to amplify the Hsp 20.8 gene and Hsp 90 variable region from genomic DNA by using the PCR. Obtained products were 216 bp in Hsp 20.8 and 437 bp in Hsp 90. There was no variation found in the six silkworm races PCR products size of contrasting response to thermal tolerance. The comparison of the sequenced nucleotide variations through multiple sequence alignment analysis of Hsp 90 variable region products of three races not showed any differences respect to their thermotolerance and formed the clusters among the voltinism. The comparison of aminoacid sequences of B. mori Hsps with dipteran and other insect taxa revealed high percentage of identity growing with phylogenetic relatedness of species. The conserved domains of B. mori Hsps predicted, in which the Hsp 20.8 possesses ${\alpha}-crystallin$ domain and Hsp 90 holds HATPase and Hsp 90 domains.

Development and Application of Weonhyeong Strain-specific SCAR Marker in Pleurotus ostreatus (느타리 버섯에서 원형 품종 특이 SCAR marker 개발)

  • Seo, Kyoung-In;Jang, Kab-Yeul;Yoo, Young-Bok;Park, Soon-Young;Kim, Kwang-Ho;Kong, Won-Sik
    • The Korean Journal of Mycology
    • /
    • v.39 no.1
    • /
    • pp.22-30
    • /
    • 2011
  • Weonhyeong is one of important commercial strains. It has good characteristics of bundle formation, grey colored pilei and high productivity. We previously reported grouping of 70 strains of Pleurotus ostreatus in which one group contained 35 strains including Weonhyeong. Four strains in that group showed same profiles implicating no variety distinction for mushroom cultivation. Now we developed a specific marker for identification of Weonhyeong. Sequence Characterized Amplified Region (SCAR) marker was developed from the RAPD amplicon. SCAR marker 'S-OPO5' produced only one band specific to 2183, 2240, 2595 and 2725 strains showing similar banding patterns to Weonhyeong in RAPD-PCR results. The sequence of 'S-OPO5' marker was unknown when compared with the data in the Genbank using BLASTN. BLASTX results indicated that the marker showed significant alignment with the protein sequences in Tricholoma bakamatsutake reverse transcriptase. The results indicate that this new SCAR marker ('S-OPO5') will be valuable to distinguish the Weonhyeong similar strains from Pleurotus spp.

Investigation of Conserved Regions in Lipase Genes (Lipase 유전자의 보존적 영역 탐색)

  • 이동근;김철민;김상진;이상현;이재화
    • Journal of Life Science
    • /
    • v.13 no.5
    • /
    • pp.723-731
    • /
    • 2003
  • For the investigation of conserved regions in lipase genes, 132 and 24 sequences were obtained from LED (Lipase Engineering Database) and COG (Clusters of Orthologous Groups of proteins), respectively. There was high diversity in lipase genes and peculiar amino acid sequences were conserved for each homologous family of LED. Similar conserved amino acid sequences were detected from COG0657 and Moraxella lipase 1 homologous group of LED. Although many studies have attempted to detect new lipase genes in procaryotes, they have been limited culturable bacteria. The importance of metagenome, including DNA from non-culturable bacteria, is known. Due to the high diversity, we assumed it might be possible to detect new lipase gene from metagenome. Due to the high diversity of nucleotide sequences in lipase genes, 10 primer sets were designed. Designed primer sets were inspected in BLAST of NCBI and they could amplify a part of the lipase gene from 222 to 713 bp. They can amplify 16.7%∼60.0% of each lipase homologous group which was 3.6 fold higher than each sets. They might offer a high probability of detecting new lipase genes, owing to high efficiency and the diversity of lipase genes.

Analyses of Expressed Sequence Tags from Chironomus riparius Using Pyrosequencing : Molecular Ecotoxicology Perspective

  • Nair, Prakash M. Gopalakrishnan;Park, Sun-Young;Choi, Jin-Hee
    • Environmental Analysis Health and Toxicology
    • /
    • v.26
    • /
    • pp.10.1-10.7
    • /
    • 2011
  • Objects: Chironomus riparius, a non-biting midge (Chironomidae, Diptera), is extensively used as a model organism in aquatic ecotoxicological studies, and considering the potential of C. riparius larvae as a bio-monitoring species, little is known about its genome sequences. This study reports the results of an Expressed Sequence Tags (ESTs) sequencing project conducted on C. riparius larvae using 454 pyrosequencing. Method: To gain a better understanding of C. riparius transcriptome, we generated ESTs database of C.ripairus using pyrosequencing method. Results: Sequencing runs, using normalized cDNA collections from fourth instar larvae, yielded 20,020 expressed sequence tags, which were assembled into 8,565 contigs and 11,455 singletons. Sequence analysis was performed by BlastX search against the National Center for Biotechnology Information (NCBI) nucleotide (nr) and uniprot protein database. Based on the gene ontology classifications, 24% (E-value${\leq}1^{-5}$) of the sequences had known gene functions, 24% had unknown functions and 52% of sequences did not match any known sequences in the existing database. Sequence comparison revealed 81% of the genes have homologous genes among other insects belonging to the order Diptera providing tools for comparative genome analyses. Targeted searches using these annotations identified genes associated with essential metabolic pathways, signaling pathways, detoxification of toxic metabolites and stress response genes of ecotoxicological interest. Conclusions: The results obtained from this study would eventually make ecotoxicogenomics possible in a truly environmentally relevant species, such as, C. riparius.

Development of SNP marker set for marker-assisted backcrossing (MABC) in cultivating tomato varieties

  • Park, GiRim;Jang, Hyun A;Jo, Sung-Hwan;Park, Younghoon;Oh, Sang-Keun;Nam, Moon
    • Korean Journal of Agricultural Science
    • /
    • v.45 no.3
    • /
    • pp.385-400
    • /
    • 2018
  • Marker-assisted backcrossing (MABC) is useful for selecting offspring with a highly recovered genetic background for a recurrent parent at early generation unlike rice and other field crops. Molecular marker sets applicable to practical MABC are scarce in vegetable crops including tomatoes. In this study, we used the National Center for Biotechnology Information- short read archive (NCBI-SRA) database that provided the whole genome sequences of 234 tomato accessions and selected 27,680 tag-single nucleotide polymorphisms (tag-SNPs) that can identify haplotypes in the tomato genome. From this SNP dataset, a total of 143 tag-SNPs that have a high polymorphism information content (PIC) value (> 0.3) and are physically evenly distributed on each chromosome were selected as a MABC marker set. This marker set was tested for its polymorphism in each pairwise cross combination constructed with 124 of the 234 tomato accessions, and a relatively high number of SNP markers polymorphic for the cross combination was observed. The reliability of the MABC SNP set was assessed by converting 18 SNPs into Luna probe-based high-resolution melting (HRM) markers and genotyping nine tomato accessions. The results show that the SNP information and HRM marker genotype matched in 98.6% of the experiment data points, indicating that our sequence analysis pipeline for SNP mining worked successfully. The tag-SNP set for the MABC developed in this study can be useful for not only a practical backcrossing program but also for cultivar identification and F1 seed purity test in tomatoes.

Functional analysis of expressed sequence tags from the liver and brain of Korean Jindo dogs

  • Kim, Jae-Young;Park, Hye-Sun;Lim, Da-Jeong;Jang, Hong-Chul;Park, Hae-Suk;Lee, Kyung-Tai;Kim, Jong-Seok;Oh, Seok-Il;Kweon, Mu-Sik;Kim, Tae-Hun;Choi, Bong-Hwan
    • BMB Reports
    • /
    • v.44 no.4
    • /
    • pp.238-243
    • /
    • 2011
  • We generated 16,993 expressed sequence tags (ESTs) from two libraries containing full-length cDNAs from the brain and liver of the Korean Jindo dog. An additional 365,909 ESTs from other dog breeds were identified from the NCBI dbEST database, and all ESTs were clustered into 28,514 consensus sequences using StackPack. We selected the 7,305 consensus sequences that could be assembled from at least five ESTs and estimated that 12,533 high-quality single nucleotide polymorphisms (SNPs) were present in 97,835 putative SNPs from the 7,305 consensus sequences. We identified 58 Jindo dog-specific SNPs in comparison to other breeds and predicted seven synonymous SNPs and ten non-synonymous SNPs. Using PolyPhen, a program that predicts changes in protein structure and potential effects on protein function caused by amino acid substitutions, three of the non-synonymous SNPs were predicted to result in changes in protein function for proteins expressed by three different genes (TUSC3, ITIH2, and NAT2).

A comparison of five sets of overlapping and non-overlapping sliding windows for semen production traits in the Thai multibreed dairy population

  • Mattaneeya Sarakul;Mauricio A. Elzo;Skorn Koonawootrittriron;Thanathip Suwanasopee;Danai Jattawa;Thawee Laodim
    • Animal Bioscience
    • /
    • v.37 no.3
    • /
    • pp.428-436
    • /
    • 2024
  • Objective: This study compared five distinct sets of biological pathways and associated genes related to semen volume (VOL), number of sperm (NS), and sperm motility (MOT) in the Thai multibreed dairy population. Methods: The phenotypic data included 13,533 VOL records, 12,773 NS records, and 12,660 MOT records from 131 bulls. The genotypic data consisted of 76,519 imputed and actual single nucleotide polymorphisms (SNPs) from 72 animals. The SNP additive genetic variances for VOL, NS, and MOT were estimated for SNP windows of one SNP (SW1), ten SNP (SW10), 30 SNP (SW30), 50 SNP (SW50), and 100 SNP (SW100) using a single-step genomic best linear unbiased prediction approach. The fixed effects in the model were contemporary group, ejaculate order, bull age, ambient temperature, and heterosis. The random effects accounted for animal additive genetic effects, permanent environment effects, and residual. The SNPs explaining at least 0.001% of the additive genetic variance in SW1, 0.01% in SW10, 0.03% in SW30, 0.05% in SW50, and 0.1% in SW100 were selected for gene identification through the NCBI database. The pathway analysis utilized genes associated with the identified SNP windows. Results: Comparison of overlapping and non-overlapping SNP windows revealed notable differences among the identified pathways and genes associated with the studied traits. Overlapping windows consistently yielded a larger number of shared biological pathways and genes than non-overlapping windows. In particular, overlapping SW30 and SW50 identified the largest number of shared pathways and genes in the Thai multibreed dairy population. Conclusion: This study yielded valuable insights into the genetic architecture of VOL, NS, and MOT. It also highlighted the importance of assessing overlapping and non-overlapping SNP windows of various sizes for their effectiveness to identify shared pathways and genes influencing multiple traits.

Construction of BLAST Server for Mollusks (연체동물 전용 서열 블라스트 서버구축)

  • Lee, Yong-Seok;Jo, Yong-Hun;Kim, Dae-Soo;Kim, Dae-Won;Kim, Min-Young;Choi, Sang-Haeng;Yon, Jei-Oh;Byun, In-Sun;Kang, Bo-Ra;Jeong, Kye-Heon;Park, Hong-Seog
    • The Korean Journal of Malacology
    • /
    • v.20 no.2
    • /
    • pp.165-169
    • /
    • 2004
  • The BLAST server for the mollusk was constructed on the basis of the Intel Server Platform SC-5250 dual Xeon 2.8 GHz cpu and Linux operating system. After establishing the operating system, we installed NCBI (National Center for Biotechnology Information) WebBLAST package after web server configuration for cgi (common gate interface) (http://chimp.kribb.re.kr/mollusks). To build up the stand alone blast, we conducted as follows: First, we downloaded the genome information (mitochondria genome information), DNA sequences, amino acid sequences related with mollusk available at NCBI. Second, it was translated into the multifasta format that was stored as database by using the formatdb program provided by NCBI. Finally, the cgi was used for the Stand Alone Blast server. In addition, we have added the vector, Escherichia coli, and repeat sequences into the server to confirm a potential contamination. Finally, primer3 program is also installed for the users to design the primer. The stand alone BLAST gave us several advantages: (1) we can get only the data that agree with the nucleotide sequence directly related with the mollusks when we are searching BLAST; (2) it will be very convenient to confirm contamination when we made the cDNA or genomic library from mollusks; (3) Compared to the current NSBI, we can quickly get the BLAST results on the mollusks sequence information.

  • PDF

Construction of a full-length cDNA library from Pinus koraiensis and analysis of EST dataset (잣나무(Pinus koraiensis)의 cDNA library 제작 및 EST 분석)

  • Kim, Joon-Ki;Im, Su-Bin;Choi, Sun-Hee;Lee, Jong-Suk;Roh, Mark S.;Lim, Yong-Pyo
    • Korean Journal of Agricultural Science
    • /
    • v.38 no.1
    • /
    • pp.11-16
    • /
    • 2011
  • In this study, we report the generation and analysis of a total of 1,211 expressed sequence tags (ESTs) from Pinus koraiensis. A cDNA library was generated from the young leaf tissue and a total of 1,211 cDNA were partially sequenced. EST and unigene sequence quality were determined by computational filtering, manual review, and BLAST analyses. In all, 857 ESTs were acquired after the removal of the vector sequence and filtering over a minimum length 50 nucleotides. A total of 411 unigene, consisting of 89 contigs and 322 singletons, was identified after assembling. Also, we identified 77 new microsatellite-containing sequences from the unigenes and classified the structure according to their repeat unit. According to homology search with BLASTX against the NCBI database, 63.1% of ESTs were homologous with known function and 22.2% of ESTs were matched with putative or unknown function. The remaining 14.6% of ESTs showed no significant similarity to any protein sequences found in the public database. Gene ontology (GO) classification showed that the most abundant GO terms were transport, nucleotide binding, plastid, in terms biological process, molecular function and cellular component, respectively. The sequence data will be used to characterize potential roles of new genes in Pinus and provided for the useful tools as a genetic resource.

Sequencing analysis of the OFC1 gene on the nonsyndromic cleft lip and palate patient in Korean (한국인 비증후군성 구순구개열 환자의 OFC1 유전자의 서열 분석)

  • Kim, Sung-Sik;Son, Woo-Sung
    • The korean journal of orthodontics
    • /
    • v.33 no.3 s.98
    • /
    • pp.185-197
    • /
    • 2003
  • This study was performed to identify the characteristics of the OFC1 gene (locus: chromosome 6p24.3) in Korean patients, which is assumed to be the major gene behind the nonsyndromic cleft lip and palate. The sample consisted of 80 subjects: 40 nonsyndromic cleft lip and palate patients (proband, 20 males and females, mean age 14.2 years); and 40 normal adults (20 males and 20 females, mean age 25.6 years). Using PCR-based assay, the OFC1 gene was amplified, sequenced, and then searched for similar protein structures. Results were as follows: 1. The OFC1 gene contains the microsatellite marker 'CA' repeats. The number of the reference 'CA' repeats was 21 times, and formed as TA(CA)11TA(CA)10. But, in Koreans, the number of tandem 'CA' repeats was varied from 17 to 26 except 18, and 'CA' repeats consisted of TA(CA)n. 2. Nine allelic variants were found. Distribution of the OFC1 allele was similar between the patients and control group. 3. There was a replacement of the base 'T' to 'C' after 11 tandem 'CA' repeats in Koreans compared with Weissenbach's report. However, the difference did not seem to be the ORF prediction results between Koreans and Weissenbach's report. 4. The BLAST search results showed the Telomerase reverse transcriptase (TERT) and the Nucleotide binding protein 2 (NBP2) as similar proteins. The TERT was a protein product by the hTERT gene in the locus 5p15.33 (NCBI Genome Annotation; NT023089) The NBP2 was a protein product by the ABCC3 (ATP-binding cassette, sub-family C) gene in the locus 17q22 (NCBI Genome Annotation; NT010783). 5. In the Pedant-Pro database analysis, the predictable protein structure of the OFC1 gene had at least one transmembrane region and one non-globular region.