• Title/Summary/Keyword: sequence databases

Search Result 226, Processing Time 0.024 seconds

Downregulation of PyHRG1, encoding a novel secretory protein in the red alga Pyropia yezoensis, enhances heat tolerance

  • Han, Narae;Wi, Jiwoong;Im, Sungoh;Lim, Ka-Min;Lee, Hun-Dong;Jeong, Won-Joong;Kim, Geun-Joong;Kim, Chan Song;Park, Eun-Jeong;Hwang, Mi Sook;Choi, Dong-Woog
    • ALGAE
    • /
    • v.36 no.3
    • /
    • pp.207-217
    • /
    • 2021
  • An increase in seawater temperature owing to global warming is expected to substantially limit the growth of marine algae, including Pyropia yezoensis, a commercially valuable red alga. To improve our knowledge of the genes involved in the acquisition of heat tolerance in P. yezoensis, transcriptomes sequences were obtained from both the wild-type SG104 P. yezoensis and heat-tolerant mutant Gy500. We selected 1,251 differentially expressed genes that were up- or downregulated in response to the heat stress condition and in the heat-tolerant mutant Gy500, based on fragment per million reads expression values. Among them, PyHRG1 was downregulated under heat stress in SG104 and expressed at a low level in Gy500. PyHRG1 encodes a secretory protein of 26.5 kDa. PyHRG1 shows no significant sequence homology with any known genes deposited in public databases to date. However, PyHRG1 homologs were found in other red algae, including other Pyropia species. When PyHRG1 was introduced into the single-cell green alga Chlamydomonas reinhardtii, transformed cells overexpressing PyHRG1 showed severely retarded growth. These results demonstrate that PyHRG1 encodes a novel red algae-specific protein and plays a role in heat tolerance in algae. The transcriptome sequences obtained in this study, which include PyHRG1, will facilitate future studies to understand the molecular mechanisms involved in heat tolerance in red algae.

Blood transcriptome resources of chinstrap (Pygoscelis antarcticus) and gentoo (Pygoscelis papua) penguins from the South Shetland Islands, Antarctica

  • Kim, Bo-Mi;Jeong, Jihye;Jo, Euna;Ahn, Do-Hwan;Kim, Jeong-Hoon;Rhee, Jae-Sung;Park, Hyun
    • Genomics & Informatics
    • /
    • v.17 no.1
    • /
    • pp.5.1-5.9
    • /
    • 2019
  • The chinstrap (Pygoscelis antarcticus) and gentoo (P. papua) penguins are distributed throughout Antarctica and the sub-Antarctic islands. In this study, high-quality de novo assemblies of blood transcriptomes from these penguins were generated using the Illumina MiSeq platform. A total of 22.2 and 21.8 raw reads were obtained from chinstrap and gentoo penguins, respectively. These reads were assembled using the Oases assembly platform and resulted in 26,036 and 21,854 contigs with N50 values of 929 and 933 base pairs, respectively. Functional gene annotations through pathway analyses of the Gene Ontology, EuKaryotic Orthologous Groups, and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases were performed for each blood transcriptome, resulting in a similar compositional order between the two transcriptomes. Ortholog comparisons with previously published transcriptomes from the $Ad{\acute{e}}lie$ (P. adeliae) and emperor (Aptenodytes forsteri) penguins revealed that a high proportion of the four penguins' transcriptomes had significant sequence homology. Because blood and tissues of penguins have been used to monitor pollution in Antarctica, immune parameters in blood could be important indicators for understanding the health status of penguins and other Antarctic animals. In the blood transcriptomes, KEGG analyses detected many essential genes involved in the major innate immunity pathways, which are key metabolic pathways for maintaining homeostasis against exogenous infections or toxins. Blood transcriptome studies such as this may be useful for checking the immune and health status of penguins without sacrifice.

Molecular characterization and functional annotation of a hypothetical protein (SCO0618) of Streptomyces coelicolor A3(2)

  • Ferdous, Nadim;Reza, Mahjerin Nasrin;Emon, Md. Tabassum Hossain;Islam, Md. Shariful;Mohiuddin, A.K.M.;Hossain, Mohammad Uzzal
    • Genomics & Informatics
    • /
    • v.18 no.3
    • /
    • pp.28.1-28.9
    • /
    • 2020
  • Streptomyces coelicolor is a gram-positive soil bacterium which is well known for the production of several antibiotics used in various biotechnological applications. But numerous proteins from its genome are considered hypothetical proteins. Therefore, the present study aimed to reveal the functions of a hypothetical protein from the genome of S. coelicolor. Several bioinformatics tools were employed to predict the structure and function of this protein. Sequence similarity was searched through the available bioinformatics databases to find out the homologous protein. The secondary and tertiary structure were predicted and further validated with quality assessment tools. Furthermore, the active site and the interacting proteins were also explored with the utilization of CASTp and STRING server. The hypothetical protein showed the important biological activity having with two functional domain including POD-like_MBL-fold and rhodanese homology domain. The functional annotation exposed that the selected hypothetical protein could show the hydrolase activity. Furthermore, protein-protein interactions of selected hypothetical protein revealed several functional partners those have the significant role for the bacterial survival. At last, the current study depicts that the annotated hypothetical protein is linked with hydrolase activity which might be of great interest to the further research in bacterial genetics.

A Study on the Detection of Similarity GPCRs by using protein Secondary structure (단백질 2차 구조를 이용한 유사 GPCR 검출에 관한 연구)

  • Ku, Ja-Hyo;Han, Chan-Myung;Yoon, Young-Woo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.14 no.1
    • /
    • pp.73-80
    • /
    • 2009
  • G protein-coupled receptors(GPCRs) family is a cell membrane protein, and plays an important role in a signaling mechanism which transmits external signals through cell membranes into cells. But, GPCRs each are known to have various complex control mechanisms and very unique signaling mechanisms. Structural features, and family and subfamily of GPCRs are well known by function. and accordingly, the most fundamental work in studies identifying the previous GPCRs is to classify the GPCRs with given protein sequences. Studies for classifying previously identified GPCRs more easily with mathematical models have been mainly going on. In this paper Considering that functions of proteins are determined by their stereoscopic structures, the present paper proposes a method to compare secondary structures of two GPCRs having different amino acid sequences, and then detect an unknown GPCRs assumed to have a same function in databases of previously identified GPCRs.

A Physical Data Design and Query Routing Technique of High Performance BLAST on E-Cluster (고성능 BLAST구현을 위한 E-Cluster 기반 데이터 분할 및 질의 라우팅 기법)

  • Kim, Tae-Kyung;Cho, Wan-Sup
    • Journal of the Korea Society of Computer and Information
    • /
    • v.14 no.2
    • /
    • pp.139-147
    • /
    • 2009
  • BLAST (Basic Local Alignment Search Tool) is a best well-known tool in a bioinformatics area. BLAST quickly compares input sequences with annotated huge sequence databases and predicts their functions. It helps biologists to make it easy to annotate newly found sequences with reduced experimental time, scope, and cost. However, as the amount of sequences is increasing remarkably with the advance of sequencing machines, performance of BLAST has been a critical issue and tried to solve it with several alternatives. In this paper, we propose a new PC-Based Cluster system (E-Cluster), a new physical data design methodology (logical partitioning technique) and a query routing technique (intra-query routing). To verify our system, we measure response time, speedup, and efficiency for various sizes of sequences in NR (Non-Redundancy) database. Experimental result shows that proposed system has better speedup and efficiency (maximum 600%) than those o( conventional approaches such as SMF machines, clusters, and grids.

The Brassica rapa Tissue-specific EST Database (배추의 조직 특이적 발현유전자 데이터베이스)

  • Yu, Hee-Ju;Park, Sin-Gi;Oh, Mi-Jin;Hwang, Hyun-Ju;Kim, Nam-Shin;Chung, Hee;Sohn, Seong-Han;Park, Beom-Seok;Mun, Jeong-Hwan
    • Horticultural Science & Technology
    • /
    • v.29 no.6
    • /
    • pp.633-640
    • /
    • 2011
  • Brassica rapa is an A genome model species for Brassica crop genetics, genomics, and breeding. With the completion of sequencing the B. rapa genome, functional analysis of the genome is forthcoming issue. The expressed sequence tags are fundamental resources supporting annotation and functional analysis of the genome including identification of tissue-specific genes and promoters. As of July 2011, 147,217 ESTs from 39 cDNA libraries of B. rapa are reported in the public database. However, little information can be retrieved from the sequences due to lack of organized databases. To leverage the sequence information and to maximize the use of publicly-available EST collections, the Brassica rapa tissue-specific EST database (BrTED) is developed. BrTED includes sequence information of 23,962 unigenes assembled by StackPack program. The unigene set is used as a query unit for various analyses such as BLAST against TAIR gene model, functional annotation using MIPS and UniProt, gene ontology analysis, and prediction of tissue-specific unigene sets based on statistics test. The database is composed of two main units, EST sequence processing and information retrieving unit and tissue-specific expression profile analysis unit. Information and data in both units are tightly inter-connected to each other using a web based browsing system. RT-PCR evaluation of 29 selected unigene sets successfully amplified amplicons from the target tissues of B. rapa. BrTED provided here allows the user to identify and analyze the expression of genes of interest and aid efforts to interpret the B. rapa genome through functional genomics. In addition, it can be used as a public resource in providing reference information to study the genus Brassica and other closely related crop crucifer plants.

Identification of Lettuce Germplasms and Commercial Cultivars Using SSR Markers Developed from EST (EST로부터 개발된 SSR 마커를 이용한 상추 유전자원 및 유통품종의 식별)

  • Hong, Jee-Hwa;Kwon, Yong-Sham;Choi, Keun-Jin;Mishra, Raghvendra Kumar;Kim, Doo Hwan
    • Horticultural Science & Technology
    • /
    • v.31 no.6
    • /
    • pp.772-781
    • /
    • 2013
  • The objective of this study was to develop simple sequence repeat (SSR) markers from expressed sequence tags (EST) of lettuce (Lactuca sativa) and identify 9 germplasms from 3 wild species of lettuce and 61 commercial cultivars using the developed EST-SSR markers. A total of 81,330 lettuce ESTs from NCBI databases were used to search for SSR and 4,229 SSR loci were identified. The highest proportion (59.12%, 2500) was represented by trinucleotide, followed by dinucleotide (29.70%, 1256) and hexanucleotide (6.62%, 280) among SSR repeat motifs. Totally 474 EST-SSR primers were developed from EST and a random set of 267 primers was used to assess the genetic diversity among 9 germplasms and 61 cultivars. Out of 267 primers, 47 EST-SSR markers showed polymorphism between 7 cultivars. Twenty-six EST-SSR markers among 47 EST-SSR markers showed high polymorphism, reproducibility, and band clearance. The relationship between 26 markers genotypes and 70 accessions was analyzed. Totally 127 polymorphic amplified fragments were obtained by 26 EST-SSR markers and two to nine SSR alleles were detected for each locus with an average of 4.88 alleles per locus. Average polymorphism information content was 0.542, ranging from 0.269 to 0.768. Genetic distance of clusters ranged from 0.05 to 0.94 between 70 accessions and dendrogram at a similarity of 0.34 gave 7 main clusters. Analysis of genetic diversity revealed by these 26 EST-SSR markers showed that the 9 germplasms and 61 commercial cultivars were discriminated by marker genotypes. These newly developed EST-SSR markers will be useful for cultivar identification and distinctness, uniformity and stability test of lettuce.

Species-specific Expression of Rpia Transcript in Cumulus-oocyte-complex (난자-난구세포 복합체에서 발현하는 Rpia 유전자의 종 특이적 발현)

  • Kim, Yun-Sun;Yoon, Se-Jin;Kim, Eun-Young;Lee, Kyung-Ah
    • Clinical and Experimental Reproductive Medicine
    • /
    • v.34 no.2
    • /
    • pp.95-106
    • /
    • 2007
  • Objective: We previously identified differentially expressed genes (DEGs) between germinal vesicle (GV) and metaphase II (MII) mouse oocyte. The present study was accomplished as a preliminary study to elucidate the role of ribose 5-phosphate isomerase A (Rpia), the essential enzyme of the pentose phosphate pathway (PPP), in oocyte maturation. We observed expression of Rpia in the mouse and porcine oocytes. Methods: Expression pattern of the 11 MII-selective DEGs in various tissues was evaluated using RT-PCR and selected 4 genes highly expressed in the ovary. According to the oocyte-selective expression profile, we selected Rpia as a target for this study. We identified the porcine Rpia sequence using EST clustering technique, since it is not yet registered in public databases. Results: The extended porcine Rpia nucleotide sequence was submitted and registered to GenBank (accession number EF213106). We prepared primers for porcine Rpia according to this sequence. In contrast to the oocyte-specific expression in the mouse, Rpia was expressed in porcine cumulus and granulosa cells as well as in oocytes. Conclusion: This is the first report on the characterization of the Rpia gene in the mouse and porcine ovarian cells. Results of the present study suggest that the mouse and porcine COCs employ different mechanism of glucose metabolism. Therefore, the different metabolic pathways during in vitro oocyte maturation (IVM) in different species may lead different maturation rates. It is required to study further regarding the role of Rpia in glucose metabolism of oocytes and follicular cell fore exploring the regulatory mechanism of oocyte maturation as well as for finding the finest culture conditions for in vitro maturation.

Dominance of Endospore-forming Bacteria on a Rotating Activated Bacillus Contactor Biofilm for Advanced Wastewater Treatment

  • Park, Seong-Joo;Yoon, Jerng-Chang;Shin, Kwang-Soo;Kim, Eung-Ho;Yim, Soo-Bin;Cho, Yeon-Je;Sung, Gi-Moon;Lee, Dong-Geun;Kim, Seung-Bum;Lee, Dong-Uk;Woo, Sung-Hoon;Koopman, Ben
    • Journal of Microbiology
    • /
    • v.45 no.2
    • /
    • pp.113-121
    • /
    • 2007
  • The bacterial diversity inherent to the biofilm community structure of a modified rotating biological contactor wastewater treatment process, referred to as the Rotating Activated Bacillus Contactor (RABC) process, was characterized in this study, via both culture-dependent and culture-independent methods. On the basis of culture-dependent methods, Bacillus sp. were found to exist in large numbers on the biofilm (6.5% of the heterotrophic bacteria) and the microbial composition of the biofilms was quite simple. Only three phyla were identified-namely, the Proteobacteria, the Actinobacteria (High G+C Gram-positive bacteria), and the Firmicutes (Low G+C Gram-positive bacteria). The culture-independent partial 16S rDNA sequence analysis revealed a considerably more diverse microbial composition within the biofilms. A total of eight phyla were recovered in this case, three of which were major groups: the Firmicutes (43.9%), the Proteobacteria (28.6%), and the Bacteroidetes (17.6%). The remaining five phyla were minor groups: the Planctomycetes (4.4%), the Chlorobi (2.2%), the Actinobacteria (1.1%), the Nitrospirae (1.1%), and the Verrucomicrobia (1.1%). The two most abundant genera detected were the endospore-forming bacteria (31.8%), Clostridium and Bacillus, both of which are members of the Firmicutes phylum. This finding indicates that these endospore-forming bacteria successfully colonized and dominated the RABC process biofilms. Many of the colonies or clones recovered from the biofilms evidenced significantly high homology in the 16S rDNA sequences of bacteria stored in databases associated with advanced wastewater treatment capabilities, including nitrification and denitrification, phosphorus accumulation, the removal of volatile odors, and the removal of chlorohydrocarbons or heavy metals. The microbial community structures observed in the biofilms were found to correlate nicely with the enhanced performance of advanced wastewater treatment protocols.

Analysis, Detection and Prediction of some of the Structural Motifs in Proteins

  • Guruprasad, Kunchur
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.325-330
    • /
    • 2005
  • We are generally interested in the analysis, detection and prediction of structural motifs in proteins, in order to infer compatibility of amino acid sequence to structure in proteins of known three-dimensional structure available in the Protein Data Bank. In this context, we are analyzing some of the well-characterized structural motifs in proteins. We have analyzed simple structural motifs, such as, ${\beta}$-turns and ${\gamma}$-turns by evaluating the statistically significant type-dependent amino acid positional preferences in enlarged representative protein datasets and revised the amino acid preferences. In doing so, we identified a number of ‘unexpected’ isolated ${\beta}$-turns with a proline amino acid residue at the (i+2) position. We extended our study to the identification of multiple turns, continuous turns and to peptides that correspond to the combinations of individual ${\beta}$ and ${\gamma}$-turns in proteins and examined the hydrogen-bond interactions likely to stabilize these peptides. This led us to develop a database of structural motifs in proteins (DSMP) that would primarily allow us to make queries based on the various fields in the database for some well-characterized structural motifs, such as, helices, ${\beta}$-strands, turns, ${\beta}$-hairpins, ${\beta}$-${\alpha}$-${\beta}$, ${\psi}$-loops, ${\beta}$-sheets, disulphide bridges. We have recently implemented this information for all entries in the current PDB in a relational database called ODSMP using Oracle9i that is easy to update and maintain and added few additional structural motifs. We have also developed another relational database corresponding to amino acid sequences and their associated secondary structure for representative proteins in the PDB called PSSARD. This database allows flexible queries to be made on the compatibility of amino acid sequences in the PDB to ‘user-defined’ super-secondary structure conformation and vice-versa. Currently, we have extended this database to include nearly 23,000 protein crystal structures available in the PDB. Further, we have analyzed the ‘structural plasticity’ associated with the ${\beta}$-propeller structural motif We have developed a method to automatically detect ${\beta}$-propellers from the PDB codes. We evaluated the accuracy and consistency of predicting ${\beta}$ and ${\gamma}$-turns in proteins using the residue-coupled model. I will discuss results of our work and describe databases and software applications that have been developed.

  • PDF