• 제목/요약/키워드: Prediction of Binding Sites

검색결과 24건 처리시간 0.044초

Prediction of Metal Ion Binding Sites in Proteins from Amino Acid Sequences by Using Simplified Amino Acid Alphabets and Random Forest Model

  • Kumar, Suresh
    • Genomics & Informatics
    • /
    • 제15권4호
    • /
    • pp.162-169
    • /
    • 2017
  • Metal binding proteins or metallo-proteins are important for the stability of the protein and also serve as co-factors in various functions like controlling metabolism, regulating signal transport, and metal homeostasis. In structural genomics, prediction of metal binding proteins help in the selection of suitable growth medium for overexpression's studies and also help in obtaining the functional protein. Computational prediction using machine learning approach has been widely used in various fields of bioinformatics based on the fact all the information contains in amino acid sequence. In this study, random forest machine learning prediction systems were deployed with simplified amino acid for prediction of individual major metal ion binding sites like copper, calcium, cobalt, iron, magnesium, manganese, nickel, and zinc.

Bioinformatic Prediction of SNPs within miRNA Binding Sites of Inflammatory Genes Associated with Gastric Cancer

  • Song, Chuan-Qing;Zhang, Jun-Hui;Shi, Jia-Chen;Cao, Xiao-Qin;Song, Chun-Hua;Hassan, Adil;Wang, Peng;Dai, Li-Ping;Zhang, Jian-Ying;Wang, Kai-Juan
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제15권2호
    • /
    • pp.937-943
    • /
    • 2014
  • Polymorphisms in miRNA binding sites have been shown to affect miRNA binding to target genes, resulting in differential mRNA and protein expression and susceptibility to common diseases. Our purpose was to predict SNPs (single nucleotide polymorphisms) within miRNA binding sites of inflammatory genes in relation to gastric cancer. A complete list of SNPs in the 3'UTR regions of all inflammatory genes associated with gastric cancer was obtained from Pubmed. miRNA target prediction databases (MirSNP, Targetscan Human 6.2, PolymiRTS 3.0, miRNASNP 2.0, and Patrocles) were used to predict miRNA target sites. There were 99 SNPs with MAF>0.05 within the miRNA binding sites of 41 genes among 72 inflammation-related genes associated with gastric cancer. NF-${\kappa}B$ and JAK-STAT are the two most important signaling pathways. 47 SNPs of 25 genes with 95 miRNAs were predicted. CCL2 and IL1F5 were found to be the shared target genes of hsa-miRNA-624-3p. Bioinformatic methods could identify a set of SNPs within miRNA binding sites of inflammatory genes, and provide data and direction for subsequent functional verification research.

Determinants of Functional MicroRNA Targeting

  • Hyeonseo Hwang;Hee Ryung Chang;Daehyun Baek
    • Molecules and Cells
    • /
    • 제46권1호
    • /
    • pp.21-32
    • /
    • 2023
  • MicroRNAs (miRNAs) play cardinal roles in regulating biological pathways and processes, resulting in significant physiological effects. To understand the complex regulatory network of miRNAs, previous studies have utilized massivescale datasets of miRNA targeting and attempted to computationally predict the functional targets of miRNAs. Many miRNA target prediction tools have been developed and are widely used by scientists from various fields of biology and medicine. Most of these tools consider seed pairing between miRNAs and their mRNA targets and additionally consider other determinants to improve prediction accuracy. However, these tools exhibit limited prediction accuracy and high false positive rates. The utilization of additional determinants, such as RNA modifications and RNA-binding protein binding sites, may further improve miRNA target prediction. In this review, we discuss the determinants of functional miRNA targeting that are currently used in miRNA target prediction and the potentially predictive but unappreciated determinants that may improve prediction accuracy.

Prediction of Mammalian MicroRNA Targets - Comparative Genomics Approach with Longer 3' UTR Databases

  • Nam, Seungyoon;Kim, Young-Kook;Kim, Pora;Kim, V. Narry;Shin, Seokmin;Lee, Sanghyuk
    • Genomics & Informatics
    • /
    • 제3권3호
    • /
    • pp.53-62
    • /
    • 2005
  • MicroRNAs play an important role in regulating gene expression, but their target identification is a difficult task due to their short length and imperfect complementarity. Burge and coworkers developed a program called TargetScan that allowed imperfect complementarity and established a procedure favoring targets with multiple binding sites conserved in multiple organisms. We improved their algorithm in two major aspects - (i) using well-defined UTR (untranslated region) database, (ii) examining the extent of conservation inside the 3' UTR specifically. Average length in our UTR database, based on the ECgene annotation, is more than twice longer than the Ensembl. Then, TargetScan was used to identify putative binding sites. The extent of conservation varies significantly inside the 3' UTR. We used the 'tight' tracks in the UCSC genome browser to select the conserved binding sites in multiple species. By combining the longer 3' UTR data, TargetScan, and tightly conserved blocks of genomic DNA, we identified 107 putative target genes with multiple binding sites conserved in multiple species, of which 85 putative targets are novel.

Inferring Single Nucleotide Polymorphisms in MicroRNA Binding Sites of Lung Cancer-related Inflammatory Genes

  • He, Fei;Zheng, Ling-Ling;Luo, Wen-Ting;Yang, Rong;Xu, Xiao-Qin;Cai, Lin
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제15권8호
    • /
    • pp.3601-3606
    • /
    • 2014
  • Single nucleotide polymorphisms located at microRNA (miRNA)-binding sites are likely to affect the expression of miRNA targets and may contribute to the susceptibility of humans to common diseases. Here 335 candidate lung cancer-related inflammatory genes were selected according to the existing literature and database. We identified putative miRNA-binding sites of 149 genes by specialised algorithms and screened SNPs in the 3'UTRs of these genes. By calculating binding free energy, we sorted 269 SNPs on the basis of the possibility of prediction. The proposed approach could help to easy the identification of functionally relevant SNPs and minimize the workflow and the costs.

고래의 게놈에서 hypoxia-inducible factor binding site의 예측과 target gene에 대한 분석 (Prediction of Hypoxia-inducible Factor Binding Site in Whale Genome and Analysis of Target Genes Regulated by Predicted Sites)

  • 임형순;이재학
    • 한국해양바이오학회지
    • /
    • 제7권2호
    • /
    • pp.35-41
    • /
    • 2015
  • Whales are marine mammals that are fully adapted to aquatic environment. Whales breathe by lungs so they require adaptive system to low oxygen concentration (hypoxia) while deep and prolonged diving. However, the study for the molecular mechanism underlying cetacean adaptation to hypoxia has been limited. Hypoxia-inducible factor (HIF) is the central transcription factor that regulates hypoxia-related gene expression. Here we identified HIF-binding sites in whale genome by phylogenetic footprinting and analyzed HIF-target genes to understand how whales cope with hypoxia. By comparison with the HIF-target genes of terrestrial mammals, it was suggested that whales may retain unique adaptation mechanisms to hypoxia.

Theoretical Investigations on Structure and Function of Human Homologue hABH4 of E.coli ALKB4

  • Shankaracharya, Shankaracharya;Das, Saibal;Prasad, Dinesh;Vidyarthi, Ambarish Sharan
    • Interdisciplinary Bio Central
    • /
    • 제2권3호
    • /
    • pp.8.1-8.5
    • /
    • 2010
  • Introduction: Recently identified human homologues of ALKB protein have shown the activity of DNA damaging drugs, used for cancer therapy. Bioinformatics study of hABH2 and hABH3 had led to the discovery of a novel DNA repair mechanism. Very little is known about structure and function of hABH4, one of the members of this superfamily. Therefore, in present study we are intended to predict its structure and function through various bioinformatics tools. Materials and Methods: Modeling was done with modeler 9v7 to predict the 3D structure of the hABH4 protein. This model was validated with the program Procheck using Ramachandran plot statistics and was submitted to PMDB with ID PM0076284. The 3d2GO server was used to predict the functions. Residues at protein ligand and protein RNA binding sites were predicted with 3dLigandSite and KYG programs respectively. Results and Discussion: 3-D model of hABH4, ALKBH4.B99990003.pdb was predicted and evaluated. Validation result showed that 96.4 % residues lies in favored and additional allowed region of Ramachandran plot. Ligand binding residues prediction showed four Ligand clusters, having 24 ligands in cluster 1. Importantly, conserved pattern of Glu196-X-Pro198- Xn-His254 in the functional domain was detected. DNA and RNA binding sites were also predicted in the model. Conclusion and Prospects: The predicted and validated model of human homologue hABH4 resulted from this study may unveil the mechanism of DNA damage repair in human and accelerate the research on designing of appropriate inhibitors aiding in chemotherapy and cancer related diseases.

Importance of Accurate Charges in Binding Affinity Calculations: A Case of Neuraminidase Series

  • Park, Kichul;Sung, Nack Kyun;Cho, Art E.
    • Bulletin of the Korean Chemical Society
    • /
    • 제34권2호
    • /
    • pp.545-548
    • /
    • 2013
  • It has been shown that calculating atomic charges using quantum mechanical level theory greatly improves the accuracy of docking. A protocol was developed and shown to be effective. That this protocol works is just a manifestation of the fact that electrostatic interactions are important in protein-ligand binding. In order to investigate how the same protocol helps in prediction of binding affinities, we took a series of known cocrystal structures of influenza neuraminidase inhibitors with the receptor and performed docking with Glide SP, Glide XP, and QPLD, the last being a workflow that incorporates QM/MM calculations to replace the fixed atomic charges of force fields with quantum mechanically recalculated ones at a given docking pose, and predicted the binding affinities of each cocrystal. The correlation with experimental binding affinities considerably improved with QPLD compared to Glide SP/XP yielding $r^2$ = 0.83. The results suggest that for binding sites, such as that of neuraminidase, which are laden with hydrophilic residues, protocols such as QPLD which utilizes QM-based atomic charges can better predict the binding affinities.

SVM 모델을 이용한 3차원 패치 기반 단백질 상호작용 사이트 예측기법 (Prediction of Protein-Protein Interaction Sites Based on 3D Surface Patches Using SVM)

  • 박성희
    • 정보처리학회논문지D
    • /
    • 제19D권1호
    • /
    • pp.21-28
    • /
    • 2012
  • 모노머 단백질의 상호작용 사이트 예측은 기능을 알지 못하는 단백질에 대해서 이것과 상호작용하는 단백질로부터 기능을 예측하거나 단백질 도킹을 위한 검색 공간의 감소에 중요한 역할을 한다. 그러나 상호작용사이트 예측은 대부분 단백질 상호작용이 세포 내에서 순간적 반응에 일어나는 약한 상호작용으로 실험에 의한 3차원 결정 구조 식별의 어려움이 따르며 이로 인해 3차원의 복합체 데이터가 제한적으로 양산된다. 이 논문에서는 모노머 단백질의 3차원 패치 계산을 통하여 구조가 알려진 복합체의 상호작용사이트와 비상호작용사이트에 대한 패치 속성을 추출하고 이를 기반으로 Support Vector Machine (SVM) 분류기법을 이용한 예측 모델 개발을 제시한다. 타겟 클래스의 데이터 불균형 문제 해결을 위해 under-sampling 기법을 이용한다. 사용된 패치속성은 2차 구조 요소와 아미노산 구성으로부터 총 9개가 추출된다. 147개의 단백질 복합체에 대해서 10 fold cross validation을 통해서 다양한 분류모델의 성능 평가를 하였다. 평가한 분류 모델 중 SVM은 92.7%의 높은 정확성을 보이고 이를 이용하여 분류 모델을 개발하였다.

Study of protein loop conformational changes by free energy estimation using colony energy

  • Kang, Beom Chang;Lee, Gyu Rie;Seok, Chaok
    • EDISON SW 활용 경진대회 논문집
    • /
    • 제3회(2014년)
    • /
    • pp.63-74
    • /
    • 2014
  • Predicting protein loop structures is an important modeling problem since protein loops are often involved in diverse biological functions by participating in enzyme active sites, ligand binding sites, etc. However, loop structure prediction is difficult even when structures of homologous proteins are known due to large sequence and structure variability among loops of homologous proteins. Therefore, an ab initio approach is necessary to solve loop modeling problems. One of the difficulties in the development of ab initio loop modeling method is to derive an accurate scoring function that closely approximates the true free energy function. In particular, entropy as well as energy contribution have to be considered adequately for loops because loops tend to be flexible compared to other parts of protein. In this study, the contribution of conformational entropy is considered in scoring loop conformations by employing "colony energy" which was previously proposed to estimate the free energy for an ensemble of conformations. Loop conformations were generated by using two EDISON_Chem programs GalaxyFill and GalaxySC, and colony energy was designed for this sampling by tuning relevant parameters. On a test set of 40 loops, the accuracy of predicted loop structure improved on average by scoring with the colony energy compared to scoring by energy alone. In addition, high correlation between colony energy and deviation from the native structure suggested that more extensive sampling can further improve the prediction accuracy. In another test on 6 ligand-binding loops that show conformational changes by ligand binding, both ligand-free and ligand-bound states could be identified by using colony energy when no information on the ligand-bound conformation is used.

  • PDF