• 제목/요약/키워드: protein structure analysis

검색결과 535건 처리시간 0.023초

Protein Backbone Torsion Angle-Based Structure Comparison and Secondary Structure Database Web Server

  • Jung, Sunghoon;Bae, Se-Eun;Ahn, Insung;Son, Hyeon S.
    • Genomics & Informatics
    • /
    • 제11권3호
    • /
    • pp.155-160
    • /
    • 2013
  • Structural information has been a major concern for biological and pharmaceutical studies for its intimate relationship to the function of a protein. Three-dimensional representation of the positions of protein atoms is utilized among many structural information repositories that have been published. The reliability of the torsional system, which represents the native processes of structural change in the structural analysis, was partially proven with previous structural alignment studies. Here, a web server providing structural information and analysis based on the backbone torsional representation of a protein structure is newly introduced. The web server offers functions of secondary structure database search, secondary structure calculation, and pair-wise protein structure comparison, based on a backbone torsion angle representation system. Application of the implementation in pair-wise structural alignment showed highly accurate results. The information derived from this web server might be further utilized in the field of ab initio protein structure modeling or protein homology-related analyses.

Enhanced Chemical Shift Analysis for Secondary Structure prediction of protein

  • Kim, Won-Je;Rhee, Jin-Kyu;Yi, Jong-Jae;Lee, Bong-Jin;Son, Woo Sung
    • 한국자기공명학회논문지
    • /
    • 제18권1호
    • /
    • pp.36-40
    • /
    • 2014
  • Predicting secondary structure of protein through assigned backbone chemical shifts has been used widely because of its convenience and flexibility. In spite of its usefulness, chemical shift based analysis has some defects including isotopic shifts and solvent interaction. Here, it is shown that corrected chemical shift analysis for secondary structure of protein. It is included chemical shift correction through consideration of deuterium isotopic effect and calculate chemical shift index using probability-based methods. Enhanced method was applied successfully to one of the proteins from Mycobacterium tuberculosis. It is suggested that correction of chemical shift analysis could increase accuracy of secondary structure prediction of protein and small molecule in solution.

Computational Approaches for Structural and Functional Genomics

  • Brenner, Steven-E.
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2000년도 International Symposium on Bioinformatics
    • /
    • pp.17-20
    • /
    • 2000
  • Structural genomics aims to provide a good experimental structure or computational model of every tractable protein in a complete genome. Underlying this goal is the immense value of protein structure, especially in permitting recognition of distant evolutionary relationships for proteins whose sequence analysis has failed to find any significant homolog. A considerable fraction of the genes in all sequenced genomes have no known function, and structure determination provides a direct means of revealing homology that may be used to infer their putative molecular function. The solved structures will be similarly useful for elucidating the biochemical or biophysical role of proteins that have been previously ascribed only phenotypic functions. More generally, knowledge of an increasingly complete repertoire of protein structures will aid structure prediction methods, improve understanding of protein structure, and ultimately lend insight into molecular interactions and pathways. We use computational methods to select families whose structures cannot be predicted and which are likely to be amenable to experimental characterization. Methods to be employed included modern sequence analysis and clustering algorithms. A critical component is consultation of the presage database for structural genomics, which records the community's experimental work underway and computational predictions. The protein families are ranked according to several criteria including taxonomic diversity and known functional information. Individual proteins, often homologs from hyperthermophiles, are selected from these families as targets for structure determination. The solved structures are examined for structural similarity to other proteins of known structure. Homologous proteins in sequence databases are computationally modeled, to provide a resource of protein structure models complementing the experimentally solved protein structures.

  • PDF

In silico annotation of a hypothetical protein from Listeria monocytogenes EGD-e unfolds a toxin protein of the type II secretion system

  • Maisha Tasneem;Shipan Das Gupta;Monira Binte Momin;Kazi Modasser Hossain;Tasnim Binta Osman;Fazley Rabbi
    • Genomics & Informatics
    • /
    • 제21권1호
    • /
    • pp.7.1-7.11
    • /
    • 2023
  • The gram-positive bacterium Listeria monocytogenes is an important foodborne intracellular pathogen that is widespread in the environment. The functions of hypothetical proteins (HP) from various pathogenic bacteria have been successfully annotated using a variety of bioinformatics strategies. In this study, a HP Imo0888 (NP_464414.1) from the Listeria monocytogenes EGD-e strain was annotated using several bioinformatics tools. Various techniques, including CELLO, PSORTb, and SOSUIGramN, identified the candidate protein as cytoplasmic. Domain and motif analysis revealed that the target protein is a PemK/MazF-like toxin protein of the type II toxin-antitoxin system (TAS) which was consistent with BLASTp analysis. Through secondary structure analysis, we found the random coil to be the most frequent. The Alpha Fold 2 Protein Structure Prediction Database was used to determine the three-dimensional (3D) structure of the HP using the template structure of a type II TAS PemK/MazF family toxin protein (DB ID_AFDB: A0A4B9HQB9) with 99.1% sequence identity. Various quality evaluation tools, such as PROCHECK, ERRAT, Verify 3D, and QMEAN were used to validate the 3D structure. Following the YASARA energy minimization method, the target protein's 3D structure became more stable. The active site of the developed 3D structure was determined by the CASTp server. Most pathogens that harbor TAS create a crucial risk to human health. Our aim to annotate the HP Imo088 found in Listeria could offer a chance to understand bacterial pathogenicity and identify a number of potential targets for drug development.

Analysis of the Globular Nature of Proteins

  • Jung, Sung-Hoon;Son, Hyeon-Seok
    • Genomics & Informatics
    • /
    • 제9권2호
    • /
    • pp.74-78
    • /
    • 2011
  • Numerous restraints and simplifications have been developed for methods that anticipate protein structure to reduce the colossal magnitude of possible conformational states. In this study, we investigated if globularity is a general characteristic of proteins and whether they can be applied as a valid constraint in protein structure simulations with approximated measurements (Gb-index). Unexpectedly, most of the proteins showed strong structural globularity (i.e., mode of approximately 76% similarity to the perfect globe) with only a few percent of proteins being outliers. Small proteins tended to be significantly non-globular ($R^2$=0.79) and the minimum Gb-index showed a logarithmic increase with the increase in protein size ($R^2$=0.62), strongly implying that the non-globular characteristics might be more acceptable for smaller proteins than larger ones. The strong perfect globe-like character and the relationship between small size and the loss of globular structure of a protein may imply that living organisms have mechanisms to aid folding into the globular structure to reduce irreversible aggregation. This also implies the possible mechanisms of diseases caused by protein aggregation, including some forms of trinucleotide repeat expansion-mediated diseases.

기하 인스턴싱 기법을 이용한 단백질 구조 가시화 및 속도 향상에 관한 연구 (The Study of Protein Structure Visualization and Rendering Speed Using the Geometry Instancing)

  • 박찬용;황치정
    • 정보처리학회논문지A
    • /
    • 제16A권3호
    • /
    • pp.153-158
    • /
    • 2009
  • 구조적 생물 정보학 분야는 단백질의 3차원 구조를 대상으로 단백질을 연구하는 분야이며, 구조적생물 정보학의 중요한 분야 중의 하나는 단백질 3차원 구조 가시화이다. 단백질의 3차원 구조를 규명하는 장비의 발달로, 규명되는 단백질의 크기와 개수가 증가함에 따라, 고성능의 단백질 가시화 시스템의 필요성도 크게 증가하였으나, 기존의 단백질 구조 가시화 시스템은 3차원 그래픽 하드웨어에 최적화 되지 못하여, 거대 단백질의 가시화에 충분한 성능을 가지지 못하였다. 본 논문에서 제안하는 단백질 3차원 구조 가시화 시스템은 거대 단백질의 가시화 하기 위하여, 3차원 그래픽 하드웨어의 최적화 기법중의 하나인 기하 인스턴싱 기법을 사용하여 빠르게 거대 단백질을 렌더링 한다. 성능 실험에서 7종의 다른 크기의 단백질을 대상으로, 4가지 가시화 방법에 대하여, 제안하는 시스템과 기존의 시스템과의 단백질 렌더링 성능 비교 실험을 하여, 대부분의 경우 우수한 성능을 보였다.

Recent advances of 17O NMR spectroscopy

  • Lin, Yuxi;Kim, Hak Nam;Lee, Young-Ho
    • 한국자기공명학회논문지
    • /
    • 제23권2호
    • /
    • pp.56-60
    • /
    • 2019
  • Study on the structure and dynamics of molecules at the atomic level is of great significance for understanding their function and stability as well as roles for various chemico-physical and biological processes. $^{17}O$ NMR spectroscopy has appeared as an elegant technique for investigating of the physicochemical and structural properties of oxygen-containing compounds such as metal organic frameworks and nanosized oxides. This method has drawn much attention as it provides unique insights into the properties of targets based on atomistic information of local oxygen environments which is otherwise difficult to obtain using other methods. In this mini review, we introduce and discuss the recent study and developments of $^{17}O$ NMR techniques which are tailored for the investigation on the structure and dynamics of water and inorganic materials.

Structural and Biochemical Characterization of the Two Drosophila Low Molecular Weight-Protein Tyrosine Phosphatases DARP and Primo-1

  • Lee, Hye Seon;Mo, Yeajin;Shin, Ho-Chul;Kim, Seung Jun;Ku, Bonsu
    • Molecules and Cells
    • /
    • 제43권12호
    • /
    • pp.1035-1045
    • /
    • 2020
  • The Drosophila genome contains four low molecular weight-protein tyrosine phosphatase (LMW-PTP) members: Primo-1, Primo-2, CG14297, and CG31469. The lack of intensive biochemical analysis has limited our understanding of these proteins. Primo-1 and CG31469 were previously classified as pseudophosphatases, but CG31469 was also suggested to be a putative protein arginine phosphatase. Herein, we present the crystal structures of CG31469 and Primo-1, which are the first Drosophila LMW-PTP structures. Structural analysis showed that the two proteins adopt the typical LMW-PTP fold and have a canonically arranged P-loop. Intriguingly, while Primo-1 is presumed to be a canonical LMW-PTP, CG31469 is unique as it contains a threonine residue at the fifth position of the P-loop motif instead of highly conserved isoleucine and a characteristically narrow active site pocket, which should facilitate the accommodation of phosphoarginine. Subsequent biochemical analysis revealed that Primo-1 and CG31469 are enzymatically active on phosphotyrosine and phosphoarginine, respectively, refuting their classification as pseudophosphatases. Collectively, we provide structural and biochemical data on two Drosophila proteins: Primo-1, the canonical LMW-PTP protein, and CG31469, the first investigated eukaryotic protein arginine phosphatase. We named CG31469 as DARP, which stands for Drosophila ARginine Phosphatase.

단백질 분자 내 α-헬릭스의 재구성 (Reconstruction of α-helices in a Protein Molecule)

  • 강범식;김구진;서우덕
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제3권4호
    • /
    • pp.163-168
    • /
    • 2014
  • 단백질 분자 내에서 ${\alpha}$-헬릭스는 단백질의 구조나 기능, 그리고 다른 단백질과의 결합, 활성 조절 등에 있어 중요한 역할을 하며, 이에 따라 헬릭스에 대한 구조적인 분석이 연구되어 왔다. ${\alpha}$-헬릭스는 그 중심축을 기준으로 다른 ${\alpha}$-헬릭스와의 상호위치를 평가하기 때문에 길게 휘어지거나 꺾인 ${\alpha}$-헬릭스들을 한 개의 헬릭스로 해석할 경우에는 단백질의 구조 분석에 있어서 상당한 오차가 발생할 수 있다. 본 논문에서는 PDB 파일 내에 표시된 단백질 분자의 ${\alpha}$-헬릭스를 주어진 오차 범위 내에서 여러 개의 곧은 형태의 헬릭스로 재구성하는 알고리즘을 제안한다.

Identification of Viral Taxon-Specific Genes (VTSG): Application to Caliciviridae

  • Kang, Shinduck;Kim, Young-Chang
    • Genomics & Informatics
    • /
    • 제16권4호
    • /
    • pp.23.1-23.5
    • /
    • 2018
  • Virus taxonomy was initially determined by clinical experiments based on phenotype. However, with the development of sequence analysis methods, genotype-based classification was also applied. With the development of genome sequence analysis technology, there is an increasing demand for virus taxonomy to be extended from in vivo and in vitro to in silico. In this study, we verified the consistency of the current International Committee on Taxonomy of Viruses taxonomy using an in silico approach, aiming to identify the specific sequence for each virus. We applied this approach to norovirus in Caliciviridae, which causes 90% of gastroenteritis cases worldwide. First, based on the dogma "protein structure determines its function," we hypothesized that the specific sequence can be identified by the specific structure. Firstly, we extracted the coding region (CDS). Secondly, the CDS protein sequences of each genus were annotated by the conserved domain database (CDD) search. Finally, the conserved domains of each genus in Caliciviridae are classified by RPS-BLAST with CDD. The analysis result is that Caliciviridae has sequences including RNA helicase in common. In case of Norovirus, Calicivirus coat protein C terminal and viral polyprotein N-terminal appears as a specific domain in Caliciviridae. It does not include in the other genera in Caliciviridae. If this method is utilized to detect specific conserved domains, it can be used as classification keywords based on protein functional structure. After determining the specific protein domains, the specific protein domain sequences would be converted to gene sequences. This sequences would be re-used one of viral bio-marks.