• 제목/요약/키워드: Protein Structure Prediction

검색결과 104건 처리시간 0.02초

Duration HMM을 이용한 진핵생물 유전자 예측 프로그램 개발 (A Eukaryotic Gene Structure Prediction Program Using Duration HMM)

  • 태홍석;박기정
    • 미생물학회지
    • /
    • 제39권4호
    • /
    • pp.207-215
    • /
    • 2003
  • 주어진 염기서열에서 단백질로 코딩되는 영역을 예측하는 유전자 구조 예측은 유전자 annotation의 가장 핵심적인 부분으로 유전자 분석 및 유전체 프로젝트 전체에 큰 영향을 준다. 진핵생물의 유전자가 원핵생물의 유전자에 비해 더 복잡한 구조를 가지기 때문에 진핵생물의 유전자 구조 예측 모델 역시 원핵생물에 비해 다양하고 복잡한 모델로 구성되어 있다. 본 연구팀은 duration hidden markov model을 기본형태로 하여 진핵생물의 유전자 구조 예측 프로그램인 EGSP를 개발하였다. 이 프로그램은 각 생명체의 유전자 구조 예측에 필요한 파라메터를 생성하는 학습기능과, 이를 기반으로 핵산 서열을 입력으로 해서 단백질을 코딩하는 부위를 예측하여 출력하는 기능으로 구성되며, 최근의 프로그램들의 추세대로 복수 개 유전자 예측의 기능을 갖추고 있다. EGSP의 학습과 예측에 사용되는 각 파라메터의 전체 성능에 대한 효과 분석 등을 위해 여러 개 signal에 대한 개별 모델이 주는 효과 등을 분석하였다. 진핵생물의 유전자 구조 예측에 가장 많이 연구되는 human dataset을 이용하여 현재 개발된 유전자 구조 예측 프로그램인 GenScan과 GeneID, Morgan 등 보편적으로 사용되는 프로그램들과의 성능을 여러 가지 기준에서 비교한 결과, 본 프로그램이 실용성 있는 수준을 보여주는 것을 확인하였다. 그리고 진핵 미생물인 Saccharomyces cerevisiae로 성능을 테스트한 결과 만족할 만한 수준의 성능을 나타내는 것을 알 수 있었다.

Quantitation of relationship and development of nutrient prediction with vibrational molecular structure spectral profiles of feedstocks and co-products from canola bio-oil processing

  • Alessandra M.R.C.B. de Oliveira;Peiqiang Yu
    • Animal Bioscience
    • /
    • 제36권3호
    • /
    • pp.451-460
    • /
    • 2023
  • Objective: This program aimed to reveal the association of feed intrinsic molecular structure with nutrient supply to animals from canola feedstocks and co-products from bio-oil processing. The special objective of this study was to quantify the relationship between molecular spectral feature and nutrient availability and develop nutrient prediction equation with vibrational molecular structure spectral profiles. Methods: The samples of feedstock (canola oil seeds) and co-products (meals and pellets) from different bio-oil processing plants in Canada (CA) and China (CH) were submitted to this molecular spectroscopic technique and their protein and carbohydrate related molecular spectral features were associated with the nutritional results obtained through the conventional methods of analyses for chemical and nutrient profiles, rumen degradable and intestinal digestible parameters. Results: The results showed that the spectral structural carbohydrates spectral peak area (ca. 1,487.8 to 1,190.8 cm-1) was the carbohydrate structure that was most significant when related to various carbohydrate parameters of canola meals (p<0.05, r>0.50). And spectral total carbohydrate area (ca. 1,198.5 to 934.3 cm-1) was most significant when studying the various carbohydrate parameters of canola seeds (p<0.05, r>0.50). The spectral amide structures (ca. 1,721.2 to 1,480.1 cm-1) were related to a few chemical and nutrient profiles, Cornell Net Carbohydrate and Protein System (CNCPS) fractions, truly absorbable nutrient supply based on the Dutch protein system (DVE/OEB), and NRC systems, and intestinal in vitro protein-related parameters in co-products (canola meals). Besides the spectral amide structures, α-helix height (ca. 1,650.8 to 1,643.1 cm-1) and β-sheet height (ca. 1,633.4 to 1,625.7 cm-1), and the ratio between them have shown to be related to many protein-related parameters in feedstock (canola oil seeds). Multi-regression analysis resulted in moderate to high R2 values for some protein related equations for feedstock (canola seeds). Protein related equations for canola meals and carbohydrate related equations for canola meals and seeds resulted in weak R2 and low p values (p<0.05). Conclusion: In conclusion, the attenuated total reflectance Fourier transform infrared spectroscopy vibrational molecular spectroscopy can be a useful resource to predict carbohydrate and protein-relates nutritional aspects of canola seeds and meals.

Mining Structure Elements from RNA Structure Data, and Visualizing Structure Elements

  • Lim, Dae-Ho;Han, Kyung-Sook
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2003년도 제2차 연례학술대회 발표논문집
    • /
    • pp.268-274
    • /
    • 2003
  • Most currently known molecular structures were determined by X-ray crystallography or Nuclear Magnetic Resonance (NMR). These methods generate a large amount of structure data, even far small molecules, and consist mainly of three-dimensional atomic coordinates. These are useful for analyzing molecular structure, but structure elements at higher level are also needed for a complete understanding of structure, and especially for structure prediction. Computational approaches exist for identifying secondary structural elements in proteins from atomic coordinates. However, similar methods have not been developed for RNA due in part to the very small amount of structure data so far available, and extracting the structural elements of RNA requires substantial manual work. Since the number of three-dimensional RNA structures is increasing, a more systematic and automated method is needed. We have developed a set of algorithms for recognizing secondary and tertiary structural elements in RNA molecules and in the protein-RNA structures in protein data banks (PDB). The present work represents the first attempt at extracting RNA structure elements from atomic coordinates in structure databases. The regularities in the structure elements revealed by the algorithms should provide useful information for predicting the structure of RNA molecules bound to proteins.

  • PDF

Characterization of Lipid Binding Region of Lipoprotein Lipase

  • Lee, Jae-Bok;Kim, Tae-Woong
    • Preventive Nutrition and Food Science
    • /
    • 제4권2호
    • /
    • pp.139-144
    • /
    • 1999
  • Lipoprotein lipase (LPL) I san enzyme that catalyzed the hydrolysis of triacylglycerols of chylomicrons and VLDL to produce 20acylglycerols and fatty acids. The enzyme, LPL, is localized on the surface of the capillary endothelium and is widely distributed in extrahepatic tissues including heart, skeletal muscle and adipose tissue. LPL has been isolated from boving milk by affinity chromatography on heparin-separose in 2 M NaCL, 5mM barbital buffer, pH 7.4. To elucidate the lipid-binding regin, LPL was digested with trypsin and then separated by gel filtration. Lipid binding region of LPL has been investigated by recombining LPL peptides with DMPC vesicles. Proteolytic LPL fragments with DMPC were reassembled and stabilized by cholate. Lipid-binding region of LPL was identified by a PTH-automated protein sequencer, as AQQHYPVSAGYTK. The analysis of the secondary structure of the lipid-binding peptides revealed a higher probability of $\alpha$-helix structure compared to the whole LPL protein. The prediction of hydrophobicity of lipid -binding region was highly hydrophobic (-1.1) compared to LPL polypetide(-0.4).

  • PDF

Expression, Purification and Characterization of the BLM binding region of human Fanconi Anemia Group J Protein

  • Yeom, Kyuho;Park, Chin-Ju
    • 한국자기공명학회논문지
    • /
    • 제20권1호
    • /
    • pp.22-26
    • /
    • 2016
  • FANCJ is a DNA helicase which contributes genome stability by resolving G-quadruplex DNA from 5' to 3' direction. In addition to main ATPase helicase core, FANCJ has the protein binding region at its C-terminal part. BRCA1 and BLM are the binding partner of FANCJ and these protein-protein interactions contribute genomic stability and the proper response to replication stress. As the first attempt for studying FANCJ-BLM interaction, we prepared BLM binding region of FANCJ and characterized with CD and NMR spectroscopy. FANCJ (881-941) with N-ter 6xHis was purified as the oligomer. Secondary structure prediction based on CD data revealed that FANCJ (881-941) composed with ${\beta}$ sheet, turn and coils.$^1H-^{15}N$ HSQC spectra showed nonhomogeneous peak intensities with less number of peaks comparing than the number of amino acids in the construct. It indicated that optimization should be necessary for detailed further structural studies.

Purification and Backbone Assignment of the Hypothetical Protein MTH1821 from Methanobacterium Thermoautotrophicum H

  • Kwak, Soo-Young;Lee, Woong-Hee;Shin, Joon;Ko, Sung-Geon;Lee, Weon-Tae
    • 한국자기공명학회논문지
    • /
    • 제11권2호
    • /
    • pp.73-84
    • /
    • 2007
  • MTH1821 (UniProtKB/TrEMBL ID O27849) is a 96-residue hypothetical protein from the open reading frame of Methanobacterium thermoautotrophicum H one of the target organisms of structural genomics pilot project. Proteins which contain conserved sequence compared with MTH1821 have not been discovered yet and the functional and structural information for MTH1821 is not available. Here, we present the sequence-specific backbone resonance using multidimensional heteronuc1ear NMR spectroscopy and propose the secondary structure using GetSBY software. The backbone resonances of N, HN, $C_{\alpha}$, $C_{\beta}$, CO and $H_{\alpha}$ which are necessary for a prediction of secondary structure by GetSBY were assigned about 98% (557/568). The secondary structure of MTH1821 confirmed that it is comprised of four strand regions and two helical regions. This report will provide a valuable resource for the calculation solution structure of MTH1821 and for the other hypothetical protein that is targeted for structural-based functional discovery.

  • PDF

Association of MC4R Gene Polymorphisms with Growth and Body Composition Traits in Chicken

  • Li, Chun-Yu;Li, Hui
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제19권6호
    • /
    • pp.763-768
    • /
    • 2006
  • Genetic and pharmacological studies in mice have demonstrated a complementary role for the melanocortin 4 receptor (MC4R) in the control of food intake, energy balance and body weight. This study was designed to investigate the associations of a MC4R gene polymorphism on chicken growth and body composition traits in broiler lines divergently selected for abdominal fat. A SNP (G54C) was found in CDS region of chicken MC4R gene. The analysis of the least squares and variance revealed a significant association between the G54C SNP and BW, CW and SL at 7 wk of age, and there were significant differences in different genotypes (p<0.05). The results from protein secondary structure prediction and tertiary structure prediction showed that it appeared a helix in $13^{th}$ amino acid and two strands at $14^{th}$ and $15^{th}$ amino acid in mutant protein, respectively. It maybe induce the change of the activity or function of MC4R gene in poultry.

Backbone 1H, 15N, and 13C Resonance Assignment and Secondary Structure Prediction of HP1298 from Helicobacter pylori

  • Kim, Won-Je;Lim, Jong-Soo;Son, Woo-Sung;Ahn, Hee-Chul;Lee, Bong-Jin
    • 한국자기공명학회논문지
    • /
    • 제12권2호
    • /
    • pp.65-73
    • /
    • 2008
  • HP1298 (Swiss-Prot ID ; P65108) is an 72-residue protein from Helicobacter pylori strain 26695. The function of HP1298 was identified as Translation initiation factor IF-l based on sequence homology, and HP1298 is included in IF-l family. Here, we report the sequence-specific backbone resonance assignments of HP1298. About 97% of all the $^{1}HN$, $^{15}N$, $^{13}C{\alpha}$, $^{13}C{\beta}$, and $^{13}CO$ resonances could be assigned unambiguously. We could predict the secondary structure of HP1298, by analyzing the deviation of the $^{13}C{\alpha}$ and $^{13}C{\beta}$ shemical shifts from their respective random coil values. Secondary structure prediction shows that HP1298 consists of six $\beta$-strands. This study is a prerequisite for determining the solution structure of HP1298 and investigating the structure-function relationship of HP1298. Assigned chemical shift can be used for the study on interaction between HP1298 and other Helicobacter pylori proteins.

단백질 가시화 형태에 따른 정보표현적합도 평가 (Evaluation of Information Representation Goodness-of-fit According to Protein Visualization Pattern)

  • 변재희;최유주;서정근
    • 인터넷정보학회논문지
    • /
    • 제16권2호
    • /
    • pp.117-125
    • /
    • 2015
  • 단백질 기능을 규명하는 단백질 구조 정보는 단백질 의약품의 약효를 증진시키고, 개발을 단축시키는데 큰 영향을 미친다. 따라서 단백질의 구조를 효과적으로 분석하기 위한 단백질 가시화에 대한 연구가 증가하고 있다. 하지만 단백질 가시화에 대한 연구가 단백질의 구조를 예측하거나 렌더링의 속도를 향상시키는 것을 중심으로 이뤄지고 있으며, 단백질 가시화 형태에 따른 정보 전달 효용성에 대한 연구는 미비한 실정이다. 본 연구는 단백질 의약품에 대한 효율적인 정보 서비스의 사전 연구로써 단백질 1, 2차구조 혼합가시화 형태별 정보표현적합도를 분석하였다. 단백질 1, 2차구조 혼합가시화 형태는 대표적 가시화 서비스인 Chimera, PDB, Cn3D와 기존 가시화 서비스의 문제점을 개선한 단백질 1, 2차구조 혼합가시화 형태를 대상으로 하였다. 정보표현적합도를 구하기 위한 정보요인은 피험자 분석 결과를 바탕으로 단백질 1차구조, 아미노산 위치, 단백질 2차구조, 단백질 2차구조 비율정보로 구분하였으며, 피험자는 단백질 의약품 업계종사기간이 5년 이상인 전문가 집단을 대상으로 하였다. 그 결과 단백질 1, 2차구조 혼합가시화형태별 정보표현적합도에는 유의미한 차이가 있었으며, 가시화 형태별 정보 전달 효용성에 차이가 있음을 입증할 수 있었다.

Comparative Genomics of T-complex protein 10 like in Humans and Chimpanzees

  • Kim, Il-Chul;Kim, Dae-Soo;Kim, Dae-Won;Choi, Sang-Haeng;Choi, Han-Ho;Chae, Sung-Hwa;Park, Hong-Seog
    • Genomics & Informatics
    • /
    • 제3권2호
    • /
    • pp.61-65
    • /
    • 2005
  • Comparing 231 genes on chimpanzee chromosome 22 with their orthologous on human chromosome 21, we have found that 15 orthologs have indels within their coding sequences. It was rather surprising that significant number of genes have changed by indel, despite the shorter time since their divergence and led us hypothesize that indels and structural changes may represent one of the major mechanism of proteome evolution in the higher primates. Human T-complex protein 10 like (TCP 10L) is a representative having indel within its coding sequence. Gene structure of human TCP10L compared with chimpanzee TCP10L gene showed 16 base pair difference in genomic DNA. As a result of the indel, frame shift mutation occurs in coding sequence (CDS) and human TCP10L express longer polypeptide of 21 amino acid residues than that of chimpanzee. Our prediction found that the indel may affect to dramatic change of secondary protein structure between human and chimpanzee TCP10L. Especially, the structural changes in the C-terminal region of TCP10L protein may affect on the interacting potential to other proteins rather than DNA binding function of the protein. Through these changes, TCP10L might influence gene expression profiles in liver and testis and subsequently influence the physiological changes required in primate evolution.