• Title/Summary/Keyword: sequence data

Search Result 3,115, Processing Time 0.033 seconds

Protein Ontology: Semantic Data Integration in Proteomics

  • Sidhu, Amandeep S.;Dillon, Tharam S.;Chang, Elizabeth;Sidhu, Baldev S.
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.388-391
    • /
    • 2005
  • The Protein Structural and Functional Conservation need a common language for data definition. With the help of common language provided by Protein Ontology the high level of sequence and functional conservation can be extended to all organisms with the likelihood that proteins that carry out core biological processes will again be probable orthologues. The structural and functional conservation in these proteins presents both opportunities and challenges. The main opportunity lies in the possibility of automated transfer of protein data annotations from experimentally traceable model organisms to a less traceable organism based on protein sequence similarity. Such information can be used to improve human health or agriculture. The challenge lies in using a common language to transfer protein data annotations among different species of organisms. First step in achieving this huge challenge is producing a structured, precisely defined common vocabulary using Protein Ontology. The Protein Ontology described in this paper covers the sequence, structure and biological roles of Protein Complexes in any organism.

  • PDF

An Approach for a Substitution Matrix Based on Protein Blocks and Physicochemical Properties of Amino Acids through PCA

  • You, Youngki;Jang, Inhwan;Lee, Kyungro;Kim, Heonjoo;Lee, Kwanhee
    • Interdisciplinary Bio Central
    • /
    • v.6 no.4
    • /
    • pp.3.1-3.10
    • /
    • 2014
  • Amino acid substitution matrices are essential tools for protein sequence analysis, homology sequence search in protein databases and multiple sequence alignment. The PAM matrix was the first widely used amino acid substitution matrix. The BLOSUM series then succeeded the PAM matrix. Most substitution matrixes were developed by using the statistical frequency of substitution between each amino acid at blocks representing groups of protein families or related proteins. However, substitution of amino acids is based on the similarity of physiochemical properties of each amino acid. In this study, a new approach was used to obtain major physiochemical properties in multiple sequence alignment. Frequency of amino acid substitution in multiple sequence alignment database and selected attributes of amino acids in physiochemical properties database were merged. This merged data showed the major physiochemical properties through principle components analysis. Using factor analysis, these four principle components were interpreted as flexibility of electronic movement, polarity, negative charge and structural flexibility. Applying these four components, BAPS was constructed and validated for accuracy. When comparing receiver operated characteristic ($ROC_{50}$) values, BAPS scored slightly lower than BLOSUM and PAM. However, when evaluating for accuracy by comparing results from multiple sequence alignment with the structural alignment results of two test data sets with known three-dimensional structure in the homologous structure alignment database, the result of the test for BAPS was comparatively equivalent or better than results for prior matrices including PAM, Gonnet, Identity and Genetic code matrix.

Study on the Generation of Inaudible Binary Random Number Using Canonical Signed Digit Coding (표준 부호 디지트 코딩을 이용한 비가청 이진 랜덤 신호 발생에 관한 연구)

  • Nam, MyungWoo;Lee, Young-Seok
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.8 no.4
    • /
    • pp.263-269
    • /
    • 2015
  • Digital watermarking is imperceptible and statistically undetectable information embeds into digital data. Most information in digital audio watermarking schemes have used binary random sequences. The embedded binary random sequence distorts and modifies the original data while it plays a vital role in security. In this paper, a binary random sequence to improve imperceptibility in perceptual region of the human auditory system is proposed. The basic idea of this work is a modification of a binary random sequence according to the frequency analysis of adjacent binary digits that have different signs in the sequence. The canonical signed digit code (CSDC) is also applied to modify a general binary random sequence and the pair-matching function between original and its modified version. In our experiment, frequency characteristics of the proposed binary random sequence was evaluated and analyzed by Bark scale representation of frequency and frequency gains.

Sequencing of panfacial fracture surgery: a literature review and personal preference

  • Jae Hee, Yoon;Dong Hee, Kang;Hyonsurk, Kim
    • Archives of Craniofacial Surgery
    • /
    • v.23 no.6
    • /
    • pp.256-261
    • /
    • 2022
  • Background: Treating panfacial fractures (PFFs) can be extremely difficult even for experienced surgeons. Although several authors have attempted to systemize the surgical approach, performing surgery by applying a unidirectional sequence is much more difficult in practice. The purpose of this study was to review the literature on PFF surgery sequence and to understand how different surgical specialists-plastic reconstructive surgery (PRS) and oral maxillofacial surgery (OMS)-chose sequence and review PFFs fixation sequence in clinical cases. Methods: The PubMed and Google Scholar databases were scoured for publications published up until May 2020. Data extracted from the studies using standard templates included fracture part, fixation sequence, originating specialist, and the countries. Bibliographic details like author and year of publication were also extracted. Also, we reviewed the data for PFFs patients in the Trauma Registry System of Dankook University Hospital from 2011 to 2021. Results: In total, 240 articles were identified. This study comprised 22 studies after screening and full-text analysis. Sixteen studies (12 OMS specialists and 4 PRS specialists) used a "bottom-top" approach, whereas three studies (1 OMS specialist and 2 PRS specialists) used a "top-bottom" method. However, three studies (only OMS specialists) reported on both sequences. In our hospital, there were a total of 124 patients with PFF who were treated during 2011 to 2021; 64 (51.6%) were in upper-middle parts, 52 (41.9%) were in mid-lower parts, and eight (6.5%) were in three parts. Conclusion: Bottom-top sequencing was mainly used in OMS specialists, and top-bottom sequencing was used at a similar rate by two specialists in literature review. In our experience, however, it was hard to consistently implement unidirectional sequence suggested by a literature review. We realigned the reliable and stable buttresses first with tailoring individually for each patient, rather than proceeding in the unidirectional sequence like bottom-top or top-bottom.

The Scene Analysis and Keyframe Extraction for Content-Based Indexing on Compressed Image Sequence (압축된 영상 시퀀스에서 내용 기반 색인을 위한 장면 분석 및 키 프레임 추출)

  • 오상헌;김상렬;김주도;이근영
    • Proceedings of the IEEK Conference
    • /
    • 1999.11a
    • /
    • pp.605-608
    • /
    • 1999
  • In this paper, we proposed several scene analysis algorithms. These algorithms using image difference and histogram operate on the sequence of DC coefficient which is extracted from Motion JPEG or MPEG without full-frame decompression. Since DC sequence has the most information of full frame while it has reduced data. Experimental results show less than 1/64 of full frame analysing complexity and exactly analyze scene changes and extract key frames.

  • PDF

A WISE/GALEX View of Red Sequence Galaxies

  • Ko, Jong-Wan;Hwang, Ho-Seong;Sohn, Young-Jong
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.37 no.1
    • /
    • pp.37.1-37.1
    • /
    • 2012
  • We present mid-IR (MIR) and near-UV (NUV) properties of red sequence galaxies defined by optical color-magnitude relation. We use the Wide-field Infrared Survey Explorer (WISE) preliminary released data matched with the SDSS DR7/GALEX GR6. The red sequence galaxies with little emission lines show a wide spread of MIR (3.4um-12um) colors, implying a variety of MIR excess emission. We focus on the properties of the red sequence galaxies with MIR excess, comparing the properties of post-starburst galaxies to trace how galaxies migrate to the red sequence.

  • PDF

A Study on the Occurrence Cause of the Zero Sequence Voltage at Tertiary Side of 345[kV] Main Transformer (345[kV] 주변압기 3차측 영상전압 발생원인 분석 및 대책에 관한 연구)

  • Woo, Jung-Wook;Kweon, Dong-Jin
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.24 no.9
    • /
    • pp.150-158
    • /
    • 2010
  • We had experienced the several system failures which were concerned about zero sequence voltages at the tertiary side of 345[kV] transformers. In this paper, we had considered the zero sequence voltage and its reduction methods at the 345[kV] auto-transformer tertiary. For analysis, we used EMTP(Electro-Magnetic Transients Program). The calculation results by EMTP were compared with the measured data of the field tests. From the calculation and the field tests, we had verified that it is due to pre-saturation characteristics of the potential transformer. So, we had improved its saturation characteristics and applied it at real site. After improvements, we could reduce the zero sequence voltage below the setting level of the relay.

Sequence Analysis of NS4 Region of HCV Isolated from Korean Patient

  • Paik, Sang-Hoon;Lee, Young-Ik;Kim, Won-Bae;Yang, Jai-Myung
    • Journal of Microbiology
    • /
    • v.33 no.3
    • /
    • pp.260-266
    • /
    • 1995
  • Hepatitis C virus (HCV) has been considered as a mojor causative agent of post-transfusion related non-A, non-B hepatitis. In this study, the cDNA sequence of NS4 region of HCV (HCV-S) obtained from a Korean patient's plasms was determined. Comparative nucleotide sequence analysis between to type II. 67.2% homology to type III, and 66.4% homology to type IV. The putative amino acid sequence homologies to types I, II, III, and IV were 82.8-84.7%, 92.5-95.1%. 72.5, and 71.1%, respectively. This data strongly suggests that HCV-S should be classified as type II. Significant similarities of hydrophobicity profiles and putative transmembranous domains were found in HCV-S and four major prototypes, indicating that the protein structure is similar in spite of the heterogeneities of intertype homologies at the level of the psrimary nucleotide and amino acid sequences.

  • PDF

Online Selective-Sample Learning of Hidden Markov Models for Sequence Classification

  • Kim, Minyoung
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.15 no.3
    • /
    • pp.145-152
    • /
    • 2015
  • We consider an online selective-sample learning problem for sequence classification, where the goal is to learn a predictive model using a stream of data samples whose class labels can be selectively queried by the algorithm. Given that there is a limit to the total number of queries permitted, the key issue is choosing the most informative and salient samples for their class labels to be queried. Recently, several aggressive selective-sample algorithms have been proposed under a linear model for static (non-sequential) binary classification. We extend the idea to hidden Markov models for multi-class sequence classification by introducing reasonable measures for the novelty and prediction confidence of the incoming sample with respect to the current model, on which the query decision is based. For several sequence classification datasets/tasks in online learning setups, we demonstrate the effectiveness of the proposed approach.

Analysis for Diagnosis of Patients with Cerebral Infarction by Sequence Modeling (순차규칙 모델링을 활용한 뇌경색증 환자 진단 분석)

  • Shin, A.M.;Park, H.J.;Lee, I.H.;Kim, Y.N.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.2 no.1
    • /
    • pp.51-56
    • /
    • 2009
  • This study was tried to analyze the diagnosis of patients with cerebral infarction by sequence modeling that was one of data mining analysis method and find out previous disease or complication of patients with cerebral infarction. Mass data that the diagnosis code of cerebral infarction was 163 in 2000 to 2007 were extracted from A hospital's database and then the data mart was constructed for analysis. Total 2,267 patients illnesses were diagnosed as cerebral infarction and 32,692 cases related diagnosis were extracted. Sequence modeling in Clementine 12.0 program was used to analyze diagnosis of patients with cerebral infarction and 8 meaningful rules were found in this paper. This result could be used as a basic data to make secondary cerebral infarction prevention program and to prevent complication of cerebral infarction.

  • PDF