• Title/Summary/Keyword: sequence analysis

Search Result 6,369, Processing Time 0.036 seconds

Characterization of Korean Erwinia carotovora Strains from Potato and Chinese Cabbage

  • Seo, Sang-Tae;Koo, Jun-Hak;Hur, Jang-Hyun;Lim, Chun-Keun
    • The Plant Pathology Journal
    • /
    • v.20 no.4
    • /
    • pp.283-288
    • /
    • 2004
  • Four Erwinia carotovora strains isolated from potatoes showing blackleg symptoms and rotted Chinese cabbage were analysed by biochemical tests and sequence analysis of 16S rDNA and 16S-23S rRNA intergenic spacer (IGS) regions, and the data were compared to related E. carotovora strains. Based on the results of the biochemical tests and sequence analysis, 2 of the 4 strains were identified as E. carotovora subsp. carotovora (Ecc), whereas the rest strains were distinct from Ecc. The last two strains, HCC3 and JEJU, were biochemically similar to E, carotovora subsp. atroseptica (Eca). However, the results of sequence analysis and Eca-specific PCR assays showed that the strains were distinct from Eca. On the basis of 16S rDNA sequence analysis, HCC3 and JEJU strains were placed in E. carotovora subsp. odorifera and E. carotovora subsp. wasabiae, respectively. The results of sequence analysis and specific PCR assay for Eca indicated that Asian Eca strains were distinct from European Eca strains, although they were phenotycally homogeneous.

An Approach for a Substitution Matrix Based on Protein Blocks and Physicochemical Properties of Amino Acids through PCA

  • You, Youngki;Jang, Inhwan;Lee, Kyungro;Kim, Heonjoo;Lee, Kwanhee
    • Interdisciplinary Bio Central
    • /
    • v.6 no.4
    • /
    • pp.3.1-3.10
    • /
    • 2014
  • Amino acid substitution matrices are essential tools for protein sequence analysis, homology sequence search in protein databases and multiple sequence alignment. The PAM matrix was the first widely used amino acid substitution matrix. The BLOSUM series then succeeded the PAM matrix. Most substitution matrixes were developed by using the statistical frequency of substitution between each amino acid at blocks representing groups of protein families or related proteins. However, substitution of amino acids is based on the similarity of physiochemical properties of each amino acid. In this study, a new approach was used to obtain major physiochemical properties in multiple sequence alignment. Frequency of amino acid substitution in multiple sequence alignment database and selected attributes of amino acids in physiochemical properties database were merged. This merged data showed the major physiochemical properties through principle components analysis. Using factor analysis, these four principle components were interpreted as flexibility of electronic movement, polarity, negative charge and structural flexibility. Applying these four components, BAPS was constructed and validated for accuracy. When comparing receiver operated characteristic ($ROC_{50}$) values, BAPS scored slightly lower than BLOSUM and PAM. However, when evaluating for accuracy by comparing results from multiple sequence alignment with the structural alignment results of two test data sets with known three-dimensional structure in the homologous structure alignment database, the result of the test for BAPS was comparatively equivalent or better than results for prior matrices including PAM, Gonnet, Identity and Genetic code matrix.

Partial Sequence Analysis of Puumala Virus M Segment from Bats in Korea

  • Yun, Bo-Kyoung;Yoon, Jeong-Joong;Lee, Yun-Tai
    • The Journal of Korean Society of Virology
    • /
    • v.29 no.1
    • /
    • pp.23-31
    • /
    • 1999
  • Hantavirus is a genus of the Bunyaviridae family causing two serious diseases, hemorrhagic fever with renal syndrome (HFRS) and hantavirus pulmonary syndrome (HPS). Puumala virus is a member of hantavirus originally found in Europe, and its natural reservoir is Clethrionomys glareolus. It is also associated with the human disease nephropathia epidemica, a milder form of HFRS. To identify the hantaviruses in bats, bats were collected from Jeong-Sun, Won-Joo, Chung-Ju and Hwa-Cheon area in Korea, and nested RT-PCR was performed with serotype specific primer from M segment. Interestingly, Puumala virus was detected in bats (Rhinolophus ferrum-equinum) only from Won-Joo. The 327 bp nested RT-PCR product, was sequenced. The sequence database search indicates that the sequence is homologous to the published sequence of Puumala viruses. The sequence similarities were ranged from 71% to 97%. The highest sequence similarity was 97% with Puumala virus Vranicam strain, and the lowest was 71% with Puumala virus K27 isolate. Puumala virus Vranicam strain was isolated from a bank vole (Clethrionomys glareolus) in Bosnia-Hercegovina. Puumala virus K27 was isolated from human in Russia. This analysis confirms that bats (Rhinolophus ferrum-equinum) in Korea are natural reservoir of Puumala virus.

  • PDF

Korean morphological analysis and phrase structure parsing using multi-task sequence-to-sequence learning (Multi-task sequence-to-sequence learning을 이용한 한국어 형태소 분석과 구구조 구문 분석)

  • Hwang, Hyunsun;Lee, Changki
    • Annual Conference on Human and Language Technology
    • /
    • 2017.10a
    • /
    • pp.103-107
    • /
    • 2017
  • 한국어 형태소 분석 및 구구조 구문 분석은 한국어 자연어처리에서 난이도가 높은 작업들로서 최근에는 해당 문제들을 출력열 생성 문제로 바꾸어 sequence-to-sequence 모델을 이용한 end-to-end 방식의 접근법들이 연구되었다. 한국어 형태소 분석 및 구구조 구문 분석을 출력열 생성 문제로 바꿀 시 해당 출력 결과는 하나의 열로서 합쳐질 수가 있다. 본 논문에서는 sequence-to-sequence 모델을 이용하여 한국어 형태소 분석 및 구구조 구문 분석을 동시에 처리하는 모델을 제안한다. 실험 결과 한국어 형태소 분석과 구구조 구문 분석을 동시에 처리할 시 형태소 분석이 구구조 구문 분석에 영향을 주는 것을 확인 하였으며, 구구조 구문 분석 또한 형태소 분석에 영향을 주어 서로 영향을 줄 수 있음을 확인하였다.

  • PDF

Korean morphological analysis and phrase structure parsing using multi-task sequence-to-sequence learning (Multi-task sequence-to-sequence learning을 이용한 한국어 형태소 분석과 구구조 구문 분석)

  • Hwang, Hyunsun;Lee, Changki
    • 한국어정보학회:학술대회논문집
    • /
    • 2017.10a
    • /
    • pp.103-107
    • /
    • 2017
  • 한국어 형태소 분석 및 구구조 구문 분석은 한국어 자연어처리에서 난이도가 높은 작업들로서 최근에는 해당 문제들을 출력열 생성 문제로 바꾸어 sequence-to-sequence 모델을 이용한 end-to-end 방식의 접근법들이 연구되었다. 한국어 형태소 분석 및 구구조 구문 분석을 출력열 생성 문제로 바꿀 시 해당 출력 결과는 하나의 열로서 합쳐질 수가 있다. 본 논문에서는 sequence-to-sequence 모델을 이용하여 한국어 형태소 분석 및 구구조 구문 분석을 동시에 처리하는 모델을 제안한다. 실험 결과 한국어 형태소 분석과 구구조 구문 분석을 동시에 처리할 시 형태소 분석이 구구조 구문 분석에 영향을 주는 것을 확인 하였으며, 구구조 구문 분석 또한 형태소 분석에 영향을 주어 서로 영향을 줄 수 있음을 확인하였다.

  • PDF

Cloning and Sequence Analysis of a Levansucrase Gene from Rahnella aquatilis ATCC15552

  • Kim, Hyun-Jin;Yang, Ji-Young;Lee, Hyeon-Gye;Cha, Jae-Ho
    • Journal of Microbiology and Biotechnology
    • /
    • v.11 no.4
    • /
    • pp.693-699
    • /
    • 2001
  • An intracellular levansucrase gene, lscR from Rahnella aquatilis ATCC 15552, was cloned and its nucleotide sequence was determined. Nucleotide sequence analysis of this gene revealed a 1,238 bp open reading frame coding for a protein of 415 amino acids. The levansucrase was expressed by using a T7 promoter in Escherichia coli BL21 (DE3) and the enzyme activity was detected in the cytoplasmic fraction. The optimum pH and temperature of this enzyme for levan formation was pH 6 and $30^{\circ}C$, respectively. The deduced amino acid sequence of the lscR gene showed a high sequence similarity (59-89%) with Gram-negative levansucrses, while the level of similarity with Gram-positive enzymes was less than 42%. Multiple alignments of levansucrase sequences reported from Gram-negative and Gram-positive bacteria revealed seven conserved regions. A comparison of the catalytic properties and deduced amino acid sequence of lscR with those of other bacterial levansucrases strongly suggest that Gram-negative and Gram-positive levansucrases have an overall different structure, but they have a similar structure at the active site.

  • PDF

The preverified test sequence generation method satisfying the completeness criteria (완전표준성을 만족하는 선행검증 시험열 생성방법에 관한 연구)

  • 박진호;양대헌;송주석;임상용
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.23 no.9A
    • /
    • pp.2383-2390
    • /
    • 1998
  • As network provides diverse functionalities recently, many rpotocol standards have become complex and many implementations have appeared. Such trends require us to test th econformance of implementations, called the conformance testing. Many researches have been performed on generating test sequence and on fualt masking base don T,U,D,W methods. At this jpoint, te new problem is suggeste dwhich is calle dthe completenes s criteria. The test sequences for the conformance testing have come up with this problem as well as fault masking. In this paper, we suggest the method of generating the preverified test sequence which can avoid the completeness criteria problem. The preverified test sequence is much more reliable than others by using the preverified edge. For the reliability of conformance testing, we define the immunity of the test sequence and provide the clue for the analysis of the test results using the immunity. The analysis of the results makes it possible for us to test the implementation again with more reliability. Also, the preverified test sequence is flexible so that it is combined with the fault-tolerant sequence for fault masking.

  • PDF

A study of PES layer splicing for system layer editing on MPEG-2 based images(compare & analysis) (MPEG-2 기반영상의 시스템 영역에서의 편집을 위한 PES 영역에서의 스트림 splicing에 관한 연구(비교. 분석))

  • 김동준;최윤식
    • Proceedings of the IEEK Conference
    • /
    • 2002.06d
    • /
    • pp.77-80
    • /
    • 2002
  • In this paper, We have studied for guaranteeing the clear display of MPEG-2 video sequence when conduct splicing of MPEG-2 system streams. we focus on the PES domain splicing considering video sequence. And we wish to make a base on the TS or PS domain splicing considering video sequence. For that, first, we compared and analyzed problems that is raised when different two PES streams are spliced and effects that affect the video sequence. And based on this analysis, we have searched for methods that resolve the cause of problems that can be happened in the display of video sequence directly in PES domain.

  • PDF

The Analysis on Scope and Sequence of Physical Education Major Curriculum In Korea Universities (체육교육학과 전공교육과정의 스코프 및 시퀀스 분석)

  • LEE, Eun-Hwa;KIM, In-Hyung
    • Journal of Fisheries and Marine Sciences Education
    • /
    • v.21 no.3
    • /
    • pp.436-450
    • /
    • 2009
  • The purpose of this study is to analyze scope and sequence of undergraduate curricula in the department of physical education. For this purpose, this paper has used the types of undergraduate subjects, which are based on analysis tools on the scope and the sequence of the Department of Education major curriculum by Kim and Lee(2005). The major results of this study were as follows. First, the proportion of major content knowledge is far more pedagogical content knowledge. Second, the scope of Physical Education major curriculum is too much stressed on 'the subjects of major content' and on 'the subjects of specific area' than 'the subjects of major skills' and 'comprehensive problem solving'. Third, the Physical Education major curriculum has shown the specific sequence; introduction/foundation courses and theory courses, application courses orderly. Whileas, application course and synthesis course are slim to none.