• 제목/요약/키워드: sequence length

검색결과 1,234건 처리시간 0.024초

시계열 분석 딥러닝 알고리즘을 적용한 낙동강 하굿둑 염분 예측 (Prediction of Salinity of Nakdong River Estuary Using Deep Learning Algorithm (LSTM) for Time Series Analysis)

  • 우정운;김연중;윤종성
    • 한국해안·해양공학회논문집
    • /
    • 제34권4호
    • /
    • pp.128-134
    • /
    • 2022
  • 낙동강 하굿둑은 올해 2022년 해수 유입기간을 매월 대조기마다로 확대, 하굿둑 상류 15 km 이내로 기수역 조성을 목표로 운영되고 있다. 목표 기수역 조성구간 및 염수피해 방지를 위한 신속한 의사결정을 위해 본 연구에서는 딥러닝 알고리즘 Long Short-Term Memory(LSTM)을 적용하여 낙동대교(하굿둑 상류 약 5 km)지점의 염분 예측을 수행하였다. 창녕·함안보 방류량 등 낙동강 하구역의 시·공간적 특성을 반영하기 위한 입력데이터를 구축하였으며, Sequence length에 따른 정도 변화를 통해 낙동강 하구역의 수리학적 특성을 고려한 최적모델을 구축하였다. 예측 정확도는 결정계수(R-squred)와 RMSE(root mean square error) 이용하여 통계분석을 실시하였으며. Sequence length가 12일 때 R-squred 0.997, RMSE 0.122로 가장 정도가 높았으며, 선행 예측시간은 12시간 간격까지 R -squred 0.93 이상으로 높은 정도를 보였다.

황복(Takifugu obscurus♀)과 자주복(T. rubripes♂) 교잡종의 형태 비교 및 분자분석 (Morphological Characteristics and Molecular Analysis of the Hybrid Takifugu obscurus♀ × T. rubripes♂)

  • 양서경;김형선;이진;한경호
    • 한국수산과학회지
    • /
    • 제56권5호
    • /
    • pp.708-715
    • /
    • 2023
  • Hybridization is a major production method used to combine beneficial traits from two different species to obtain a potentially dominant trait. In China, Takifugu obscurus and T. rubripes were artificially crossed, and the resulting hybrids had an average body weight 38.06-8.93% higher than that of the parental species, which enabled the hybrids to be grown in freshwater. This study aimed to provide the basic data necessary for the classification of T. obscurus♀×T. rubripes♂ hybrids in terms of economic value and market potential. Morphological comparing the morphology of hybrids and parental species, we discovered that the hybrids had intermediate traits of the parental species. In morphometrics, the hybrid index (HI) value of head length against standard length was close to the trait of T. rubripes, and the HI values of preanal length and predorsal length were close to those of T. obscurus; however, the HI values of nasal length, snout length, length of anal fin, length of pectoral fin, caudal peduncle depth and caudal peduncle length were found to be unique characteristics of the hybrids. Regarding molecular analysis, a 99.8% nucleotide sequence similarity was found between the hybrid and T. obscurus.

잣나무(Pinus koraiensis)의 cDNA library 제작 및 EST 분석 (Construction of a full-length cDNA library from Pinus koraiensis and analysis of EST dataset)

  • 김준기;임수빈;최선희;이종석;노승문;임용표
    • 농업과학연구
    • /
    • 제38권1호
    • /
    • pp.11-16
    • /
    • 2011
  • In this study, we report the generation and analysis of a total of 1,211 expressed sequence tags (ESTs) from Pinus koraiensis. A cDNA library was generated from the young leaf tissue and a total of 1,211 cDNA were partially sequenced. EST and unigene sequence quality were determined by computational filtering, manual review, and BLAST analyses. In all, 857 ESTs were acquired after the removal of the vector sequence and filtering over a minimum length 50 nucleotides. A total of 411 unigene, consisting of 89 contigs and 322 singletons, was identified after assembling. Also, we identified 77 new microsatellite-containing sequences from the unigenes and classified the structure according to their repeat unit. According to homology search with BLASTX against the NCBI database, 63.1% of ESTs were homologous with known function and 22.2% of ESTs were matched with putative or unknown function. The remaining 14.6% of ESTs showed no significant similarity to any protein sequences found in the public database. Gene ontology (GO) classification showed that the most abundant GO terms were transport, nucleotide binding, plastid, in terms biological process, molecular function and cellular component, respectively. The sequence data will be used to characterize potential roles of new genes in Pinus and provided for the useful tools as a genetic resource.

Cloning and Characterization of a new tobamovirus infecting Hibiscus rosa-sinensis

  • Srinivasan, L.K.G.;Wong, S.M.
    • 한국식물병리학회:학술대회논문집
    • /
    • 한국식물병리학회 2003년도 정기총회 및 추계학술발표회
    • /
    • pp.125.3-126
    • /
    • 2003
  • A near full-length sequence of a new tobamovirus infecting Hibiscus rosa-sinensis L. was determined. The genome consists of 58 nucleotides (nt) 5' UTR, followed by a 4.9 kb ORF which methyl transferase helicase domain (128 kDa), readthrough protein RNA dependent RNA polymerase (RdRp) 185 kDa and a 52 kDa protein. The 128 kDa protein had a maximum homology of 51.4 % to TMGMV and amino acids (an) were 54.3 % identical to TMV- vulgare strain. The 185 kDa RdRp had a maximum homology of 53.5% to TMV-Ob and KGMMV-Y and a 59.6% homology at the an level to CGMMV-SH. The MP gene encodes 282 aa and its theoretical molecular weight is 30.4 kDa. The nt and an sequence identities of MP ranged from 38.8% to 43.9% and 30.9% to 37.9%, respectively. The CP gene encodes 163 residues and with a theoretical molecular weight of 18.2 kDa The (nt) and aa sequences of the CP were 46.9 % to 51.6% and 45.3% to 57.1% identical to other tobamoviruses, respectively. The predicted virion origin of assembly (OAS) was located in the CP gene. Phylogenetic trees generated based on the nt and as sequences of RdRp, MP and CP genes indicated that this new virus clustered with subgroup II tobamoviruses. Although the CP ORF of this virus shared a high nt and aa sequence identity with Sunn-hemp mosaic virus (SHMV), Western analysis showed that it is serologically unrelated to SHMV. We propose the name Hibiscus virus S (HVS) for this Singapore isolate. This is the first report on a near full-length sequence of a Tobamovirus that infects hibiscus.

  • PDF

Expression of a Cu-Zn Superoxide Dismutase Gene in Response to Stresses and Phytohormones in Rehmannia Glutinosa

  • Park, Myoung-Ryoul;Ryu, Sang-Soo;Yoo, Nam-Hee;Yu, Chang-Yeon;Yun, Song-Joong
    • 한국약용작물학회지
    • /
    • 제13권5호
    • /
    • pp.270-275
    • /
    • 2005
  • Superoxide dismutases (SOD) are metalloenzymes that convert $O_2^-\;to\;H_2O_2$. Rehmannia glutinosa is highly tolerant to paraquat-induced oxidative stress. The primary objective of this study was to characterize regulation of SOD gene expression in R. glutinosa in response to oxidative stresses and hormones. A full-length putative SOD clone (RgCu-ZnSOD1) was isolated from the leaf cDNA library of R. glutinosa using an expressed sequence tag clone as a probe. RgCu-ZnSOD1 cDNA is 777 bp in length and contains an open reading frame for a polypeptide consisted of 152 amino acid residues. The deduced amino acid sequence of the clone shows highest sequence similarity to the cytosolic Cu-ZnSODs. The two to three major bands with several minor ones on the Southern blots indicate that RgCu-ZnSOD1 is a member of a small multi-gene family. RgCuZnSOD1 mRNA was constitutively expressed in the leaf, flower and root. The expression of RgCu-ZnSOD1 mRNA was increased about 20% by wounding and paraquat, but decreased over 50% by ethylene and $GA_3$. This result indicates that the RgCu-ZnSOD1 expression is regulated differentially by different stresses and phytohormones at the transcription level. The RgCu-ZnSOD1 sequence and information on its regulation will be useful in investigating the role of SOD in the paraquat tolerance of R. glutinosa.

단백질 서열정렬 정확도 예측을 위한 새로운 방법 (A new method to predict the protein sequence alignment quality)

  • 이민호;정찬석;김동섭
    • Bioinformatics and Biosystems
    • /
    • 제1권1호
    • /
    • pp.82-87
    • /
    • 2006
  • 현재 가장 많이 사용되는 단백질 구조 예측 방법은 비교 모델링 (comparative modeling) 방법이다. 비교 모델링 방법에서의 정확도를 높이기 위해서는 alignment의 정확도 역시 매우 필수적으로 필요하다. 비교 모델링 과정 중의 fold-recognition 단계에서 alignment의 정확도에 의해 template을 고르는 방법은 단지 가장 비슷한 template을 선택하는 방법에 비해 주목을 받지 못하고 있다. 최근에는 두 가지의 alignment에 사이의 shift 정보를 바탕으로 한 shift score라는 수치가 alignment의 성능을 표현하기 위해서 개발되었다. 우리는 더 정확한 구조 예측의 첫걸음이 될 수 있는 shift score를 예측하는 방법을 개발하였다. Shift score를 예측하기 위해 support vector regression (SVR)이 사용되었다. 사전에 구축된 라이브러리 안의 길이가 n 인 template과 구조를 알고 싶은 query 단백질 사이의 alignment는 n+2 차원의 input 벡터로 변환된다. Structural alignment가 가장 좋은 alignment로 가정되었고 SVR은 query 단백질과 template 단백질의 structural alignment과 profile-profile alignment 사이의 shift score를 예측하도록 training 되었다. 예측 정확도는 Pearson 상관계수로 측정되었다. Training 된 SVR은 실제의 shift score와 예측된 shift score 사이에 0.80의 Pearson 상관계수를 갖는 정도로 예측하였다.

  • PDF

Sequence analysis of ORF4 gene of porcine reproductive and respiratory syndrome virus (PRRSV) Korean isolate CNV-1

  • Park, Jee-yong;Lim, Bae-keun;Kim, Hyun-soo
    • 대한수의학회지
    • /
    • 제39권2호
    • /
    • pp.294-300
    • /
    • 1999
  • In this study PRRSV was isolated from serum of an infected pig and designated as CNV-1, ORF4 gene was sequenced, and the nucleotide sequence, deduced amino acid sequence and the amino acid sequence of the neutralizing domain was compared with other PRRSV Strains. ORF4 gene of the Korean isolate PRRSV CNV-1 was shown to be 537bp in length, which is the same as US strain ISU55 but 21bp longer than another US strain MN1b, and 15bp shorter than European strain LV. The homologies of the nucleotide sequences between the Korean isolate CNV-1 and the US strains ISU55, MN1b and European strain LV were 91.8%, 88.1%, 67.6%, respectively, and the homologies of the deduced amino acid sequences were 94.4%, 84.4%, 68.5%, respectively. The neutralizing domain of the CNV-1 was shown to be 36 amino acids in length which is the same as ISU55, MN1b, but 4 amino acids shorter than that of the neutralizing domain reported in LV. The homologies of the amino acid sequences of the neutralizing domain between the Korean isolate CNV-1 and the US strains ISU55, MN1b and European strain LV were 92.5%, 85%, 57.5%, respectively. The molecular characteristics of ORF4 gene of the Korean isolate PRRSV CNV-1 shown in this study suggests that the CNV-1 is genetically closer to the US strains. Also the wide variation of the neutralizing domain between the CNV-1 and LV suggests that there is substantial immunogenic variation between the two strains.

  • PDF

Characterization of Expressed Sequence Tags (ESTs) Generated from the Bombyx mandarina Whole Larvae and Molecular Cloning of Serine Protease Homologue Gene

  • Hwang, Jae Sam;Yun, Eun Young;Goo, Tae Won;Kim, Iksoo;Choi, Kwang Ho;Seong, Su Il;Kim, Keun Young;Lee, Sang Mong;Kang, Seok Woo
    • International Journal of Industrial Entomology and Biomaterials
    • /
    • 제9권2호
    • /
    • pp.167-171
    • /
    • 2004
  • We constructed an oligo-d(T) primed directional cDNA library from the Bombyx mandarina whole larvae. In an effort to isolate genes expressed in the B. mandarina, 227 expressed sequence tags (ESTs) were generated by single-pass sequencing from the cDNA library. Sequence analysis showed that 107 clones (47.1%) were classified into known genes and 120 clones (52.9%) were novel transcripts, which are unknown for their function. Of the 107 known genes, the most abundant gene was found to be actin and followed by serine protease in the expression profile. Among these clones, a serine protease homolog (BmSP) which is a class of proteolytic enzymes isolated. Full-length sequence of the BmSP cDNA clone was 922 bp in length and has an open reading frame of 276 amino acids. The conserved histidine, aspatic acid and serine residues forming the catalytic center as well as cysteine residues contributing to three disulphide bonds also were found in Bmsp gene. mRNA expression analysis revealed a high and specific expression of the gene only in midgut tissue, suggesting that BmSP gene is closely associated with the expression of digestive enzyme.

5지 신호교차로에서의 안전을 고려한 신호현시 설계 (Safety Enhanced Signal Phase Sequence Design of a Rotary with Five Leg Intersection)

  • 박재완;김진태;장명순
    • 대한교통학회지
    • /
    • 제20권7호
    • /
    • pp.23-29
    • /
    • 2002
  • 일반적으로 4지 또는 3지의 교차로가 설계·운영되고 있으나 적지 않게 5지 또는 그 이상 의 원형 신호교차로가 실질적으로 이용되고 있다. 접근로 수에 따른 교차로 형태별 분류에 의하면 5지 이상의 교차로에서의 상충지 점의 수는 4지 교차로의 그것보다 월등히 높아 설계지침에서도 4지 이하의 교차로를 설계할 것을 권하고 있으며 마찬가지로 5지 원형 신호교차로에서도 그 상충지점 수가 많다. 이러한 이유로 5지 신호 교차로의 신호 설계는 교통소통이 아닌 교통안전측면에서 신호현시 순서 및 길이가 결정되어야 할 필요가 있으며 또 그러한 현시순서는 교차로 내 원활한 교통의 흐름을 심각한 수준으로 방해하지 않을 필요가 있다. 본 연구에서는 5지 신호교차로의 안전을 고려한 현시순서설계 방안을 제시한다. 울산광역시에서 운영중인 공업탑 5지 신호교차로를 대상으로 현장자료를 수집하였으며, 신호시간 설계모형은 TRANSYT-7F를 적용했다. TRANSYT-7F에서의 최적 신호현시의 길이를 토대로 기본적으로 "한 현시에 2개 교통류의 이동" 원칙에 따라 재배열하였다. 제안된 방법으로 보정된 신호현시 순서 및 길이를 사용하여 모의실험한 결과 TRANSYT-7F에서 제시한 최적 신호현시 순서 및 길이를 적용한 것에 비하여 평균 6.2%지체도 증가가 있었으나 교차로 내 상충수를 61.5% 줄이는 결과를 도출하였다.

이상 탐지를 위한 시스템콜 시퀀스 임베딩 접근 방식 비교 (Comparison of System Call Sequence Embedding Approaches for Anomaly Detection)

  • 이근섭;박경선;김강석
    • 융합정보논문지
    • /
    • 제12권2호
    • /
    • pp.47-53
    • /
    • 2022
  • 최근 지능화된 보안 패러다임의 변화에 따라, 다양한 정보보안 시스템에서 발생하는 각종 정보를 인공지능 기반 이상탐지에 적용하기 위한 연구가 증가하고 있다. 따라서 본 연구는 로그와 같은 시계열 데이터를 수치형 특성인 벡터로 변환하기 위하여 딥러닝 기반 Word2Vec 모델의 CBOW와 Skip-gram 추론 방식과 동시발생 빈도 기반 통계 방식을 사용하여 공개된 ADFA 시스템콜 데이터에 대하여, 벡터의 차원, 시퀀스 길이 및 윈도우 사이즈를 고려한 다양한 임베딩 벡터로의 변환에 대한 실험을 진행하였다. 또한 임베딩 모델로 생성된 벡터를 입력으로 하는 GRU 기반 이상 탐지 모델을 통해 탐지 성능뿐만 아니라 사용된 임베딩 방법들의 성능을 비교 평가하였다. 통계 모델에 비해 추론 기반 모델인 Skip-gram이 특정 윈도우 사이즈나 시퀀스 길이에 치우침 없이 좀 더 안정되게(stable) 성능을 유지하여, 시퀀스 데이터의 각 이벤트들을 임베딩 벡터로 만드는데 더 효과적임을 확인하였다.