• Title/Summary/Keyword: 서열

Search Result 3,677, Processing Time 0.025 seconds

A CNV detection algorithm based on statistical analysis of the aligned reads (정렬된 리드의 통계적 분석을 기반으로 하는 CNV 검색 알고리즘)

  • Hong, Sang-Kyoon;Hong, Dong-Wan;Yoon, Jee-Hee;Kim, Baek-Sop;Park, Sang-Hyun
    • The KIPS Transactions:PartD
    • /
    • v.16D no.5
    • /
    • pp.661-672
    • /
    • 2009
  • Recently it was found that various genetic structural variations such as CNV(copy number variation) exist in the human genome, and these variations are closely related with disease susceptibility, reaction to treatment, and genetic characteristics. In this paper we propose a new CNV detection algorithm using millions of short DNA sequences generated by giga-sequencing technology. Our method maps the DNA sequences onto the reference sequence, and obtains the occurrence frequency of each read in the reference sequence. And then it detects the statistically significant regions which are longer than 1Kbp as the candidate CNV regions by analyzing the distribution of the occurrence frequency. To select a proper read alignment method, several methods are employed in our algorithm, and the performances are compared. To verify the superiority of our approach, we performed extensive experiments. The result of simulation experiments (using a reference sequence, build 35 of NCBI) revealed that our approach successfully finds all the CNV regions that have various shapes and arbitrary length (small, intermediate, or large size).

Analysis and evaluation of morphological and molecular polymorphism in the hybridization of Elaeagnus ×maritima and E. ×submacrophylla (잡종 기원 녹보리똥나무와 큰보리장나무의 형태학적 및 분자적 다양성 분석 및 평가)

  • Young-Jong JANG;Dong Chan SON;Kang-Hyup LEE;Jung-Hyun LEE;Boem Kyun PARK
    • Korean Journal of Plant Taxonomy
    • /
    • v.53 no.2
    • /
    • pp.126-147
    • /
    • 2023
  • The taxonomic identity of Elaeagnus ×maritima and E. ×submacrophylla (Elaeagnaceae) in Korea is unclear, yet they are presumed to be hybrid taxa based on their morphology. To determine their hybrid origins, a morphological analysis (field surveys and specimen examinations) and a molecular analysis involving two nuclear ribosomal DNA (nrDNA) regions (internal transcribed spacer and 5S non-transcribed spacer) and one chloroplast DNA (cpDNA) region (matK) were conducted. The morphological analysis revealed that E. ×maritima showed certain morphological similarities to E. glabra, whereas E. ×submacrophylla showed certain morphological similarities to E. pungens. However, the molecular analysis indicated that E. ×maritima exhibited additive species-specific sites of E. glabra and E. macrophylla in the nrDNA regions. Notably, E. ×submacrophylla showed various aspects, with some individuals exhibiting additive species-specific sites of E. pungens and E. macrophylla in the nrDNA and E. macrophylla sequences in the cpDNA regions, some individuals exhibiting E. macrophylla sequences in the nrDNA and E. pungens sequences in the cpDNA regions, and some individuals displaying E. macrophylla sequences in both the nrDNA and cpDNA regions, despite an intermediate morphology between E. pungens and E. macrophylla. These results indicate that these two species are of hybrid origin and frequently cross between parental and hybrid individuals.

Molecular Phylogenetic Study of Nesiohelix samarangae Based on CO-I Gene (동양달팽이 (Nesiohelix samarangae)의 CO-I 유전자를 이용한 분자계통학적 연구)

  • Bang, In Seok;Lee, Yong Seok
    • The Korean Journal of Malacology
    • /
    • v.30 no.4
    • /
    • pp.391-397
    • /
    • 2014
  • Previously, we have reported expressed sequence tags (ESTs) analysis on the land snail, Nesiohelix samarangae (Ns). Of these ESTs, we have identified four partial fragments of N. samarangae cytochrome oxydase I (NsCO-I) gene which lead to obtain an 852 bp partial cDNA. Since NsCO-I is one of the best-known molecular phylogenetic markers, we have attempted to conduct comparative in silico analysis by using the NsCO-I gene. The combined results from BLAST analyses, multiple sequence alignment and molecular phylogenetic study of NsCO-I cDNA indicate that N. samarangae has similarity to three land snails such as Elona quimperiana, Euhadra herklotsi and Euhadra idzumonis.

Nucleotide Sequence and Inducibility Analysis of Chloramphenicol Acetyltransferase Gene from Staphylococcus aureus R-plasmid pSBK203 (Staphylococcus aureus에서 분리된 R-plasmid pSBK203상의 chloramphenicol acetyltransferase 인자의 염기서열 및 유발성 분석)

  • 권동현;변우현
    • Korean Journal of Microbiology
    • /
    • v.27 no.3
    • /
    • pp.194-200
    • /
    • 1989
  • The nucleotide sequence of inducible chloramphenicol acetyl-transferase(CAT) gene isolated from a small plasmid pSBK203 of Staphylococcus aureus was determined. The base sequence shows that structural gene of pSBK203-CAT encodes a protein of 213 amino acids and has a leader region which encodes a short polypeptide of 9 amino-acids in its upstream. vertical bar /sup 35/S vertical bar-Methionine labelled CAT gene product in minicell showed almost same mobility with pC194-CAT of which molecular weight is 24Kdal on polyacrylamide gel electrophoresis. Predicted amino acid sequence of pSBK203-CAT has revealed a high degree of homology with the CATs of pC194 and pC221 than those of cat-86, Tn9 and proteus mirabilis PM13.

  • PDF

Prediction of Core Promoter Region with Dependency - Reflecting Decomposition Model (의존성 반영 분해모델에 의한 유전자의 핵심 프로모터 영역 예측)

  • 김기봉;박기정;공은배
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.3_4
    • /
    • pp.379-387
    • /
    • 2003
  • A lot of microbial genome projects have been completed to pour the enormous amount of genomic sequence data. In this context. the problem of identifying promoters in genomic DNA sequences by computational methods has attracted considerable research attention in recent years. In this paper, we propose a new model of prokaryotic core promoter region including the -10 region and transcription initiation site, that is Dependency-Reflecting Decomposition Model (DRDM), which captures the most significant biological dependencies between positions (allowing for non-adjacent as well as adjacent dependencies). DRDM showed a good result of performance test and it will be employed effectively in predicting promoters in long microbial genomic Contigs.

Data Mining Techniques for Analyzing Promoter Sequences (프로모터 염기서열 분석을 위한 데이터 마이닝 기법)

  • 김정자;이도헌
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2000.10a
    • /
    • pp.328-332
    • /
    • 2000
  • As DNA sequences have been known through the Genome project the techniques for dealing with molecule-level gene information are being made researches briskly. It is also urgent to develop new computer algorithms for making databases and analyzing it efficiently considering the vastness of the information for known sequences. In this respect, this paper studies the association rule search algorithms for finding out the characteristics shown by means of the association between promoter sequences and genes, which is one of the important research areas in molecular biology. This paper treat biological data, while previous search algorithms used transaction data. So, we design a transformed association nile algorithm that covers data types and biological properties. These research results will contribute to reducing the time and the cost for biological experiments by minimizing their candidates.

  • PDF

Searching Method for New Small RNA in Bacillus subtilis Using Bioinformation (생물정보를 이용하여 바실러스 서브틸리스에서 새로운 Small RNA를 예측하는 방법)

  • Lee, Sang-Soo
    • The Journal of Natural Sciences
    • /
    • v.18 no.1
    • /
    • pp.47-53
    • /
    • 2007
  • In order to find novel sRNA in Bacillus subtilis which would be used to adapt to several conditions, we searched the whole genome of Bacillus subtilis using the following procedure. At first, the locations of recognition sequence of transcription factors such as PerR, OhrR, Fur and Zur were searched in the intergenic region of Bacillus subtilis genome and the locations of rho independent transcription terminator sites were also determined. Based on the information of these locations, the sRNA candidates were chosen by close locations (less than 300 bp) between the recognition site of transcription factors and rho independent transcription terminator site. Than transcription promoter sites were searched in the region of previously identified sRNA candidates and 5 PerR, 1 OhrR, 1 Fur and 1 Zur regulated good sRNA candidates were found.

  • PDF

Species Diversity of Forest Vegetation in Mt.Jangan, Chollabuk-do (전라북도 장안산 삼림식생의 종다양성)

  • Kim, Chang-Hwan;Myung, Hyun;Shin, Byung-Chuel
    • Korean Journal of Environment and Ecology
    • /
    • v.13 no.3
    • /
    • pp.271-279
    • /
    • 1999
  • 전라북도 장안산의 72군락 지점에서 식물사회학적 조사에 의하여 구분된 10개 군락. 즉 신갈나무 군락, 신갈나무-철쭉꽃 군락, 신갈나무-노린재나무 군락, 신갈나무-졸참나무 군락, 졸참나무 군락, 굴참나무 군락, 서어나무 군락, 물푸레나무 군락, 층층나무 군락, 들메나무 군락에서 풍부도지수, 이질성지수, 균등도지수, 우점도지수를 산출하여 고도, 토양 특성 및 우점종군에 따른 종다양성의 변활르 분석하였으며 종서열-중요치 곡선을 이용하여 각 식물의 우점서열을 결정하고 각 종이 식물군락 내의 자원을 어떻게 분배하고 있는가를 결정하였다 고도, 토양요인(pH, base) 및 우점종의 차이는 삼림의 종 다양성에 영향을 미치는 중요한 변수로서 작용하였으며 우점종군에 따른 다양성의 변화는 지형과 교란에 의하여 영향을 받았다 종서열-중요치 곡선에서 조사된 10개 군락은 대수정규분포에 접근하고 있어서 군락간 약간의 차이는 있지만 대체적으로 어떤 특정 종이 군집 내 자원 공간을 독점하지 않고 적절히 분배하여 사용하고 있었다.

  • PDF

Genetic Diversity of Hepatitis C Virus in Korea (한국내 C형 간염바이러스의 유전적 다양성)

  • Kim, Hyun-Sung;Choe, Joon-Ho;Lee, Hyo-Suk
    • The Journal of Korean Society of Virology
    • /
    • v.26 no.1
    • /
    • pp.31-45
    • /
    • 1996
  • C형 간염바이러스 (HCV)는 각 개체간에 뉴클레오티드 서열상의 다양성을 나타내고, 이러한 유전적 다양성이 임상병리적 증상과 밀접한 연관이 있을 것으로 고려되어 왔다. 본 연구에서는 HCV E1과 NS5B 부위의 염기서열 분석을 통해 한국의 C형 간염바이러스의 분포와 다양성에 관해 분석하고, 발생계통도를 그려 HCV간의 진화적 거리를 확인하였다. 염기서열분석은 서울대학교 병원과 충남대학교 병원으로부터 얻은 56개의 HCV-양성 혈청을 대상으로 RT-PCR과 PCR 과정을 통해 얻은 유전자 산물을 클로닝하여 수행되었다. 56개의 혈청중 53개의 샘플에서 HCV RNA가 검출되었다. 이들 53개 샘플에 대한 분석 곁과, 유전형 1a, 1b, 2a, 2b, 7a가 각각 5.7, 45.3, 45.3, 1.9, 1.9%로 분포하고 있고, 1b형과 2a형이 한국에서의 주요한 HCV 유전형으로 밝혀졌다. 본 연구는 염기서열 분석을 통해 한국에서 1b형과 마찬가지로 2a형도 높은 빈도로 분포하고 있고, 비록 분포 빈도는 낮지만 1a 형과 7a 형도 존재하고 있음을 밝힌 최초의 보고이다.

  • PDF

Unification System for Analysis of DNA Sequence (DNA 서열 분석을 위한 통합 시스템)

  • Song, Young-Ohk;Chang, Duk-Jin
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.3
    • /
    • pp.65-72
    • /
    • 2011
  • We stand at real world that some practical use method of gene information appears in succession by entrance on the stage of advanced techonlogy. As a lot of studies and development are achieved based on analysis of bio data, necessity of a tool that can help correct interpretation of data is required more and more in a lot of targets of bioinformatics to search new relation and information are established. In this paper, we are offered in existing I wish to offer user a more convenient study tool developing system that can supplement shortcomings of various tools for data analysis. So we've designed to offer in united environment that is not environment that is parted ORF driving out, bio information retrieval and work of similarity comparison lamp to work for bio data analysis and offers lacking consecutiveness in existing analysis system.