• 제목/요약/키워드: Noncoding DNA sequence

검색결과 34건 처리시간 0.021초

DNA 데이터 저장을 위한 DNA 정보 은닉 기법 (DNA Information Hiding Method for DNA Data Storage)

  • 이석환;권기룡
    • 전자공학회논문지
    • /
    • 제51권10호
    • /
    • pp.118-127
    • /
    • 2014
  • DNA 데이터 저장(Data storage)은 DNA의 염기 서열에 대용량의 디지털 데이터를 저장하는 방법으로, 차세대 정보 저장 매개물로 인식되고 있다. 본 논문에서는 DNA 스테가노그라픽 기반으로 비부호 DNA 서열(Noncoding DNA sequence)에 정보를 저장하는 방법을 제안한다. 제안한 방법은 암호화된 데이터들을 정수 변화표에 의하여 데이터 염기 서열로 변환한 후, 시드 정보, 및 섹터 길이로 구성된 은닉 키에 의하여 비부호 염기 서열에 은닉한다. 따라서 단백질의 유전 기능이 유지되고, 원 DNA 서열없이 정보가 검출되며, 변이에 의하여 발생되는 오류가 검출된다. 기존 방법과의 비교 실험을 통하여 제안한 방법이 높은 bpn를 가지는 저장 효율을 가지며, 패리티 염기에 의하여 은닉된 정보의 오류 위치를 검출할 수 있음을 확인하였다.

Cloning and Nucleotide Sequence of a cDNA Encoding the Rat Triosephosphate Isomerase

  • Lee, Kyunglim;Ryu, Jiwon
    • Archives of Pharmacal Research
    • /
    • 제19권6호
    • /
    • pp.497-501
    • /
    • 1996
  • A gene coding for triosephosphate isomerase (TPI) from a rat skeletal muscle cDNA library was cloned and its nucleotide sequence was determined. The 1, 348-bp cDNA clone contains 24 bp $5^I$ noncoding region, the entire 750 bp coding region corresponding to a protein of 249 amino acids, $547bp 3^I$ noncoding region and part of a poly(A) tail. It also contains a polyadenylation signal, AATAAA, starting from 17 bp upstream of the poly(A) tail. The calculated molecular weight of rat TPI is 27.8 kDa and the net charge is +4. The deduced amino acid sequence from rat TPI CDNA sequence has 93% and 94% homology with that of mouse and human clones, respectively. The amino acids at the residue of Asn12, Lys14, His96, Glu 166, His96, His101, Ala177, Tyr165, Glu13O, Tyr2O9, and Ser212 in catalytic site are completely identical, confirming that the functional residues in TPI proteins are highly conserved throughout evolution. The most profound characteristic of rat TPI enzyme, compared with other TPIs, is that there are five cysteine substitutions at the residue of 21, 27, 159, 195 and 204. A Glu123 instead of Gly was found in rabbit, rhesus, mouse and human sequences. Through the method of RT-PCR, the mRNA transcription level of TPI gene was found to be different among various tissues and was highest in muscle.

  • PDF

순무 모자이크 바이러스 두 한국계통의 3' 말단 비번역부위에 대한 염기서열분석 및 2차구조 모델링 (Nucleotide Sequence Analysis and Secondary Structure Modeling of the 3'-Noncoding Regions of Two Korean Strains of Turnip Mosaic Virus)

  • 최장경;류기현;최국선;박원목
    • 한국식물병리학회지
    • /
    • 제11권3호
    • /
    • pp.271-277
    • /
    • 1995
  • The RNA nucleotide sequences of the 3/-noncoding regions (3'-NCRs) of two Korean strains of turnip mosaic virus (TuMV), Ca and cqs, have been determined from their cDNA clones that encompassed the 3'-terminal regions of the viral genomic RNAs. The 3'-NCRs of both strains were 209 nucleotides long, terminated with GAC residues and poly (A) tails. The potential polyadenylational signal motif, UAUGU, was located 140 nucleotides upstream from the poly (A) tail in each of the virus. A highly conserved hexanucleotide sequence [A G U G A/U G/C], which was common in the 3'-NCRs of the potyvirus RNAs, was also found at the regions of 119 bases upstream from the 3'-end. Comparison of the 3'-NCRs of the two Korean isolates with those of four strains from Canada, China and Japan showed significantly identical genotypes (94.3∼99.5%). The secondary structure of three loops with long stems was found within the 3'-NCRs by sequence analysis. The substituted bases in the region among the six TuMV strains did not alter their secondary structures. Length of the 3'-NCRs of the know 11 potyviral RNAs and TuMV RNAs was different from one another and their nucleotide sequences showed 55.7% to 24.0% of homology. The 3'-NCR, therefore, is considered to be useful for phylogenetic studies in potyviruses.

  • PDF

연속적 차분 확장 기반 가역 DNA 워터마킹 (Consecutive Difference Expansion Based Reversible DNA Watermarking)

  • 이석환;권기룡
    • 전자공학회논문지
    • /
    • 제52권7호
    • /
    • pp.51-62
    • /
    • 2015
  • 대용량의 DNA 정보 저장, DNA 서열 저작권 보호를 위한 DNA 워터마킹, 및 비밀 통신을 위한 DNA 스테가노그라픽에 대한 관심이 증대되면서, 원본 DNA 서열의 기능 유지와 복원이 가능한 가역성 DNA 워터마킹이 필요하다. 본 논문에서는 비부호영역 DNA 서열을 이용한 DE(Difference expansion) 기반 가역 DNA 워터마킹 기법을 제안한다. 가역 DNA 워터마킹에서는 생물학적 기능 변경이 없고, 문자 형태의 서열 내에 대용량의 데이터를 은닉하여야 하며, 원본 DNA 서열이 복원되어야 한다. 제안한 방법에서는 문자 서열을 십진수 형태의 수치계수로 변환한 다음, 인접 수치 계수 쌍의 DE 기반 다중비트 은닉 방법(DE-MBE, DE based multiple bits embedding)과 이전 은닉 수치계수를 예측으로 한 연속 DE 기반 다중비트 은닉 방법들(C-DE-MBE, consecutive DE based multiple bits embedding)에 의하여 워터마크가 은닉된다. 은닉 과정에서는 워터마크된 서열에 의하여 부호영역을 나타내는 허위 시작코돈 발생을 방지하기 위하여 비교 탐색을 수행한다. 실험 결과로부터 제안한 방법이 기존 방법에 비하여 높은 은닉 용량을 가지며, 허위 시작코돈이 발생되지 않으며, 기준 서열없이 원본 DNA 서열이 복원됨을 확인하였다.

Molecular Characterization and Expression Analysis of S6K1 in Cashmere Goats (Capra hircus)

  • Wu, Manlin;Bao, Wenlei;Hao, Xiyan;Zheng, Xu;Wang, Yanfeng;Wang, Zhigang
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제26권8호
    • /
    • pp.1057-1064
    • /
    • 2013
  • p70 ribosomal S6 kinase (p70S6K) can integrate nutrient and growth factor signals to promote cell growth and survival. We report our molecular characterization of the complementary DNA (cDNA) that encodes the goat p70S6K gene 40S ribosomal S6 kinase 1 (S6K1) (GenBank accession GU144017) and its 3' noncoding sequence in Inner Mongolia Cashmere goats (Capra hircus). Goat S6K1 cDNA was 2,272 bp and include an open reading frame (ORF) of 1,578 bp, corresponding to a polypeptide of 525 amino acids, and a 694-residue 3' noncoding sequence with a polyadenylation signal at nucleotides 2,218 to 2,223. The relative abundance of S6K1 mRNA was measured by real-time PCR in 6 tissues, and p70S6K expression was examined by immunohistochemistry in heart and testis. The phosphorylation of p70S6K is regulated by mitogen-activated protein kinase (MAPK) signaling in fetal fibroblasts.

바이오 정보보호 위한 히스토그램 쉬프팅 기반 가역성 DNA 워터마킹 기법 (Reversible DNA Watermarking Technique Using Histogram Shifting for Bio-Security)

  • 이석환;권성근;이응주;권기룡
    • 한국멀티미디어학회논문지
    • /
    • 제20권2호
    • /
    • pp.244-253
    • /
    • 2017
  • Reversible DNA watermarking is capable of continuous DNA storage and forgery prevention, and has the advantage of being able to analyze biological mutation processes by external watermarking by iterative process of concealment and restoration. In this paper, we propose a reversible DNA watermarking method based on histogram multiple shifting of noncoding DNA sequence that can prevent false start codon, maintain original sequence length, maintain high watermark capacity without biologic mutation. The proposed method transforms the non-coding region DNA sequence to the n-th code coefficients and embeds the multiple bits of the n-th code coefficients by the non-recursive histogram multiple shifting method. The multi-bit embedding process prevents the false start codon generation through comparison search between adjacent concealed nucleotide sequences. From the experimental results, it was confirmed that the proposed method has higher watermark capacity of 0.004-0.382 bpn than the conventional method and has higher watermark capacity than the additional data. Also, it was confirmed that false start codon was not generated unlike the conventional method.

A Short Report on the Markov Property of DNA Sequences on 200-bp Genomic Units of Roadmap Genomics ChromHMM Annotations: A Computational Perspective

  • Park, Hyun-Seok
    • Genomics & Informatics
    • /
    • 제16권4호
    • /
    • pp.27.1-27.6
    • /
    • 2018
  • The non-coding DNA in eukaryotic genomes encodes a language that programs chromatin accessibility, transcription factor binding, and various other activities. The objective of this study was to determine the effect of the primary DNA sequence on the epigenomic landscape across a 200-base pair of genomic units by integrating 127 publicly available ChromHMM BED files from the Roadmap Genomics project. Nucleotide frequency profiles of 127 chromatin annotations stratified by chromatin variability were analyzed and integrative hidden Markov models were built to detect Markov properties of chromatin regions. Our aim was to identify the relationship between DNA sequence units and their chromatin variability based on integrated ChromHMM datasets of different cell and tissue types.

Identification of a New 5'-Noncoding Exon Region and Promoter Activity in Human N-Acetylglucosaminyltransferase III Gene

  • Kang, Bong-Seok;Kim, Yeon-Jeong;Shim, Jae-Kyoung;Song, Eun-Young;Park, Young-Guk;Lee, Young-Choon;Nam, Kyung-Soo;Kim, June-Ki;Lee, Tae-Kyun;Chung, Tae-Wha;Kim, Cheorl-Ho
    • BMB Reports
    • /
    • 제31권6호
    • /
    • pp.578-584
    • /
    • 1998
  • In a previous paper (Kim et al., 1996a), the immediate 5' -flanking region and coding region of the human UDP-N -acetylglucosamine:-D-mannoside-1,4-Nacetylglucosaminyltransferase III (N-acetylglucosaminyitransferase- III; GnT-III) gene was reported, isolated and analyzed. Herein, we report on amplification of a new 5' -noncoding region of the GnT-III mRNA by single-strand ligation to single-stranded cDNA-PCR (5' -RACE PCR) using poly(A)+ RNA isolated from human fetal liver cells. A cDNA clone was obtained with 5' sequences (96 bp) that diverged seven nucleotides upstream from the ATG (+1) start codon. A concensus splice junction sequence, TCTCCCGCAG, was found immediately 5' to the position where the sequences of the cDNA diverged. The result suggested the presence of an intron in the 5' -noncoding region and that the cDNA was an incompletely reversetranscribed cDNA product derived from an mRNA containing a new noncoding exon. When mRNA expression of GnT-III in various human tissues and cancer cell lines was examined, Northern blot analysis indicated high expression levels of GnT-III in human fetal kidney and brain tissues, as well as for a number of leukemia and lymphoma cancer cell lines. Promoter activities of the 5' -flanking regions of exon 1 and the new noncoding region were measured in a human hepatoma cell line, HepG2, by luciferase assays. The 5'-flanking region of exon 1 was the most active, whilst that of exon 2 was inactive.

  • PDF

A Short Report on the Markov Property of DNA Sequences on 200-bp Genomic Units of ENCODE/Broad ChromHMM Annotations: A Computational Perspective

  • Park, Hyun-Seok
    • Genomics & Informatics
    • /
    • 제16권3호
    • /
    • pp.65-70
    • /
    • 2018
  • The non-coding DNA in eukaryotic genomes encodes a language which programs chromatin accessibility, transcription factor binding, and various other activities. The objective of this short report was to determine the impact of primary DNA sequence on the epigenomic landscape across 200-base pair genomic units by integrating nine publicly available ChromHMM Browser Extensible Data files of the Encyclopedia of DNA Elements (ENCODE) project. The nucleotide frequency profiles of nine chromatin annotations with the units of 200 bp were analyzed and integrative Markov chains were built to detect the Markov properties of the DNA sequences in some of the active chromatin states of different ChromHMM regions. Our aim was to identify the possible relationship between DNA sequences and the newly built chromatin states based on the integrated ChromHMM datasets of different cells and tissue types.

Phylogenetic Analysis by RFLP and Sequencing of Mitochondrial DNA in a Korean Population

  • Lee, Jin-Young;Kim, Heui-Soo;Ha, Bae-Jin;Park, Yeong-Hong
    • Archives of Pharmacal Research
    • /
    • 제29권1호
    • /
    • pp.88-95
    • /
    • 2006
  • Analysis of molecular nature of mitochondrial DNA (mtDNA) could be powerful marker for anthropological studies of modern populations. While population genetic studies on mtDNA have been reported for several ethnic groups, no such study has been documented for the Korean population. We surveyed mtDNA polymorphisms in the HVS I of noncoding D-loop region and its upstream region from 430 unrelated healthy Korean population by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) and direct sequencing analysis. PCR product with 2,790 bp spanning the specific mtDNA region (mt13715-16504) was subjected to RFLP analysis using 6 restriction enzyme (Hinf I, Hae III, Alu I, Dde I, Mbo I, Rsa I). On the PAUP analysis of PCR-RFLP results, 38 mtDNA haplotypes (Hap 1-38) were detected in the Korean populations, which were classified into 11 haplogroups (Grp 1-11) of related haplotypes encompassing all 38 haplotypes. In comparison of sequencing data with Anderson's reference sequence, the transition type was more prevalent than the transversion type. Insertions or deletions were not found. In addition, three of the polymorphic sites (A16240C, A16351G, G16384A) in HVS-I region are determined newly. The polymorphic sites were distributed randomly in the region, though the frequency at each site was variable. Thus, this research might be required for the genealogical study of Orientals.