• 제목/요약/키워드: DNA data storage

검색결과 28건 처리시간 0.055초

DNA 데이터 저장을 위한 DNA 정보 은닉 기법 (DNA Information Hiding Method for DNA Data Storage)

  • 이석환;권기룡
    • 전자공학회논문지
    • /
    • 제51권10호
    • /
    • pp.118-127
    • /
    • 2014
  • DNA 데이터 저장(Data storage)은 DNA의 염기 서열에 대용량의 디지털 데이터를 저장하는 방법으로, 차세대 정보 저장 매개물로 인식되고 있다. 본 논문에서는 DNA 스테가노그라픽 기반으로 비부호 DNA 서열(Noncoding DNA sequence)에 정보를 저장하는 방법을 제안한다. 제안한 방법은 암호화된 데이터들을 정수 변화표에 의하여 데이터 염기 서열로 변환한 후, 시드 정보, 및 섹터 길이로 구성된 은닉 키에 의하여 비부호 염기 서열에 은닉한다. 따라서 단백질의 유전 기능이 유지되고, 원 DNA 서열없이 정보가 검출되며, 변이에 의하여 발생되는 오류가 검출된다. 기존 방법과의 비교 실험을 통하여 제안한 방법이 높은 bpn를 가지는 저장 효율을 가지며, 패리티 염기에 의하여 은닉된 정보의 오류 위치를 검출할 수 있음을 확인하였다.

A Pattern Matching Extended Compression Algorithm for DNA Sequences

  • Murugan., A;Punitha., K
    • International Journal of Computer Science & Network Security
    • /
    • 제21권8호
    • /
    • pp.196-202
    • /
    • 2021
  • DNA sequencing provides fundamental data in genomics, bioinformatics, biology and many other research areas. With the emergent evolution in DNA sequencing technology, a massive amount of genomic data is produced every day, mainly DNA sequences, craving for more storage and bandwidth. Unfortunately, managing, analyzing and specifically storing these large amounts of data become a major scientific challenge for bioinformatics. Those large volumes of data also require a fast transmission, effective storage, superior functionality and provision of quick access to any record. Data storage costs have a considerable proportion of total cost in the formation and analysis of DNA sequences. In particular, there is a need of highly control of disk storage capacity of DNA sequences but the standard compression techniques unsuccessful to compress these sequences. Several specialized techniques were introduced for this purpose. Therefore, to overcome all these above challenges, lossless compression techniques have become necessary. In this paper, it is described a new DNA compression mechanism of pattern matching extended Compression algorithm that read the input sequence as segments and find the matching pattern and store it in a permanent or temporary table based on number of bases. The remaining unmatched sequence is been converted into the binary form and then it is been grouped into binary bits i.e. of seven bits and gain these bits are been converted into an ASCII form. Finally, the proposed algorithm dynamically calculates the compression ratio. Thus the results show that pattern matching extended Compression algorithm outperforms cutting-edge compressors and proves its efficiency in terms of compression ratio regardless of the file size of the data.

DNA Based Cloud Storage Security Framework Using Fuzzy Decision Making Technique

  • Majumdar, Abhishek;Biswas, Arpita;Baishnab, Krishna Lal;Sood, Sandeep K.
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권7호
    • /
    • pp.3794-3820
    • /
    • 2019
  • In recent years, a cloud environment with the ability to detect illegal behaviours along with a secured data storage capability is much needed. This study presents a cloud storage framework, wherein a 128-bit encryption key has been generated by combining deoxyribonucleic acid (DNA) cryptography and the Hill Cipher algorithm to make the framework unbreakable and ensure a better and secured distributed cloud storage environment. Moreover, the study proposes a DNA-based encryption technique, followed by a 256-bit secure socket layer (SSL) to secure data storage. The 256-bit SSL provides secured connections during data transmission. The data herein are classified based on different qualitative security parameters obtained using a specialized fuzzy-based classification technique. The model also has an additional advantage of being able to decide on selecting suitable storage servers from an existing pool of storage servers. A fuzzy-based technique for order of preference by similarity to ideal solution (TOPSIS) multi-criteria decision-making (MCDM) model has been employed for this, which can decide on the set of suitable storage servers on which the data must be stored and results in a reduction in execution time by keeping up the level of security to an improved grade.

Storing Digital Information in Long-Read DNA

  • Ahn, TaeJin;Ban, Hamin;Park, Hyunsoo
    • Genomics & Informatics
    • /
    • 제16권4호
    • /
    • pp.30.1-30.6
    • /
    • 2018
  • There is urgent need for effective and cost-efficient data storage, as the worldwide requirement for data storage is rapidly growing. DNA has introduced a new tool for storing digital information. Recent studies have successfully stored digital information, such as text and gif animation. Previous studies tackled technical hurdles due to errors from DNA synthesis and sequencing. Studies also have focused on a strategy that makes use of 100-150-bp read sizes in both synthesis and sequencing. In this paper, we a suggest novel data encoding/decoding scheme that makes use of long-read DNA (~1,000 bp). This enables accurate recovery of stored digital information with a smaller number of reads than the previous approach. Also, this approach reduces sequencing time.

An Efficient DNA Sequence Compression using Small Sequence Pattern Matching

  • Murugan., A;Punitha., K
    • International Journal of Computer Science & Network Security
    • /
    • 제21권8호
    • /
    • pp.281-287
    • /
    • 2021
  • Bioinformatics is formed with a blend of biology and informatics technologies and it employs the statistical methods and approaches for attending the concerning issues in the domains of nutrition, medical research and towards reviewing the living environment. The ceaseless growth of DNA sequencing technologies has resulted in the production of voluminous genomic data especially the DNA sequences thus calling out for increased storage and bandwidth. As of now, the bioinformatics confronts the major hurdle of management, interpretation and accurately preserving of this hefty information. Compression tends to be a beacon of hope towards resolving the aforementioned issues. Keeping the storage efficiently, a methodology has been recommended which for attending the same. In addition, there is introduction of a competent algorithm that aids in exact matching of small pattern. The DNA representation sequence is then implemented subsequently for determining 2 bases to 6 bases matching with the remaining input sequence. This process involves transforming of DNA sequence into an ASCII symbols in the first level and compress by using LZ77 compression method in the second level and after that form the grid variables with size 3 to hold the 100 characters. In the third level of compression, the compressed output is in the grid variables. Hence, the proposed algorithm S_Pattern DNA gives an average better compression ratio of 93% when compared to the existing compression algorithms for the datasets from the UCI repository.

인공지능 기법을 이용한 홀로그래픽 데이터 스토리지 시스템의 에러 보정 (Error Correction of Holographic Data Storage System Using Artificial Intelligence)

  • 김장현;박진배;양현석;박영필
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2006년도 제37회 하계학술대회 논문집 D
    • /
    • pp.2142-2143
    • /
    • 2006
  • Today any data storage system cannot satisfy all of these conditions, however holographic data storage system can perform faster data transfer rate because it is a page oriented memory system using volume hologram in writing and retrieving data. System can be constructed without mechanically actuating part therefore fast data transfer rate and high storage capacity about 1Tb/cm3 can be realized. In this research, to reduce errors of binary data stored in holographic data storage system, a new method for bit error reduction is suggested. Firstly, find fuzzy rule to use test bed system for Element of Holographic Digital Data System. Secondly, make fuzzy rule table using DNA coding method. Finally, reduce prior error element and recording digital data. Recording ratio and reconstruction ratio show good performance.

  • PDF

대용량 DNA서열 처리를 위한 서픽스 트리 생성 알고리즘의 개발 (Suffix Tree Constructing Algorithm for Large DNA Sequences Analysis)

  • 최해원
    • 한국산업정보학회논문지
    • /
    • 제15권1호
    • /
    • pp.37-46
    • /
    • 2010
  • 서픽스 트리는 데이터의 내부구조를 자세히 나타내고 선형시간 탐색이 가능한 효과적인 자료구조로서 DNA 서열분석 등에 유용하다. 그러나 서열을 서픽스 트리로 구축하는 경우 트리의 크기가 원본의 최소 30배 이상으로 커지므로 테라바이트(TB)급의 대용량 DNA 서열의 경우에 메모리상의 응용은 매우 어려운 문제점이 있다. 이에 본 논문에서는 디스크를 이용한 대용량 DNA의 서픽스 트리 응용기법을 제시한다. 이때 DNA 서열구조를 고려한 서픽스 트리 선형 탐색 특성 유지를 보장한다. 이를 검증하기 위하여 9G Byte의 유전자 단편 서열을 이용해 424G Byte의 서픽스 트리를 디스크에 구축한 다음, 임의의 질의 서열에 대해 KMP알고리즘과 비교한 결과 질의 응답시간에서 우수한 성능을 보였다.

바이오 정보보호 위한 히스토그램 쉬프팅 기반 가역성 DNA 워터마킹 기법 (Reversible DNA Watermarking Technique Using Histogram Shifting for Bio-Security)

  • 이석환;권성근;이응주;권기룡
    • 한국멀티미디어학회논문지
    • /
    • 제20권2호
    • /
    • pp.244-253
    • /
    • 2017
  • Reversible DNA watermarking is capable of continuous DNA storage and forgery prevention, and has the advantage of being able to analyze biological mutation processes by external watermarking by iterative process of concealment and restoration. In this paper, we propose a reversible DNA watermarking method based on histogram multiple shifting of noncoding DNA sequence that can prevent false start codon, maintain original sequence length, maintain high watermark capacity without biologic mutation. The proposed method transforms the non-coding region DNA sequence to the n-th code coefficients and embeds the multiple bits of the n-th code coefficients by the non-recursive histogram multiple shifting method. The multi-bit embedding process prevents the false start codon generation through comparison search between adjacent concealed nucleotide sequences. From the experimental results, it was confirmed that the proposed method has higher watermark capacity of 0.004-0.382 bpn than the conventional method and has higher watermark capacity than the additional data. Also, it was confirmed that false start codon was not generated unlike the conventional method.

해양변사자 신원확인을 위한 해양경찰의 인체유래물 보관사업 모델제시 (Korea Coast Guard's Human Biological Materials Storage Project for Identifying Bodies Recovered from the Sea: A Model Suggestion)

  • 주현정;추민규;백윤기;김남율;최아진;임선영;이종남;김형규;이한성
    • 해양환경안전학회지
    • /
    • 제24권2호
    • /
    • pp.171-178
    • /
    • 2018
  • 해양경찰은 바다라는 극한 환경에서 임무수행 중 빈번히 사망 실종의 위험에 맞닥뜨린다. 사고 발생 시 신속한 신원확인을 위해 해양경찰연구센터는 해양경찰을 대상으로 DNA를 이용한 신원확인 시스템을 구축하였으며 사체가 발견되지 않을 경우를 대비해 국립묘지 안장이 가능한 장치 또한 마련하였다. 순직 실종 등 대비 신원확인용 인체유래물 보관사업 운영규칙에 따라 인체유래물의 관리, 보관, 폐기, 품질검사가 진행되며 이는 관리위원회의 감시 하에 행해진다. 한반도 해역에서는 매년 700여건의 해양 변사자가 발생하는데 사체가 늦게 발견되어 부패가 심하거나, 일부분만 발견되어 지문 또는 치아로 개인 식별이 불가능할 경우 해양 변사자의 신원확인을 위한 방법이 필요하다. 해양경찰연구센터가 운영중인 '인체유래물 보관사업'을 해양 수산 종사자, 관련 연구자, 해양 레저인에게까지 확대 실행한다면 해양 변사자 신원확인에 큰 도움이 될 것이다.

Public Perception of a Criminal DNA Database in Korea

  • Lee, Ji Hyun;Cho, Sohee;Kim, Moon Young;Lee, Seung Hwan;Lee, Hwan Young;Lee, Soong Deok;LoCascio, Sarah Prusoff;Jung, Kyu Won
    • Asian Journal for Public Opinion Research
    • /
    • 제7권2호
    • /
    • pp.75-93
    • /
    • 2019
  • Background: Since 2010, Korea has maintained a DNA database of those convicted of or awaiting trial for certain crimes. There have been proposals to expand the list of crimes included in this database, or conversely, omit certain crimes if they are committed during protests. An understanding of the feelings of the public as we consider the ethical, legal, and social aspects of a DNA database and as revisions to laws are made is required. Methodology: Questions related to the DNA database were included in the nationally representative Korean Academic Multimode Open Survey (KAMOS) panel (June-August 2016). Results: Of 2,000 randomly selected panel members, 1,013 respondents participated in this survey, including 89.2% who supported the existence of a criminal DNA database. The current system of storing DNA profiles until a suspect's acquittal or a convict's death was supported by 79.5% of respondents. In addition, 70.8% of respondents agreed with the expansion of crime categories included in the criminal database. Many (93.4%) respondents favored genetic testing and data storage to determine the identity and cause of death for people who die of unnatural causes. Some differences in attitude related to social class were noted, with those who self-identified as members of the upper class more likely to support the database and its expansion to include additional crimes than those who self-identified as middle or lower class. Conclusion: Our findings suggest that Koreans generally support the criminal DNA database.