• Title/Summary/Keyword: Sequence data

검색결과 3,115건 처리시간 0.031초

Converting Panax ginseng DNA and chemical fingerprints into two-dimensional barcode

  • Cai, Yong;Li, Peng;Li, Xi-Wen;Zhao, Jing;Chen, Hai;Yang, Qing;Hu, Hao
    • Journal of Ginseng Research
    • /
    • 제41권3호
    • /
    • pp.339-346
    • /
    • 2017
  • Background: In this study, we investigated how to convert the Panax ginseng DNA sequence code and chemical fingerprints into a two-dimensional code. In order to improve the compression efficiency, GATC2Bytes and digital merger compression algorithms are proposed. Methods: HPLC chemical fingerprint data of 10 groups of P. ginseng from Northeast China and the internal transcribed spacer 2 (ITS2) sequence code as the DNA sequence code were ready for conversion. In order to convert such data into a two-dimensional code, the following six steps were performed: First, the chemical fingerprint characteristic data sets were obtained through the inflection filtering algorithm. Second, precompression processing of such data sets is undertaken. Third, precompression processing was undertaken with the P. ginseng DNA (ITS2) sequence codes. Fourth, the precompressed chemical fingerprint data and the DNA (ITS2) sequence code were combined in accordance with the set data format. Such combined data can be compressed by Zlib, an open source data compression algorithm. Finally, the compressed data generated a two-dimensional code called a quick response code (QR code). Results: Through the abovementioned converting process, it can be found that the number of bytes needed for storing P. ginseng chemical fingerprints and its DNA (ITS2) sequence code can be greatly reduced. After GTCA2Bytes algorithm processing, the ITS2 compression rate reaches 75% and the chemical fingerprint compression rate exceeds 99.65% via filtration and digital merger compression algorithm processing. Therefore, the overall compression ratio even exceeds 99.36%. The capacity of the formed QR code is around 0.5k, which can easily and successfully be read and identified by any smartphone. Conclusion: P. ginseng chemical fingerprints and its DNA (ITS2) sequence code can form a QR code after data processing, and therefore the QR code can be a perfect carrier of the authenticity and quality of P. ginseng information. This study provides a theoretical basis for the development of a quality traceability system of traditional Chinese medicine based on a two-dimensional code.

영상의 지역성과 인접 픽셀 차분 시퀀스를 이용하는 가역 데이터 임베딩 기법 (Reversible Data Embedding Algorithm Using the Locality of Image and the Adjacent Pixel Difference Sequence)

  • 정수목
    • 한국정보전자통신기술학회논문지
    • /
    • 제9권6호
    • /
    • pp.573-577
    • /
    • 2016
  • 본 논문에서는 영상의 지역성과 인접 픽셀 차분시퀀스를 이용하는 가역 데이터 임베딩 기법을 제안하였다. 자연영상에는 일반적으로 지역성이 존재한다. 영상의 지역성을 이용하여 인접한 픽셀 값을 예측하는 기법을 기존의 기법인 APD(Adjacent Pixel Difference) 기법에 적용하여 임베딩 가능한 데이터 량을 증가 시키고 다양한 레벨로 데이터 임베딩을 가능하게 하는 가역 데이터 임베딩 기법을 제안하였다. 실험결과를 통하여 제안된 기법의 우수성을 확인하였다.

A Primer for Disease Gene Prioritization Using Next-Generation Sequencing Data

  • Wang, Shuoguo;Xing, Jinchuan
    • Genomics & Informatics
    • /
    • 제11권4호
    • /
    • pp.191-199
    • /
    • 2013
  • High-throughput next-generation sequencing (NGS) technology produces a tremendous amount of raw sequence data. The challenges for researchers are to process the raw data, to map the sequences to genome, to discover variants that are different from the reference genome, and to prioritize/rank the variants for the question of interest. The recent development of many computational algorithms and programs has vastly improved the ability to translate sequence data into valuable information for disease gene identification. However, the NGS data analysis is complex and could be overwhelming for researchers who are not familiar with the process. Here, we outline the analysis pipeline and describe some of the most commonly used principles and tools for analyzing NGS data for disease gene identification.

최소 DTW 거리 기반의 데이터 시퀀스 색인 기법 (Sequence Data Indexing Method based on Minimum DTW Distance)

  • 길기정;송석일;송재종;이석필;장세진;이종설
    • 한국콘텐츠학회논문지
    • /
    • 제11권12호
    • /
    • pp.52-59
    • /
    • 2011
  • 이 논문에서는 시퀀스 데이터베이스에서 효과적인 유사 검색을 지원하기 위한 색인 기법을 제안한다. 제안하는 색인 기법에서는 데이터 시퀀스에 대한 필터링 효과를 얻기 위해, 최소 DTW 거리를 새롭게 제안한다. 최소 DTW 거리는 유사한 데이터 시퀀스 그룹과 질의 시퀀스 사이의 최소거리를 측정하는 방법이다. 최소 DTW 거리는 계층적인 색인 구조를 통해서 시퀀스 데이터베이스를 필터링하면서 유사도 검색을 수행할 수 있도록 한다. 마지막으로, 실험을 통해서 제안하는 방법의 우수성을 입증한다.

병렬 구조에 의한 가변 논리제어장치의 기능적 설계 (A Functional Design of Programmable Logic Controller Based on Parallel Architecture)

  • 이정훈;신현식
    • 대한전기학회논문지
    • /
    • 제40권8호
    • /
    • pp.836-844
    • /
    • 1991
  • PLC(programmable logic controller) system is widely used for the control of factory. PLC system receives ladder diagram which is drawn by the user to implement hardware logic, converts the ladder diagram into sequence program which is executable in the PLC system, and executes the sequence program indefinitely unless user breaks. The sequence program processes the data of on/off signal, and endures 1 scan delay and missing of pulse-type signal shorter than a scan time. So, data dependency doesn't exist. By applying theis characteristics to multiprocessor architecture, we design parellel PLC functionally and evaluate performance upgrade. Parallel PLC consists of central processing module, N general processing unit, and a shared memory by master-slave type. Each module executes allocated sequence program by the control of central processing module. We can expect performance upgrade by parallel processing, and reliability by relocation of sequence program when error occurs in processing module.

  • PDF

A Novel M-ary Code-Selected Direct Sequence BPAM UWB Communication System

  • Bai, Zhiquan;Kwak, Kyung-Sup
    • ETRI Journal
    • /
    • 제28권1호
    • /
    • pp.95-98
    • /
    • 2006
  • In this letter, a novel M-ary code-selected direct sequence (DS) ultra-wideband (UWB) communication system is presented. Our purpose is to achieve a high data rate by an M-ary code-selected direct sequence bipolar pulse amplitude modulation (MCSDS-BPAM) scheme. In this system, a particular DS code sequence is selected by the $log_2M$/2 bits from the DS gold code set. This scheme can accomplish both a high data rate without increasing the system bandwidth or changing the pulse shape and improve the BER with an increase of modulation level M even at a lower signal-to-noise ratio (SNR). The receiver signal processing algorithm is given for an MCSDS-BPAM UWB system over an ideal AWGN channel and correlation receivers.

  • PDF

Correlation Analysis between Regulatory Sequence Motifs and Expression Profiles by Kernel CCA

  • Rhee, Je-Keun;Joung, Je-Gun;Chang, Jeong-Ho;Zhang, Byoung-Tak
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2005년도 BIOINFO 2005
    • /
    • pp.63-68
    • /
    • 2005
  • Transcription factors regulate gene expression by binding to gene upstream region. Each transcription factor has the specific binding site in promoter region. So the analysis of gene upstream sequence is necessary for understanding regulatory mechanism of genes, under a plausible idea that assumption that DNA sequence motif profiles are closely related to gene expression behaviors of the corresponding genes. Here, we present an effective approach to the analysis of the relation between gene expression profiles and gene upstream sequences on the basis of kernel canonical correlation analysis (kernel CCA). Kernel CCA is a useful method for finding relationships underlying between two different data sets. In the application to a yeast cell cycle data set, it is shown that gene upstream sequence profile is closely related to gene expression patterns in terms of canonical correlation scores. By the further analysis of the contributing values or weights of sequence motifs in the construction of a pair of sequence motif profiles and expression profiles, we show that the proposed method can identify significant DNA sequence motifs involved with some specific gene expression patterns, including some well known motifs and those putative, in the process of the yeast cell cycle.

  • PDF

연관규칙과 순차패턴을 이용한 프로세스 마이닝 (A Process Mining using Association Rule and Sequence Pattern)

  • 정소영;권수태
    • 산업경영시스템학회지
    • /
    • 제31권2호
    • /
    • pp.104-111
    • /
    • 2008
  • A process mining is considered to support the discovery of business process for unstructured process model, and a process mining algorithm by using the associated rule and sequence pattern of data mining is developed to extract information about processes from event-log, and to discover process of alternative, concurrent and hidden activities. Some numerical examples are presented to show the effectiveness and efficiency of the algorithm.

케이블 모뎀 상향링크에 적합한 CAZAC sequence를 이용한 coarse timing recovery의 두 알고리즘 비교 (Comparison of Two Algorithms using CAZAC Sequence for Cable Modem Uplink)

  • 하현주;오왕록;김환우
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2007년도 하계종합학술대회 논문집
    • /
    • pp.53-54
    • /
    • 2007
  • As Cable Network is developing for 2-way high speed data service, it should be developed to transfer high speed data using limited bandwidth. If QAM is using for this, synchronization algorithms become important system parameters. In this paper, we present two methods of coarse timing recovery using CAZAC sequence for cable modem uplink.

  • PDF

페이딩 환경에서의 효율적인 협력통신 시스템 동기 알고리즘 연구 (Efficient Synchronization Scheme for Cooperative Communication System over Fading Channel)

  • 김윤현;김진영
    • 한국위성정보통신학회논문지
    • /
    • 제5권2호
    • /
    • pp.64-68
    • /
    • 2010
  • 본 논문은 환경 협력통신 시스템에서 새로운 동기 알고리즘 방법을 다양한 페이딩 채널에 적용하여 연구하였다. 기존 데이터 프레임에 확산코드를 삽입하여 효율적으로 데이터 동기를 제어하는 방식으로 연구하였다. 사용된 확산 코드는 M-시퀀스와 PN(Pseudo Noise) 시퀀스를 사용하였으며, 각 프레임에 일정 비트 시퀀스를 삽입하여, 수신된 데이터에서 사용한 확산코드를 추출하여 Correlation 연산을 취해 데이터 지연값을 확인할 수 있다. 모의실험에 있어서, 협력통신 방법은 DF (Decode-and-forward) 방식으로 실험을 하였으며, 페이딩 채널 환경은 Rayleigh, Rician, Gaussian 채널을 각각 적용하여 확산코드별로 나누어 성능을 분석했다. 또한, 본 논문의 결과는 추후 협력통신 시스템 연구에 적용할 수 있다.