• Title/Summary/Keyword: Data Sequence

Search Result 3,089, Processing Time 0.034 seconds

Digital Processing for Multichannel Seismic Data(I) -Marine Reflection Data Processing- (다중채널 탄성파 탐사자료의 전산처리(I) - 해양반사파 자료처리 -)

  • 김기영;홍종국;주형태
    • The Journal of Engineering Geology
    • /
    • v.1 no.1
    • /
    • pp.85-108
    • /
    • 1991
  • Marine seismic processing is characterized by a great amount of data, several professional processing steps, and various parameters to be decided in each step. In general, adequate processing sequence and optimum parameters are obtained through test processing with sample set of data representing the whole group. The sequence and parameters are then applied in processing the whole data. In this paper, optimum processing sequence and parameters for the data acquired in Korean continental shelf are examined through test processing with real data. Finally, a good-quality migration section is produced using those sequence and parameters decided on the basis of the test results.

  • PDF

Efficient Implementation of a Pseudorandom Sequence Generator for High-Speed Data Communications

  • Hwang, Soo-Yun;Park, Gi-Yoon;Kim, Dae-Ho;Jhang, Kyoung-Son
    • ETRI Journal
    • /
    • v.32 no.2
    • /
    • pp.222-229
    • /
    • 2010
  • A conventional pseudorandom sequence generator creates only 1 bit of data per clock cycle. Therefore, it may cause a delay in data communications. In this paper, we propose an efficient implementation method for a pseudorandom sequence generator with parallel outputs. By virtue of the simple matrix multiplications, we derive a well-organized recursive formula and realize a pseudorandom sequence generator with multiple outputs. Experimental results show that, although the total area of the proposed scheme is 3% to 13% larger than that of the existing scheme, our parallel architecture improves the throughput by 2, 4, and 6 times compared with the existing scheme based on a single output. In addition, we apply our approach to a $2{\times}2$ multiple input/multiple output (MIMO) detector targeting the 3rd Generation Partnership Project Long Term Evolution (3GPP LTE) system. Therefore, the throughput of the MIMO detector is significantly enhanced by parallel processing of data communications.

A Pattern Summary System Using BLAST for Sequence Analysis

  • Choi, Han-Suk;Kim, Dong-Wook;Ryu, Tae-W.
    • Genomics & Informatics
    • /
    • v.4 no.4
    • /
    • pp.173-181
    • /
    • 2006
  • Pattern finding is one of the important tasks in a protein or DNA sequence analysis. Alignment is the widely used technique for finding patterns in sequence analysis. BLAST (Basic Local Alignment Search Tool) is one of the most popularly used tools in bio-informatics to explore available DNA or protein sequence databases. BLAST may generate a huge output for a large sequence data that contains various sequence patterns. However, BLAST does not provide a tool to summarize and analyze the patterns or matched alignments in the BLAST output file. BLAST lacks of general and robust parsing tools to extract the essential information out from its output. This paper presents a pattern summary system which is a powerful and comprehensive tool for discovering pattern structures in huge amount of sequence data in the BLAST. The pattern summary system can identify clusters of patterns, extract the cluster pattern sequences from the subject database of BLAST, and display the clusters graphically to show the distribution of clusters in the subject database.

Efficient Stream Sequence Matching Algorithms for Handheld Devices over Time-Series Stream Data (시계열 스트림 데이터 상에서 핸드헬드 디바이스를 위한 효율적인 스트림 시퀀스 매칭 알고리즘)

  • Moon Yang-Sae;Loh Woong-Kee
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.8B
    • /
    • pp.736-744
    • /
    • 2006
  • For the handhold devices, minimizing repetitive CPU operations such as multiplications is a major factor for their performances. In this paper, we propose efficient algorithms for finding similar sequences from streaming time-series data such as stock prices, network traffic data, and sensor network data. First, we formally define the problem of similar subsequence matching from streaming time-series data, which is called the stream sequence matching in this paper. Second, based on the window construction mechanism adopted by the previous subsequence matching algorithms, we present an efficient window-based approach that minimizes CPU operations required for stream sequence matching. Third, we propose a notion of window MBR and present two stream sequence matching algorithms based on the notion. Fourth, we formally prove correctness of the proposed algorithms. Finally, through a series of analyses and experiments, we show that our algorithms significantly outperform the naive algorithm. We believe that our window-based algorithms are excellent choices for embedded stream sequence matching in handhold devices.

Title Generation Model for which Sequence-to-Sequence RNNs with Attention and Copying Mechanisms are used (주의집중 및 복사 작용을 가진 Sequence-to-Sequence 순환신경망을 이용한 제목 생성 모델)

  • Lee, Hyeon-gu;Kim, Harksoo
    • Journal of KIISE
    • /
    • v.44 no.7
    • /
    • pp.674-679
    • /
    • 2017
  • In big-data environments wherein large amounts of text documents are produced daily, titles are very important clues that enable a prompt catching of the key ideas in documents; however, titles are absent for numerous document types such as blog articles and social-media messages. In this paper, a title-generation model for which sequence-to-sequence RNNs with attention and copying mechanisms are employed is proposed. For the proposed model, input sentences are encoded based on bi-directional GRU (gated recurrent unit) networks, and the title words are generated through a decoding of the encoded sentences with keywords that are automatically selected from the input sentences. Regarding the experiments with 93631 training-data documents and 500 test-data documents, the attention-mechanism performances are more effective (ROUGE-1: 0.1935, ROUGE-2: 0.0364, ROUGE-L: 0.1555) than those of the copying mechanism; in addition, the qualitative-evaluation radiative performance of the former is higher.

A New Galaxy Classification Scheme in the WISE Color-Luminosity Diagram

  • Lee, Gwang-Ho;Sohn, Jubee;Lee, Myung Gyoon
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.38 no.2
    • /
    • pp.49.1-49.1
    • /
    • 2013
  • We present a new galaxy classification scheme in the Wide-field Infrared Survey Explorer (WISE) [$3.4{\mu}m$]-[$12{\mu}m$] color versus $12{\mu}m$ luminosity diagram. In this diagram, galaxies can be classified into three groups in different evolutionary stages. Late-type galaxies are distributed linearly along "MIR star-forming sequence" identified by Hwang et al. (2012). Some early-type galaxies show another sequence at [3.4]-[12] $(AB){\simeq}-2.0$, and we call this 'MIR blue sequence'. They are quiescent systems with old stellar population older than 10 Gyr. Between the MIR star-forming sequence and the MIR blue sequence, some early- and late-type galaxies are sparsely distributed, and we call these galaxies 'MIR green cloud galaxies'. Interestingly, both MIR blue sequence galaxies and MIR green cloud ones lie on the red sequence in the optical color-magnitude diagram. However, MIR green cloud galaxies have lower stellar masses and younger stellar populations (smaller $D_n4000$) than MIR blue sequence galaxies, suggesting that MIR green cloud galaxies are in the transition stage from MIR star-forming sequence galaxies to MIR blue sequence ones. We present differences in various galaxy properties between the three MIR classes using a multi-wavelength data, combined with the WISE and Sloan Digital Sky Survey Data Release 10, of local (0.03 < z < 0.07) galaxies.

  • PDF

DEVELOPMENT OF XML BASED PERSONALIZED DATAASE MANAGEMENT SYTEM FOR BIOLOGISTS

  • Cho Kyung Hwan;Jung Kwang Su;Kim Sun Shin;Ryu Keun Ho
    • Proceedings of the KSRS Conference
    • /
    • 2005.10a
    • /
    • pp.770-773
    • /
    • 2005
  • In most biological laboratory, sequences from sequence machine are stored into file disks as simple files. It will be hard work to store and manage the sequence data with consistency and integrity such as storing redundant files. It is required needed to develop a system which integrated and managed genome data with consistency and integrity for accurate sequence analysis. There fore, in this paper, we not only store gene and protein sequence data through sequencing but also manage them. We also make a integrate schema for transforming the file formats and design database system using it. As integrated schema is designed as a BSML, it is possible to apply a style language of XSL. From this, we can transfer among heterogeneous sequence formats.

  • PDF

An Efficient Algorithm for Mining Interactive Communication Sequence Patterns (대화형 통신 순서열 패턴의 마이닝을 위한 효율적인 알고리즘)

  • Haam, Deok-Min;Song, Ji-Hwan;Kim, Myoung-Ho
    • Journal of KIISE:Databases
    • /
    • v.36 no.3
    • /
    • pp.169-179
    • /
    • 2009
  • Communication log data consist of communication events such as sending and receiving e-mail or instance message and visiting web sites, etc. Many countries including USA and EU enforce the retention of these data on the communication service providers for the purpose of investigating or detecting criminals through the Internet. Because size of the retained data is very large, the efficient method for extracting valuable information from the data is needed for Law Enforcement Authorities to use the retained data. This paper defines the Interactive Communication Sequence Patterns(ICSPs) that is the important information when each communication event in communication log data consists of sender, receiver, and timestamp of this event. We also define a Mining(FDICSP) problem to discover such patterns and propose a method called Fast Discovering Interactive Communication Sequence Pattern(FDICSP) to solve this problem. FDICSP focuses on the characteristics of ICS to reduce the search space when it finds longer sequences by using shorter sequences. Thus, FDICSP can find Interactive Communication Sequence Patterns efficiently.

Dummy Sequence Insertion for PAPR Reduction of OFDM Communication System (OFDM 통신시스템의 PAPR 저감을 위한 더미 시퀀스 삽입)

  • 이재은;유흥균;정영호;함영권
    • The Journal of Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.14 no.12
    • /
    • pp.1239-1247
    • /
    • 2003
  • OFDM(orthogonal frequency division multiplexing) communications system is very attractive for the high data rate transmission in the frequency selective lading channel. Since OFDM has high PAPR(peak-to-average power ratio), OFDM signal may be distorted by the nonlinear HPA(high power amplifier). In this paper, we propose the DSI(dummy sequence insertion) method for OFDM communication system. Some sub-carriers are inserted for PAPR reduction. They carry the specified dummy data sequence which are used for only PAPR reduction and do not work as side information like the conventional PTS(partial transmit sequence) or SLM(selected mapping) method. We use the complementary sequence and the combination of the correlation sequence as the dummy sequence. Flipping technique is used for the DSI method to get the effective PAPR reduction. It is important that BER of the proposed method is independent of the damage of the dummy data sequence. And DSI method has better spectral efficiency than the conventional block coding. On the other hand, threshold PAPR method is applied to cut down the processing time. However, this DSI method is not better than the conventional PTS method in the respect of the PAPR reduction performance. The DSI method includes the threshold PAPR lower than the PAPR of the OFDM signal, reduces the processing time and improves the BER performance.

Acoustic Analysis of Koreans' Production Errors in English - with reference to nasalization and lateralization (한국인 화자의 영어 발음 오류에 관한 음향적 분석 - 비음화와 설측음화를 중심으로 -)

  • Kim, Mi-Hye;Kang, Sun-Mi;Kim, Kee-Ho
    • Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.53-63
    • /
    • 2008
  • This paper examined the acoustic differences in English speech production between English native speakers and Korean learners. Korean speakers seem to produce errors by over-applying the Korean phonological rules(nasalization and lateralization) to English speech under the conditions comparable to those of Korean which contain nasal+lateral or lateral+nasal sequences. Being based on this prediction, the experimental data is grouped into three sets, [n]+[l] sequence, [l]+[n]sequence, and [m]+[l] sequence. The result shows that, Korean speakers usually nasalize or lateralize the target words or phrases in every three categories while English natives don't. In set A([n]+[l] sequence), both nasalization and lateralization were found in [n]+[l] sequence, the same circumstances where both nasalization and lateralization can be placed as in Korean. In the case of set B([l]+[n] sequence), only lateralization is observed. It is because the nasalization never occurs in the sequence of l-n in Korean. There is no lateralization in set C([m]+[l] sequence), because only nasalization occurs in the sequence of m-l in Korean. This results reconfirmed that the nasalization and lateralization rules in Korean deeply influence on the English production data. Korean speakers need to be taught not to over-apply Korean phonological rule to English production for accurate pronunciation.

  • PDF