• Title/Summary/Keyword: Sequence-dependency

Search Result 51, Processing Time 0.02 seconds

n-Gram/2L: A Space and Time Efficient Two-Level n-Gram Inverted Index Structure (n-gram/2L: 공간 및 시간 효율적인 2단계 n-gram 역색인 구조)

  • Kim Min-Soo;Whang Kyu-Young;Lee Jae-Gil;Lee Min-Jae
    • Journal of KIISE:Databases
    • /
    • v.33 no.1
    • /
    • pp.12-31
    • /
    • 2006
  • The n-gram inverted index has two major advantages: language-neutral and error-tolerant. Due to these advantages, it has been widely used in information retrieval or in similar sequence matching for DNA and Protein databases. Nevertheless, the n-gram inverted index also has drawbacks: the size tends to be very large, and the performance of queries tends to be bad. In this paper, we propose the two-level n-gram inverted index (simply, the n-gram/2L index) that significantly reduces the size and improves the query performance while preserving the advantages of the n-gram inverted index. The proposed index eliminates the redundancy of the position information that exists in the n-gram inverted index. The proposed index is constructed in two steps: 1) extracting subsequences of length m from documents and 2) extracting n-grams from those subsequences. We formally prove that this two-step construction is identical to the relational normalization process that removes the redundancy caused by a non-trivial multivalued dependency. The n-gram/2L index has excellent properties: 1) it significantly reduces the size and improves the Performance compared with the n-gram inverted index with these improvements becoming more marked as the database size gets larger; 2) the query processing time increases only very slightly as the query length gets longer. Experimental results using databases of 1 GBytes show that the size of the n-gram/2L index is reduced by up to 1.9${\~}$2.7 times and, at the same time, the query performance is improved by up to 13.1 times compared with those of the n-gram inverted index.

Korean Semantic Role Labeling using Stacked Bidirectional LSTM-CRFs (Stacked Bidirectional LSTM-CRFs를 이용한 한국어 의미역 결정)

  • Bae, Jangseong;Lee, Changki
    • Journal of KIISE
    • /
    • v.44 no.1
    • /
    • pp.36-43
    • /
    • 2017
  • Syntactic information represents the dependency relation between predicates and arguments, and it is helpful for improving the performance of Semantic Role Labeling systems. However, syntax analysis can cause computational overhead and inherit incorrect syntactic information. To solve this problem, we exclude syntactic information and use only morpheme information to construct Semantic Role Labeling systems. In this study, we propose an end-to-end SRL system that only uses morpheme information with Stacked Bidirectional LSTM-CRFs model by extending the LSTM RNN that is suitable for sequence labeling problem. Our experimental results show that our proposed model has better performance, as compare to other models.

Simultaneous Analysis of Concentration and Flow Fields in A Stirred Tank Using Large Eddy Simulation (대형 와 모사를 사용한 혼합 탱크 내의 농도장과 유동장의 동시 해석)

  • Yoon, Hyun-Sik;Chun, Ho-Hwan;Ha, Man-Yeong
    • Transactions of the Korean Society of Mechanical Engineers B
    • /
    • v.27 no.9
    • /
    • pp.1282-1289
    • /
    • 2003
  • Transport of a scalar quantity, such as chemical concentration or temperature, is important in many engineering applications and environmental flows. Here we report on results obtained from the large eddy simulations of flow and concentration fields inside the tank performed using a spectral multi-domain technique. The computations were driven by specifying the impeller-induced flow at the blade tip radius (Yoon et al.). This study focused on the concentration development at different molecular diffusivities in a stirred tank operated under turbulent conditions. The main objective of the work presented here is to study the large-scale mixing structure at different molecular diffusivities in a stirred tank by using the large eddy simulation. The time sequence of concentration and flow fields shows the flow dependency of the concentration development. The presence of spatial inhomogenieties is detailed by observing the time variation oflocal concentration at different positions.

Adaptive Reconstruction of Harmonic Time Series Using Point-Jacobian Iteration MAP Estimation and Dynamic Compositing: Simulation Study

  • Lee, Sang-Hoon
    • Korean Journal of Remote Sensing
    • /
    • v.24 no.1
    • /
    • pp.79-89
    • /
    • 2008
  • Irregular temporal sampling is a common feature of geophysical and biological time series in remote sensing. This study proposes an on-line system for reconstructing observation image series contaminated by noises resulted from mechanical problems or sensing environmental condition. There is also a high likelihood that during the data acquisition periods the target site corresponding to any given pixel may be covered by fog or cloud, thereby resulting in bad or missing observation. The surface parameters associated with the land are usually dependent on the climate, and many physical processes that are displayed in the image sensed from the land then exhibit temporal variation with seasonal periodicity. A feedback system proposed in this study reconstructs a sequence of images remotely sensed from the land surface having the physical processes with seasonal periodicity. The harmonic model is used to track seasonal variation through time, and a Gibbs random field (GRF) is used to represent the spatial dependency of digital image processes. The experimental results of this simulation study show the potentiality of the proposed system to reconstruct the image series observed by imperfect sensing technology from the environment which are frequently influenced by bad weather. This study provides fundamental information on the elements of the proposed system for right usage in application.

Access efficiency of small sized files in Big Data using various Techniques on Hadoop Distributed File System platform

  • Alange, Neeta;Mathur, Anjali
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.7
    • /
    • pp.359-364
    • /
    • 2021
  • In recent years Hadoop usage has been increasing day by day. The need of development of the technology and its specified outcomes are eagerly waiting across globe to adopt speedy access of data. Need of computers and its dependency is increasing day by day. Big data is exponentially growing as the entire world is working in online mode. Large amount of data has been produced which is very difficult to handle and process within a short time. In present situation industries are widely using the Hadoop framework to store, process and produce at the specified time with huge amount of data that has been put on the server. Processing of this huge amount of data having small files & its storage optimization is a big problem. HDFS, Sequence files, HAR, NHAR various techniques have been already proposed. In this paper we have discussed about various existing techniques which are developed for accessing and storing small files efficiently. Out of the various techniques we have specifically tried to implement the HDFS- HAR, NHAR techniques.

On the use of time-dependent success criteria within risk-informed analyses. Application to LONF-ATWS sequences in PWR reactors

  • Jorge Sanchez-Torrijos;Cesar Queral;Carlos Paris;Maria Jose Rebollo;Miguel Sanchez-Perea;Jose Maria Posada
    • Nuclear Engineering and Technology
    • /
    • v.54 no.12
    • /
    • pp.4601-4619
    • /
    • 2022
  • The classical Probabilistic Safety Analysis (PSA) does not include any time dependence explicitly. However, the success criteria (SC) could evolve during the cycle for some initiating events. In that sense, there is a type of sequence in which this time-dependency is quite important, the family of Anticipated Transient without Scram (ATWS) sequences in Pressurized Water Reactors. Therefore, a new risk-informed approach is proposed in this paper, which makes it possible to obtain the time-dependent SC evolution of the safety functions affected by the Moderator Temperature Coefficient (MTC) value. Then, the evolution of the ATWS conditional core damage probability (CCDP) could be obtained using a PSA model. To quantify the CCDP, the average values of the time-dependent failure probabilities must be computed. Finally, the comparison between the CCDP obtained through the application of the classical PSA approach and the new one makes it possible to quantify the impact of time-dependence on the SC of the headers that this new risk-informed ATWS approach can provide.

Multi-hazard vulnerability modeling: an example of wind and rain vulnerability of mid/high-rise buildings during hurricane events

  • Zhuoxuan Wei;Jean-Paul Pinelli;Kurtis Gurley;Shahid Hamid
    • Wind and Structures
    • /
    • v.38 no.5
    • /
    • pp.355-366
    • /
    • 2024
  • Severe natural multi-hazard events can cause damage to infrastructure and economic losses of billions of dollars. The challenges of modeling these losses include dependency between hazards, cause and sequence of loss, and lack of available data. This paper presents and explores multi-hazard loss modeling in the context of the combined wind and rain vulnerability of mid/high-rise buildings during hurricane events. A component-based probabilistic vulnerability model provides the framework to test and contrast two different approaches to treat the multi-hazards: In one, the wind and rain hazard models are both decoupled from the vulnerability model. In the other, only the wind hazard is decoupled, while the rain hazard model is embedded into the vulnerability model. The paper presents the mathematical and conceptual development of each approach, example outputs from each for the same scenario, and a discussion of weaknesses and strengths of each approach.

Applicability Evaluation of Modified Overlay Model on the Cyclic Behavior of 316L Stainless Steel at Room Temperature (316L 스테인리스강의 상온 반복 거동에 대한 수정 다층 모델의 적용성 검토)

  • Lim Jae-Yong;Lee Soon-Bok
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.28 no.10
    • /
    • pp.1603-1611
    • /
    • 2004
  • The validity of 'modified overlay model' to describe the cyclic behavior of annealed 316L stainless steel at room temperature was investigated. Material parameters(~f$_{i}$, m$_{i}$b, η, E) fur the model were obtained through constant strain amplitude test. The strain amplitude dependency of elastic limit and cyclic hardening, which were the characteristics of this model, were considered. Eight subelements were used to describe the nonlinearity of the hysteresis loops. The calculated hysteresis curve in each condition (0.5%, 0.7%, 0.9% train amplitude test) was very close to the experimental one. Two tests, incremental step test and 5-step test, ere performed to check the validity of 'modified overlay model'. The elastic limit was saturated to the one of the highest strain amplitudes of the block in the incremental step test, so it seemed to be Masing material at the stabilized block. Cyclic hardening was successfully described in the increasing sequence of the strain amplitude in 5-step test. But, the slight cyclic softening followed by higher strain amplitude would not be able to simulate by'modified overlay model'. However, the discrepancy induced was very small between the calculated hystereses and the experimental ones. In conclusion,'Modified overlay model'was proved to be appropriate in strain range of 0.35%~ 1.0%..0%.

Influence of Stiffness Coefficients on Optical Performance in Composite Optical Substrate (강성계수가 복합재 광학판 성능에 미치는 영향성 연구)

  • Kim, Kyung-Pyo
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.18 no.11
    • /
    • pp.762-769
    • /
    • 2017
  • The extensional stiffness in quasi-isotropic laminates is uniform in the radial direction, but the bending stiffness varies radially due to the stacking sequence. This paper addresses the directional dependency of the bending stiffness and its radial variation in three types of quasi-isotropic laminate reflectors consisting of unidirectional fiber composite materials (UDM) and randomly distributed composite materials (short fiber, RDM). The extensional stiffness and bending stiffness in optical reflectors using RDM are uniform, while the bending stiffness in those using UDM varies radially from 11% to 26%. Also, the stiffness sensitivity, such as the bend-twist or bend-torsion effect, due to the differences in the stiffness value in the composite, is large. These factors are problematic in the optical field requiring precision surfaces. Utilizing RDM might be one way to eliminate the presence of bending stiffness in composite mirror substrates.

A Two-Phase Component Identification Method using Static and Dynamic Relationship between Classes (클래스들 간의 정적ㆍ동적 관계에 의한 2단계 컴포넌트 식별방법)

  • Choi Mi-Sook;Cho Eun-Sook;Park Jai-Nyun;Ha Jong-Sung
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.11 no.1
    • /
    • pp.1-14
    • /
    • 2005
  • It is difficult to identify reusable and independent components in component-based development(CBD) process. Therefore existing methodologies have dealt the problem of component identification based on only developer's intuition and heuristics. As a result, it is difficult to identify the business components by common developers. Therefore, in this paper, we propose a new baseline and technique to identify the business components based on domain model such as use case diagrams, class diagrams, and sequence diagrams. proposed method identifies components through two phases; system component identification and business component identification. Especially, we consider structural characteristics as well as dependency characteristics according to methods call types and directions in identifying components. We also present a case study and comparative analysis and assessment to prove the practical use of our technique.