• 제목/요약/키워드: sequence of sets

검색결과 360건 처리시간 0.031초

INSTABILITY OF THE BETTI SEQUENCE FOR PERSISTENT HOMOLOGY AND A STABILIZED VERSION OF THE BETTI SEQUENCE

  • JOHNSON, MEGAN;JUNG, JAE-HUN
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • 제25권4호
    • /
    • pp.296-311
    • /
    • 2021
  • Topological Data Analysis (TDA), a relatively new field of data analysis, has proved very useful in a variety of applications. The main persistence tool from TDA is persistent homology in which data structure is examined at many scales. Representations of persistent homology include persistence barcodes and persistence diagrams, both of which are not straightforward to reconcile with traditional machine learning algorithms as they are sets of intervals or multisets. The problem of faithfully representing barcodes and persistent diagrams has been pursued along two main avenues: kernel methods and vectorizations. One vectorization is the Betti sequence, or Betti curve, derived from the persistence barcode. While the Betti sequence has been used in classification problems in various applications, to our knowledge, the stability of the sequence has never before been discussed. In this paper we show that the Betti sequence is unstable under the 1-Wasserstein metric with regards to small perturbations in the barcode from which it is calculated. In addition, we propose a novel stabilized version of the Betti sequence based on the Gaussian smoothing seen in the Stable Persistence Bag of Words for persistent homology. We then introduce the normalized cumulative Betti sequence and provide numerical examples that support the main statement of the paper.

한국인 화자의 영어 발음 오류에 관한 음향적 분석 - 비음화와 설측음화를 중심으로 - (Acoustic Analysis of Koreans' Production Errors in English - with reference to nasalization and lateralization)

  • 김미혜;강선미;김기호
    • 음성과학
    • /
    • 제15권3호
    • /
    • pp.53-63
    • /
    • 2008
  • This paper examined the acoustic differences in English speech production between English native speakers and Korean learners. Korean speakers seem to produce errors by over-applying the Korean phonological rules(nasalization and lateralization) to English speech under the conditions comparable to those of Korean which contain nasal+lateral or lateral+nasal sequences. Being based on this prediction, the experimental data is grouped into three sets, [n]+[l] sequence, [l]+[n]sequence, and [m]+[l] sequence. The result shows that, Korean speakers usually nasalize or lateralize the target words or phrases in every three categories while English natives don't. In set A([n]+[l] sequence), both nasalization and lateralization were found in [n]+[l] sequence, the same circumstances where both nasalization and lateralization can be placed as in Korean. In the case of set B([l]+[n] sequence), only lateralization is observed. It is because the nasalization never occurs in the sequence of l-n in Korean. There is no lateralization in set C([m]+[l] sequence), because only nasalization occurs in the sequence of m-l in Korean. This results reconfirmed that the nasalization and lateralization rules in Korean deeply influence on the English production data. Korean speakers need to be taught not to over-apply Korean phonological rule to English production for accurate pronunciation.

  • PDF

cDNA 마이크로어레이에서 유전자간 상관 관계에 대한 보고 (A Report on the Inter-Gene Correlations in cDNA Microarray Data Sets)

  • 김병수;장지선;김상철;임요한
    • 응용통계연구
    • /
    • 제22권3호
    • /
    • pp.617-626
    • /
    • 2009
  • 최근에 보고되는 일련의 연구는 Affymetrix 마이크로어레이 자료에서 유전자간 상관관계가 강하고 장범위(長範圍)(long-ranged)로 나타나고 있으며, 기존의 "편한" 가정, 즉 유전자간 상관관계가 매우 약하며, 따라서 유전자간 유사 독립성을 가정할 수 있다는 주장이 비현실적이라는 것을 보고하고 있다. Qui 등 (2005b)은 각 유전자의 검정통계량을 병합하여 통계적 추론을 하는 이른바 비모수적 경험적 베이즈 방법을 적용하면 검색된 특이발현 유전자수의 분산이 커진다는 것을 보고하고 있고, 이러한 분산의 불안전성 이유로서 유전자간 강한 상관관계를 지적하고 있다. 또한 Klebanov와 Yakovlev (2007)는 유전자간 상관관계가 통계적 분석을 어렵게 하는 요인이라기 보다는 유용한 정보의 원천이고 적정한 변환을 통하여 근사 독립을 유지할 수 있는 급수를 만들 수 있으며 이 급수를 ${\delta}$-급수라고 불렀다. 본 보고에서는 국내에서 생산된 2조의 cDNA 마이크로어레이 자료에서 유전자간 상관관계가 비교적 강하며, 장범위(長範圍)로 나타나는 것을 확인하며, 유사 독립성을 전제할 수 있는 ${\delta}$-급수가 cDNA 마이크로어레이에서도 발견되는 것을 보고하고자 한다, 동 보고는 추후 cDNA 마이크로어레이 자료의 분석에서도 유전자간 상관관계를 고려하여야 함을 강조하고 있다.

Operations of fuzzy bags

  • Kim, Kyung-Soo;Miyamoto, Sadaaki
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1996년도 Proceedings of the Korea Automatic Control Conference, 11th (KACC); Pohang, Korea; 24-26 Oct. 1996
    • /
    • pp.28-31
    • /
    • 1996
  • A bag is a set-like entity which can contain repeated elements. Fuzzy bags have been studied by Yager, who defined their basic relations and operations. However, his definitions of the basic relations and operations are inconsistent with the corresponding relations and operations for ordinary fuzzy sets. The present paper presents new basic relations and operations of fuzzy bags using a grade sequence for each element of the universal set. Moreover the .alpha.-cut, t-norms, the extension principle, and the composition of fuzzy bag relations are described.

  • PDF

SRLS을 이용한 파라미터 추정에 관한 연구 (A study on the parameter estimate using selective recursive least square)

  • 유치형;이재하;정찬수
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1989년도 한국자동제어학술회의논문집; Seoul, Korea; 27-28 Oct. 1989
    • /
    • pp.441-444
    • /
    • 1989
  • This correspondence presents a recursive estimation algorithm which, unlike conventional ones; updates the estimates only when a sufficient improvement can be obtained with a bounded noise assumption, the resulting sequence of estimates is a sequence of convex sets(ellipsoids) in the parameter space. For the cases studied, the algorithm use less than 20 percent of the. data to update, the estimate and still acquired good accuracy for spectral estimation.

  • PDF

Establishment of Quantitative Analysis Method for Genetically Modified Maize Using a Reference Plasmid and Novel Primers

  • Moon, Gi-Seong;Shin, Weon-Sun
    • Preventive Nutrition and Food Science
    • /
    • 제17권4호
    • /
    • pp.274-279
    • /
    • 2012
  • For the quantitative analysis of genetically modified (GM) maize in processed foods, primer sets and probes based on the 35S promoter (p35S), nopaline synthase terminator (tNOS), p35S-hsp70 intron, and zSSIIb gene encoding starch synthase II for intrinsic control were designed. Polymerase chain reaction (PCR) products (80~101 bp) were specifically amplified and the primer sets targeting the smaller regions (80 or 81 bp) were more sensitive than those targeting the larger regions (94 or 101 bp). Particularly, the primer set 35F1-R1 for p35S targeting 81 bp of sequence was even more sensitive than that targeting 101 bp of sequence by a 3-log scale. The target DNA fragments were also specifically amplified from all GM labeled food samples except for one item we tested when 35F1-R1 primer set was applied. A reference plasmid pGMmaize (3 kb) including the smaller PCR products for p35S, tNOS, p35S-hsp70 intron, and the zSSIIb gene was constructed for real-time PCR (RT-PCR). The linearity of standard curves was confirmed by using diluents ranging from $2{\times}10^1{\sim}10^5$ copies of pGMmaize and the $R^2$ values ranged from 0.999~1.000. In the RT-PCR, the detection limit using the novel primer/probe sets was 5 pg of genomic DNA from MON810 line indicating that the primer sets targeting the smaller regions (80 or 81 bp) could be used for highly sensitive detection of foreign DNA fragments from GM maize in processed foods.

다중변수 순차층서 모델 개발을 통한 울릉분지 남서부 대륙주변부의 층서연구 (Muti-variable Sequence Stratigraphic Model and its Application to Shelf-Slope System of the Southwestern Ulleung Basin Margin)

  • 윤석훈;박세진;조성권
    • 한국석유지질학회지
    • /
    • 제5권1_2호
    • /
    • pp.36-47
    • /
    • 1997
  • 본 연구에서는 기존 Exxon group의 순차층서 모델이 갖고 있는 적용의 한계성을 극복하기 위하여 다중변수 순차층서 모델을 제시하였다. 이 모델에서는 2~3차수의 주기성을 갖고 변화되는 범세계적 해수면 변동, 지구조운동, 퇴적물 공급 등의 영향요소를 모두 변수로 간주하며, 이들 요소에 의해 순차층의 내부구성과 경계면 타입이 결정된다. 순차층을 구성하는 기본 단위로 해석적 층서단위인 퇴적계 연합체(systems tract)대신, 내부 지층의 집적형태에 의해 구분되는 기술적 층서단위인 부순차층세트(parasequence set)를 채택하였으며, 부순차층세트의 타입은 상대적 해수면 변동과 퇴적물 공급속도의 변화를 반영한다. 따라서 본 순차층서 모델은, 네 가지 타입의 상대적 해수면 변동과 세 등급의 퇴적물 공급 변화 사이의 조합으로써 예상되는 단일 또는 $2{\~}4$개의 부순차층세트로 구성된 7가지 타입의 순차층을 보여준다. 다중변수 순차층서 모델을 울릉분지 남서부 대륙주변부에 적용한 결과, 순차층의 타입은 울릉분지의 형성과 닫힘에 따른 3단계의 분지주변부 지구조 운동의 변화를 잘 반영한다. 즉, 순차층을 구성하는 부순차층세트 조합의 변화와 울릉분지 지구조 운동의 타입과 속도가 시기적으로 일치 한다. 해저면의 급격한 침강을 동반한 울릉분지의 후열도 열개 동안에는 분지의 급격 한 침강으로 인해 상대적 인 해수면은 지속적으로 상승되었고, 많은 양의 퇴적물이 공급되어 1개의 부순차층세트로 구성된 순차층이 형성되었다. 이후, 압축성 변형을 동반한 울릉분지의 닫힘 동안에는, 분지 침강이 느려지고 국지적으로 융기가 일어나 하강 또는 상승우세의 상대적 해수면 변동이 야기되었고, 또한 지구조적 요인에 의해 통제되는 퇴적물 집적속도의 지역적인 차이로 인하여 변형대와 비변형대에서 상이한 순차층이 형성되었다. 분지 닫힘단계 이후에는 다시 해저면이 전반적으로 침강되면서, 상승이 우세한 상대적 해수면 변동과 보통 또는 빠른 퇴적물 집적속도가 결합하여 2-3개의 부순차층세트로 구성된 순차층을 형성하였다.

  • PDF

Centralized Kalman Filter with Adaptive Measurement Fusion: its Application to a GPS/SDINS Integration System with an Additional Sensor

  • Lee, Tae-Gyoo
    • International Journal of Control, Automation, and Systems
    • /
    • 제1권4호
    • /
    • pp.444-452
    • /
    • 2003
  • An integration system with multi-measurement sets can be realized via combined application of a centralized and federated Kalman filter. It is difficult for the centralized Kalman filter to remove a failed sensor in comparison with the federated Kalman filter. All varieties of Kalman filters monitor innovation sequence (residual) for detection and isolation of a failed sensor. The innovation sequence, which is selected as an indicator of real time estimation error plays an important role in adaptive mechanism design. In this study, the centralized Kalman filter with adaptive measurement fusion is introduced by means of innovation sequence. The objectives of adaptive measurement fusion are automatic isolation and recovery of some sensor failures as well as inherent monitoring capability. The proposed adaptive filter is applied to the GPS/SDINS integration system with an additional sensor. Simulation studies attest that the proposed adaptive scheme is effective for isolation and recovery of immediate sensor failures.

An Efficient Video Retrieval Algorithm Using Key Frame Matching for Video Content Management

  • Kim, Sang Hyun
    • International Journal of Contents
    • /
    • 제12권1호
    • /
    • pp.1-5
    • /
    • 2016
  • To manipulate large video contents, effective video indexing and retrieval are required. A large number of video indexing and retrieval algorithms have been presented for frame-wise user query or video content query whereas a relatively few video sequence matching algorithms have been proposed for video sequence query. In this paper, we propose an efficient algorithm that extracts key frames using color histograms and matches the video sequences using edge features. To effectively match video sequences with a low computational load, we make use of the key frames extracted by the cumulative measure and the distance between key frames, and compare two sets of key frames using the modified Hausdorff distance. Experimental results with real sequence show that the proposed video sequence matching algorithm using edge features yields the higher accuracy and performance than conventional methods such as histogram difference, Euclidean metric, Battachaya distance, and directed divergence methods.

Correlation Analysis between Regulatory Sequence Motifs and Expression Profiles by Kernel CCA

  • Rhee, Je-Keun;Joung, Je-Gun;Chang, Jeong-Ho;Zhang, Byoung-Tak
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2005년도 BIOINFO 2005
    • /
    • pp.63-68
    • /
    • 2005
  • Transcription factors regulate gene expression by binding to gene upstream region. Each transcription factor has the specific binding site in promoter region. So the analysis of gene upstream sequence is necessary for understanding regulatory mechanism of genes, under a plausible idea that assumption that DNA sequence motif profiles are closely related to gene expression behaviors of the corresponding genes. Here, we present an effective approach to the analysis of the relation between gene expression profiles and gene upstream sequences on the basis of kernel canonical correlation analysis (kernel CCA). Kernel CCA is a useful method for finding relationships underlying between two different data sets. In the application to a yeast cell cycle data set, it is shown that gene upstream sequence profile is closely related to gene expression patterns in terms of canonical correlation scores. By the further analysis of the contributing values or weights of sequence motifs in the construction of a pair of sequence motif profiles and expression profiles, we show that the proposed method can identify significant DNA sequence motifs involved with some specific gene expression patterns, including some well known motifs and those putative, in the process of the yeast cell cycle.

  • PDF