• Title/Summary/Keyword: random vectors

Search Result 150, Processing Time 0.033 seconds

Bayesian Multiple Change-Point Estimation and Segmentation

  • Kim, Jaehee;Cheon, Sooyoung
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.6
    • /
    • pp.439-454
    • /
    • 2013
  • This study presents a Bayesian multiple change-point detection approach to segment and classify the observations that no longer come from an initial population after a certain time. Inferences are based on the multiple change-points in a sequence of random variables where the probability distribution changes. Bayesian multiple change-point estimation is classifies each observation into a segment. We use a truncated Poisson distribution for the number of change-points and conjugate prior for the exponential family distributions. The Bayesian method can lead the unsupervised classification of discrete, continuous variables and multivariate vectors based on latent class models; therefore, the solution for change-points corresponds to the stochastic partitions of observed data. We demonstrate segmentation with real data.

Sparse Kernel Regression using IRWLS Procedure

  • Park, Hye-Jung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.3
    • /
    • pp.735-744
    • /
    • 2007
  • Support vector machine(SVM) is capable of providing a more complete description of the linear and nonlinear relationships among random variables. In this paper we propose a sparse kernel regression(SKR) to overcome a weak point of SVM, which is, the steep growth of the number of support vectors with increasing the number of training data. The iterative reweighted least squares(IRWLS) procedure is used to solve the optimal problem of SKR with a Laplacian prior. Furthermore, the generalized cross validation(GCV) function is introduced to select the hyper-parameters which affect the performance of SKR. Experimental results are then presented which illustrate the performance of the proposed procedure.

  • PDF

Visualization of flowfield around Two Circular Cylinders by a Discrete Vortex Method (이산와법에 의한 2원주 주위의 유동장 가시화)

  • Ro Ki-Deok;Lee Young-Hoon;Son Yeong-Tae
    • 한국가시화정보학회:학술대회논문집
    • /
    • 2002.11a
    • /
    • pp.63-66
    • /
    • 2002
  • The Flow patterns around two cylinders in various arrangements were studied by a discrete vortex method. The flow for the surface of each cylinder was represented by arranging bound vortices at adequate intervals. The viscous diffusion of fluid was represented by the random walk method. The vortex distributions, streaklines, timelines and velocity vectors around two cylinders were calculated for centre-to-centre pitch rations of P/D=1.5 and 2.5, attack angles of $\alpha=0^{\circ},\;30^{\circ},\;60^{\circ},\;and\;90^{\circ}$, correspond to the photographs by flow visualization and the flow intereference between two cylinders in var ious arrangements was clearly visualized by a numerical simulation.

  • PDF

Estimation of high-dimensional sparse cross correlation matrix

  • Yin, Cao;Kwangok, Seo;Soohyun, Ahn;Johan, Lim
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.6
    • /
    • pp.655-664
    • /
    • 2022
  • On the motivation by an integrative study of multi-omics data, we are interested in estimating the structure of the sparse cross correlation matrix of two high-dimensional random vectors. We rewrite the problem as a multiple testing problem and propose a new method to estimate the sparse structure of the cross correlation matrix. To do so, we test the correlation coefficients simultaneously and threshold the correlation coefficients by controlling FRD at a predetermined level α. Further, we apply the proposed method and an alternative adaptive thresholding procedure by Cai and Liu (2016) to the integrative analysis of the protein expression data (X) and the mRNA expression data (Y) in TCGA breast cancer cohort. By varying the FDR level α, we show that the new procedure is consistently more efficient in estimating the sparse structure of cross correlation matrix than the alternative one.

Comparison of random forest classification performance of autism spectrum disorders according to different component ratios of the functional connectivity matrix and principal component vectors using neuroimaging (뇌기능영상기반 기능적 연결성 행렬의 서로 다른 성분 비율과 주성분 벡터에 따른 자폐 스펙트럼 장애의 랜덤 포레스트 분류성능 비교)

  • Choi, Hyoungshin;Park, Hyunjin
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.05a
    • /
    • pp.351-353
    • /
    • 2021
  • 자폐 스펙트럼 장애는 이질적인 신경 발달 장애로, 뇌기능영상에 기반한 기능적 연결성 행렬을 이용해 연구가 활발하게 진행된다. 기능적 연결성 행렬을 분석하기 위해 주성분 분석방법을 이용하며, 이를 통해 뇌의 기능적 경향성 패턴을 확인할 수 있다. 이 때, 서로 다른 연결성 성분 비율과 주성분 벡터를 이용해서 다양한 기능적 경향성 패턴을 얻을 수 있다. 패턴에 따른 랜덤 포레스트 분류 모델의 성능이 달라지는데 이를 비교해본 결과, 상위 50%의 성분을 이용하여 만든 기능적 경향성 패턴 1 이 데이터의 설명 비율도 높고, 우수한 분류 성능을 보이는 것을 확인했다.

A Study on the Signal Processing for Content-Based Audio Genre Classification (내용기반 오디오 장르 분류를 위한 신호 처리 연구)

  • 윤원중;이강규;박규식
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.6
    • /
    • pp.271-278
    • /
    • 2004
  • In this paper, we propose a content-based audio genre classification algorithm that automatically classifies the query audio into five genres such as Classic, Hiphop, Jazz, Rock, Speech using digital sign processing approach. From the 20 seconds query audio file, the audio signal is segmented into 23ms frame with non-overlapped hamming window and 54 dimensional feature vectors, including Spectral Centroid, Rolloff, Flux, LPC, MFCC, is extracted from each query audio. For the classification algorithm, k-NN, Gaussian, GMM classifier is used. In order to choose optimum features from the 54 dimension feature vectors, SFS(Sequential Forward Selection) method is applied to draw 10 dimension optimum features and these are used for the genre classification algorithm. From the experimental result, we can verify the superior performance of the proposed method that provides near 90% success rate for the genre classification which means 10%∼20% improvements over the previous methods. For the case of actual user system environment, feature vector is extracted from the random interval of the query audio and it shows overall 80% success rate except extreme cases of beginning and ending portion of the query audio file.

A New Low Power Scan BIST Architecture Based on Scan Input Transformation Scheme (스캔입력 변형기법을 통한 새로운 저전력 스캔 BIST 구조)

  • Son, Hyeon-Uk;Kim, You-Bean;Kang, Sung-Ho
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.45 no.6
    • /
    • pp.43-48
    • /
    • 2008
  • Power consumption during test can be much higher than that during normal operation since test vectors are determined independently. In order to reduce the power consumption during test process, a new BIST(Built-In Self Test) architecture is proposed. In the proposed architecture, test vectors generated by an LFSR(Linear Feedback Shift Resister) are transformed into the new patterns with low transitions using Bit Generator and Bit Dropper. Experiments performed on ISCAS'89 benchmark circuits show that transition reduction during scan testing can be achieved by 62% without loss of fault coverage. Therefore the new architecture is a viable solution for reducing both peak and average power consumption.

A Watermarking Method Based on the Informed Coding and Embedding Using Trellis Code and Entropy Masking (Trellis 부호 및 엔트로피 마스킹을 이용한 정보부호화 기반 워터마킹)

  • Lee, Jeong-Hwan
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.13 no.12
    • /
    • pp.2677-2684
    • /
    • 2009
  • In this paper, we study a watermarking method based on the informed coding and embedding by means of trellis code and entropy masking. An image is divided as $8{\times}8$ block with no overlapping and the discrete cosine transform(DCT) is applied to each block. Then the 16 medium-frequency AC terms of each block are extracted. Next it is compared with gaussian random vectors having zero mean and unit variance. As these processing, the embedding vectors with minimum value of linear combination between linear correlation and Watson distance can be obtained by Viterbi algorithm at each stage of trellis coding. For considering the image characteristics, we apply different weight value between the linear correlation and the Watson distance using the entropy masking. To evaluate the performance of proposed method, the average bit error rate of watermark message is calculated from different several images. By the experiments the proposed method is improved in terms of the average bit error rate.

Comparison of Prediction Accuracy Between Classification and Convolution Algorithm in Fault Diagnosis of Rotatory Machines at Varying Speed (회전수가 변하는 기기의 고장진단에 있어서 특성 기반 분류와 합성곱 기반 알고리즘의 예측 정확도 비교)

  • Moon, Ki-Yeong;Kim, Hyung-Jin;Hwang, Se-Yun;Lee, Jang Hyun
    • Journal of Navigation and Port Research
    • /
    • v.46 no.3
    • /
    • pp.280-288
    • /
    • 2022
  • This study examined the diagnostics of abnormalities and faults of equipment, whose rotational speed changes even during regular operation. The purpose of this study was to suggest a procedure that can properly apply machine learning to the time series data, comprising non-stationary characteristics as the rotational speed changes. Anomaly and fault diagnosis was performed using machine learning: k-Nearest Neighbor (k-NN), Support Vector Machine (SVM), and Random Forest. To compare the diagnostic accuracy, an autoencoder was used for anomaly detection and a convolution based Conv1D was additionally used for fault diagnosis. Feature vectors comprising statistical and frequency attributes were extracted, and normalization & dimensional reduction were applied to the extracted feature vectors. Changes in the diagnostic accuracy of machine learning according to feature selection, normalization, and dimensional reduction are explained. The hyperparameter optimization process and the layered structure are also described for each algorithm. Finally, results show that machine learning can accurately diagnose the failure of a variable-rotation machine under the appropriate feature treatment, although the convolution algorithms have been widely applied to the considered problem.

Development of New Vector Systems as Genetic Tools Applicable to Mycobacteria (Mycobacteria에 적용 가능한 genetic tool로서의 새로운 vector system 개발)

  • Jeong, Ji-A;Lee, Ha-Na;Ko, In-Jeong;Oh, Jeong-Il
    • Journal of Life Science
    • /
    • v.23 no.2
    • /
    • pp.290-298
    • /
    • 2013
  • The genus Mycobacterium includes crucial animal and human pathogens such as Mycobacterium tuberculosis, Mycobacterium leprae, and Mycobacterium bovis. Although it is important to understand the genetic basis for their virulence and persistence in host, genetic analysis in mycobacteria was hampered by a lack of sufficient genetic tools. Therefore, many functional vectors as molecular genetic tools have been designed for understanding mycobacterial biology, and the application of these tools to mycobacteria has accelerated the study of mechanisms involved in virulence and gene expression. To overcome the pre-existing problems in genetic manipulation of mycobacteria, this paper reports new vector systems as effective genetic tools in Mycobacterium smegmatis. Three vectors were developed; pKOTs is a suicide vector for mutagenesis containing a temperature-sensitive replication origin (TSRO) and the sacB gene encoding levansucrase as a counterselectable marker. pMV306lacZ is an integrative lacZ transcriptional fusion vector that can be inserted into chromosomal DNA by site-specific recombination. pTnMod-OKmTs is a minitransposon vector harboring the TSRO that can be used in random mutagenesis. It was demonstrated in this study that these vectors effectively worked in M. smegmatis. The vector systems reported here are expected to successfully applicable to future research of mycobacterial molecular genetics.