• Title/Summary/Keyword: State clustering

Search Result 230, Processing Time 0.024 seconds

Variational Autoencoder Based Dimension Reduction and Clustering for Single-Cell RNA-seq Gene Expression (단일세포 RNA-SEQ의 유전자 발현 군집화를 위한 변이 자동인코더 기반의 차원감소와 군집화)

  • Chi, Sang-Mun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.11
    • /
    • pp.1512-1518
    • /
    • 2021
  • Since single cell RNA sequencing provides the expression profiles of individual cells, it provides higher cellular differential resolution than traditional bulk RNA sequencing. Using these single cell RNA sequencing data, clustering analysis is generally conducted to find cell types and understand high level biological processes. In order to effectively process the high-dimensional single cell RNA sequencing data fir the clustering analysis, this paper uses a variational autoencoder to transform a high dimensional data space into a lower dimensional latent space, expecting to produce a latent space that can give more accurate clustering results. By clustering the features in the transformed latent space, we compare the performance of various classical clustering methods for single cell RNA sequencing data. Experimental results demonstrate that the proposed framework outperforms many state-of-the-art methods under various clustering performance metrics.

CLUSTERING DNA MICROARRAY DATA BY STOCHASTIC ALGORITHM

  • Shon, Ho-Sun;Kim, Sun-Shin;Wang, Ling;Ryu, Keun-Ho
    • Proceedings of the KSRS Conference
    • /
    • 2007.10a
    • /
    • pp.438-441
    • /
    • 2007
  • Recently, due to molecular biology and engineering technology, DNA microarray makes people watch thousands of genes and the state of variation from the tissue samples of living body. With DNA Microarray, it is possible to construct a genetic group that has similar expression patterns and grasp the progress and variation of gene. This paper practices Cluster Analysis which purposes the discovery of biological subgroup or class by using gene expression information. Hence, the purpose of this paper is to predict a new class which is unknown, open leukaemia data are used for the experiment, and MCL (Markov CLustering) algorithm is applied as an analysis method. The MCL algorithm is based on probability and graph flow theory. MCL simulates random walks on a graph using Markov matrices to determine the transition probabilities among nodes of the graph. If you look at closely to the method, first, MCL algorithm should be applied after getting the distance by using Euclidean distance, then inflation and diagonal factors which are tuning modulus should be tuned, and finally the threshold using the average of each column should be gotten to distinguish one class from another class. Our method has improved the accuracy through using the threshold, namely the average of each column. Our experimental result shows about 70% of accuracy in average compared to the class that is known before. Also, for the comparison evaluation to other algorithm, the proposed method compared to and analyzed SOM (Self-Organizing Map) clustering algorithm which is divided into neural network and hierarchical clustering. The method shows the better result when compared to hierarchical clustering. In further study, it should be studied whether there will be a similar result when the parameter of inflation gotten from our experiment is applied to other gene expression data. We are also trying to make a systematic method to improve the accuracy by regulating the factors mentioned above.

  • PDF

A Pattern Clustering Approach to the Rule Acquisition for the Fuzzy controller of a CAMCODER (패턴 clustering에 의한 캠코더 퍼지 제어기의 rule 획득)

  • 장경식;정진영;신충식;신중인;방교윤;김재희
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.30B no.1
    • /
    • pp.72-78
    • /
    • 1993
  • While the rules for an expert system are obtained through the interviewing with domain experts or by designer's own experience, these are not adequate for fuzzy controllers dealing quantitative control values. In this paper, by considering a state of the controlled system as a pattern, we propose a method to obtain the control rules by a statistical method. Namely, we propose a method to obtain the control rules by a statistical method. Namely, we propose an rule acquisition method that is objective, mechanical, and inductive inference using a cluster-seeking algorithm, or K-means clustering algorithm. To validate this study, we show an example of an IRIS control in a CAMCODER and analyse the rules acquired from 98 sample patterns consisting of 45 features.

  • PDF

Nonlinear Characteristics of Fuzzy Scatter Partition-Based Fuzzy Inference System

  • Park, Keon-Jun;Huang, Wei;Yu, C.;Kim, Yong K.
    • International journal of advanced smart convergence
    • /
    • v.2 no.1
    • /
    • pp.12-17
    • /
    • 2013
  • This paper introduces the fuzzy scatter partition-based fuzzy inference system to construct the model for nonlinear process to analyze nonlinear characteristics. The fuzzy rules of fuzzy inference systems are generated by partitioning the input space in the scatter form using Fuzzy C-Means (FCM) clustering algorithm. The premise parameters of the rules are determined by membership matrix by means of FCM clustering algorithm. The consequence part of the rules is represented in the form of polynomial functions and the parameters of the consequence part are estimated by least square errors. The proposed model is evaluated with the performance using the data widely used in nonlinear process. Finally, this paper shows that the proposed model has the good result for high-dimension nonlinear process.

A Study on the Non-keyword Models in the Keyword Spotting System using the Phone-Based Hidden Markov Models (음소 HMM을 이용한 Keyword Spotting 시스템에서의 Non-Keyword 모델에 관한 연구)

  • 이활림
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1995.06a
    • /
    • pp.83-87
    • /
    • 1995
  • Keyword Spotting 이란 음성인식의 한 분야로서 입력된 음성에서 미리 정해진 특정단어 또는 복수 개의 단어들 중 어느 것이 포함되어 있는지의 여부를 찾아내고 이 단어를 식별해 내는 작업을 의미한다. 음소모델을 이용하여 Keyword Spotting 시스템을 구성할 경우 새로운 keyword의 추가 또는 변경이 필요할 때 단순히 그 발음사전에 따라 음소모델들을 연결시킴으로써 keyword 모델을 구성할 수 있으므로 단어모델에 의한 방법에 비해 장점이 있다. 본 논문에서는 triphone을 기본단위로 하는 HMM 에 의해 keyword 모델을 구성하고, non-keyword 모델 및 silence 모델을 함께 사용하는 keyword spotting 시스템을 구성하였다. 이러한 시스템에서 non-keyword 모델은 keyword와 keyword가 아닌 음성을 구분 지어주는 역할을 하므로 인식성능의 향상을 위해서는 적절한 non-keyword 모델의 선택이 필요하다. 본 논문에서는 10개의 state를 갖는 단일모델, 조음방법에 의해 음소들을 clustering 한 모델, 그리고 통계적 방법에 의해 음소들을 clustering 한 모델들을 각각 non-keyword 모델로 사용하여 그 성능을 비교하였다. 6개의 keyword를 대상으로 한 화자독립 keyword spotting 실험결과, 통계적 방법에 의해 음소들을 6 또는 7개의 그룹으로 clustering 한 방법이 가장 우수한 인식성능을 나타냈다.

  • PDF

3-D K-means clustering method considering internal chemical state variation of self-dischareg of Li-ion battery (리튬 이온 배터리의 자가 방전에 따른 내부 화학적 상태를 고려한 3-D K-means Clustering 스크리닝 기법 연구)

  • Han, Dongho;Kwon, Sanguk;Kim, Seungwoo;Lim, Cheolwoo;Kim, Jonghoon
    • Proceedings of the KIPE Conference
    • /
    • 2019.11a
    • /
    • pp.150-151
    • /
    • 2019
  • 리튬 이온 배터리가 전기 자동차 및 다양한 어플리케이션에 적용됨에 따라 폐배터리의 수요 또한 증가하고 있다. 내부 화학적 상태가 상이한 배터리의 전기적 특성실험을 통해 파라미터를 선정할 수 있으며 전기적 특성 실험 전 후의 시간차에 따른 파라미터 변화를 반영하는 것이 필수적이다. 제조 공정과정의 파라미터의 측정값과 특성실험 후의 파라미터 재측정값을 비교함으로써 이를 3-D Kmeans Clustering 알고리즘에 반영하여 더욱 정밀한 셀 선별을 실시하였다.

  • PDF

Development of Energy-sensitive Cluster Formation and Cluster Head Selection Technique for Large and Randomly Deployed WSNs

  • Sagun Subedi;Sang Il Lee
    • Journal of information and communication convergence engineering
    • /
    • v.22 no.1
    • /
    • pp.1-6
    • /
    • 2024
  • Energy efficiency in wireless sensor networks (WSNs) is a critical issue because batteries are used for operation and communication. In terms of scalability, energy efficiency, data integration, and resilience, WSN-cluster-based routing algorithms often outperform routing algorithms without clustering. Low-energy adaptive clustering hierarchy (LEACH) is a cluster-based routing protocol with a high transmission efficiency to the base station. In this paper, we propose an energy consumption model for LEACH and compare it with the existing LEACH, advanced LEACH (ALEACH), and power-efficient gathering in sensor information systems (PEGASIS) algorithms in terms of network lifetime. The energy consumption model comprises energy-sensitive cluster formation and a cluster head selection technique. The setup and steady-state phases of the proposed model are discussed based on the cluster head selection. The simulation results demonstrated that a low-energy-consumption network was introduced, modeled, and validated for LEACH.

Theoretical Modeling of High Concentration Bismuth-based Erbium-doped Fiber Amplifier (고농도로 도핑된 Bismuth 기반 어븀첨가 광섬유 증폭기의 이론적 모델링 기법에 관한 연구)

  • Shin, Jae-Hyun;Jung, Min-Wan;Lee, Ju-Han
    • Korean Journal of Optics and Photonics
    • /
    • v.21 no.4
    • /
    • pp.139-145
    • /
    • 2010
  • A complete modeling of erbium-doped Bismuth-oxide fibers with a high doping concentration is presented. A 6-level amplifier system that incorporated clustering-induced concentration quenching, cooperative upconversion, pump excited state absorption (ESA), and signal ESA, was adopted for the modeling. The accuracy of the modeling was verified by comparing the calculated gain and noise figure with experimentally obtained ones.

Decision Tree State Tying Modeling Using Parameter Estimation of Bayesian Method (Bayesian 기법의 모수 추정을 이용한 결정트리 상태 공유 모델링)

  • Oh, SangYeob
    • Journal of Digital Convergence
    • /
    • v.13 no.1
    • /
    • pp.243-248
    • /
    • 2015
  • Recognition model is not defined when you configure a model, Been added to the model after model building awareness, Model a model of the clustering due to lack of recognition models are generated by modeling is causes the degradation of the recognition rate. In order to improve decision tree state tying modeling using parameter estimation of Bayesian method. The parameter estimation method is proposed Bayesian method to navigate through the model from the results of the decision tree based on the tying state according to the maximum probability method to determine the recognition model. According to our experiments on the simulation data generated by adding noise to clean speech, the proposed clustering method error rate reduction of 1.29% compared with baseline model, which is slightly better performance than the existing approach.

A design of the PSDG based semantic slicing model for software maintenance (소프트웨어의 유지보수를 위한 PSDG기반 의미분할모형의 설계)

  • Yeo, Ho-Young;Lee, Kee-O;Rhew, Sung-Yul
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.8
    • /
    • pp.2041-2049
    • /
    • 1998
  • This paper suggests a technique for program segmentation and maintenance using PSDG(Post-State Dependency Graph) that improves the quality of a software by identifying and detecting defects in already fixed source code. A program segmentation is performed by utilizing source code analysis which combines the measures of static, dynamic and semantic slicing when we need understandability of defect in programs for corrective maintanence. It provides users with a segmental principle to split a program by tracing state dependency of a source code with the graph, and clustering and highlighting, Through a modeling of the PSDG, elimination of ineffective program deadcode and generalization of related program segments arc possible, Additionally, it can be correlated with other design modeb as STD(State Transition Diagram), also be used as design documents.

  • PDF