• 제목/요약/키워드: data partition

Search Result 413, Processing Time 0.024 seconds

Bayesian Method for Combining Results from Different Poisson Experiments

  • Cho, Jang Sik;Kim, Dal Ho
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.2
    • /
    • pp.533-540
    • /
    • 2000
  • The problem of information related to I poission experiments, each having a distinct failure rate $\theta$i I=1,2,…,I, is considered. Instead of using a standard exchangeable prior for $\theta$=($\theta$1,$\theta$2,…,$\theta$I), we consider a partition of the experiments and take the $\theta$i's belonging to the same partition subgroup to be exchangeable and the $\theta$i's belonging to distinct subgroups to be independent. And we perform Gibbs sampling approach for Bayesian inference on $\theta$ conditional on a partition. Numerical study using real data is provided.

  • PDF

Nonlinear Characteristics of Fuzzy Scatter Partition-Based Fuzzy Inference System

  • Park, Keon-Jun;Huang, Wei;Yu, C.;Kim, Yong K.
    • International journal of advanced smart convergence
    • /
    • v.2 no.1
    • /
    • pp.12-17
    • /
    • 2013
  • This paper introduces the fuzzy scatter partition-based fuzzy inference system to construct the model for nonlinear process to analyze nonlinear characteristics. The fuzzy rules of fuzzy inference systems are generated by partitioning the input space in the scatter form using Fuzzy C-Means (FCM) clustering algorithm. The premise parameters of the rules are determined by membership matrix by means of FCM clustering algorithm. The consequence part of the rules is represented in the form of polynomial functions and the parameters of the consequence part are estimated by least square errors. The proposed model is evaluated with the performance using the data widely used in nonlinear process. Finally, this paper shows that the proposed model has the good result for high-dimension nonlinear process.

A Study of the Gas Liquid Partition Coefficients of Eleven Normal, Branched and Cyclic Alkanes in Sixty Nine Common Organic Liquids: The Effect of Solute Structure

  • Cheong, Won-Jo
    • Bulletin of the Korean Chemical Society
    • /
    • v.23 no.3
    • /
    • pp.459-468
    • /
    • 2002
  • Literature data measured by the author have been processed to report on the effect of solute structure on gas liquid partition coefficients of eleven normal, branched and cyclic alkanes ranging in carbon number from five to nine in sixty nine low molecular weight liquids. The alkane solutes are n-pentane(p), n-hexane(hx), n-heptane(hp), n-octane(o), n-nonane(n), 2-methylpentane(mp), 2,5-dimethylpentane(dp), 2,5-dimethylhexane(dh), 2,3,4-trimethylpentane(tp), cyclohexane(ch), and ethylcyclohexane(ec). The solvent set encompasses most of those studied by Rohrschneider as well as three homologous series of solvents (n-alkanes, 1-alcohols and 1-nitriles) and several perfluorinated alkanes and highly fluorinated alcohols. An excellent linear relationship was observed between lnK and the carbon number of n-alkanes. The effective carbon numbers of branched and cyclic alkanes were determined in a similar fashion to the method of Kovats index. We found that the logarithm of solute vapor pressure multiplied by solute molar volume was a perfect descriptor for the linear relationship with the median effective carbon number.

Thermodynamic Model for Partition Coefficients in the Two Protein Systems

  • Jung, Chang-Min;Bae, Young-Chan;Kim, Jae-Jun
    • Macromolecular Research
    • /
    • v.15 no.7
    • /
    • pp.682-687
    • /
    • 2007
  • The equation of state developed herein is predicated on a hard-sphere reference with perturbations introduced via a potential function to account for electrostatic forces and for attraction between protein particles. During this process, the generalized Lennard-Jones (GLJ) pair potential function is employed. The GLJ pair potential function is employed to represent the protein-protein interaction in two-protein systems. Via the use of the relation between the equation of state and the chemical potential, the phase behavior in the aqueous two-protein system can be estimated. The partition coefficients can be obtained via these processes. The calculated values of the coefficients agree fairly well with the experimental data in the given pH and ionic strength range, with no additional adjustable model parameters.

The Transport Phenomena of Some Solutes through the Copolymer Membranes of 2-hydroxyethylmethacrylate (HEMA) with Selected Hydrophobic Monomers

  • Kim, Whan-Gun;Jhon, Mu-Shik
    • Bulletin of the Korean Chemical Society
    • /
    • v.6 no.3
    • /
    • pp.128-131
    • /
    • 1985
  • A series of copolymer membranes of 2-hydroxyethylmethacrylate (HEMA) with selected hydrophobic monomers were prepared without crosslinking agents. The equilibrium water content, the partition coefficient, and the permeability of the solutes such as urea, methylurea, 1,3-di-methylurea, and acetamide via these membranes were measured. The partition coefficient data show that as the hydrophobicity of solutes increased, the partition of solutes were dictated by hydrophobic interaction between solute and polymer matrix. Diffusion coefficients obtained in these experiments decrease as the water content of polymer membrane decreases. This decrease is blunt as the excess heat capacities, ${\phi}C^0_p$ (excess) in aqueous solution at infinite dilution of solute increases. To investigate the relationship between water content and diffusion coefficient, the results of the diffusion experiments were examined in light of a free-volume model of diffusive transport. The remarkable increase of urea mobility in the polymer network containing relatively larger bulk water can be considered as water structure breaking effect.

Generation of Efficient Fuzzy Classification Rules Using Evolutionary Algorithm with Data Partition Evaluation (데이터 분할 평가 진화알고리즘을 이용한 효율적인 퍼지 분류규칙의 생성)

  • Ryu, Joung-Woo;Kim, Sung-Eun;Kim, Myung-Won
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.18 no.1
    • /
    • pp.32-40
    • /
    • 2008
  • Fuzzy rules are very useful and efficient to describe classification rules especially when the attribute values are continuous and fuzzy in nature. However, it is generally difficult to determine membership functions for generating efficient fuzzy classification rules. In this paper, we propose a method of automatic generation of efficient fuzzy classification rules using evolutionary algorithm. In our method we generate a set of initial membership functions for evolutionary algorithm by supervised clustering the training data set and we evolve the set of initial membership functions in order to generate fuzzy classification rules taking into consideration both classification accuracy and rule comprehensibility. To reduce time to evaluate an individual we also propose an evolutionary algorithm with data partition evaluation in which the training data set is partitioned into a number of subsets and individuals are evaluated using a randomly selected subset of data at a time instead of the whole training data set. We experimented our algorithm with the UCI learning data sets, the experiment results showed that our method was more efficient at average compared with the existing algorithms. For the evolutionary algorithm with data partition evaluation, we experimented with our method over the intrusion detection data of KDD'99 Cup, and confirmed that evaluation time was reduced by about 70%. Compared with the KDD'99 Cup winner, the accuracy was increased by 1.54% while the cost was reduced by 20.8%.

Impact Analysis of Partition Utility Score in Cluster Analysis (군집분석의 분할 유용도 점수의 영향 분석)

  • Lee, Gye Sung
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.3
    • /
    • pp.481-486
    • /
    • 2021
  • Machine learning algorithms adopt criterion function as a key component to measure the quality of their model derived from data. Cluster analysis also uses this function to rate the clustering result. All the criterion functions have in general certain types of favoritism in producing high quality clusters. These clusters are then described by attributes and their values. Category utility and partition utility play an important role in cluster analysis. These are fully analyzed in this research particularly in terms of how they are related to the favoritism in the final results. In this research, several data sets are selected and analyzed to show how different results are induced from these criterion functions.

Multidimensional Scaling Using the Pseudo-Points Based on Partition Method (분할법에 의한 가상점을 활용한 다차원척도법)

  • Shin, Sang Min;Kim, Eun-Seong;Choi, Yong-Seok
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.6
    • /
    • pp.1171-1180
    • /
    • 2015
  • Multidimensional scaling (MDS) is a graphical technique of multivariate analysis to display dissimilarities among individuals into low-dimensional space. We often have two kinds of MDS which are metric MDS and non-metric MDS. Metric MDS can be applied to quantitative data; however, we need additional information about variables because it only shows relationships among individuals. Gower (1992) proposed a method that can represent variable information using trajectories of the pseudo-points for quantitative variables on the metric MDS space. We will call his method a 'replacement method'. However, the trajectory can not be represented even though metric MDS can be applied to binary data when we apply his method to binary data. Therefore, we propose a method to represent information of binary variables using pseudo-points called a 'partition method'. The proposed method partitions pseudo-points, accounting both the rate of zeroes and ones. Our metric MDS using the proposed partition method can show the relationship between individuals and variables for binary data.

A Movie Recommendation System based on Fuzzy-AHP with User Preference and Partition Algorithm (사용자 선호도와 군집 알고리즘을 이용한 퍼지-계층적 분석 기법 기반 영화 추천 시스템)

  • Oh, Jae-Taek;Lee, Sang-Yong
    • Journal of Digital Convergence
    • /
    • v.15 no.11
    • /
    • pp.425-432
    • /
    • 2017
  • The current recommendation systems have problems including the difficulty of figuring out whether they recommend items that actual users have preference for or have simple interest in, the scarcity of data to recommend proper items due to the extremely small number of users, and the cold-start issue of the dropping system performance to recommend items that can satisfy users according to the influx of new users. In an effort to solve these problems, this study implemented a movie recommendation system to ensure user satisfaction by using the Fuzzy-Analytic Hierarchy Process, which can reflect uncertain situations and problems, and the data partition algorithm to group similar items among the given ones. The data of a survey on movie preference with 61 users was applied to the system, and the results show that it solved the data scarcity problem based on the Fuzzy-AHP and recommended items fit for a user with the data partition algorithm even with the influx of new users. It is thought that research on the density-based clustering will be needed to filter out future noise data or outlier data.

SUPPORT Applications for Classification Trees

  • Lee, Sang-Bock;Park, Sun-Young
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.3
    • /
    • pp.565-574
    • /
    • 2004
  • Classification tree algorithms including as CART by Brieman et al.(1984) in some aspects, recursively partition the data space with the aim of making the distribution of the class variable as pure as within each partition and consist of several steps. SUPPORT(smoothed and unsmoothed piecewise-polynomial regression trees) method of Chaudhuri et al(1994), a weighted averaging technique is used to combine piecewise polynomial fits into a smooth one. We focus on applying SUPPORT to a binary class variable. Logistic model is considered in the caculation techniques and the results are shown good classification rates compared with other methods as CART, QUEST, and CHAID.

  • PDF