• Title/Summary/Keyword: Crisp data

Search Result 70, Processing Time 0.029 seconds

비모수 퍼지회귀모형

  • Choe, Seung-Hoe;Kim, Hae-Gyeong;Seong, Na-Yeong
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2003.05a
    • /
    • pp.199-201
    • /
    • 2003
  • 본 연구에서는 크리스프자료(crisp data)인 독립변수와 퍼지자료(fuzzy data)인 종속변수 사이의 관계가 특정한 함수로 표현되지 않는 비모수 퍼지회귀모형을 분석하기위하여 퍼지수 순위와 퍼지순위변환방법을 소개하고, 모의실험을 통하여 퍼지순위변환방법의 효율성을 조사한다.

  • PDF

Correlation of Intuitionistic Fuzzy Sets

  • Son, Mi-Jung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.4
    • /
    • pp.546-549
    • /
    • 2007
  • When we deal with crisp data, it is common to find the correlation between variables. In this paper, we propose a method to calculate the correlation coefficient for intuitionistic fuzzy data, by adopting the concepts from the conventional statistics. The value of the correlation coefficient computed from our formula not only provides us the strength of the relationship of intuitionistic fuzzy sets, but also shows that the intuitionistic fuzzy sets are positively or negatively related.

A Decision Support System for the Selection of a Rapid Prototyping Process (쾌속조형공정 선정을 위한 지원 시스템)

  • 변홍석;이관행
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2003.06a
    • /
    • pp.5-8
    • /
    • 2003
  • This paper presents a methodology to be able to select an appropriate RP system that suits the end use of a part. Evaluation factors used in process selection include major attributes such as accuracy, roughness, strength, elongation, part cost and build time that greatly affect the performance of RP systems. Crisp values such as accuracy and surface roughness are obtained with a new test part developed. The test part is designed with conjoint analysis to reflect users' preference. The part cost and build time that have approximate ranges due to cost and many variable parameters are presented by linguistic values that can be described with triangular fuzzy numbers. Based on the evaluation values obtained, an appropriate RP process for a specific part application is selected by using the modified TOPSIS(Technique of Order Preference by Similarity to Ideal Solution) method. It uses crisp data as well as linguistic variables, and each weight on the alternatives is assigned by using pair-wise comparison matrix. The ranking order helps the decision making of the selection of RP systems.

  • PDF

Cluster ing for Analysis of Raman Hyper spectral Dental Data

  • Jung, Sung-Hwan
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.1
    • /
    • pp.19-28
    • /
    • 2013
  • In this research, we presented an effective clustering method based on ICA for the analysis of huge Raman hyperspectral dental data. The hyperspectral dataset captured by HR800 micro Raman spectrometer at UMKC-CRISP(University of Missouri-Kansas City Center for Research on Interfacial Structure and Properties), has 569 local points. Each point has 1,005 hyperspectal dentin data. We compared the clustering effectiveness and the clustering time for the case of using all dataset directly and the cases of using the scores after PCA and ICA. As the result of experiment, the cases of using the scores after PCA and ICA showed, not only more detailed internal dentin information in the aspect of medical analysis, but also about 7~19 times much shorter processing times for clustering. ICA based approach also presented better performance than that of PCA, in terms of the detailed internal information of dentin and the clustering time. Therefore, we could confirm the effectiveness of ICA for the analysis of Raman hyperspectral dental data.

A Study on Partial Pattern Estimation for Sequential Agglomerative Hierarchical Nested Model (SAHN 모델의 부분적 패턴 추정 방법에 대한 연구)

  • Jang, Kyung-Won;Ahn, Tae-Chon
    • Proceedings of the KIEE Conference
    • /
    • 2005.10b
    • /
    • pp.143-145
    • /
    • 2005
  • In this paper, an empirical study result on pattern estimation method is devoted to reveal underlying data patterns with a relatively reduced computational cost. Presented method performs crisp type clustering with given n number of data samples by means of the sequential agglomerative hierarchical nested model (SAHN). Conventional SAHN based clustering requires large computation time in the initial step of algorithm. To deal with this concern, we modified overall process with a partial approach. In the beginning of this method, we divide given data set to several sub groups with uniform sampling and then each divided sub data group is applied to SAHN based method. The advantage of this method reduces computation time of original process and gives similar results. Proposed is applied to several test data set and simulation result with conceptual analysis is presented.

  • PDF

Analysis of Hyperspectral Dentin Data Using Independent Component Analysis

  • Jung, Sung-Hwan
    • Journal of Korea Multimedia Society
    • /
    • v.12 no.12
    • /
    • pp.1755-1760
    • /
    • 2009
  • In this research, for the first time, we tried to analyse Raman hyperspectral dentin data using Independent Component Analysis (ICA) to see its possibility of adoption for the dental analysis software. We captured hyperspectral dentin data on 569 spots on a molar with dental lesion by HR800 Micro Raman Spectrometer at UMKC-CRISP (University of Missouri at Kansas City-Center for Research on Interfacial Structure and Properties). Each spot has 1,005 hyperspectral data. We applied ICA to the captured hyperspectral data of dentin for evaluating ICA approach, and compared it with the well known multivariate analysis method, PCA. As a result of the experiment, ICA approach shows better local characteristic of dentin than the result of PCA. We confirmed that ICA also could be a good method along with PCA in the dental analysis software.

  • PDF

Clustering Approaches to Identifying Gene Expression Patterns from DNA Microarray Data

  • Do, Jin Hwan;Choi, Dong-Kug
    • Molecules and Cells
    • /
    • v.25 no.2
    • /
    • pp.279-288
    • /
    • 2008
  • The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.

FUZZY REGRESSION TOWARDS A GENERAL INSURANCE APPLICATION

  • Kim, Joseph H.T.;Kim, Joocheol
    • Journal of applied mathematics & informatics
    • /
    • v.32 no.3_4
    • /
    • pp.343-357
    • /
    • 2014
  • In many non-life insurance applications past data are given in a form known as the run-off triangle. Smoothing such data using parametric crisp regression models has long served as the basis of estimating future claim amounts and the reserves set aside to protect the insurer from future losses. In this article a fuzzy counterpart of the Hoerl curve, a well-known claim reserving regression model, is proposed to analyze the past claim data and to determine the reserves. The fuzzy Hoerl curve is more flexible and general than the one considered in the previous fuzzy literature in that it includes a categorical variable with multiple explanatory variables, which requires the development of the fuzzy analysis of covariance, or fuzzy ANCOVA. Using an actual insurance run-off claim data we show that the suggested fuzzy Hoerl curve based on the fuzzy ANCOVA gives reasonable claim reserves without stringent assumptions needed for the traditional regression approach in claim reserving.

Different approaches towards fuzzy database systems A Survey

  • Rundensteiner, Elke A.;Hawkes, Lois Wright
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.3 no.1
    • /
    • pp.65-75
    • /
    • 1993
  • Fuzzy data is a phenomenon often occurring in real life. There is the inherent vagueness of classification terms referring to a continuous scale, the uncertainty of linguistic terms such as "I almost agree" or the vagueness of terms and concepts due to the statistical variability in communication [20] and many more. Previously, such fuzzy data was approximated by non-fuzzy (crisp) data, which obviously did not lead to a correct and precise representation of the real world. Fuzzy set theory has been developed to represent and manipulate fuzzy data [18]. Explicitly managing the degree of fuzziness in databases allows the system to distinguish between what is known, what is not known and what is partially known. Systems in the literature whose specific objective is to handle imprecision in databases present various approaches. This paper is concerned with the different ways uncertainty and imprecision are handled in database design. It outlines the major areas of fuzzification in (relational) database systems.

  • PDF

Predictive Spatial Data Fusion Using Fuzzy Object Representation and Integration: Application to Landslide Hazard Assessment

  • Park, No-Wook;Chi, Kwang-Hoon;Chung, Chang-Jo;Kwon, Byung-Doo
    • Korean Journal of Remote Sensing
    • /
    • v.19 no.3
    • /
    • pp.233-246
    • /
    • 2003
  • This paper presents a methodology to account for the partial or gradual changes of environmental phenomena in categorical map information for the fusion/integration of multiple spatial data. The fuzzy set based spatial data fusion scheme is applied in order to account for the fuzziness of boundaries in categorical information showing the partial or gradual environmental impacts. The fuzziness or uncertainty of boundary is represented as two kinds of fuzzy membership functions based on fuzzy object concept and the effects of them are quantitatively evaluated with the help of a cross validation procedure. A case study for landslide hazard assessment demonstrates the better performance of this scheme as compared to traditional crisp boundary representation.