• Title/Summary/Keyword: Similarity function

Search Result 555, Processing Time 0.025 seconds

Analysis of Fuzzy Entropy and Similarity Measure for Non Convex Membership Functions

  • Lee, Sang-H.;Kim, Sang-Jin
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.9 no.1
    • /
    • pp.4-9
    • /
    • 2009
  • Fuzzy entropy is designed for non convex fuzzy membership function using well known Hamming distance measure. Design procedure of convex fuzzy membership function is represented through distance measure, furthermore characteristic analysis for non convex function are also illustrated. Proof of proposed fuzzy entropy is discussed, and entropy computation is illustrated.

Similarity Analysis of Exports Value Added by Country and Implication for Korea's Global Value Added Chains

  • Cho, Jung-Hwan
    • Journal of Korea Trade
    • /
    • v.23 no.4
    • /
    • pp.103-114
    • /
    • 2019
  • Purpose - This paper investigates the structure of exports across countries in terms of value added. Exports value added is examined under two categories, domestic and overseas. Using a statistical classification method by distance based on these two value added categories, this paper estimates the similarity of exports value added across countries including Korea. Design/methodology - The model of study is to employ a generalized distance function and then derive the Manhattan and Euclidean distances. The paper also performs cluster analysis using the Partitioning Around Medoids (PAM) and hierarchical methods to classify the 44 sample countries considered in this study. Findings - Our main findings are as follows. The 44 countries can be classified under 5 groups by their domestic and overseas value added in exports. Korea has a sandwich global value chains (GVCs) position between Japan, China, and Taiwan in the East Asian region. Originality/value - Existing papers point out the double counting problem of trade statistics as the intermediate goods trade across borders increases. This paper addresses the double counting problem by using the World Input-Output Table. The paper shows the need to explore the similarity of value added in exports structure across countries and investigate the GVCs position and role of each country.

Measurement of Rhythmic Similarity for Auditory Memory Game (청각 기억 게임을 위한 리듬 유사도 측정 기술)

  • Kim, Ju-Wan;Lee, Se-Won;Park, Ho-Chong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.30 no.3
    • /
    • pp.136-141
    • /
    • 2011
  • In this paper, a method for measuring rhythmic similarity between two sound signals for auditory memory game is proposed. The proposed method analyzes energy fluctuation, the temporal duration of energy peak, the timbre of two signals, and detects beat positions for each signal. Then, it determines the rhythm vector after compensating a difference in tempo and the number of beats between two signals. Finally, a method for rhythmic similarity measurement is defined as a function of the dissimilarity between two rhythm vectors and a difference in the number of beats. The rhythmic similarity measured by the proposed method and that by the subjective listening test are compared, and the correlation of 0.86 between two results is achieved.

A Tolerant Rough Set Approach for Handwritten Numeral Character Classification

  • Kim, Daijin;Kim, Chul-Hyun
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1998.06a
    • /
    • pp.288-295
    • /
    • 1998
  • This paper proposes a new data classification method based on the tolerant rough set that extends the existing equivalent rough set. Similarity measure between two data is described by a distance function of all constituent attributes and they are defined to be tolerant when their similarity measure exceeds a similarity threshold value. The determination of optimal similarity theshold value is very important for the accurate classification. So, we determine it optimally by using the genetic algorithm (GA), where the goal of evolution is to balance two requirements such that (1) some tolerant objects are required to be included in the same class as many as possible. After finding the optimal similarity threshold value, a tolerant set of each object is obtained and the data set is grounded into the lower and upper approximation set depending on the coincidence of their classes. We propose a two-stage classification method that all data are classified by using the lower approxi ation at the first stage and then the non-classified data at the first stage are classified again by using the rough membership functions obtained from the upper approximation set. We apply the proposed classification method to the handwritten numeral character classification. problem and compare its classification performance and learning time with those of the feed forward neural network's back propagation algorithm.

  • PDF

Identification Performance of Low-Molecular Compounds by Searching Tandem Mass Spectral Libraries with Simple Peak Matching

  • Milman, Boris L.;Zhurkovich, Inna K.
    • Mass Spectrometry Letters
    • /
    • v.9 no.3
    • /
    • pp.73-76
    • /
    • 2018
  • The number of matched peaks (NMP) is estimated as the spectral similarity measure in tandem mass spectral library searches of small molecules. In the high resolution mode, NMP provides the same reliable identification as in the case of a common dot-product function. Corresponding true positive rates are ($94{\pm}3$) % and ($96{\pm}3$) %, respectively.

Clustering Parts Based on the Design and Manufacturing Similarities Using a Genetic Algorithm

  • Lee, Sung-Youl
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.16 no.4
    • /
    • pp.119-125
    • /
    • 2011
  • The part family (PF) formation in a cellular manufacturing has been a key issue for the successful implementation of Group Technology (GT). Basically, a part has two different attributes; i.e., design and manufacturing. The respective similarity in both attributes is often conflicting each other. However, the two attributes should be taken into account appropriately in order for the PF to maximize the benefits of the GT implementation. This paper proposes a clustering algorithm which considers the two attributes simultaneously based on pareto optimal theory. The similarity in each attribute can be represented as two individual objective functions. Then, the resulting two objective functions are properly combined into a pareto fitness function which assigns a single fitness value to each solution based on the two objective functions. A GA is used to find the pareto optimal set of solutions based on the fitness function. A set of hypothetical parts are grouped using the proposed system. The results show that the proposed system is very promising in clustering with multiple objectives.

Coupling Matrix Synthesis Methods for RF/Microwave Filter Design (초고주파용 필터설계를 위한 결합행렬 합성법)

  • Choi, Dong-Muk;Kim, Che-Young
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.32 no.12A
    • /
    • pp.1346-1353
    • /
    • 2007
  • In this paper, the methods are presented for the calculation of general coupling coefficient matrixes used in the band pass filter design. They are calculated from transmission coefficient($S_{21}$) and reflection coefficient($S_{11}$) with desired characteristics derived from the poles of filter function and return loss(RL). The calculated matrixes from this method are transformed to the folded canonical filter structure using similarity transformation which lends us the practical filter design. Based on the resulting matrix, the folded canonical filter has been designed.

Functional Analysis of ESTs from the Flower Bud of Korean Ginseng

  • Yang, Deok-Chun;In, Jun-Gyo;Kim, Moo-Sung;Jeon, Jong-Seong
    • Proceedings of the Plant Resources Society of Korea Conference
    • /
    • 2003.04a
    • /
    • pp.124-124
    • /
    • 2003
  • In order to study gene expression in a reproductive organ, we constructed a cDNA library of immature flower buds in Korean ginseng and generated expressed sequence tags (ESTs) of 3,360 clones randomly selected. The ESTs could be clustered into 1,844 non-redundant groups. Similarity search of the non-redundant ESTs against public non-redundant databases of both protein and DNA indicated that 1,254 groups show similarity to genes of known function. These ESTs clones were divided into sixteen categories depending upon gene function. The most abundant transcripts were unknown protein (72), chlorophyll a/b-binding protein (48), and stylar glycoprotein. There are no useful informations of gene expression during the development of flower bud in Korean ginseng. These results could help to understand the development of flower bud in Korean ginseng.

  • PDF

Massive Music Resources Retrieval Method Based on Ant Colony Algorithm

  • Yun Meng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.5
    • /
    • pp.1208-1222
    • /
    • 2024
  • Music resources are characterized by quantization, diversification and complication. With the rapid increase of the demand for music resources, the storage of music resources is very large. In order to improve the retrieval effect of music resources, a massive music resources retrieval method based on ant colony algorithm is proposed to effectively use music resources. This paper constructs autocorrelation function to extract pitch feature of music resource, classifies the music resource information by calculating feature similarity. Using ant colony algorithm to correlate the feature of music resource, gain the result of correlative, locate the result of detection and get the result of multi-module. Simulation results show that the proposed method has high precision and recall, short retrieval time and can effectively retrieve massive music resources.

GORank: Semantic Similarity Search for Gene Products using Gene Ontology (GORank: Gene Ontology를 이용한 유전자 산물의 의미적 유사성 검색)

  • Kim, Ki-Sung;Yoo, Sang-Won;Kim, Hyoung-Joo
    • Journal of KIISE:Databases
    • /
    • v.33 no.7
    • /
    • pp.682-692
    • /
    • 2006
  • Searching for gene products which have similar biological functions are crucial for bioinformatics. Modern day biological databases provide the functional description of gene products using Gene Ontology(GO). In this paper, we propose a technique for semantic similarity search for gene products using the GO annotation information. For this purpose, an information-theoretic measure for semantic similarity between gene products is defined. And an algorithm for semantic similarity search using this measure is proposed. We adapt Fagin's Threshold Algorithm to process the semantic similarity query as follows. First, we redefine the threshold for our measure. This is because our similarity function is not monotonic. Then cluster-skipping and the access ordering of the inverted index lists are proposed to reduce the number of disk accesses. Experiments with real GO and annotation data show that GORank is efficient and scalable.