• Title/Summary/Keyword: similarity metric

Search Result 111, Processing Time 0.023 seconds

Signal Peptide Cleavage Site Prediction Using a String Kernel with Real Exponent Metric (실수 지수 메트릭으로 구성된 스트링 커널을 이용한 신호펩티드의 절단위치 예측)

  • Chi, Sang-Mun
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.10
    • /
    • pp.786-792
    • /
    • 2009
  • A kernel in support vector machines can be described as a similarity measure between data, and this measure is used to find an optimal hyperplane that classifies patterns. It is therefore important to effectively incorporate the characteristics of data into the similarity measure. To find an optimal similarity between amino acid sequences, we propose a real exponent exponential form of the two metrices, which are derived from the evolutionary relationships of amino acids and the hydrophobicity of amino acids. We prove that the proposed metric satisfies the conditions to be a metric, and we find a relation between the proposed metric and the metrics in the string kernels which are widely used for the processing of amino acid sequences and DNA sequences. In the prediction experiments on the cleavage site of the signal peptide, the optimal metric can be found in the proposed metrics.

Improved Collaborative Filtering Using Entropy Weighting

  • Kwon, Hyeong-Joon
    • International Journal of Advanced Culture Technology
    • /
    • v.1 no.2
    • /
    • pp.1-6
    • /
    • 2013
  • In this paper, we evaluate performance of existing similarity measurement metric and propose a novel method using user's preferences information entropy to reduce MAE in memory-based collaborative recommender systems. The proposed method applies a similarity of individual inclination to traditional similarity measurement methods. We experiment on various similarity metrics under different conditions, which include an amount of data and significance weighting from n/10 to n/60, to verify the proposed method. As a result, we confirm the proposed method is robust and efficient from the viewpoint of a sparse data set, applying existing various similarity measurement methods and Significance Weighting.

  • PDF

Structural Similarity Based Video Quality Metric using Human Visual System (구조적 유사도 기반의 인간의 시각적 특성을 이용한 비디오 품질 측정 기준)

  • Park, Jin-Cheol;Lee, Sang-Hoon
    • Journal of Broadcast Engineering
    • /
    • v.14 no.1
    • /
    • pp.36-43
    • /
    • 2009
  • Recently, the structural similarity (SSIM) index metric is proposed. In the present paper, a new framework, which is called visual SSIM (VSSIM), is proposed by incorporating crucial human factors into the SSIM. The human factors are foveation, luminance, frequency and motion information. The performance of VSSIM is evaluated by subjective quality test compliant with the Video Quality Expert Group (VQEG) multimedia group test plan. It shows that the visual SSIM is more correlated with the subjective quality result than the conventional SSIM.

A Structural Complexity Metric for Web Application based on Similarity (유사도 기반의 웹 어플리케이션 구조 복잡도)

  • Jung, Woo-Sung;Lee, Eun-Joo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.8
    • /
    • pp.117-126
    • /
    • 2010
  • Software complexity is used to evaluate a target system's maintainability. The existing complexity metrics on web applications are count-based, so it is hard to incorporate the understandability of developers or maintainers. To make up for this shortcomings, entropy-theory can be applied to define complexity, however, it is assumed that information quantity of each paper is identical. In this paper, structural complexity of a web application is defined based on information theory and similarity. In detail, the proposed complexity is defined using entropy as the previous approach, but the information quantity of individual pages is defined using similarity. That is, a page which are similar with many pages has smaller information quantity than a page which are dissimilar to others. Furthermore, various similarity measures can be used for various views, which results in many-sided complexity measures. Finally, several complexity properties are applied to verify the proposed metric and case studies shows the applicability of the metric.

Similarity Analysis of Sibling Nodes in SNOMED CT Terminology System (SNOMED CT 용어체계에서 형제 노드의 유사도 분석 기법)

  • Woo-Seok Ryu
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.19 no.1
    • /
    • pp.295-300
    • /
    • 2024
  • This paper discusses the incompleteness of the SNOMED CT and proposes a noble metric which evaluates similarity among sibling nodes as a method to address this incompleteness. SNOMED CT encompasses an extensive range of medical terms, but it faces issues of ontology incompleteness, such as missing concepts in the hierarchy. We propose a noble metric for evaluating similarity among nodes within a node group, composed of multiple sibling nodes, to identify missing concepts, and identify groups with low similarity. Analyzing the similarity of sibling node groups in the March 2023 international release of SNOMED CT, the average similarity of 29,199 sibling node groups, which are sub-concepts of the clinical finding concept and are consist of two or more sibling nodes, was found to be 0.81. The group with the lowest similarity was associated with child concepts of poisoning, with a similarity of 0.0036.

Applying Consistency-Based Trust Definition to Collaborative Filtering

  • Kim, Hyoung-Do
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.3 no.4
    • /
    • pp.366-375
    • /
    • 2009
  • In collaborative filtering, many neighbors are needed to improve the quality and stability of the recommendation. The quality may not be good mainly due to the high similarity between two users not guaranteeing the same preference for products considered for recommendation. This paper proposes a consistency definition, rather than similarity, based on information entropy between two users to improve the recommendation. This kind of consistency between two users is then employed as a trust metric in collaborative filtering methods that select neighbors based on the metric. Empirical studies show that such collaborative filtering reduces the number of neighbors required to make the recommendation quality stable. Recommendation quality is also significantly improved.

An Improved Object Detection Method using Hausdorff Distance Modified by Local Pattern Similarity (국지적 패턴 유사도에 의해 수정된 Hausdorff 거리를 이용한 개선된 객체검출)

  • Cho, Kyoung-Sik;Koo, Ja-Young
    • Journal of the Korea Society of Computer and Information
    • /
    • v.12 no.6
    • /
    • pp.147-152
    • /
    • 2007
  • Face detection is a crucial part of the face recognition system. It determines the performance of the whole recognition system. Hausdorff distance metric has been used in face detection and recognition with good results. It defines the distance metric based only on the geometric similarity between two sets or points. However, not only the geometry but also the local patterns around the points are available in most cases. In this paper a new Hausdorff distance measure is proposed that makes hybrid use of the similarity of the geometry and the local patterns around the points. Several experiments shows that the new method outperforms the conventional method.

  • PDF

An approach for improving the performance of the Content-Based Image Retrieval (CBIR)

  • Jeong, Inseong
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.30 no.6_2
    • /
    • pp.665-672
    • /
    • 2012
  • Amid rapidly increasing imagery inputs and their volume in a remote sensing imagery database, Content-Based Image Retrieval (CBIR) is an effective tool to search for an image feature or image content of interest a user wants to retrieve. It seeks to capture salient features from a 'query' image, and then to locate other instances of image region having similar features elsewhere in the image database. For a CBIR approach that uses texture as a primary feature primitive, designing a texture descriptor to better represent image contents is a key to improve CBIR results. For this purpose, an extended feature vector combining the Gabor filter and co-occurrence histogram method is suggested and evaluated for quantitywise and qualitywise retrieval performance criterion. For the better CBIR performance, assessing similarity between high dimensional feature vectors is also a challenging issue. Therefore a number of distance metrics (i.e. L1 and L2 norm) is tried to measure closeness between two feature vectors, and its impact on retrieval result is analyzed. In this paper, experimental results are presented with several CBIR samples. The current results show that 1) the overall retrieval quantity and quality is improved by combining two types of feature vectors, 2) some feature is better retrieved by a specific feature vector, and 3) retrieval result quality (i.e. ranking of retrieved image tiles) is sensitive to an adopted similarity metric when the extended feature vector is employed.

Similarity Relationship between Basic Species of the Oak by the Numerical Method (수치분석(數値分析)에 의(依)한 참나무 기본종(基本種)의 유연관계(類緣關係))

  • Ma, Sang-Kyu
    • Journal of Korean Society of Forest Science
    • /
    • v.21 no.1
    • /
    • pp.47-51
    • /
    • 1974
  • In order to prove the similarity relationships between the basic species of oak through Electronic Data Processing System(EDPS) and numerical analysis, The analized species and datas were selected from the list of morphological observation in the thesis of T.B. Lee, 1961, "Phytogenetic study of the subgenus Lepidobalanus of the genus Quercus in Korea", and were coded by categories shown in Table 1. The value in the list were transformed into hundred percentage to standardize the observational value by each code into dimensionless. The similarity index between species were computed through formula of non-metric coefficient, $N_{jk}=\sum\limits_{i=1}^{n}\(\frac{{\mid}x_{ij}-x_{ik}{\mid}}{x_{ij}+x_{ik}}\)$, using the UNiVAC-1106, at National Computer Center. Quercus aliena, by analysis result, is most similar to Q. mongolica with the similarity index, 71.6 and Q. dentata is most far apart from Q. serrata in the relationship with index, 121.4. The above thesis of Professor, T. Lee, are closely similar with the result of this research study. But, their similar relationship are proved in quantity through numerical method in our research study. In addition, The relationships among Q. mongolica, Q. aliena and Q. serrata are found to be very similar, but Q. dentata to be enough far in similarity to other species by dendrogram shown at Fig. 1. The numerical classification through EDPS is found to be suitable method also applicable to the plant taxonomy.

  • PDF

Similarity-based Caching Replacement Loss Minimization in Wireless Mobile Proxy Systems (무선 모바일 프록시 시스템에서 유사도 기반의 캐싱 손실 최소화)

  • Lee, Chong-Deuk
    • Journal of Advanced Navigation Technology
    • /
    • v.16 no.3
    • /
    • pp.455-462
    • /
    • 2012
  • The loss due to caching replacement in the wireless mobile proxy caching structure has a significant effect on streaming QoS. This paper proposes a similarity-based caching loss minimization (SCLM) for minimizing the loss caused by the caching replacement. The proposed scheme divides object segments, and then it performs the similarity relation about them. Segments that perform the similarity relation generates similarity relation tree (SRT). The similarity is an important metric for deciding a relevance feedback, and segments that satisfy these requirements in the cache block for caching replacement. Simulation results show that the proposed scheme has better performance than the existing prefix caching scheme, segment-based caching scheme, and bi-directional proxy scheme in terms of QoS, average delayed startup ratio, cache throughput, and cache response ratio.