• Title/Summary/Keyword: 유사성 척도

Search Result 338, Processing Time 0.028 seconds

An Aptitude Test System using Fuzzy Reasoning (퍼지 추론을 적용한 적성 평가 시스템)

  • 안수영;김두완;정환묵
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2002.12a
    • /
    • pp.451-454
    • /
    • 2002
  • 본 논문에서는 개인의 적성을 판단하는 문제를 처리하기 위한 가중치 퍼지추론 알고리즘을 제시하고, 지식표현을 위해 퍼지 집합 이론과 퍼지 생성 규칙들을 이용하였다. 거리척도에 서는 퍼지값이 높은 구간의 척도를 낮은 구간의 척도에 비례하여 유사성을 구하였다. 또한, 가중치를 정량화한 값과 척도값을 연산하여 유사성을 나타냈고, 추출된 항목과 규칙과의 가능성을 구하였다. 여기서, 결과는 수검자들이 응답한 값들에 따라 임의의 직업군이 적당한 지를 나타내기 위해 확신도로 해석하였다.

A Study of Similarity Measures on Multidimensional Data Sequences Using Semantic Information (의미 정보를 이용한 다차원 데이터 시퀀스의 유사성 척도 연구)

  • Lee, Seok-Lyong;Lee, Ju-Hong;Chun, Seok-Ju
    • The KIPS Transactions:PartD
    • /
    • v.10D no.2
    • /
    • pp.283-292
    • /
    • 2003
  • One-dimensional time-series data have been studied in various database applications such as data mining and data warehousing. However, in the current complex business environment, multidimensional data sequences (MDS') become increasingly important in addition to one-dimensional time-series data. For example, a video stream can be modeled as an MDS in the multidimensional space with respect to color and texture attributes. In this paper, we propose the effective similarity measures on which the similar pattern retrieval is based. An MDS is partitioned into segments, each of which is represented by various geometric and semantic features. The similarity measures are defined on the basis of these segments. Using the measures, irrelevant segments are pruned from a database with respect to a given query. Both data sequences and query sequences are partitioned into segments, and the query processing is based upon the comparison of the features between data and query segments, instead of scanning all data elements of entire sequences.

Context-Weighted Metrics for Example Matching (문맥가중치가 반영된 문장 유사 척도)

  • Kim, Dong-Joo;Kim, Han-Woo
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.43 no.6 s.312
    • /
    • pp.43-51
    • /
    • 2006
  • This paper proposes a metrics for example matching under the example-based machine translation for English-Korean machine translation. Our metrics served as similarity measure is based on edit-distance algorithm, and it is employed to retrieve the most similar example sentences to a given query. Basically it makes use of simple information such as lemma and part-of-speech information of typographically mismatched words. Edit-distance algorithm cannot fully reflect the context of matched word units. In other words, only if matched word units are ordered, it is considered that the contribution of full matching context to similarity is identical to that of partial matching context for the sequence of words in which mismatching word units are intervened. To overcome this drawback, we propose the context-weighting scheme that uses the contiguity information of matched word units to catch the full context. To change the edit-distance metrics representing dissimilarity to similarity metrics, to apply this context-weighted metrics to the example matching problem and also to rank by similarity, we normalize it. In addition, we generalize previous methods using some linguistic information to one representative system. In order to verify the correctness of the proposed context-weighted metrics, we carry out the experiment to compare it with generalized previous methods.

Fuzzy Similarity Measure (퍼지 유사도 척도)

  • Lee, Kwang-Hyung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.8 no.6
    • /
    • pp.119-121
    • /
    • 1998
  • For a fuzzy system modeled by a fuzzy hypergraph, two fuzzy similarity measures are proposed:one for the fuzzy similarity between fuzzy sets and the other between elements in fuzzy sets. The proposed measures can represent the realistic similarities which can not be given by the existing measures. With an example, it is shown that it can be used in the system analysis.

  • PDF

Feasibility Test and Design of Korean Translation Memory System (한국어 번역 메모리 시스템의 실현성 분석 및 설계)

  • Ryu, Cheol;Roh, Yoon-Hyung;Lee, Ki-Young;Choi, Sung-Kwon;Park, Sang-Kyu
    • Annual Conference on Human and Language Technology
    • /
    • 2001.10d
    • /
    • pp.281-287
    • /
    • 2001
  • 번역 메모리(Translation Memory) 시스템이란 기존에 번역된 결과를 담고 있는 대용량의 번역 메모리에서 사용자가 제시한 입력문과 가장 유사한 문장을 검색한 후, 유사도 순으로 결과를 제시하여 이후의 번역 작업을 보다 효율적으로 할 수 있도록 도와주는 시스템을 말한다. 이는 기계 번역 시스템과 비교해 볼때, 보다 실현 가능성이 높은 자연어 처리의 응용 분야라고 할 수 있다. 일반적으로 번역 메모리 시스템에서 핵심이 되는 요소는 번역메모리의 구성과 유사성 척도에 대한 정의라고 할 수 있다. 국외의 경우, 이미 많은 상용 시스템들이 개발되어 번역 작업의 시간 및 비용을 줄이는데 많은 도움을 주고 있지만, 국내의 경우 한국어 번역 메모리의 구성 및 한국어 문장간 유사성 척도 등에 대한 연구가 미흡한 실정이다. 따라서 본 논문에서는 한국어를 대상으로 번역 메모리의 효율적인 구성 방법 및 문장간 유사성 척도에 대한 정의를 내리며, 한국어를 대상으로한 번역 메모리 시스템에 대한 실현 가능성을 논한다.

  • PDF

Non-parametric approach for the grouped dissimilarities using the multidimensional scaling and analysis of distance (다차원척도법과 거리분석을 활용한 그룹화된 비유사성에 대한 비모수적 접근법)

  • Nam, Seungchan;Choi, Yong-Seok
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.4
    • /
    • pp.567-578
    • /
    • 2017
  • Grouped multivariate data can be tested for differences between two or more groups using multivariate analysis of variance (MANOVA). However, this method cannot be used if several assumptions of MANOVA are violated. In this case, multidimensional scaling (MDS) and analysis of distance (AOD) can be applied to grouped dissimilarities based on the various distances. A permutation test is a non-parametric method that can also be used to test differences between groups. MDS is used to calculate the coordinates of observations from dissimilarities and AOD is useful for finding group structure using the coordinates. In particular, AOD is mathematically associated with MANOVA if using the Euclidean distance when computing dissimilarities. In this paper, we study the between and within group structure by applying MDS and AOD to the grouped dissimilarities. In addition, we propose a new test statistic using the group structure for the permutation test. Finally, we investigate the relationship between AOD and MANOVA from dissimilarities based on the Euclidean distance.

Multidimensional scaling of categorical data using the partition method (분할법을 활용한 범주형자료의 다차원척도법)

  • Shin, Sang Min;Chun, Sun-Kyung;Choi, Yong-Seok
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.1
    • /
    • pp.67-75
    • /
    • 2018
  • Multidimensional scaling (MDS) is an exploratory analysis of multivariate data to represent the dissimilarity among objects in the geometric low-dimensional space. However, a general MDS map only shows the information of objects without any information about variables. In this study, we used MDS based on the algorithm of Torgerson (Theory and Methods of Scaling, Wiley, 1958) to visualize some clusters of objects in categorical data. For this, we convert given data into a multiple indicator matrix. Additionally, we added the information of levels for each categorical variable on the MDS map by applying the partition method of Shin et al. (Korean Journal of Applied Statistics, 28, 1171-1180, 2015). Therefore, we can find information on the similarity among objects as well as find associations among categorical variables using the proposed MDS map.

Image Recognition by Using Hybrid Coefficient Measure of Correlation and Distance (상관계수과 거리계수의 조합형 척도를 이용한 영상인식)

  • Hong, Seong-Jun;Cho, Yong-Hyun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.20 no.3
    • /
    • pp.343-347
    • /
    • 2010
  • This paper presents an efficient image recognition method using the hybrid coefficient measure of correlation and distance. The correlation coefficient is applied to measure the statistical similarity by using Pearson coefficient, and distance coefficient is also applied to measure the spacial similarity by using city-block. The total similarity among images is calculated by extending the similarity between the feature vectors, then the feature vectors can be extracted by PCA and ICA, respectively. The proposed method has been applied to the problem for recognizing the 960(30 persons * 4 expressions * 2 lights * 4 poses) facial images of 40*50 pixels. The experimental results show that the proposed method of ICA has a superior recognition performances than the method using PCA, and is affected less by the environmental influences so as lighting.

A New Similarity Measure using Fuzzy Logic for User-based Collaborative Filtering (사용자 기반의 협력필터링을 위한 퍼지 논리를 이용한 새로운 유사도 척도)

  • Lee, Soojung
    • The Journal of Korean Association of Computer Education
    • /
    • v.21 no.5
    • /
    • pp.61-68
    • /
    • 2018
  • Collaborative filtering is a fundamental technique implemented in many commercial recommender systems and provides a successful service to online users. This technique recommends items by referring to other users who have similar rating records to the current user. Hence, similarity measures critically affect the system performance. This study addresses problems of previous similarity measures and suggests a new similarity measure. The proposed measure reflects the subjectivity or vagueness of user ratings and the users' rating behavior by using fuzzy logic. We conduct experimental studies for performance evaluation, whose results show that the proposed measure demonstrates outstanding performance improvements in terms of prediction accuracy and recommendation accuracy.

A Hot Coil Quality Design Su, pp.rt System using Case Based Reasoning (사례기반추론을 이용한 열연제품 품질설계지원시스템)

  • 고영관;박상혁;서민수;임여종
    • Journal of Intelligence and Information Systems
    • /
    • v.3 no.1
    • /
    • pp.101-109
    • /
    • 1997
  • 철강제품의 품질설계란 제품의 주문요구조건을 만족시키기 위해 제품의 성분 및 생산공정을 결정하는 과정을 의미한다. 본 연구에서는 품질설계업무를 지원하기 위한 시스템을 개발하였다. 설계업무의 특성을 고려하여 과거사례를 설계에 이용하기 위해, 사례기반추론(Case-based Reasoning)접근방법을 이용하였다. 본 연구에서는 또한 유사사례의 효율적 검색을 위해 품질설계 문제에 적합한 유사성척도를 제안하고 있으며, 문제에 적합한 유사성척도를 제안하고 있으며, 문제에 적합한 지식관리 방법 및 설계조정 방법을 개발하였다.

  • PDF