• Title/Summary/Keyword: similarity-based

Search Result 3,604, Processing Time 0.027 seconds

A Real-Time Concept-Based Text Categorization System using the Thesauraus Tool (시소러스 도구를 이용한 실시간 개념 기반 문서 분류 시스템)

  • 강원석;강현규
    • Journal of KIISE:Software and Applications
    • /
    • v.26 no.1
    • /
    • pp.167-167
    • /
    • 1999
  • The majority of text categorization systems use the term-based classification method. However, because of too many terms, this method is not effective to classify the documents in areal-time environment. This paper presents a real-time concept-based text categorization system,which classifies texts using thesaurus. The system consists of a Korean morphological analyzer, athesaurus tool, and a probability-vector similarity measurer. The thesaurus tool acquires the meaningsof input terms and represents the text with not the term-vector but the concept-vector. Because theconcept-vector consists of semantic units with the small size, it makes the system enable to analyzethe text with real-time. As representing the meanings of the text, the vector supports theconcept-based classification. The probability-vector similarity measurer decides the subject of the textby calculating the vector similarity between the input text and each subject. In the experimentalresults, we show that the proposed system can effectively analyze texts with real-time and do aconcept-based classification. Moreover, the experiment informs that we must expand the thesaurustool for the better system.

Information Management by Data Quantification with FuzzyEntropy and Similarity Measure

  • Siang, Chua Hong;Lee, Sanghyuk
    • Journal of the Korea Convergence Society
    • /
    • v.4 no.2
    • /
    • pp.35-41
    • /
    • 2013
  • Data management with fuzzy entropy and similarity measure were discussed and verified by applying reliable data selection problem. Calculation of certainty or uncertainty for data, fuzzy entropy and similarity measure are designed and proved. Proposed fuzzy entropy and similarity are considered as dissimilarity measure and similarity measure, and the relation between two measures are explained through graphical illustration.Obtained measures are useful to the application of decision theory and mutual information analysis problem. Extension of data quantification results based on the proposed measures are applicable to the decision making and fuzzy game theory.

Mining Clusters of Sequence Data using Sequence Element-based Similarity Measure (시퀀스 요소 기반의 유사도를 이용한 시퀀스 데이터 클러스터링)

  • 오승준;김재련
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2004.11a
    • /
    • pp.221-229
    • /
    • 2004
  • Recently, there has been enormous growth in the amount of commercial and scientific data, such as protein sequences, retail transactions, and web-logs. Such datasets consist of sequence data that have an inherent sequential nature. However, only a few of the existing clustering algorithms consider sequentiality. This study presents a method for clustering such sequence datasets. The similarity between sequences must be decided before clustering the sequences. This study proposes a new similarity measure to compute the similarity between two sequences using a sequence element. Two clustering algorithms using the proposed similarity measure are proposed: a hierarchical clustering algorithm and a scalable clustering algorithm that uses sampling and a k-nearest neighbor method. Using a splice dataset and synthetic datasets, we show that the quality of clusters generated by our proposed clustering algorithms is better than that of clusters produced by traditional clustering algorithms.

  • PDF

APPLICATIONS OF SIMILARITY MEASURES FOR PYTHAGOREAN FUZZY SETS BASED ON SINE FUNCTION IN DECISION-MAKING PROBLEMS

  • ARORA, H.D.;NAITHANI, ANJALI
    • Journal of applied mathematics & informatics
    • /
    • v.40 no.5_6
    • /
    • pp.897-914
    • /
    • 2022
  • Pythagorean fuzzy sets (PFSs) are capable of modelling information with more uncertainties in decision-making problems. The essential feature of PFSs is that they are described by three parameters: membership function, non-membership function and hesitant margin, with the total of the squares of each parameter equal to one. The purpose of this article is to suggest some new similarity measures and weighted similarity measures for PFSs. Numerical computations have been carried out to validate our proposed measures. Applications of these measures have been applied to some real-life decision-making problems of pattern detection and medicinal investigations. Moreover, a descriptive illustration is employed to compare the results of the proposed measures with the existing analogous similarity measures to show their effectiveness.

Ontology Matching Method Based on Word Embedding and Structural Similarity

  • Hongzhou Duan;Yuxiang Sun;Yongju Lee
    • International journal of advanced smart convergence
    • /
    • v.12 no.3
    • /
    • pp.75-88
    • /
    • 2023
  • In a specific domain, experts have different understanding of domain knowledge or different purpose of constructing ontology. These will lead to multiple different ontologies in the domain. This phenomenon is called the ontology heterogeneity. For research fields that require cross-ontology operations such as knowledge fusion and knowledge reasoning, the ontology heterogeneity has caused certain difficulties for research. In this paper, we propose a novel ontology matching model that combines word embedding and a concatenated continuous bag-of-words model. Our goal is to improve word vectors and distinguish the semantic similarity and descriptive associations. Moreover, we make the most of textual and structural information from the ontology and external resources. We represent the ontology as a graph and use the SimRank algorithm to calculate the structural similarity. Our approach employs a similarity queue to achieve one-to-many matching results which provide a wider range of insights for subsequent mining and analysis. This enhances and refines the methodology used in ontology matching.

Analysis of Image Similarity Index of Woven Fabrics and Virtual Fabrics - Application of Textile Design CAD System and Shuttle Loom - (직물과 가상소재의 화상 유사성 분석 연구 - 수직기 및 텍스타일 CAD시스템 활용 -)

  • Yoon, Jung-Won;Kim, Jong-Jun
    • Fashion & Textile Research Journal
    • /
    • v.15 no.6
    • /
    • pp.1010-1017
    • /
    • 2013
  • Current global textiles and fashion industries have gradually shifted focus to high value-added, high sensibility, and multi-functional products based on new human-friendliness and sustainable growth technologies. Textile design CAD systems have been developed in conjunction with computer hardware and software sector advances. This study compares the patterns or images of actual woven fabrics and virtual fabrics prepared with a textile design CAD system. In this study, several weave structures (such as fancy yarn weave and patterns) were prepared with a shuttle loom. The woven textile images were taken using a CCD camera. The same weave structure data and yarn data were fed into a textile design CAD system in order to simulate fabric images as similarly as possible. Similarity Index analysis methods allowed for an analysis of the index between the actual fabric specimen and the simulated image of the corresponding fabric. The results showed that repeated small pattern weaves provide superior similarity index values than those of a fancy yarn weave that indicate some irregularities due to fancy yarn attributes. A Complex Wavelet Structural Similarity(CW-SSIM) index resulted in a better index than other methods such as Multi-Scale(MS) SSIM, and Feature Similarity(FS) SSIM, across fabric specimen images. A correlation analysis of the similarity index based on an image analysis and a similarity evaluation by panel members was also implemented.

The Effect of OCB Profile Similarity between Individual and Colleagues on Experienced Incivility (개인-동료 간 OCB 프로파일 유사도가 무례경험에 미치는 영향 연구)

  • Song, Gi-Ryung;Kim, Kyoung-Seok
    • Asia-Pacific Journal of Business
    • /
    • v.13 no.3
    • /
    • pp.245-259
    • /
    • 2022
  • Purpose - Studies have been continuously carried out by researchers so far to clarify the factors influencing employee's incivility at work. However, the behavior of employees who are the target of incivility not much has been revealed which behavior affects the experience of incivility. Among them, it is interesting that the effect of OCB, a representative of employees' positive behavior in the workplace, on their experienced incivility has not been investigated. Therefore, this study attempted to clarify the relationship between OCB and experienced incivility that previous studies have not yet discovered. Design/methodology/approach - In the process, the concept of profile similarity was introduced and based on this, it was assumed that the OCB profile similarity between individual and colleagues, not the absolute level of OCB, would affect the experienced incivility and demonstrated this. The analysis was conducted by applying the survey data obtained from 205 employees to hierarchical regression analysis. Findings - As a result of the analysis, it was examined that the absolute OCB value used in previous studies did not significantly affect the experienced incivility, but the higher the similarity level, the less experienced incivility. The implications obtained based on this and future research directions are discussed together in the conclusion. Research implications or Originality - This study is the first one that considers OCB's profile similarity as a antecedent of experienced incivility. OCB profile similarity concept was only treated as a theoretical issue even in very early stage of OCB research stream, but this study examines the significant effect of OCB profile similarity. Moreover, behavioral antecedents of experienced incivility has not been identified well, but this study finds out that OCB can be a behavioral antecedent of experienced incivility.

Content based Video Segmentation Algorithm using Comparison of Pattern Similarity (장면의 유사도 패턴 비교를 이용한 내용기반 동영상 분할 알고리즘)

  • Won, In-Su;Cho, Ju-Hee;Na, Sang-Il;Jin, Ju-Kyong;Jeong, Jae-Hyup;Jeong, Dong-Seok
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.10
    • /
    • pp.1252-1261
    • /
    • 2011
  • In this paper, we propose the comparison method of pattern similarity for video segmentation algorithm. The shot boundary type is categorized as 2 types, abrupt change and gradual change. The representative examples of gradual change are dissolve, fade-in, fade-out or wipe transition. The proposed method consider the problem to detect shot boundary as 2-class problem. We concentrated if the shot boundary event happens or not. It is essential to define similarity between frames for shot boundary detection. We proposed 2 similarity measures, within similarity and between similarity. The within similarity is defined by feature comparison between frames belong to same shot. The between similarity is defined by feature comparison between frames belong to different scene. Finally we calculated the statistical patterns comparison between the within similarity and between similarity. Because this measure is robust to flash light or object movement, our proposed algorithm make contribution towards reducing false positive rate. We employed color histogram and mean of sub-block on frame image as frame feature. We performed the experimental evaluation with video dataset including set of TREC-2001 and TREC-2002. The proposed algorithm shows the performance, 91.84% recall and 86.43% precision in experimental circumstance.

Feature-based Similarity Assessment for Re-using CAD Models (CAD 모델 재사용을 위한 특징형상기반 유사도 측정에 관한 연구)

  • Park, Byoung-Keon;Kim, Jay-Jung
    • Korean Journal of Computational Design and Engineering
    • /
    • v.16 no.1
    • /
    • pp.21-30
    • /
    • 2011
  • Similarity assessment of a CAD model is one of important issues from the aspect of model re-using. In real practice, many new mechanical parts are designed by modifying existing ones. The reuse of part enables to save design time and efforts for the designers. Design time would be further reduced if there were an efficient way to search for existing similar designs. This paper proposes an efficient algorithm of similarity assessment for mechanical part model with design history embedded within the CAD model. Since it is possible to retrieve the design history and detailed-feature information using CAD API, we can obtain an accurate and reliable assessment result. For our purpose, our assessment algorithm can be divided by two: (1) we select suitable parts by comparing MSG (Model Signature Graph) extracted from a base feature of the required model; (2) detailed-features' similarities are assessed with their own attributes and reference structures. In addition, we also propose a indexing method for managing a model database in the last part of this article.