• Title/Summary/Keyword: graph similarity

Search Result 142, Processing Time 0.023 seconds

PageRank Algorithm Using Link Context (링크내역을 이용한 페이지점수법 알고리즘)

  • Lee, Woo-Key;Shin, Kwang-Sup;Kang, Suk-Ho
    • Journal of KIISE:Databases
    • /
    • v.33 no.7
    • /
    • pp.708-714
    • /
    • 2006
  • The World Wide Web has become an entrenched global medium for storing and searching information. Most people begin at a Web search engine to find information, but the user's pertinent search results are often greatly diluted by irrelevant data or sometimes appear on target but still mislead the user in an unwanted direction. One of the intentional, sometimes vicious manipulations of Web databases is Web spamming as Google bombing that is based on the PageRank algorithm, one of the most famous Web structuring techniques. In this paper, we regard the Web as a directed labeled graph that Web pages represent nodes and the corresponding hyperlinks edges. In the present work, we define the label of an edge as having a link context and a similarity measure between link context and the target page. With this similarity, we can modify the transition matrix of the PageRank algorithm. A motivating example is investigated in terms of the Singular Value Decomposition with which our algorithm can outperform to filter the Web spamming pages effectively.

Application of diversity of recommender system accordingtouserpreferencechange (사용자 선호도 변화에 따른 추천시스템의 다양성 적용)

  • Na, Hyeyeon;Nam, Kihwan
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.4
    • /
    • pp.67-86
    • /
    • 2020
  • Recommender Systems have been huge influence users and business more and more. Recently the importance of E-commerce has been reached rapid growth greatly in world-wide COVID-19 pandemic. Recommender system is the center of E-commerce lively. Top ranked E-commerce managers mentioned that recommender systems have a major influence on customer's purchase such as about 50% of Netflix, Amazon sales from their recommender systems. Most algorithms have been focused on improving accuracy of recommender system regardless of novelty, diversity, serendipity etc. Recommender systems with only high accuracy cannot satisfy business long-term profit because of generating sales polarization. In addition, customers do not experience enjoyment of shopping from only focusing accuracy recommender system because customer's preference is changed constantly. Therefore, recommender systems with various values need to be developed for user's high satisfaction. Reranking is the most useful methodology to realize diversity of recommender system. In this paper, diversity of recommender system is represented through constructing high similarity with users who have different preference using each user's purchased item's category algorithm. It is distinguished from past research approach which is changing the algorithm of recommender system without user's diversity preference level. We tried to discover user's diversity preference level and observed the results how the effect was different according to user's diversity preference level. In addition, graph-based recommender system was used to show diversity through user's network, not collaborative filtering. In this paper, Amazon Grocery and Gourmet Food data was used because the low-involvement product, such as habitual product, foods, low-priced goods etc., had high probability to show customer's diversity. First, a bipartite graph with users and items simultaneously is constructed to make graph-based recommender system. However, each users and items unipartite graph also need to be established to show diversity of recommender system. The weight of each unipartite graph has played crucial role changing Jaccard Distance of item's category. We can observe two important results from the user's unipartite network. First, the user's diversity preference level is observed from the network and second, dissimilar users can be discovered in the user's network. Through the research process, diversity of recommender system is presented highly with small accuracy loss and optimalization for higher accuracy is possible controlling diversity ratio. This paper has three important theoretical points. First, this research expands recommender system research for user's satisfaction with various values. Second, the graph-based recommender system is developed newly. Third, the evaluation indicator of diversity is made for diversity. In addition, recommender systems are useful for corporate profit practically and this paper has contribution on business closely. Above all, business long-term profit can be improved using recommender system with diversity and the recommender system can provide right service according to user's diversity level. Lastly, the corporate selling low-involvement products have great effect based on the results.

A discursive approach to analysis of definition of graph in first year middle school textbooks (담론적 관점(discursive approach)에서 중1 수학 교과서의 그래프 정의 분석)

  • Kim, Won;Choi, Sang-Ho;Kim, Dong-Joong
    • Communications of Mathematical Education
    • /
    • v.32 no.3
    • /
    • pp.407-433
    • /
    • 2018
  • In order to analyze textbooks from a discursive approach, the purpose of this study is to structuralize an analytic framework based on previous literature review and apply it to analyzing the meanings and their syntheses developed by words and visual mediators appeared in the definition of graph in first-year middle school textbooks. The discursive approach consists of the communicational approach developed by Sfard(2008) and the systemic functional linguistics developed by Halliday(1985/2004). In this study, ideational meta-functions for ideational meanings and interpersonal meta-functions for interpersonal meanings were employed to analyze the meanings produced by words and visual mediators in textbooks, whereas textual meta-functions for textual meanings were used for analyzing the synthesized relationships between words and visual mediators. Results show that first, density in mathematical discourse was very high and subjects in mathematical activities were ambiguous in the ideational meanings of words, and behavior aspect was more emphasized than thinking aspect in the interpersonal meanings of words which request student participations. In the case of ideational meanings of visual mediators, there was a lack of narrative diagrams, whereas there were qualitative differences in the case of offer. Second, there was a need for promoting a wide range of diverse synthetic relationships between words and visual mediators for developing enriched mathematical meanings through the varying uses like specification, explanation, similarity, and complement. These results are so important that they provide a new analytic framework from a discursive approach to textbook analysis because not only words, but also visual mediators are analyzed as tools for producing meanings in mathematics textbooks and their synthetic relationships are also examined.

Detection of M:N corresponding class group pairs between two spatial datasets with agglomerative hierarchical clustering (응집 계층 군집화 기법을 이용한 이종 공간정보의 M:N 대응 클래스 군집 쌍 탐색)

  • Huh, Yong;Kim, Jung-Ok;Yu, Ki-Yun
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.30 no.2
    • /
    • pp.125-134
    • /
    • 2012
  • In this paper, we propose a method to analyze M:N corresponding relations in semantic matching, especially focusing on feature class matching. Similarities between any class pairs are measured by spatial objects which coexist in the class pairs, and corresponding classes are obtained by clustering with these pairwise similarities. We applied a graph embedding method, which constructs a global configuration of each class in a low-dimensional Euclidean space while preserving the above pairwise similarities, so that the distances between the embedded classes are proportional to the overall degree of similarity on the edge paths in the graph. Thus, the clustering problem could be solved by employing a general clustering algorithm with the embedded coordinates. We applied the proposed method to polygon object layers in a topographic map and land parcel categories in a cadastral map of Suwon area and evaluated the results. F-measures of the detected class pairs were analyzed to validate the results. And some class pairs which would not detected by analysis on nominal class names were detected by the proposed method.

An Effective Method for Comparing Control Flow Graphs through Edge Extension (에지 확장을 통한 제어 흐름 그래프의 효과적인 비교 방법)

  • Lim, Hyun-Il
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.2 no.8
    • /
    • pp.317-326
    • /
    • 2013
  • In this paper, we present an effective method for comparing control flow graphs which represent static structures of binary programs. To compare control flow graphs, we measure similarities by comparing instructions and syntactic information contained in basic blocks. In addition, we also consider similarities of edges, which represent control flows between basic blocks, by edge extension. Based on the comparison results of basic blocks and edges, we match most similar basic blocks in two control flow graphs, and then calculate the similarity between control flow graphs. We evaluate the proposed edge extension method in real world Java programs with respect to structural similarities of their control flow graphs. To compare the performance of the proposed method, we also performed experiments with a previous structural comparison for control flow graphs. From the experimental results, the proposed method is evaluated to have enough distinction ability between control flow graphs which have different structural characteristics. Although the method takes more time than previous method, it is evaluated to be more resilient than previous method in comparing control flow graphs which have similar structural characteristics. Control flow graph can be effectively used in program analysis and understanding, and the proposed method is expected to be applied to various areas, such as code optimization, detection of similar code, and detection of code plagiarism.

A Comparative Analysis about Various Editions of Donguibogam (판본별 교감을 통한 『동의보감』의 정본화)

  • Lee, Jeong-Hyeon;Oh, Junho
    • The Journal of Korean Medical History
    • /
    • v.31 no.1
    • /
    • pp.57-70
    • /
    • 2018
  • Much research has already been done on Donguibogam. However, comparison of specific characters was not done because researchers found it difficult to compare different editions of the text in one place. Recently, important editions have been published on the Internet, making comparison possible. In this paper, researchers compare eight editions Donguibogam, including the original edition published in 1613 and seven other editions corrected by the Naeuiwon (Joseon Dynasty National Medical Center). The comparison results were summarized and tabulated. The results of the comparison are analyzed and presented in this article as a chart. The result of comparing the characters and the analyzed graph were in agreement. The authors propose that all written and electronic publications of Donguibogam should refer to other editions implied, quoted or referenced within the text and including with proper citations, and reference the original and first edition. Inadequate referencing will pollute future knowledge of this foundational text of Traditional Korean Medicine and may result in perpetration of mis-information. Based on accumulated knowledge and study of historical Korean Medicine texts, the Namsan edition made a mistake in the editing process. The year of publication of Gabsul-yoengyoeng-gegan Edition needs to be studied again and corrections made where appropriate.

Association between Shopping Items and the Demographics of Foreign Tourists in South Korea

  • Jeong, Dong-Bin
    • East Asian Journal of Business Economics (EAJBE)
    • /
    • v.7 no.3
    • /
    • pp.63-73
    • /
    • 2019
  • Purposes - In this research, we look over and investigate associations between shopping items and the demographics of foreign tourists in South Korea. The related seven variables are gender, age, occupations, country of residence, visit month, visit purpose and trip type. In addition, we can graph twenty-one shopping items in association with the demographics of foreign tourists by computing their dissimilarity or similarity on two dimension planes granting that the association exists between the underlying variables. Research design and methodology - This research is performed by Ministry of Culture and Tourism in 2017 and investigated 13,200 foreign tourists from 20 countries. For analyzing the detailed relationships between shopping items and the demographics of foreign tourists, we take advantage of both independent test and correspondence analysis as key statistical techniques. Results - The findings show that shopping items which foreign tourists purchase are closely associated with the three different demographics variables such as country of residence, tour type and visit purpose by monitoring significant p-value of chi-squared statistic Conclusions - This study suggests Ministry of Culture, Sports and Tourism must explore ways toward tourism infrastructure such as global marketing, municipality strategy for attracting foreign tourists, development of diverse shopping items and services and so on.

A Study on the Spatial Configuration Characteristics of the Apartment Building Type based on the Space Syntax Analysis (공간구문론 분석에 의한 아파트 주동형식별 공간배치 특성에 관한 연구)

  • 박몽섭;박찬돈;하재명
    • Journal of the Korean housing association
    • /
    • v.15 no.1
    • /
    • pp.185-194
    • /
    • 2004
  • This paper deals with the spatial configuration of the unit floor plan based on the apartment building type. It is generally agreed that various apartment building types were organized by diverse apartment unit floor plan. The reason behind this diversity, apartment unit plan are affected by the environmental condition and the diverse spatial composition of the unit floor plan. Apartment building types were classified by the similarity of the justified-graph type. This types classified into two category; One core type which were classified by the shape of the core, and multiple core type which were compounded by more than two core. These categories divided into 5 types, and 2 types. Each types were compared in view of the mean depth, relative asymmetry, integration value. Consequently, these types could be classified in the number of the unit floor plan. It is profitable to 3 unit types which was analyzed in view of an indicator of the spatial configuration. Therefore, 3 unit types is favorable to the composition of the apartment building types.

A Study of an Image Retrieval Method using Binary Subimage (이진 부분영상을 이용한 영상 검색 기법에 관한 연구)

  • 정순영;최민규;남재열
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.2 no.1
    • /
    • pp.28-37
    • /
    • 2001
  • An image retrieval method combining shape information of 2-dimension color histograms with color information of HSI color histograms is proposed in this paper. In addition, the proposed method can find location information of image through the comparison of similarity among subimages. The suggested retrieval method applies the location information to shape and color information and can retrieve region information which is hard to distinguish in the binary image. Some simulation results show that it works very well in the behalf of precision/recall graph compare with conventional method which uses color histogram. Especially, the proposed method brought well effects such as rotations and transitions of the objects in an image was found.

  • PDF

A Covariance-matching-based Model for Musical Symbol Recognition

  • Do, Luu-Ngoc;Yang, Hyung-Jeong;Kim, Soo-Hyung;Lee, Guee-Sang;Dinh, Cong Minh
    • Smart Media Journal
    • /
    • v.7 no.2
    • /
    • pp.23-33
    • /
    • 2018
  • A musical sheet is read by optical music recognition (OMR) systems that automatically recognize and reconstruct the read data to convert them into a machine-readable format such as XML so that the music can be played. This process, however, is very challenging due to the large variety of musical styles, symbol notation, and other distortions. In this paper, we present a model for the recognition of musical symbols through the use of a mobile application, whereby a camera is used to capture the input image; therefore, additional difficulties arise due to variations of the illumination and distortions. For our proposed model, we first generate a line adjacency graph (LAG) to remove the staff lines and to perform primitive detection. After symbol segmentation using the primitive information, we use a covariance-matching method to estimate the similarity between every symbol and pre-defined templates. This method generates the three hypotheses with the highest scores for likelihood measurement. We also add a global consistency (time measurements) to verify the three hypotheses in accordance with the structure of the musical sheets; one of the three hypotheses is chosen through a final decision. The results of the experiment show that our proposed method leads to promising results.