• Title/Summary/Keyword: 시맨틱 거리

Search Result 10, Processing Time 0.025 seconds

A New Semantic Distance Measurement Method using TF-IDF in Linked Open Data (링크드 오픈 데이터에서 TF-IDF를 이용한 새로운 시맨틱 거리 측정 기법)

  • Cho, Jung-Gil
    • Journal of the Korea Convergence Society
    • /
    • v.11 no.10
    • /
    • pp.89-96
    • /
    • 2020
  • Linked Data allows structured data to be published in a standard way that datasets from various domains can be interlinked. With the rapid evolution of Linked Open Data(LOD), researchers are exploiting it to solve particular problems such as semantic similarity assessment. In this paper, we propose a method, on top of the basic concept of Linked Data Semantic Distance (LDSD), for calculating the Linked Data semantic distance between resources that can be used in the LOD-based recommender system. The semantic distance measurement model proposed in this paper is based on a similarity measurement that combines the LOD-based semantic distance and a new link weight using TF-IDF, which is well known in the field of information retrieval. In order to verify the effectiveness of this paper's approach, performance was evaluated in the context of an LOD-based recommendation system using mixed data of DBpedia and MovieLens. Experimental results show that the proposed method shows higher accuracy compared to other similar methods. In addition, it contributed to the improvement of the accuracy of the recommender system by expanding the range of semantic distance calculation.

A Technique for Generating Semantic Trajectories by Using GPS Moving Trajectories and POI information (GPS 이동 궤적과 관심지점 정보를 이용한 시맨틱 궤적 쟁성 기법)

  • Jang, Yuhee;Lee, Juwon;Lim, Hyo-Sang
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2015.04a
    • /
    • pp.722-725
    • /
    • 2015
  • 모바일 환경에서 사용자의 GPS 궤적은 위치기반서비스(Location Based Service)에서 새로운 자원으로써 활용되고 있다. 위치기반서비스의 확장을 위해 단순히 사용자의 위치를 지도에 표시하는 것뿐만 아니라 사용자들이 위치했던 장소들이 내포하고 있는 의미를 발견해 내는 것이 필요하다. 이를 위해 최근 사용자의 위치정보에 관심지점(POI: Point of Interest)의 정보를 결합하여 시맨틱 궤적(Semantic Trajectory)을 생성하고 분석하는 연구들이 진행되고 있다. 이러한 기존연구의 경우 시맨틱 궤적을 생성하기 위해, 사용자의 GPS 궤적과 POI의 면적 정보(polygon)가 겹칠 경우를 찾아내서 이를 시맨틱 궤적으로 생성하였다. 하지만 대부분 공개된 POI 정보는 실제 장소들의 면적 정보를 제공하지 않고 좌표(point) 값 만을 제공하기 때문에 기존의 방법으로는 시맨틱 궤적을 생성하지 못하는 문제가 있다. 본 논문에서는 사용자의 GPS 궤적과 POI의 좌표 값을 이용하여 사용자가 실제 방문했을 것으로 예상되는 POI 를 추정하고 이를 시맨틱 궤적으로 생성해 내는 방법을 제안한다. 제안하는 기법은 GPS 궤적의 속력 정보를 사용하여 사용자가 정지했었던 구간을 판별하고, 정지 구간 주변의 POI 밀도에 따라 정지 구간을 영역으로 확장한다. 그리고 영역에 포함된 POI 중 정지 구간과의 거리가 가장 가깝고, 가장 오랜 시간 포함되었던 POI를 사용자가 방문했던 POI로 판단한다. 이 방법은 POI의 면적정보가 없는 제한적인 상황에서도 시맨틱 궤적을 생성할 수 있다는 장점을 가진다.

Similarity Calculation for Mobile Life Log Data Mining (모바일 라이프로그 데이터 마이닝을 위한 Non-Euclidean 데이터의 유사도 계산)

  • Lee, Young-Seol;Cho, Sung-Bae
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06a
    • /
    • pp.298-301
    • /
    • 2011
  • 모바일 기기에서 수집된 많은 정보들은 시맨틱한 정보들을 포함하고 있기 때문에 수치 해석에 특화된 클러스터링 등의 데이터마이닝 방법들을 적용하기가 힘들다. 따라서 상대적인 유사도를 계산하는 방법이 많이 이용되지만, 상대적인 유사도 값조차 유클리드 거리로 환산이 불가능한 특징을 가지는 경우가 많다. 본 논문에서는 비유클리드 특징을 가지는 유사도를 TFIDF 와 pseudo-Euclidean embedding을 적용하여 유클리드 공간 상의 거리값으로 변환하는 방법을 제안한다. 제안하는 방법의 가능성을 보이기 위하여 모바일 기기에서 대학생들의 생활 패턴을 반영하는 데이터를 수집하고, 수집된 데이터에 제안하는 방법을 적용한다. 그리고 적용된 결과를 대학생들의 생활 패턴과 비교하여 분석한다. 또한 장소 간의 유사도를 이용하는 애플리케이션의 프로토타입을 개발한다.

The Semantic Zooming Method for Efficient Web Browsing on Internet-connected Digital Television (IPTV 환경에서 효율적인 웹 탐색을 위한 시맨틱 주밍 기법)

  • Chung, Ji-Hye;Lee, Hye-Jeong;Lea, Jong-Ho;Kim, Yeun-Bae
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.579-583
    • /
    • 2008
  • Web pages with complex layout and small font size do not display well on large screen display such as TV because it has limited capabilities: long distance view, passive user attitude, limited input device like a legacy remote controller. We have designed and implemented new semantic zoom browsing facilities to support effective navigation on Internet-connected digital television with limited capabilities. Our browser performs partitioning of an HTML document content into semantic blocks. Semantic blocks present summarized information with more readable style and modified layout for optimal reading and browsing. Individual blocks can be selected by the user and zoomed in more detail information by the user. The scrolling on large display device needs more user interaction. Our browser modifies the layout of an HTML document with removing horizontal scrolling and minimizing vertical scrolling. This method allows users to easily view the web page by converting into optimal reading style and layout and to easily seek the information just with zooming.

  • PDF

Semantic Document-Retrieval Based on Markov Logic (마코프 논리 기반의 시맨틱 문서 검색)

  • Hwang, Kyu-Baek;Bong, Seong-Yong;Ku, Hyeon-Seo;Paek, Eun-Ok
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.6
    • /
    • pp.663-667
    • /
    • 2010
  • A simple approach to semantic document-retrieval is to measure document similarity based on the bag-of-words representation, e.g., cosine similarity between two document vectors. However, such a syntactic method hardly considers the semantic similarity between documents, often producing semantically-unsound search results. We circumvent such a problem by combining supervised machine learning techniques with ontology information based on Markov logic. Specifically, Markov logic networks are learned from similarity-tagged documents with an ontology representing the diverse relationship among words. The learned Markov logic networks, the ontology, and the training documents are applied to the semantic document-retrieval task by inferring similarities between a query document and the training documents. Through experimental evaluation on real world question-answering data, the proposed method has been shown to outperform the simple cosine similarity-based approach in terms of retrieval accuracy.

Development of Subsurface Spatial Information Model with Cluster Analysis and Ontology Model (온톨로지와 군집분석을 이용한 지하공간 정보모델 개발)

  • Lee, Sang-Hoon
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.13 no.4
    • /
    • pp.170-180
    • /
    • 2010
  • With development of the earth's subsurface space, the need for a reliable subsurface spatial model such as a cross-section, boring log is increasing. However, the ground mass was essentially uncertain. To generate model was uncertain because of the shortage of data and the absence of geotechnical interpretation standard(non-statistical uncertainty) as well as field environment variables(statistical uncertainty). Therefore, the current interpretation of the data and the generation of the model were accomplished by a highly trained experts. In this study, a geotechnical ontology model was developed using the current expert experience and knowledge, and the information content was calculated in the ontology hierarchy. After the relative distance between the information contents in the ontology model was combined with the distance between cluster centers, a cluster analysis that considered the geotechnical semantics was performed. In a comparative test of the proposed method, k-means method, and expert's interpretation, the proposed method is most similar to expert's interpretation, and can be 3D-GIS visualization through easily handling massive data. We expect that the proposed method is able to generate the more reasonable subsurface spatial information model without geotechnical experts' help.

A Study on the Performance Evaluation of Semantic Retrieval Engines (시맨틱검색엔진의 성능평가에 관한 연구)

  • Noh, Young-Hee
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.22 no.2
    • /
    • pp.141-160
    • /
    • 2011
  • This study suggested knowledge base and search engine for the libraries that have the largescaled data. For this purpose, 3 components of knowledge bases(triple ontology, concept-based knowledge base, inverted file) were constructed and 3 search engines(search engine JENA for rule-based reasoning, Concept-based search engine, keyword-based Lucene retrieval engine) were implemented to measure their performance. As a result, concept-based retrieval engine showed the best performance, followed by ontology-based Jena retrieval engine, and then by a normal keyword search engine.

Identifying potential buyers in the technology market using a semantic network analysis (시맨틱 네트워크 분석을 이용한 원천기술 분야의 잠재적 기술수요 발굴기법에 관한 연구)

  • Seo, Il Won;Chon, ChaeNam;Lee, Duk Hee
    • Journal of Technology Innovation
    • /
    • v.21 no.1
    • /
    • pp.279-301
    • /
    • 2013
  • This study demonstrates how social network analysis can be used for identifying potential buyers in technology marketing; in such, the methodology and empirical results are proposed. First of all, we derived the three most important 'seed' keywords from 'technology description' sections. The technologies are generated by various types of R&D activities organized by South Korea's public research institutes in the fundamental science fields. Second, some 3, 000 words were collected from websites related to the three 'seed' keywords. Next, three network matrices (i.e., one matrix per seed keyword) were constructed. To explore the technology network structure, each network is analyzed by degree centrality and Euclidean distance. The network analysis suggests 100 potentially demanding companies and identifies seven common companies after comparing results derived from each network. The usefulness of the result is verified by investigating the business area of the firm's homepages. Finally, five out of seven firms were proven to have strong relevance to the target technology. In terms of social network analysis, this study expands its application scope of methodology by combining semantic network analysis and the technology marketing method. From a practical perspective, the empirical study suggests the illustrative framework for exploiting prospective demanding companies on the web, raising possibilities of technology commercialization in the basic research fields. Future research is planned to examine how the efficiency of process and accuracy of result is increased.

  • PDF

Scalarization of HPF FORALL Construct (HPF FORALL 구조의 스칼라화(Scalarization))

  • Koo, Mi-Soon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.12 no.5
    • /
    • pp.121-129
    • /
    • 2007
  • Scalarization is a process that a parallel construct like an array statement of Fortran 90 or FORALL of HPF is converted into sequential loops that maintain the correct semantics. Most compilers of HPF, recognized as a standard data parallel language, convert a HPF program into a Fortran 77 program inserted message passing primitives. During scalariztion, a parallel construct FORALL should be translated into Fortran 77 DO loops maintaining the semantics of FORALL. In this paper, we propose a scalarization algorithm which converts a FORALL construct into a DO loop with improved performance. For this, we define and use a relation distance vector to keep necessary dependence informations. Then we evaluate execution times of the codes generated by our method and by PARADIGM compiler method for various array sizes.

  • PDF

A Methodology of the Information Retrieval System Using Fuzzy Connection Matrix and Document Connectivity Order (색인어 퍼지 관계와 서열기법을 이용한 정보 검색 방법론)

  • Kim, Chul;Lee, Seung-Chai;Kim, Byung-Ki
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.5
    • /
    • pp.1160-1169
    • /
    • 1996
  • In this study, an experiment of information retrieval using fuzzy connection matrix of keywords was conducted. A query for retrieval was constructed from each keyword and Boolean operator such as AND, OR, NOT. In a workstation environment, the performance of the fuzzy retrieval system was proved to be considerably effective than that of the system using the crisp set theory. And both recall ratio and precision ratio showed that the proposed technique would be a possible alternative in future information retrieval. Some special features of this experimental system were ; ranking the results in the order of connectivity, making the retrieval results correspond flexibly by changing the threshold value, trying to accord the retrieval process with the retrieval semantics by treating the averse-connectivity (fuzzy value) as a semantic approximation between kewords.

  • PDF