• Title/Summary/Keyword: Search Keywords

Search Result 574, Processing Time 0.028 seconds

Text-mining Based Graph Model for Keyword Extraction from Patent Documents (특허 문서로부터 키워드 추출을 위한 위한 텍스트 마이닝 기반 그래프 모델)

  • Lee, Soon Geun;Leem, Young Moon;Um, Wan Sup
    • Journal of the Korea Safety Management & Science
    • /
    • v.17 no.4
    • /
    • pp.335-342
    • /
    • 2015
  • The increasing interests on patents have led many individuals and companies to apply for many patents in various areas. Applied patents are stored in the forms of electronic documents. The search and categorization for these documents are issues of major fields in data mining. Especially, the keyword extraction by which we retrieve the representative keywords is important. Most of techniques for it is based on vector space model. But this model is simply based on frequency of terms in documents, gives them weights based on their frequency and selects the keywords according to the order of weights. However, this model has the limit that it cannot reflect the relations between keywords. This paper proposes the advanced way to extract the more representative keywords by overcoming this limit. In this way, the proposed model firstly prepares the candidate set using the vector model, then makes the graph which represents the relation in the pair of candidate keywords in the set and selects the keywords based on this relationship graph.

Keyword Search and Ranking Methods on Semantic Web Documents (시맨틱 웹 문서에 대한 키워드 검색 및 랭킹 기법)

  • Kim, Youn-Hee;Oh, Sung-Kyun
    • Journal of Satellite, Information and Communications
    • /
    • v.7 no.3
    • /
    • pp.86-93
    • /
    • 2012
  • In this paper, we propose keyword search and ranking methods for OWL documents that describe metadata and ontology on the Semantic Web. The proposed keyword search method defines a unit of keyword search result as an information resource and expands a scope of query keyword to names of class and property or literal data. And we reflected derived information by inference in the keyword search by considering the elements of OWL documents such as hierarchical relationship of classes or properties and equal relationship of classes. In addition, our method can search a large number of information resources that are relevant to query keywords because of information resources indirectly associated with query keywords through semantic relationship. Our ranking method can improve user's search satisfaction because of involving a variety of factors in the ranking by considering the characteristics of OWL. The proposed methods can be used to retrieve digital contents, such as broadcast programs.

Fast Result Enumeration for Keyword Queries on XML Data

  • Zhou, Junfeng;Chen, Ziyang;Tang, Xian;Bao, Zhifeng;Ling, TokWang
    • Journal of Computing Science and Engineering
    • /
    • v.6 no.2
    • /
    • pp.127-140
    • /
    • 2012
  • In this paper, we focus on efficient construction of tightest matched subtree (TMSubtree) results, for keyword queries on extensible markup language (XML) data, based on smallest lowest common ancestor (SLCA) semantics. Here, "matched" means that all nodes in a returned subtree satisfy the constraint that the set of distinct keywords of the subtree rooted at each node is not subsumed by that of any of its sibling nodes, while "tightest" means that no two subtrees rooted at two sibling nodes can contain the same set of keywords. Assume that d is the depth of a given TMSubtree, m is the number of keywords of a given query Q. We proved that if d ${\leq}$ m, a matched subtree result has at most 2m! nodes; otherwise, the size of a matched subtree result is bounded by (d - m + 2)m!. Based on this theoretical result, we propose a pipelined algorithm to construct TMSubtree results without rescanning all node labels. Experiments verify the benefits of our algorithm in aiding keyword search over XML data.

A Natural Language Retrieval System for Entertainment Data (엔터테인먼트 데이터를 위한 자연어 검색시스템)

  • Kim, Jung-In
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.1
    • /
    • pp.52-64
    • /
    • 2015
  • Recently, as the quality of life has been improving, search items in the area of entertainment represent an increasing share of the total usage of Internet portal sites. Information retrieval in the entertainment area is mainly depending on keywords that users are inputting, and the results of information retrieval are the contents that contain those keywords. In this paper, we propose a search method that takes natural language inputs and retrieves the database pertaining to entertainment. The main components of our study are the simple Korean morphological analyzer using case particle information, predicate-oriented token generation, standardized pattern generation coherent to tokens, and automatic generation of the corresponding SQL queries. We also propose an efficient retrieval system that searches the most relevant results from the database in terms of natural language querying, especially in the restricted domain of music, and shows the effectiveness of our system.

A Study on Contributor to Sports Development Big Data Research Using Oral Records

  • Byun, Jisun
    • Journal of Multimedia Information System
    • /
    • v.8 no.4
    • /
    • pp.301-308
    • /
    • 2021
  • The purpose of this study is to analyze the oral records of sports development contributors to explore the direction of big data research on sports development contributors in the future. To this end, the audio file produced in the interview with Lee00, a sports development contributor, was converted into text. The major themes were extracted by analyzing these oral records. The sub-themes were extracted in chronological order. Keywords were extracted by analyzing sub-themes. And the extracted keywords are searched in Google search engine to find related topics and to use them. A Google search for the topic 'Mt. Inwang' extracted from the oral archives of Lee00, a contributor to the development of sports, finds newspaper articles about President Moon Jae-in's climbing Mt. Inwang and opening up Mt. Bukhan. In addition, articles about Mt. Inwang and mountain climbers that the narrator In-jeong Lee speaks are searched for. Through these articles, you can Deriving the theme of the museum exhibition, Collection of museum exhibits, Use as climbing education material.

Design of WWW IR System Based on Keyword Clustering Architecture (색인어 말뭉치 처리를 기반으로 한 웹 정보검색 시스템의 설계)

  • 송점동;이정현;최준혁
    • The Journal of Information Technology
    • /
    • v.1 no.1
    • /
    • pp.13-26
    • /
    • 1998
  • In general Information retrieval systems, improper keywords are often extracted and different search results are offered comparing to user's aim bacause the systems use only term frequency informations for selecting keywords and don't consider their meanings. It represents that improving precision is limited without considering semantics of keywords because recall ratio and precision have inverse proportion relation. In this paper, a system which is able to improve precision without decreasing recall ratio is designed and implemented, as client user module is introduced which can send feedbacks to server with user's intention. For this purpose, keywords are selected using relative term frequency and inverse document frequency and co-occurrence words are extracted from original documents. Then, the keywords are clustered by their semantics using calculated mutual informations. In this paper, the system can reject inappropriate documents using segmented semantic informations according to feedbacks from client user module. Consequently precision of the system is improved without decreasing recall ratio.

  • PDF

A Hybrid Collaborative Filtering-based Product Recommender System using Search Keywords (검색 키워드를 활용한 하이브리드 협업필터링 기반 상품 추천 시스템)

  • Lee, Yunju;Won, Haram;Shim, Jaeseung;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.151-166
    • /
    • 2020
  • A recommender system is a system that recommends products or services that best meet the preferences of each customer using statistical or machine learning techniques. Collaborative filtering (CF) is the most commonly used algorithm for implementing recommender systems. However, in most cases, it only uses purchase history or customer ratings, even though customers provide numerous other data that are available. E-commerce customers frequently use a search function to find the products in which they are interested among the vast array of products offered. Such search keyword data may be a very useful information source for modeling customer preferences. However, it is rarely used as a source of information for recommendation systems. In this paper, we propose a novel hybrid CF model based on the Doc2Vec algorithm using search keywords and purchase history data of online shopping mall customers. To validate the applicability of the proposed model, we empirically tested its performance using real-world online shopping mall data from Korea. As the number of recommended products increases, the recommendation performance of the proposed CF (or, hybrid CF based on the customer's search keywords) is improved. On the other hand, the performance of a conventional CF gradually decreased as the number of recommended products increased. As a result, we found that using search keyword data effectively represents customer preferences and might contribute to an improvement in conventional CF recommender systems.

An Effective Mobile Web Object Navigation Based on the Steiner Tree Approach (스타이너트리 기반의 효과적인 모바일 웹 오브젝트 네비게이션)

  • Lee, Woo-Key;Song, Justin Jong-Su;Lee, James J.H.
    • Korean Management Science Review
    • /
    • v.28 no.1
    • /
    • pp.1-10
    • /
    • 2011
  • One of the fundamental roles of web object navigation is to support what the user wants precisely and efficiently from the enormous web database to the web browser. As long as the web search results are a set of individual lists, it is all right to display each and every web result for the web browser to display a web object one by one. However, in case the search results are a collection of multiple interrelated web objects, then there is a need to represent for a new mechanism for linked web objects at a time. We define a unit of web objects derived from a Steiner tree where the web objects include a set of specific keywords calculated by the weight from which the solutions are extracted. Even if a web object does not include all the keywords, then the related hypertext linked web objects are derived and displayed onto the mobile web browser with meta data in one shot. In this paper, it is applied for the mobile browser that the web contents can dynamically be displayed with Steiner trees until each renewal of the navigation request may be issued. In this paper, a new synchronized mobile browsing method is developed so that the navigating time can drastically be reduced and the web navigating efficiency can be dramatically enhanced without sacrificing memory consumption.

The Comparison of Certified Emission Reductions Forecasting Model Using Price of Certified Emission Reductions and Related Search Keywords (탄소배출권 가격과 연관검색어를 활용한 탄소배출권 가격 예측 방법론 비교)

  • Kim, Hyeonho;Im, Giseong;Kim, Yujin;Lee, Minwoo;Han, Seungwoo
    • Proceedings of the Korean Institute of Building Construction Conference
    • /
    • 2020.06a
    • /
    • pp.44-45
    • /
    • 2020
  • Korea has the fourth highest CO2 emission among OECD countries in 2018, As of 2019, total greenhouse gas emissions per capita increased by about 98.2% in comparison to 1990. Korea has promised a 37% reduction in greenhouse gas emissions in 2030 from the projected Paris Climate Change Accord. Currently, many countries use the emissions trading system(ETS) for international carbon management. In 2015, ETS has been implemented in Korea, and the importance of calculating CO2 emissions from construction machinery has increased. So, we require an accurate calculation of the environmental charges through the allocated CERs. Using the CER price and related search keywords, this paper derive about prediction models of CER price and compare and focus on more accurate prediction about CER price. By this method, the budget needed to establish the initial construction process plan can be calculated based on more accurate predicted CER price.

  • PDF

An Efficient Retrieval Technique for Spatial Web Objects (공간 웹 객체의 효율적인 검색 기법)

  • Yang, PyoungWoo;Nam, Kwang Woo
    • Journal of KIISE
    • /
    • v.42 no.3
    • /
    • pp.390-398
    • /
    • 2015
  • Spatial web objects refer to web documents that contain geographic information. Recently, services that create spatial web objects have increased greatly because of the advancements in devices such as smartphones. For services such as Twitter or Facebook, simple texts posted by users is stored along with information about the post's location. To search for such spatial web objects, a method that uses spatial information and text information simultaneously is required. Conventional spatial web object search methods mostly use R-tree and inverted file methods. However, these methods have a disadvantage of requiring a large volume of space when building indices. Furthermore, such methods are efficient for searching with many keywords but are inefficient for searching with a few keywords.. In this paper, we propose a spatial web object search method that uses a quad-tree and a patricia-trie. We show that the proposed technique is more effective than existing ones in searching with a small number of keywords. Furthermore, we show through an experiment that the space required by the proposed technique is much smaller than that required by existing ones.