• 제목/요약/키워드: Web search

Search Result 1,646, Processing Time 0.031 seconds

Ontology-based User Customized Search Service Considering User Intention (온톨로지 기반의 사용자 의도를 고려한 맞춤형 검색 서비스)

  • Kim, Sukyoung;Kim, Gunwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.4
    • /
    • pp.129-143
    • /
    • 2012
  • Recently, the rapid progress of a number of standardized web technologies and the proliferation of web users in the world bring an explosive increase of producing and consuming information documents on the web. In addition, most companies have produced, shared, and managed a huge number of information documents that are needed to perform their businesses. They also have discretionally raked, stored and managed a number of web documents published on the web for their business. Along with this increase of information documents that should be managed in the companies, the need of a solution to locate information documents more accurately among a huge number of information sources have increased. In order to satisfy the need of accurate search, the market size of search engine solution market is becoming increasingly expended. The most important functionality among much functionality provided by search engine is to locate accurate information documents from a huge information sources. The major metric to evaluate the accuracy of search engine is relevance that consists of two measures, precision and recall. Precision is thought of as a measure of exactness, that is, what percentage of information considered as true answer are actually such, whereas recall is a measure of completeness, that is, what percentage of true answer are retrieved as such. These two measures can be used differently according to the applied domain. If we need to exhaustively search information such as patent documents and research papers, it is better to increase the recall. On the other hand, when the amount of information is small scale, it is better to increase precision. Most of existing web search engines typically uses a keyword search method that returns web documents including keywords which correspond to search words entered by a user. This method has a virtue of locating all web documents quickly, even though many search words are inputted. However, this method has a fundamental imitation of not considering search intention of a user, thereby retrieving irrelevant results as well as relevant ones. Thus, it takes additional time and effort to set relevant ones out from all results returned by a search engine. That is, keyword search method can increase recall, while it is difficult to locate web documents which a user actually want to find because it does not provide a means of understanding the intention of a user and reflecting it to a progress of searching information. Thus, this research suggests a new method of combining ontology-based search solution with core search functionalities provided by existing search engine solutions. The method enables a search engine to provide optimal search results by inferenceing the search intention of a user. To that end, we build an ontology which contains concepts and relationships among them in a specific domain. The ontology is used to inference synonyms of a set of search keywords inputted by a user, thereby making the search intention of the user reflected into the progress of searching information more actively compared to existing search engines. Based on the proposed method we implement a prototype search system and test the system in the patent domain where we experiment on searching relevant documents associated with a patent. The experiment shows that our system increases the both recall and precision in accuracy and augments the search productivity by using improved user interface that enables a user to interact with our search system effectively. In the future research, we will study a means of validating the better performance of our prototype system by comparing other search engine solution and will extend the applied domain into other domains for searching information such as portal.

Design of Semantic Search System for the Search of Duplicated Geospatial Projects (공간정보사업의 중복사업 검색을 위한 의미기반검색 시스템의 설계)

  • Park, Sangun;Lim, Jay Ick;Kang, Juyoung
    • Journal of Information Technology Services
    • /
    • v.12 no.3
    • /
    • pp.389-404
    • /
    • 2013
  • Geospatial information, which is one of social overhead capital, is predicted as a core growing industry for the future. The production of geospatial information requires a huge budget, so it is very important objective of the policy for geospatial information to prevent the duplication of geospatial projects. In this paper, we proposed a semantic search system which extracts possible duplication of geospatial projects by using ontology for geospatial project administration. In order to achieve our goal, we suggested how to construct and utilize geospatial project ontology, and designed the architecture and process of the semantic search. Moreover, we showed how the suggested semantic search works with a duplicated projects search scenario. The suggested system enables a nonprofessional can easily search for duplicated projects, therefore we expect that our research contributes to effective and efficient duplication review process for geospatial projects.

Development of a XML Web Services Retrieval Engine (XML 웹 서비스 검색 엔진의 개발)

  • Sohn, Seung-Beom;Oh, Il-Jin;Hwang, Yun-Young;Lee, Kyong-Ha;Lee, Kyu-Chul
    • Journal of Information Technology Applications and Management
    • /
    • v.13 no.4
    • /
    • pp.121-140
    • /
    • 2006
  • UDDI (Universal Discovery Description and Integration) Registry is used for Web Services registration and search. UDDI offers the search result to the keyword-based query. UDDI supports WSDL registration but it does not supports WSDL search. So it is required that contents based search and ranking using name and description in UDDI registration information and WSDL. This paper proposes a retrieval engine considering contents of services registered in the UDDI and WSDL. It uses Vector Space Model for similarity comparison between contents of those. UDDI registry information hierarchy and WSDL hierarchy are considered during searching process. This engine suppports two discovery methods. One is Keyword-based search and the other is template-based search supporting ranking for user's query. Template-based search offers how service interfaces correspond to the query for WSDL documents. Proposed retrieval engine can offer search result more accurately than one which UDDI offers and it can retrieve WSDL which is registered in UDDI in detail.

  • PDF

Document Classification Model Using Web Documents for Balancing Training Corpus Size per Category

  • Park, So-Young;Chang, Juno;Kihl, Taesuk
    • Journal of information and communication convergence engineering
    • /
    • v.11 no.4
    • /
    • pp.268-273
    • /
    • 2013
  • In this paper, we propose a document classification model using Web documents as a part of the training corpus in order to resolve the imbalance of the training corpus size per category. For the purpose of retrieving the Web documents closely related to each category, the proposed document classification model calculates the matching score between word features and each category, and generates a Web search query by combining the higher-ranked word features and the category title. Then, the proposed document classification model sends each combined query to the open application programming interface of the Web search engine, and receives the snippet results retrieved from the Web search engine. Finally, the proposed document classification model adds these snippet results as Web documents to the training corpus. Experimental results show that the method that considers the balance of the training corpus size per category exhibits better performance in some categories with small training sets.

Design for RDF-based Semantic Web System (RDF 기반 시맨틱 웹 시스템 설계)

  • Lee, Jong-Won;Jang, Ki-Man;Kim, Kyng-Hwan;Yang, Xitong;Jung, Hoe-Kyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2014.10a
    • /
    • pp.684-686
    • /
    • 2014
  • It is difficult to effectively search and data management due to the increasing number of web is now. While Semantic Web technologies and the development of next-generation wepin this as a way to overcome them, and monopolize the domestic utilization is not overwhelming introduction to the Semantic Web technology is being used in existing search engines. This causes the development of the Semantic Web is becoming slower, and reluctant to use the Semantic Web users who use search engines as well. In this paper, compared to the currently used web and the next generation of the web, and why utilization is low compared to the search engine you are using an existing Web technology that uses the Semantic Web technology is a search engine, what research was that the inefficient because, as a RDF-based Semantic suggest how to improve the efficiency solved by designing the web.

  • PDF

Evaluation of Mobile Unified Search Contents of Naver and Google Korea (네이버와 구글의 모바일 통합 검색 컨텐츠 평가)

  • Park, So-Yeon
    • Journal of Korean Library and Information Science Society
    • /
    • v.42 no.4
    • /
    • pp.263-280
    • /
    • 2011
  • This study aims to investigate current status of mobile search services of Korean search portals, and analyze mobile unified search contents of Naver and Google Korea. In particular, this study analyzed characteristics of mobile unified search such as number of retrieved documents, collection distribution, and yearly distribution. Also, documents were evaluated in terms of relevance, credibility, and currency. This study compared quality of Naver's unified Web best and unified Web, and Google's best Web documents and Web documents. The correlation between document's ranking and document's relevance was analyzed. The results of this study can be implemented to the portal's effective development of mobile search service.

A Hybrid Query Disambiguation Adaptive Approach for Web Information Retrieval

  • Ibrahim, Roliana;Kamal, Shahid;Ghani, Imran;Jeong, Seung Ryul
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.7
    • /
    • pp.2468-2487
    • /
    • 2015
  • In web searching, trustable and precise results are greatly affected by the inherent uncertainty in the input queries. Queries submitted to search engines are by nature ambiguous and constitute a significant proportion of the instances given to web search engines. Ambiguous queries pose real challenges for the web search engines due to versatility of information. Temporal based approaches whereas somehow reduce the uncertainty in queries but still lack to provide results according to users aspirations. Web search science has created an interest for the researchers to incorporate contextual information for resolving the uncertainty in search results. In this paper, we propose an Adaptive Disambiguation Approach (ADA) of hybrid nature that makes use of both the temporal and contextual information to improve user experience. The proposed hybrid approach presents the search results to the users based on their location and temporal information. A Java based prototype of the systems is developed and evaluated using standard dataset to determine its efficacy in terms of precision, accuracy, recall, and F1-measure. Supported by experimental results, ADA demonstrates better results along all the axes as compared to temporal based approaches.

A Study on Online Consumers′Price Sensitivity (온라인 시장에서 가격민감도에 영향을 미치는 요인에 관한 연구)

  • 송형철
    • The Journal of the Korea Contents Association
    • /
    • v.2 no.3
    • /
    • pp.59-69
    • /
    • 2002
  • This article purpose are on the variables of consumer's Doe sensitivity. Our result from sets of data indicate that the web site trust, the web site interactivity and the perceived risk have an effect on price search. Our result is as follows. First, the more trust the web site, the lower the price search. Second, the more interactivity of the web site, the lower the price search. Third, the greater the depth of information at the web site, the higher the price search. forth, the higher the perceived risk, the higher the price search. Fifth, the higher the knowledge of product, the higher the price search.

  • PDF

Rate of Waste in Authority Names for the Web of Science Journals among Saudi Universities

  • Otaibi, Abdullah Al;Sawy, Yaser Mohammad Al
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.7
    • /
    • pp.267-272
    • /
    • 2021
  • The current study aimed at measuring the rate of loss in search results of the actual number of publications in journals indexed by Web of Science when not using the accurate official authority name as indicated by the Ministry of Education. Conducting a search using the authority name does not always yield complete results of all existing publications. Researchers in Saudi universities tend to use up to 10 different random names of universities when searching. This interesting fact has prompted the authors of this paper to conduct a study on the search results of 30 Saudi universities using the authority name as indicated by the Ministry of Education. The statistical analyses revealed that there is a high tendency for the wrong use of authority names. Results show that 8 universities were not found in the search results. Furthermore, other universities are losing between 10 and 30% of search results that reflect the actual number of publications. Consequently, the rank of each university, as well as the general rank of Saudi universities in the Web of Science, will be affected.

Ontology-Based Information Retrieval for Cultural Assets Information (문화재 정보의 온톨로지 기반 검색시스템)

  • Baek Seung-Jae;Cheon Hyeon-Jae;Lee Hong-Chul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.3 s.35
    • /
    • pp.229-236
    • /
    • 2005
  • The Semantic Web enables machines to achieve an effective retrieval, integration, and reuse of web resources. The keyword search method currently used has a limit to accurate search results because of a simple string matching method in web environment. This paper proposes an Ontology-Based Information Retrieval which can solve the problems and retrieve better search results through semantic relations. In this system, we implemented the Cultural Assets Ontology based on OWL with RDQL and Jena API. we also suggest a method to handle properties stored in a database.

  • PDF