• Title/Summary/Keyword: Web Searching

Search Result 565, Processing Time 0.027 seconds

Ontology-based User Customized Search Service Considering User Intention (온톨로지 기반의 사용자 의도를 고려한 맞춤형 검색 서비스)

  • Kim, Sukyoung;Kim, Gunwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.4
    • /
    • pp.129-143
    • /
    • 2012
  • Recently, the rapid progress of a number of standardized web technologies and the proliferation of web users in the world bring an explosive increase of producing and consuming information documents on the web. In addition, most companies have produced, shared, and managed a huge number of information documents that are needed to perform their businesses. They also have discretionally raked, stored and managed a number of web documents published on the web for their business. Along with this increase of information documents that should be managed in the companies, the need of a solution to locate information documents more accurately among a huge number of information sources have increased. In order to satisfy the need of accurate search, the market size of search engine solution market is becoming increasingly expended. The most important functionality among much functionality provided by search engine is to locate accurate information documents from a huge information sources. The major metric to evaluate the accuracy of search engine is relevance that consists of two measures, precision and recall. Precision is thought of as a measure of exactness, that is, what percentage of information considered as true answer are actually such, whereas recall is a measure of completeness, that is, what percentage of true answer are retrieved as such. These two measures can be used differently according to the applied domain. If we need to exhaustively search information such as patent documents and research papers, it is better to increase the recall. On the other hand, when the amount of information is small scale, it is better to increase precision. Most of existing web search engines typically uses a keyword search method that returns web documents including keywords which correspond to search words entered by a user. This method has a virtue of locating all web documents quickly, even though many search words are inputted. However, this method has a fundamental imitation of not considering search intention of a user, thereby retrieving irrelevant results as well as relevant ones. Thus, it takes additional time and effort to set relevant ones out from all results returned by a search engine. That is, keyword search method can increase recall, while it is difficult to locate web documents which a user actually want to find because it does not provide a means of understanding the intention of a user and reflecting it to a progress of searching information. Thus, this research suggests a new method of combining ontology-based search solution with core search functionalities provided by existing search engine solutions. The method enables a search engine to provide optimal search results by inferenceing the search intention of a user. To that end, we build an ontology which contains concepts and relationships among them in a specific domain. The ontology is used to inference synonyms of a set of search keywords inputted by a user, thereby making the search intention of the user reflected into the progress of searching information more actively compared to existing search engines. Based on the proposed method we implement a prototype search system and test the system in the patent domain where we experiment on searching relevant documents associated with a patent. The experiment shows that our system increases the both recall and precision in accuracy and augments the search productivity by using improved user interface that enables a user to interact with our search system effectively. In the future research, we will study a means of validating the better performance of our prototype system by comparing other search engine solution and will extend the applied domain into other domains for searching information such as portal.

PDFindexer: Distributed PDF Indexing system using MapReduce

  • Murtazaev, JAziz;Kihm, Jang-Su;Oh, Sangyoon
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.4 no.1
    • /
    • pp.13-17
    • /
    • 2012
  • Indexing allows converting raw document collection into easily searchable representation. Web searching by Google or Yahoo provides subsecond response time which is made possible by efficient indexing of web-pages over the entire Web. Indexing process gets challenging when the scale gets bigger. Parallel techniques, such as MapReduce framework can assist in efficient large-scale indexing process. In this paper we propose PDFindexer, system for indexing scientific papers in PDF using MapReduce programming model. Unlike Web search engines, our target domain is scientific papers, which has pre-defined structure, such as title, abstract, sections, references. Our proposed system enables parsing scientific papers in PDF recreating their structure and performing efficient distributed indexing with MapReduce framework in a cluster of nodes. We provide the overview of the system, their components and interactions among them. We discuss some issues related with the design of the system and usage of MapReduce in parsing and indexing of large document collection.

Smart Adapted Service in Ubiquitous (유비쿼터스 환경에서 사용자의 일정에 따른 지능 정보 제공 시스템)

  • Ahn, Ho-Seok;Sa, In-Kyu;Baek, Young-Min;Ahn, Youn-Seok;Choi, Jin-Young
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.57 no.3
    • /
    • pp.480-487
    • /
    • 2008
  • In this paper, we propose a Smart Adapted Service which can manage a schedule automatically. Smart Adapted Service gives a notice beforehand regarding information associated with the schedule, by searching the Internet. If the user has written down the name of goods or food which he wants to buy, Smart Adapted Service finds the most suitable store nearby him using the user's favorite list. The user's favorite list is created by Outlook Web Access System by analysing the schedule and habits of the user. User can access Smart Document System remotely through the Internet using Outlook Web Access System. We developed an Auto AP Roaming System for seamless communication and Smart Document System for arranging the information. We evaluated the system and verified that it is convenient to use and working well.

RepWeb: A Web-Based Search Tool for Repeat-Related Literatures

  • Woo, Tae-Ha;Kim, Young-Uk;Kwon, Je-Keun;Seo, Jung-Min
    • Genomics & Informatics
    • /
    • v.5 no.2
    • /
    • pp.88-91
    • /
    • 2007
  • Repetitive sequences such as SINE, LINE, and LTR elements form a major part of eukaryotic genomes. A literature search tool that summarizes the information contained within repeat elements would provide biologists in the field of genomics with a useful tool for analyzing genomic sequence features. We developed a java program designed to make literature access easier by using two search engines simultaneously. RepWeb is a web-based search system that provides a user friendly interface for searching the reference data and journals for information related to repeat elements by using the search engines, Google Scholar and PubMed, simultaneously. It provides an interface that displays the repeat element- related biological information, and includes useful functions such as the production of a repeat tree, clickable links to PubMed and Google Scholar, exporting, and sorting a field into date, author, journal and title.

INTEROPERABLE APPLICATION OF 3D GEO-BASED FEATURES ON MOBILE AND WEB

  • Dong, Woo-Cheol;Lee, Ki-Won
    • Proceedings of the KSRS Conference
    • /
    • 2008.10a
    • /
    • pp.274-276
    • /
    • 2008
  • At the stage of content convergence into cell phone, technologies for geo-spatial information sharing and searching are being developed. Currently, 2D portable navigation map for mobile navigation is provided by communication companies, but geobrowers for 3D geo-information in cell phone are under developing. In this study, 3D feature transformation among X3D-M3G-KML, on mobile and web environments, is dealt with as the first stage for the further mobile 3D web application. As well, it is possible to real-time interoperable 3D geo-information exchange issues within both environments.

  • PDF

Implementation of Web-based Street Fashion Design Analysis System (웹 기반(基盤)(Web-based) 스트리트 패션 디자인 분석(分析) 시스템 설계(設計) 및 구현(具顯))

  • Park, Hye-Won;Park, Hee-Chang
    • Journal of Fashion Business
    • /
    • v.9 no.2
    • /
    • pp.160-173
    • /
    • 2005
  • Fashion is hard to expect owing to the rapid change in accordance with consumer taste and environment, and has a tendency toward variety and individuality. Especially street fashion d 21st century is not being regarded as one of the subcultures but is playing an important role as a fountainhead d fashion trend. Therefore, Searching and analyzing street fashions helps us to understand the popular fashions d the next season and also it is important in understanding the consumer fashion sense and commercial area. So, we need to understand fashion styles quantitatively and qualitatively by providing visual data and dividing images. The purpose of this study is to design for street fashion on design analysis using web which can update quantitative and qualitative data. through the on site investigation d street fashion, and put the information onto a database.

Judging Translated Web Document & Constructing Bilingual Corpus (웹 번역문서 판별과 병렬 말뭉치 구축)

  • Jee-hyung, Kim;Yill-byung, Lee
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.10a
    • /
    • pp.787-789
    • /
    • 2004
  • People frequently feel the need of a general searching tool that frees from language barrier when they find information through the internet. Therefore, it is necessary to have a multilingual parallel corpus to search with a word that includes a search keyword and has a corresponding word in another language, Multilingual parallel corpus can be built and reused effectively through the several processes which are judgment of the web documents, sentence alignment and word alignment. To build a multilingual parallel corpus, multi-lingual dictionary should be constructed in each language and HTML should be simplified. And by understanding the meaning and the statistics of document structure, judgment on translated web documents will be made and the searched web pages will be aligned in sentence unit.

  • PDF

WebInfoSync : Using Data Synchronization, Extensible Mobile Information Searching System on Web Environment (WebInfoSync: 데이타 동기화를 이용한 Web환경의 확장형 Mobile 정보검색 시스템)

  • Shin, Soung-Soo;Kook, Youn-Gyou;Kim, Woon-Yong;Choi, Young-Keun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.04b
    • /
    • pp.1607-1610
    • /
    • 2002
  • 모바일 디바이스의 급속한 보급과 사용자층의 증가는 보다 효율적인 정보의 활용능력이 요구된다. 이러한 정보의 활용능력을 증가시키는 방법으로 기준의 웹정보에 대한 모바일 디바이스 활용방법을 들 수 있다. 모바일 디바이스에 대한 기존의 정보 구축방법은 모바일 디바이스 환경에 맞는 정보를 새롭게 구축하고 서비스하는 형태로 진행되었다. 그러나 이러한 방법은 모바일 환경에 적합하도록 정보를 변환하는데 많은 시간과 노력이 필요하다. 이에 본 논문에서는 기존 웹정보를 효율적으로 모바일 디바이스 환경에 활용할 수 있는 방법을 제시한다. 이러한 방법은 웹 정보의 자동 변환과 데이터 동기화를 통해 이루어 질 수 있다. 이를 통하여 정보의 재구축비용을 줄일 수 있고 모바일 디바이스를 이용한 정보활용이 향상되는 효과를 가져온다.

  • PDF

Design of a RDF Metadata System for the Searching of Application Programs (응용프로그램의 검색을 위한 RDF 메타데이터 시스템의 설계)

  • Yoo Weon-Hee;Kouh Hoon-Joon
    • The Journal of the Korea Contents Association
    • /
    • v.5 no.6
    • /
    • pp.1-9
    • /
    • 2005
  • As the amount of data on the web increase, it is difficult to search what we want exactly. Therefore, much researches are attempted to search web resources efficiently. So, W3C established the standard that give meanings to resources on the web using RDF metadata. The RDF metadata had been mainly described a document data on the web. But it is difficult to create automatically the metadata for application programs than the document data. This paper proposes a method to use RDF metadata to search application programs. Firstly, we define RDF data model that stores the information of the application programs and RDF schema that references the RDF data model. And we design a prototype system to search application programs. This system meets expectation, getting the application to fullfill the needs of user, and has the efficiency of the searching function.

  • PDF

A Comparative Study on Models of Web-based Information Seeking Behavior (웹 정보탐색행위 모형의 비교 분석 연구)

  • 김성진
    • Journal of the Korean Society for information Management
    • /
    • v.21 no.2
    • /
    • pp.211-233
    • /
    • 2004
  • The web is a new information environment, which has different characteristics from a traditional IR environment. Needed are more research from a new point of view as well as the adoption of a new research paradigm in order to understand a user-system interaction on the web. The purpose of this study is to review and analyze models of web-based information seeking behavior, which Wang, Hawk & Tenopir, Hsieh-Yee, Choo, Detlor & Turnbull, Chun & Cooper, Rieh, and Spink proposed. The comparative analysis indicates that web-based information seeking models are categorized into three area: interaction model, information seeking behavior model, and evaluation model, and that they are based on a multifaceted interaction and a nonlinear perspective.