• Title/Summary/Keyword: Web-based Retrieval

Search Result 459, Processing Time 0.028 seconds

A Study of Knowledge Based Agent System for Web New-Document Retrieval (지식기반 방식을 이용한 웹 뉴스문서 검색 에이전트 시스템 연구)

  • 이성열;백혜정;박영택
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2000.10b
    • /
    • pp.102-104
    • /
    • 2000
  • 현재 인터넷상의 정보와 문서의 양은 상상을 초월하는 증가추이를 나타내고 있다. 이와 더불어 표현하려는 목적에 따라 체계적으로 정리되고 정형화된 문서들 또한 증가하고 있다. 이러한 문서들 중에는 각 인터넷 신문사나 웹진과 같은 문서들이 포함되는데, 이러한 문서들은 각각의 내용구성과 표현 형식에 있어서 비슷한 구성을 지니고 있다. 본 논문에서는 이러한 체계적이고 정형화된 웹 뉴스 문서검색을 위하여 '지식기반 방식을 이용한 웹 뉴스문서 검색 에이전트 시스템'을 제안한다. 사용자는 시스템에서 제공하는 지식을 기반으로 검색하고자 하는 대상을 에이전트 시스템에게 요청하게 되고 지식기반을 이용한 에이전트 시스템은 보다 정확한 정보를 사용자에게 제공하게 된다.

  • PDF

Design of XML DTDs for Content-based Retrieval of Web Image (웹 이미지 내용 기반 검색을 위한 XML DTD 설계)

  • 김형근;홍성용;나연묵
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.10a
    • /
    • pp.232-234
    • /
    • 2001
  • 인터넷의 발달과 사용의 확산에 따라 멀티미디어 데이터의 양이 급격히 증가하고 있다. 특히 멀티미디어 정보 가운데에서도 이미지 양은 대규모이므로 사용자가 원하는 이미지를 찾기가 쉽지 않았으며, 이에 따라 이미지 데이타를 검색하기 위한 여러 가지 방법들이 계속해서 제안되고 있다. 본 논문에서는 XML을 활용하여 웹상의 이미지 데이터에 대한 특징 정보를 구조적으로 표현해 웹 이미지에 대한 내용 기반 검색 능력을 개선한다. 관계 테이터베이스에 저장된 색상, 질감, 키워드 등 이미지 데이터에 대한 특징 정보들을 XML 문서로 자동 변환하기 위하여 이들 각각의 대한 DTD를 설계하고, 이들을 통합하여 검색할 수 있도록 통합 DTD를 설계한다. 통합 DTD를 XML 데이터 서버를 이용하여 구현에 실제 웹 상의 상품이미지를 검색하는데 적용함으로써 제안한 결과의 유용성을 보인다.

  • PDF

HMS-based Integration and Retrieval of Hospital Information on the Web (HMS를 기반으로 한 웹 상의 병원정보 통합 및 검색)

  • 양정욱;홍동완;윤지희;주한규
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.10a
    • /
    • pp.76-78
    • /
    • 2001
  • HMS(Hallym Mediator System)는 XML을 기본 데이터 모델로 하여 인터넷에 산재하여 있는 분산 이질 정보에 대한 통합, 검색 기능을 제공하는 미디에이터 시스템이다. 분산이질 정보의 공통 스키마 구조로서 XML DTD를 사용하며, 각종 정보에 대한 가상의 통합 뷰(view) 생성 기능을 제공하여 웹 상의 통합된 가상 정보 구조를 표현한다. 실용성 및 성능평가를 위하여, HMS를 기반으로 하는 병원정보 통합/검색 시스템을 구현하였다. 병원정보 통합/검색 시스템은 가상접근 기법(virtual approach)기반의 정보검색 시스템으로서, 일반 사용자는 웹 상의 각종 병원 정보를 정보의 위치에 상관없이 비쥬얼 사용자 인터떼이스틀 통하여 제공 받게된다

  • PDF

Intelligent Image Retrieval Using Inference-Based Web Ontology (추론기반의 웹 온톨로지를 이용한 지능형 이미지 검색)

  • Kim, Su-Kyoung;Ahan, Kee-Hong
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2007.05a
    • /
    • pp.521-524
    • /
    • 2007
  • 추론 기반의 온톨로지 구축은 시맨틱 웹 응용의 구현을 위한 최소 요건이다. 그러나 현재 시맨틱 웹응용에 적용된 대부분의 온톨로지들은 추론을 통한 지식의 재사용을 제공하지 못하며, 이는 시맨틱 웹응용의 발전에 많은 지장을 주는 요인이다. 따라서 본 연구는 서술 논리와 규칙 언어로 표현된 추론 기반의 웹 온톨로지를 구축하고, 이를 지능형 이미지 검색에 적용하였다. 추론 엔진을 이용한 지능형 이미지 검색 결과 실험으로, 추론 기반의 웹 온톨로지와 주석 기반의 웹 온톨로지를 이미지 검색 시스템에 적용하였으며, 추론 기반의 웹 온톨로지를 적용한 검색 결과가 재현율과 정확율에 있어 더욱 우수한 성능을 보여주었다.

A Study on the Development of Search Algorithm for Identifying the Similar and Redundant Research (유사과제파악을 위한 검색 알고리즘의 개발에 관한 연구)

  • Park, Dong-Jin;Choi, Ki-Seok;Lee, Myung-Sun;Lee, Sang-Tae
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.11
    • /
    • pp.54-62
    • /
    • 2009
  • To avoid the redundant investment on the project selection process, it is necessary to check whether the submitted research topics have been proposed or carried out at other institutions before. This is possible through the search engines adopted by the keyword matching algorithm which is based on boolean techniques in national-sized research results database. Even though the accuracy and speed of information retrieval have been improved, they still have fundamental limits caused by keyword matching. This paper examines implemented TFIDF-based algorithm, and shows an experiment in search engine to retrieve and give the order of priority for similar and redundant documents compared with research proposals, In addition to generic TFIDF algorithm, feature weighting and K-Nearest Neighbors classification methods are implemented in this algorithm. The documents are extracted from NDSL(National Digital Science Library) web directory service to test the algorithm.

Building Intelligent User Interface Agent for Semantically Reformulating User Query in Medicine

  • Yang, Jung-Jin;Lim, Chae-Myung;Chu, Sung-Joon;Lee, Dong-Hoon;Park, Duck-Whan;Park, Tae-Yong
    • Journal of Intelligence and Information Systems
    • /
    • v.9 no.2
    • /
    • pp.101-119
    • /
    • 2003
  • Achieving the beneficiary goal of recent discovery in human genome project still needs a way to retrieve and analyze the exponentially expanding bio-related information. Research on bio-related fields naturally applies knowledge discovered to the current problem and make inferences to extract new information where shared concepts and data containing information need to be defined and used in a coherent way. In such a professional domain, while the need to help users reduce their work and to improve search results has been emerged, methods for systematic retrieval and adequate exchange of relevant information are still in their infancy. The design of our system aims at improving the quality of information retrieval in a professional domain by utilizing both corpus-based and concept-based ontology. Meta-rules of helping users to make an adequate query are formed into an ontology in the domain. The integration of those knowledge permits the system to retrieve relevant information in a more semantic and systematic fashion. This work mainly describes the query models with details of GUI and a secondary query generation of the system.

  • PDF

ChungbukN: An User Location based News Retrieval System (충북N:사용자 위치 기반 뉴스 검색 시스템)

  • Kwon, Sun-Ock;Jeong, Ji-Seong;Kim, Ji-Hoon;Kim, Hee-Ran;Yoo, Kwan-Hee
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.12
    • /
    • pp.524-532
    • /
    • 2012
  • According to increasing in number of smart phone subscribers to offer the convenience of users, wide range of applications in various fields have emerged. Recently, a lot of applications are being developed to provide a way of receiving information according to the user's current location. Also, the news seems difficult to provide the necessary information among the numerous data. Especially, it is difficult to find the news that associated with the region. There are many applications that provide news, but there is no system to provide news information according to the user's location information in domestic, so users not receive the news of the region. In this paper, we propose a news retrieval application which provides users with news around by using the location information of smart phone users. Because this system provides news of the region, it has the advantage to obtain the around information easily. The proposed system, whose name is 'ChungbukN', provides news that receives data at chungbuk comprehensive daily newspaper 'Daily Chungbuk'.

A VoiceXML-based EPG Retrieval System (VoiceXML기반 EPG 검색 시스템)

  • 김한수;황인준
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.10 no.4
    • /
    • pp.351-363
    • /
    • 2004
  • Recent commencement of digital broadcasting has enabled various TV programs through hundreds of channels. As a result, it becomes a time-consuming job for the TV audience to look up newspaper or TV magazines for the schedule of a specific TV program. To relieve this problem, digital broadcasting usually provides an EPG(Electronic Program Guide) for the audience. Currently. most EPG services are focusing on the visual delivery of information through a web site, digital TV or mobile devices. However, this approach could cause a serious restriction to some users including drivers or visually handicapped persons, who can't input keywords for the search. In order to solve this problem, in this paper, we propose a VoiceXML-based EPG retrieval system that enables even such special users to browse EPG. conveniently using a mobile phone. We implemented a prototype system and proved its effectiveness through experiments.

An analysis of user behaviors on the search engine results pages based on the demographic characteristics

  • Bitirim, Yiltan;Ertugrul, Duygu Celik
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.7
    • /
    • pp.2840-2861
    • /
    • 2020
  • The purpose of this survey-based study is to make an analysis of search engine users' behaviors on the Search Engine Results Pages (SERPs) based on the three demographic characteristics gender, age, and program studying. In this study, a questionnaire was designed with 12 closed-ended questions. Remaining questions other than the demographic characteristic related ones were about "tab", "advertisement", "spelling suggestion", "related query suggestion", "instant search suggestion", "video result", "image result", "pagination" and the amount of clicking results. The questionnaire was used and the data collected were analyzed with the descriptive statistics as well as the inferential statistics. 84.2% of the study population was reached. Some of the major results are as follows: Most of each demographic characteristic category (i.e. female, male, under-20, 20-24, above-24, English computer engineering, Turkish computer engineering, software engineering) have rarely or more click for tab, spelling suggestion, related query suggestion, instant search suggestion, video result, image result, and pagination. More than 50.0% of female category click advertisement rarely; however, for the others, 50.0% or more never click advertisement. For every demographic characteristic category, between 78.0% and 85.4% click 10 or fewer results. This study would be the first attempt with its complete content and design. Search engine providers and researchers would gain knowledge to user behaviors about the usage of the SERPs based on the demographic characteristics.

Terminology Recognition System based on Machine Learning for Scientific Document Analysis (과학 기술 문헌 분석을 위한 기계학습 기반 범용 전문용어 인식 시스템)

  • Choi, Yun-Soo;Song, Sa-Kwang;Chun, Hong-Woo;Jeong, Chang-Hoo;Choi, Sung-Pil
    • The KIPS Transactions:PartD
    • /
    • v.18D no.5
    • /
    • pp.329-338
    • /
    • 2011
  • Terminology recognition system which is a preceding research for text mining, information extraction, information retrieval, semantic web, and question-answering has been intensively studied in limited range of domains, especially in bio-medical domain. We propose a domain independent terminology recognition system based on machine learning method using dictionary, syntactic features, and Web search results, since the previous works revealed limitation on applying their approaches to general domain because their resources were domain specific. We achieved F-score 80.8 and 6.5% improvement after comparing the proposed approach with the related approach, C-value, which has been widely used and is based on local domain frequencies. In the second experiment with various combinations of unithood features, the method combined with NGD(Normalized Google Distance) showed the best performance of 81.8 on F-score. We applied three machine learning methods such as Logistic regression, C4.5, and SVMs, and got the best score from the decision tree method, C4.5.