Search | Korea Science

Design and Implementation of Web Search Engine Using Dynamic Category Hierarchy (동적분류체계를 사용한 웹 검색엔진의 설계 및 구현)

Park, Sun;Choi, Bum-Gi
- Proceedings of the Korea Information Processing Society Conference
- /
- 2003.05b
- /
- pp.747-750
- /
- 2003
분류검색 방법은 색인검색 방법과 함께 중요한 요소로서 웹 검색 엔진에서 지원되고 있다. 색인검색 방법에서는 검색결과의 재현율이 높지만 검색결과가 너무 많이 나오기 때문에 원하는 검색결과를 찾아내는 것이 어렵다는 단점이 있다. 또한 능숙한 컴퓨터 사용자는 색인검색을 자주 사용하지만, 컴퓨터에 익숙하지 않은 대부분의 사람들은 분류검색 방법을 사용한다. 이러한 이유 때문에 검색엔진에서 분류검색 방법이 반드시 필요하다. 그러나 분류검색 방법은 찾고자 하는 문서의 해당분류가 애매모호하거나 명확하게 알지 못할 때에는 문서를 찾지 못하는 경우가 빈번히 발생한다. 즉, 검색결과의 정확도는 높으나 재현율이 떨어지는 단점이 있다. 본 논문은 이러한 분류검색에 대한 문제점을 해결하기 위해서 분류와 검색어간의 관계를 퍼지논리를 이용하여 정량적으로 계산하고 이를 바탕으로 분류간의 함의관계를 유도함으로써 동적인 분류체계를 구성하는 새로운 웹 검색엔진을 설계하고 구현하였다. 구현된 검색엔진은 분류간의 함의관계를 유사한 하위분류로서 간주함으로써 분류검색 결과의 재현율을 높일 수 있다.
PDF

Dynamic Classification of Categories in Web Search Environment (웹 검색 환경에서 범주의 동적인 분류)

Choi Bum-Ghi;Lee Ju-Hong;Park Sun
- Journal of KIISE:Software and Applications
- /
- v.33 no.7
- /
- pp.646-654
- /
- 2006
Directory searching and index searching methods are two main methods in web search engines. Both of the methods are applied to most of the well-known Internet search engines, which enable users to choose the other method if they are not satisfied with results shown by one method. That is, Index searching tends to come up with too many search results, while directory searching has a difficulty in selecting proper categories, frequently mislead to false ones. In this paper, we propose a novel method in which a category hierarchy is dynamically constructed. To do this, a category is regarded as a fuzzy set which includes keywords. Similarly extensible subcategories of a category can be found using fuzzy relational products. The merit of this method is to enhance the recall rate of directory search by expanding subcategories on the basis of similarity.
PDF KSCI

Dynamic Classification of Web Search Categories (웹 검색 분류어의 동적인 분류)

Choi, Bum-Ghi;Park, Sun;Lee, Ju-Hong
- Proceedings of the Korean Information Science Society Conference
- /
- 2003.04d
- /
- pp.521-523
- /
- 2003
본 논문은 웹 탐색 중 디렉토리 검색엔진의 분류검색에 대한 문제점을 해결하기 위해서 분류와 검색어간의 관계를 퍼지논리를 이용하여 계산하고 분류간의 함의관계를 유도함으로써 동적인 분류체계를 구성하는 새로운 방법을 제시한다. 이 방법의 장점은 분류간의 함의관계를 유사한 하위분류로서 간주함으로써 분류검색 결과의 재현율을 높일 수 있다는 것이다.
PDF

The Effect of User-Centered Categorization System of Homepages on Directory Search (사용자 중심의 홈페이지 분류체계가 분류 검색에 미치는 효과)

박창호;염성숙;이정모
- Korean Journal of Cognitive Science
- /
- v.11 no.1
- /
- pp.47-65
- /
- 2000
Categorization systems of homepages in search engines are likely to be constructed considering system's efficiency only but not user-centered. This study I investigated user's mental model of superordinate and subordinate categories using category terms of major Korean search engines. From this result, we constructed two kinds of categorization system; redundant system and singular system. In the redundant system, for example, a subordinate category can belong to a number of superordinate categories, but in the singular system to only one superordinate category Three prototype categorization systems, with 'Simmani', were designed and search performances of each system were observed repetitively Overall results, with frequency of correct a answers, number of steps and time taken in solution taken into account, showed the redundant system was superior to the other two systems. This indicates that categorization search could be improved with appropriate categorizaton system. However. l in recognition test score in singular system was the best, which indicates that search performance and recognition memory of categorization reveal different aspects of categorization system learning. Issues of category organization. ways of interface, prior knowledge, exploratory learning, and application areas are discussed further.
PDF

An Analysis on Classification Retrieval Operation in University Libraries (대학도서관의 분류검색 운영 분석)

Lee Jong-Moon
- Journal of Korean Library and Information Science Society
- /
- v.36 no.2
- /
- pp.165-178
- /
- 2005
This study aims to identify the status of the classification retrieval operation by investigating and analyzing the classification retrieval related to the books in the university libraries. The Investigation concentrated on whether the classification retrieval service is provided, Access Method and classification retrieval level. The data was collected from 97 libraries where URL access was available during the period of survey in 100 libraries selected by the systematic sampling. As a result, while $92.8\%$ of 97 libraries provided the classification retrieval service, $52.2\%$ of it enabled the access to classification retrieval service only by the classification number and $47.8\%$ by classification number and classification directory. Consequently, it was found that the retrieval environment in the libraries where the access was enabled only by classification number should be urgently improved for the activation of classification retrieval.
PDF

A Web-Based Information System for the Integrated Search for Protein Structure Classifications (단백질 구조 분류의 통합 검색을 위한 웹 정보시스템)

신원준;황의윤;김진홍;안건태;이명준
- Proceedings of the Korean Information Science Society Conference
- /
- 2004.04b
- /
- pp.274-276
- /
- 2004
단백질은 대부분 공간상의 특징을 고려할 때 유사한 부분을 기준으로 분류되는 경우가 많다 단백질 구조 분류 데이터베이스는 단백질이 가지는 다양한 구조 정보를 바탕으로 단백질 구조 분류 정보를 제공하고 있다. 대표적인 단백질 구조 분류 데이터베이스에는 CATH와 SCOP 데이터베이스가 있다. 이들 데이터베이스는 서로 다른 구조 분류 기준으로 단백질 구조를 분류하고 있으며, 단백질 구조 분류 정보를 검색하는 웹 서비스를 개별적으로 제공하고 있다. 따라서 여러 종류의 단백질 구조 분류 정보를 하나의 웹 사이트에서 검색할 수 있으면 유용할 것이다. 본 논문에서는 CATH와 SCOP에서 정의한 단백질 구조 분류 정보의 통합적인 검색 기능 일 통계 정보를 체계적으로 제공하는 웹 정보시스템에 관하여 기술한다. 제안된 시스템은 CATH와 SCOP에서 제공하는 각각의 데이터를 가공하여 효과적인 구조 분류 검색을 지원하는 구조화된 데이터베이스를 구축하였다. 개발된 시스템은 PDB 식별자, CAT터 식별자. 그리고 SCOP 식별자 또는 단백질 분류 이름으로 한번의 검색으로 두 데이터베이스에서 제공하는 계층적 구조 분류 정보를 제공한다. 또한, 단백질 구조에 대한 유용한 통계 정보를 제공한다.
PDF

Design of Intelligeng Web Image Search Engine (지능적 웹 이미지 검색 엔진의 설계)

박명선;이석호
- Proceedings of the Korean Information Science Society Conference
- /
- 1999.10a
- /
- pp.51-53
- /
- 1999
기존의 웹 이미지 검색 엔진은 웹 이미지를 검색할 때 웹 이미지의 특징과, 웹 이미지를 포함한 HTML 문서의 텍스트를 이용한다. 그러나, 텍스트는 문맥에 따라 의미가 달라질 수 있으므로, 검색 대상을 미리 분류하면 검색 효율을 높일 수 있다. 본 논문은 웹 문서의 텍스트에서 이미지와 관련이 있는 이미지 설명 텍스트를 자동으로 추출하고, 검색 효율을 높이기 위하여 웹 이미지를 자동으로 분류하는 지능적 웹 이미지 검색 엔진을 제안한다. 지능적 웹 이미지 검색 엔진은 분류와 용어, 용어와 용어 사이의 연관도를 이용하여 분류의 정확도를 높인다.
PDF

A Study on the Types of Online Shopping Queries using Topic Modeling and Principal Components Analysis (토픽모델링과 주성분 분석을 활용한 온라인 쇼핑 검색 질의 유형 분류)

Kang, Hyeonah;Lim, Heuiseok
- Proceedings of the Korea Information Processing Society Conference
- /
- 2020.11a
- /
- pp.765-768
- /
- 2020
검색 질의 연구 분야의 대부분 선행 연구는 검색 질의 주제 분류에 집중되어 있으며 질의 자체에 대한 연구자의 정성적인 판단으로 분석되었다. 이는 검색 이후 클릭 된 문서를 고려하지 않고 진행되었다는 점과 분석 주제 및 활용 데이터가 제한적이라는 것에 한계가 있다. 이에 본 연구는 국내 대형 온라인쇼핑몰의 1년간의 검색로그를 활용하여 검색 질의와 검색 이후 조회한 문서명 정보를 기반으로 토픽모델링을 수행하여 검색 질의 주제를 정의하였다. 또한 검색 행동특성에 따른 주제별 성격을 정의하기 위하여 주성분 분석을 통해 주요 변수 추출 후 각 주제별 검색 행동특성을 분석하였다. 본 연구 결과는 효과적인 검색 서비스 구축 및 검색 시스템 개발에 기여 할 것으로 기대된다. 향후 연구로는 텍스트 분류기 모델링 실험을 통해 자동 분류 시스템을 구현할 수 있을 것이다.
https://doi.org/10.3745/PKIPS.y2020m11a.765 인용 PDF

A Study of Personalized Retrieval System Evaluation (개인화 검색시스템 평가에 관한 연구)

Kim, Kwang-Young;Choe, Ho-Seop;Jin, Du-Suk;Kim, Jin-Suk
- Proceedings of the Korean Information Science Society Conference
- /
- 2010.06b
- /
- pp.39-42
- /
- 2010
본 논문에서는 주제별 분류기반의 개인화 검색시스템의 평가를 위해서 기존의 한글 정보 검색시스템 평가를 위해서 사용하는 한글 테스트 컬렉션(HANTEC v2.0)을 사용하였다. 주제별 분류기반의 개인화 검색 시스템의 평가를 위해서 첫째, 한글 테스트 컬렉션을 한국일보-40075 문서분류 테스트 컬렉션을 이용하여 주제별 분류를 수행 하였다. 둘째, 한국일보-40075 문서분류 테스트 컬렉션의 분류 체계에 다라 한글 테스트 컬렉션의 문서들을 kNN 분류기를 이용하여 분류를 수행하였다. 마지막으로 구축된 컬렉션을 이용하여 주제별 분류기반의 개인화 검색시스템의 성능 평가를 수행하였다.
PDF

The selection of Best suited Automatic Web Document Classification Based on Intranet (인트라넷 기반의 최적의 웹문서 자동 분류기법 선정)

김국희;윤희병
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2004.10a
- /
- pp.423-426
- /
- 2004
인트라넷에서는 증가하는 웹문서의 검색을 목적으로 웹 검색엔진의 도입이 활발히 진행 중이며 대부분 찾아야할 키워드를 알고 접근하는 검색엔진 형태이다. 그러나 사용자가 무엇을 찾아야 하는지 모르는 경우 웹문서 분류체계는 효율적인 방법을 제시할 수 있다. 일부 구축되어 있는 분류체계는 수작업에 의한 분류로 인해 증가하는 웹문서의 양에 효율적으로 대처하기 곤란하므로 자동분류기법을 활용한 분류가 더 효율적일 것이다. 본 논문에서는 국방인트라넷의 수작업으로 구축된 분류체계를 대상으로 용어 가중치를 계산하는 방법을 달리하여 다양한 분류기법을 적용하여 성능을 비교평가하고 웹문서 자동분류시스템에 적용하여 분류성능의 향상을 도모하고자 한다.
PDF

Search Result 1,718, Processing Time 0.049 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)