• Title/Summary/Keyword: content- based retrieval

Search Result 717, Processing Time 0.027 seconds

Text-Confidence Feature Based Quality Evaluation Model for Knowledge Q&A Documents (텍스트 신뢰도 자질 기반 지식 질의응답 문서 품질 평가 모델)

  • Lee, Jung-Tae;Song, Young-In;Park, So-Young;Rim, Hae-Chang
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.10
    • /
    • pp.608-615
    • /
    • 2008
  • In Knowledge Q&A services where information is created by unspecified users, document quality is an important factor of user satisfaction with search results. Previous work on quality prediction of Knowledge Q&A documents evaluate the quality of documents by using non-textual information, such as click counts and recommendation counts, and focus on enhancing retrieval performance by incorporating the quality measure into retrieval model. Although the non-textual information used in previous work was proven to be useful by experiments, data sparseness problem may occur when predicting the quality of newly created documents with such information. To solve data sparseness problem of non-textual features, this paper proposes new features for document quality prediction, namely text-confidence features, which indicate how trustworthy the content of a document is. The proposed features, extracted directly from the document content, are stable against data sparseness problem, compared to non-textual features that indirectly require participation of service users in order to be collected. Experiments conducted on real world Knowledge Q&A documents suggests that text-confidence features show performance comparable to the non-textual features. We believe the proposed features can be utilized as effective features for document quality prediction and improve the performance of Knowledge Q&A services in the future.

A Proposal of Methods for Extracting Temporal Information of History-related Web Document based on Historical Objects Using Machine Learning Techniques (역사객체 기반의 기계학습 기법을 활용한 웹 문서의 시간정보 추출 방안 제안)

  • Lee, Jun;KWON, YongJin
    • Journal of Internet Computing and Services
    • /
    • v.16 no.4
    • /
    • pp.39-50
    • /
    • 2015
  • In information retrieval process through search engine, some users want to retrieve several documents that are corresponding with specific time period situation. For example, if user wants to search a document that contains the situation before 'Japanese invasions of Korea era', he may use the keyword 'Japanese invasions of Korea' by using searching query. Then, search engine gives all of documents about 'Japanese invasions of Korea' disregarding time period in order. It makes user to do an additional work. In addition, a large percentage of cases which is related to historical documents have different time period between generation date of a document and record time of contents. If time period in document contents can be extracted, it may facilitate effective information for retrieval and various applications. Consequently, we pursue a research extracting time period of Joseon era's historical documents by using historic literature for Joseon era in order to deduct the time period corresponding with document content in this paper. We define historical objects based on historic literature that was collected from web and confirm a possibility of extracting time period of web document by machine learning techniques. In addition to the machine learning techniques, we propose and apply the similarity filtering based on the comparison between the historical objects. Finally, we'll evaluate the result of temporal indexing accuracy and improvement.

Design and Implementation of Web-based Problem Management System for CT Radiological Technologist Education (CT 전문방사선사 교육을 위한 웹기반 문항관리 시스템의 설계 및 구현)

  • Shin Yong-Won;Koo Bong-Oh;Shim Choon-Bo
    • The Journal of the Korea Contents Association
    • /
    • v.5 no.1
    • /
    • pp.27-35
    • /
    • 2005
  • Recently, despite of the rapid progress of information technology in the medical and health fields, the development and management of problem sets about medical and education contents related with radiological technologist has been still achieved by manual and offline method using document editor. In this study, the unique web-based problem management system is designed and implemented. That system can efficiently manage and present various kind of problem set about integrated education and personal license without time and space limitations in order to improve the efficiency of supplementary training and to obtain the professional license for CT radiological technologist. The proposed system is composed of administration module and user module. The former supports several functions such as problem creation, problem categorization, user management, and adjustment of leveled assessment. On the other hand, the latter functions examination applying , problem retrieval, personal score retrieval, and interpretation viewing, and so on. In addition, our system is expected as a useful and practical system which provides problem interpretation and analysis of score results after applying for the examination. It can elevate ability of learning and information interchange among them preparing for CT professional radiological technologist licensing examination

  • PDF

A new approach for overlay text detection from complex video scene (새로운 비디오 자막 영역 검출 기법)

  • Kim, Won-Jun;Kim, Chang-Ick
    • Journal of Broadcast Engineering
    • /
    • v.13 no.4
    • /
    • pp.544-553
    • /
    • 2008
  • With the development of video editing technology, there are growing uses of overlay text inserted into video contents to provide viewers with better visual understanding. Since the content of the scene or the editor's intention can be well represented by using inserted text, it is useful for video information retrieval and indexing. Most of the previous approaches are based on low-level features, such as edge, color, and texture information. However, existing methods experience difficulties in handling texts with various contrasts or inserted in a complex background. In this paper, we propose a novel framework to localize the overlay text in a video scene. Based on our observation that there exist transient colors between inserted text and its adjacent background a transition map is generated. Then candidate regions are extracted by using the transition map and overlay text is finally determined based on the density of state in each candidate. The proposed method is robust to color, size, position, style, and contrast of overlay text. It is also language free. Text region update between frames is also exploited to reduce the processing time. Experiments are performed on diverse videos to confirm the efficiency of the proposed method.

An Analysis on the Conception of Web Navigation in Library and Information Science Research (웹 정보 네비게이션에 대한 개념 분석: 국외 문헌정보학 연구논문을 중심으로)

  • Park, Heejin
    • Journal of the Korean Society for information Management
    • /
    • v.29 no.4
    • /
    • pp.229-249
    • /
    • 2012
  • The study aims to analyze the notion of web navigation in library and information science (LIS) by investigating the concepts and its consequences. The analysis is based on 32 articles published in international LIS journals between 1995 and 2012. The study noted the changing contexts of information retrieval in the Web environment, and discusses the changes over time in the way in which the notion of navigation have been regarded. Three main concepts are identified: navigation behavior, navigation strategies, and navigation designs.

Real-time Face Extraction for Content-based Image Retrieval (내용기반 영상 검색을 위한 실시간 얼굴 영역 추출)

  • 이미숙;이성환
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1996.06a
    • /
    • pp.169-174
    • /
    • 1996
  • 객체 인식은 대용량의 영상 데이터를 분석, 탐색하고 재구성하기 위한 내용기반 영상 검색의 매우 중요한 분야이며, 특히 인간의 얼굴은 검색 영상 내에서 대부분 주요한 장면에 위치하고 있기 때문에 그 비중이 매우 크다. 본 논문에서는 내용기반 영상 검색을 위한 실시간 얼굴 영역 추출 방법을 제안한다. 제안된 방법에서는 다층 피라미드 구조와 간단한 형태의 머리 형판을 사용하여 얼굴의 후보 영역을 추출한 후, 보다 정확한 얼굴 영역을 추출하기 위하여 후보 영역 내에서 눈의 위치를 탐색하고, 두 눈의 위치를 기준으로 최종적인 얼굴 영역을 추출하였다. 얼굴 후보 영역 추출 단계에서는 얼굴의 형태 정보를 포함하고 있는 모자이크 형판을 사용하여 머리와 턱을 포함한 얼굴 영역을 추출하였으며, 눈 위치 추출 단계에서는 눈의 위치 정보를 사용하여 눈의 탐색 영역을 결정하고, 탐색 영역 내에서 이진 영상 형판을 사용하여 눈의 위치를 추출한 후, 눈 영역의 무게 중심을 눈의 중심 위치로 설정하였다. 마지막 얼굴 영역 추출단계에서는 두 눈의 위치를 기준으로 사각형의 영역을 얼굴 영역으로 추출하였다. 제안된 방법의 성능을 검증하기 위하여 1700장의 다양한 영상에 대하여 실험하였으며, 실험 결과 한 장의 영상에서 얼굴 영역을 추출하는데 있어서, Pentium 166Mz의 PC상에서 평균 3.2초의 처리 속도와 91.7%의 추출률을 보임으로써, 실시간 얼굴 영역 추출에 매우 효과적임을 알 수 있었다.

  • PDF

텍스타일 영상에서의 감성 기반 검색 시스템

  • Kim, Young-Rae;Shin, Yun-Hee;Kim, Eun-Yi
    • Proceedings of the Korea Society for Industrial Systems Conference
    • /
    • 2009.05a
    • /
    • pp.82-87
    • /
    • 2009
  • 본 논문에서는 감성 기반으로 텍스타일을 자동으로 색인하고 검색 할 수 있는 시스템을 제안한다. 제안된 시스템은 영상 수집기, 감성 색인기, 검색기(Matcher), 질의 인터페이스로 구성되어 있다. 감성 색인기는 텍스타일 영상에 포함된 컬러와 패턴 정보를 기반으로 감성개념을 인식하고, 이를 이용하여 영상을 색인한다. 이때, 감성 어휘로 고바야시가 정의한 8개 (romantic, natural, casual, elegant, chic, classic, dandy, modern)를 사용한다. 질의 인터페이스에서 사용자는 두 가지 방식으로 질의를 선택할 수 있다. 첫 번째 방법은 감성 키워드를 사용하는 것이고, 두 번째는 사용자의 의도를 설명할 수 있는 영상을 이용하는 예제 기반 질의 방식이다. 질의가 주어지면, 검색기는 랭킹 알고리즘을 사용하여 검색 결과를 생성한다. 이 때, 유사도 비교방식은 선택된 질의방식에 따라 달라진다. 제안된 시스템의 성능을 검증하기 위해 웹 검색에 익숙한 50명(남자: 32명, 여자: 18명)을 대상으로 웹에서 수집한 3,416 장에 대해서 3가지 항목으로 사용자 평가를 하였다. 사용자 평가의 항목인 적합도(Relevance), 노력(Search Effort), 만족도(Satisfaction)의 결과로 사용자가 검색한 결과영상에서 적합도의 수치가 낮게 나왔지만, 만족도와 노력의 수치는 높게 평가되었다. 제안된 시스템에서 사용자는 자신이 선호하는 결과 영상을 상위 40개의 영상 내에서 얻을 수 있었다. 이는 제안된 시스템이 사용자들이 원하는 영상을 효율적으로 검색할 수 있다는 것을 증명했다.

  • PDF

A SHAPE FEATURE EXTRACTION FOR COMPLEX TOPOGRAPHICAL IMAGES

  • Kwon Yong-Il;Park Ho-Hyun;Lee Seok-Lyong;Chung Chin-Wan
    • Proceedings of the KSRS Conference
    • /
    • 2005.10a
    • /
    • pp.575-578
    • /
    • 2005
  • Topographical images, in case of aerial or satellite images, are usually similar in colors and textures, and complex in shapes. Thus we have to use shape features of images for efficiently retrieving a query image from topographical image databases. In this paper, we propose a shape feature extraction method which is suitable for topographical images. This method, which improves the existing projection in the Cartesian coordinates, performs the projection operation in the polar coordinates. This method extracts three attributes, namely the number of region pixels, the boundary pixel length of the region from the centroid, the number of alternations between region and background, along each angular direction of the polar coordinates. It extracts the features of complex shape objects which may have holes and disconnected regions. An advantage of our method is that it is invariant to rotation/scale/translation of images. Finally we show the advantages of our method through experiments by comparing it with CSS which is one of the most successful methods in the area of shape feature extraction

  • PDF

Document Clustering Methods using Hierarchy of Document Contents (문서 내용의 계층화를 이용한 문서 비교 방법)

  • Hwang, Myung-Gwon;Bae, Yong-Geun;Kim, Pan-Koo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.10 no.12
    • /
    • pp.2335-2342
    • /
    • 2006
  • The current web is accumulating abundant information. In particular, text based documents are a type used very easily and frequently by human. So, numerous researches are progressed to retrieve the text documents using many methods, such as probability, statistics, vector similarity, Bayesian, and so on. These researches however, could not consider both subject and semantic of documents. So, to overcome the previous problems, we propose the document similarity method for semantic retrieval of document users want. This is the core method of document clustering. This method firstly, expresses a hierarchy semantically of document content ut gives the important hierarchy domain of document to weight. With this, we could measure the similarity between documents using both the domain weight and concepts coincidence in the domain hierarchies.

Color Image Segmentation for Content-based Image Retrieval (내용기반 영상검색을 위한 칼라 영상 분할)

  • Lee, Sang-Hun;Hong, Choong-Seon;Kwak, Yoon-Sik;Lee, Dai-Young
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.9
    • /
    • pp.2994-3001
    • /
    • 2000
  • In this paper. a method for color image segmentation using region merging is proposed. A inhomogeneity which exists in image is reduced by smoothing with non-linear filtering. saturation enhancement and intensity averaging in previous step of image segmentation. and a similar regions are segmented by non-uniform quantization using zero-crossing information of color histogram. A edge strength of initial region is measured using high frequency energy of wavelet transform. A candidate region which is merged in next step is selected by doing this process. A similarity measure for region merging is processed using Euclidean distance of R. G. B color channels. A Proposed method can reduce an over-segmentation results by irregular light sources et. al, and we illustrated that the proposed method is reasonable by simulation.

  • PDF