• Title/Summary/Keyword: Relevant Image Retrieval

Search Result 41, Processing Time 0.019 seconds

MPEG-7 Homogeneous Texture Descriptor

  • Ro, Yong-Man;Kim, Mun-Churl;Kang, Ho-Kyung;Manjunath, B.S.;Kim, Jin-Woong
    • ETRI Journal
    • /
    • v.23 no.2
    • /
    • pp.41-51
    • /
    • 2001
  • MPEG-7 standardization work has started with the aims of providing fundamental tools for describing multimedia contents. MPEG-7 defines the syntax and semantics of descriptors and description schemes so that they may be used as fundamental tools for multimedia content description. In this paper, we introduce a texture based image description and retrieval method, which is adopted as the homogeneous texture descriptor in the visual part of the MPEG-7 final committee draft. The current MPEG-7 homogeneous texture descriptor consists of the mean, the standard deviation value of an image, energy, and energy deviation values of Fourier transform of the image. These are extracted from partitioned frequency channels based on the human visual system (HVS). For reliable extraction of the texture descriptor, Radon transformation is employed. This is suitable for HVS behavior. We also introduce various matching methods; for example, intensity-invariant, rotation-invariant and/or scale-invariant matching. This technique retrieves relevant texture images when the user gives a querying texture image. In order to show the promising performance of the texture descriptor, we take the experimental results with the MPEG-7 test sets. Experimental results show that the MPEG-7 texture descriptor gives an efficient and effective retrieval rate. Furthermore, it gives fast feature extraction time for constructing the texture descriptor.

  • PDF

Relevant Image Retrieval of Korean Documents based on Sentence and Word Importance (문장 및 단어 중요도를 통한 한국어 문서 연관 이미지 검색)

  • Kim, Nam-Gyu;Kang, Shin-Jae
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.3
    • /
    • pp.43-48
    • /
    • 2019
  • While reading text-only documents and finding unknown words, readers will become the focus disturbed and not be able to understand the content of the documents. Because children have little experience, it is difficult to understand correctly if the description in context is unfamiliar or ambiguous. In this paper, in order to help understand the text and increase the interest of the readers, we analyze the texts of documents and select the contents that are considered important, and implement a system that displays the most relevant images automatically from the web and links the texts and the images together. The implementation of the system divides the article into paragraphs, analyzes the text, selects important sentences for each paragraph and the important words that best represent the meaning of the important sentences, searches for images related to the words on the web, and then links the images to each of the previous paragraphs. Experiments have shown how to select important sentences and how to select important words in the sentences. As a result of the experiment, we could get 60% performance by evaluating the accuracy of the relation between three selected images and corresponding important sentences.

The Kernel Trick for Content-Based Media Retrieval in Online Social Networks

  • Cha, Guang-Ho
    • Journal of Information Processing Systems
    • /
    • v.17 no.5
    • /
    • pp.1020-1033
    • /
    • 2021
  • Nowadays, online or mobile social network services (SNS) are very popular and widely spread in our society and daily lives to instantly share, disseminate, and search information. In particular, SNS such as YouTube, Flickr, Facebook, and Amazon allow users to upload billions of images or videos and also provide a number of multimedia information to users. Information retrieval in multimedia-rich SNS is very useful but challenging task. Content-based media retrieval (CBMR) is the process of obtaining the relevant image or video objects for a given query from a collection of information sources. However, CBMR suffers from the dimensionality curse due to inherent high dimensionality features of media data. This paper investigates the effectiveness of the kernel trick in CBMR, specifically, the kernel principal component analysis (KPCA) for dimensionality reduction. KPCA is a nonlinear extension of linear principal component analysis (LPCA) to discovering nonlinear embeddings using the kernel trick. The fundamental idea of KPCA is mapping the input data into a highdimensional feature space through a nonlinear kernel function and then computing the principal components on that mapped space. This paper investigates the potential of KPCA in CBMR for feature extraction or dimensionality reduction. Using the Gaussian kernel in our experiments, we compute the principal components of an image dataset in the transformed space and then we use them as new feature dimensions for the image dataset. Moreover, KPCA can be applied to other many domains including CBMR, where LPCA has been used to extract features and where the nonlinear extension would be effective. Our results from extensive experiments demonstrate that the potential of KPCA is very encouraging compared with LPCA in CBMR.

Design and Implementation of a COncept-based Image Retrieval System: COIRS (개념 기반 이미지 정보 검색 시스템 COIRS의 설계 및 구현)

  • Yang, Hyung-Jeong;Kim, Ho-Young;Yang, Jae-Dong;Hur, Dae-Young
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.12
    • /
    • pp.3025-3035
    • /
    • 1998
  • In this paper, we describe the design and implementationof COIRS COncept,based Image Retricval System). It differs from extant content-based image retrieval systems in that it enables users to query based on concepts- it allows users to get images concepmally relevant. A concept is basically an aggregation of promitive objects in an image. For such a cencept based image retrieval functionality. COIRS aglopts an image descriptor called triple and includes a triple thesaurus used for capturing concepts. There are four facilities in COIRS: a visual image indeses a triple thesaurus, an inverted fiel, and a user query interface. The visnal image indeser facilitates object laeling and the percification of positionof objects. It is an assistant tool designed to minimize manual work when indexing images. The thesarrus captires the concepts by analyzing triples, thereby extracting image semantics. The triples are then for formalating queries as well as indexing images. The user query interiare enables users to formulate...

  • PDF

Development of Computer Vision System for Individual Recognition and Feature Information of Cow (I) - Individual recognition using the speckle pattern of cow - (젖소의 개체인식 및 형상 정보화를 위한 컴퓨터 시각 시스템 개발 (I) - 반문에 의한 개체인식 -)

  • 이종환
    • Journal of Biosystems Engineering
    • /
    • v.27 no.2
    • /
    • pp.151-160
    • /
    • 2002
  • Cow image processing technique would be useful not only for recognizing an individual but also for establishing the image database and analyzing the shape of cows. A cow (Holstein) has usually the unique speckle pattern. In this study, the individual recognition of cow was carried out using the speckle pattern and the content-based image retrieval technique. Sixty cow images of 16 heads were captured under outdoor illumination, which were complicated images due to shadow, obstacles and walking posture of cow. Sixteen images were selected as the reference image for each cow and 44 query images were used for evaluating the efficiency of individual recognition by matching to each reference image. Run-lengths and positions of runs across speckle area were calculated from 40 horizontal line profiles for ROI (region of interest) in a cow body image after 3 passes of 5$\times$5 median filtering. A similarity measure for recognizing cow individuals was calculated using Euclidean distance of normalized G-frame histogram (GH). normalized speckle run-length (BRL), normalized x and y positions (BRX, BRY) of speckle runs. This study evaluated the efficiency of individual recognition of cow using Recall(Success rate) and AVRR(Average rank of relevant images). Success rate of individual recognition was 100% when GH, BRL, BRX and BRY were used as image query indices. It was concluded that the histogram as global property and the information of speckle runs as local properties were good image features for individual recognition and the developed system of individual recognition was reliable.

An XML-based Multimedia News Management System (XML 기반 멀티미디어 뉴스 관리 시스템)

  • Kim Hyon Hee;Park Seung Soo
    • The KIPS Transactions:PartB
    • /
    • v.11B no.7 s.96
    • /
    • pp.785-792
    • /
    • 2004
  • With recent progress of related multimedia computing technologies, it is necessay to retrieve diverse types of multimedia data based on multi-media content and their relationships. However, different from alphanumeric data, it is difficult to provide relevant multimedia information, be-cause multimedia contents and their relationships are implied in multimedia data. Therefore, in case of a multimedia news service system that is a representative multimedia application, most of new services provide relevant news about text articles and retrieval of multimedia news such as video news or image news are provided independently. In this paper, we present an XML-based multimedia news management system, which provides integrating, retrieval, and delivery of relevant multimedia news. Our data model composed of media object, relationship object, and view object represents diverse types of multimedia news content and semantically related multimedia news. In addition, a proposed view mechanism makes it possible to customize multimedia news, and therefore provides multimedia news efficiently.

Comparison Shopping Systems using Image Retrieval based on Semantic Web (시맨틱 웹 기반의 이미지 정색을 이용한 비교 쇼핑 시스템)

  • Lee, Kee-Sung;Yu, Young-Hoon;Jo, Gun-Sik;Kim, Heung-Nam
    • Journal of Intelligence and Information Systems
    • /
    • v.11 no.2
    • /
    • pp.1-15
    • /
    • 2005
  • The explosive growth of the Internet leads to various on-line shopping malls and active E-Commerce. however, as the internet has experienced continuous growth, users have to face a variety and a huge amount of items, and often waste a lot of time on purchasing items that are relevant to their interests. To overcome this problem the comparison shopping systems, which can help to compare items' information with those other shopping malls, have been issued as a solution. However, when users do not have much knowledge what they want to find, a keyword-based searching in the existing comparison shopping systems lead users to waste time for searching information. Thereby, the performance is fell down. To solve this problem in this research, we suggest the Comparison Shopping System using Image Retrieval based on Semantic Web. The proposed system can assist users who don't know items' information that they want to find and serve users for quickly comparing information among the items. In the proposed system we use semantic web technology. We insert the Semantic Annotation based on Ontology into items' image of each shopping mall. Consequently, we employ those images for searching the items instead of using a complex keyword. In order to evaluate performance of the proposed system we compare our experimental results with those of Keyword-based Comparison Shopping System and simple Semantic Web-based Comparison Shopping System. Our result shows that the proposed system has improved performance in comparison with the other systems.

  • PDF

Comparison and Evaluation of Web-based Image Search Engines (이미지정보 탐색을 위한 웹 검색엔진의 비교 평가)

  • Kim, Hyo-Jung
    • Journal of Information Management
    • /
    • v.31 no.4
    • /
    • pp.50-70
    • /
    • 2000
  • Since the contents of internet resources are beginning to include texts, images and sounds, different Web-based image search engines have been developed accordingly. It is a fact that these diversities of multimedia contents have made search process and retrieval of relevant information very difficult. The purpose of the study is to compare and evaluate its special features and performance of the existing image search engines in order to provide user help to select appropriate search engines. The study selected AV Photo Finder, Lycos MultiMedia, Amazing Picture Machine, Image Surfer, WebSeek, Ditto for comparison and evaluation because of their reputations of popularity among users of image search engines. The methodology of the study was to analyze previous related literature and establish criteria for the evaluation of image search engines. The study investigated characteristics, indexing methods, search capabilities, screen display and user interfaces of different search engines for the purpose of comparison of its performance. Finally, the study measured relative recall and precision ratios to evaluate their electiveness of retrieval under the experimental set up. Results of the comparative analysis in regard to its search performance are as follows. AV Photo Finder marked the highest rank among other image search engines. Ditto and WebSeek also showed comparatively high precision ratio. Lycos MultiMedia and Image Surfer follows after them. Amazing Picture Machine stowed the lowest in ranking.

  • PDF

Toward a Key-frame Automatic Extraction Method for Video Storyboard Surrogates Based on Users' EEG Signals and Discriminant Analysis (뇌파측정기술(EEG)과 판별분석을 이용한 영상물의 키프레임 자동 분류 방안 연구)

  • Kim, Hyun-Hee
    • Journal of the Korean Society for information Management
    • /
    • v.32 no.3
    • /
    • pp.377-396
    • /
    • 2015
  • This study proposed a key-frame automatic extraction method for video storyboard surrogates based on users' cognitive responses, EEG signals and discriminant analysis. Using twenty participants, we examined which ERP pattern is suitable for each step, assuming that there are five image recognition and process steps (stimuli attention, stimuli perception, memory retrieval, stimuli/memory comparison, relevance judgement). As a result, we found that each step has a suitable ERP pattern, such as N100, P200, N400, P3b, and P600. Moreover, we also found that the peak amplitude of left parietal lobe (P7) and the latency of FP2 are important variables in distinguishing among relevant, partial, and non-relevant frames. Using these variables, we conducted a discriminant analysis to classify between relevant and non-relevant frames.

Design and Implementation of the Query Processor and Browser for Content-based Retrieval in Video Database (내용기반 검색을 위한 비디오 데이터베이스 질의처리기 및 브라우저의 설계 및 구현)

  • Lee, Hun-Sun;Kim, Yong-Geol;Bae, Yeong-Rae;Jin, Seong-Il
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.8
    • /
    • pp.2008-2019
    • /
    • 1999
  • As computing technologies are rapidly progressed and widely used, the needs of high quality information have been increased. To satisfy these needs, it is essential to develop a system which can provide an efficient storing, managing and retrieving mechanism of complex multimedia data, esp. video data. In this paper, we propose a metadata model which can support content-based retrieval of video data. And we design and implement an integrated user interface for querying and browser for content-based retrieval in video database which can efficiently access and browse the video clip that user want to see. Proposed query processor and browser can support various user queries by integrating image feature, spatial temporal feature and annotation. Our system supports structure browsing of retrieved result, so users can more exactly and efficiently access relevant video clip. Without browsing the whole video clip, users can know the contents of video by seeing the storyboard. This storyboard facility makes users know more quickly the content of video clip.

  • PDF