• Title/Summary/Keyword: visual similarity

Search Result 244, Processing Time 0.022 seconds

KNN-based Image Annotation by Collectively Mining Visual and Semantic Similarities

  • Ji, Qian;Zhang, Liyan;Li, Zechao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.9
    • /
    • pp.4476-4490
    • /
    • 2017
  • The aim of image annotation is to determine labels that can accurately describe the semantic information of images. Many approaches have been proposed to automate the image annotation task while achieving good performance. However, in most cases, the semantic similarities of images are ignored. Towards this end, we propose a novel Visual-Semantic Nearest Neighbor (VS-KNN) method by collectively exploring visual and semantic similarities for image annotation. First, for each label, visual nearest neighbors of a given test image are constructed from training images associated with this label. Second, each neighboring subset is determined by mining the semantic similarity and the visual similarity. Finally, the relevance between the images and labels is determined based on maximum a posteriori estimation. Extensive experiments were conducted using three widely used image datasets. The experimental results show the effectiveness of the proposed method in comparison with state-of-the-arts methods.

An Effect of Similarity Judgement on Human Performance in Inspection Tasks (유사성(類似性) 판단(判斷)과 검사수행도(檢査遂行度)에 관한 연구)

  • Son, Il-Mun;Lee, Dong-Chun;Lee, Sang-Do
    • Journal of Korean Society for Quality Management
    • /
    • v.20 no.2
    • /
    • pp.109-117
    • /
    • 1992
  • An inspection task largely can be seen as a job divided up into a series of visual search and classification subtasks. In these subtasks, an Inspector must performs to compare the standard references proposed in visual environments and recalled in his memory with the visual stimuli to be inspected. It means that the judgement of similarity should be demanded on inspection tasks. Therefore, the inspector's ability for the judgement of similarity and the difference similarity between inspection materials are important factors to effect on performances in inspection tasks. In this paper, to analysis the effect of these factors on inspection time, an inspection task is designed and suggested by means of computer simulator. Especially, the skin conductance responses(SCR) of subjects are measured to evaluate the complexity of tasks due to the difference of similarity between materials. In the results of experiment, the more similar or different the difference of similarity between materials is, the shorter the inspection time is because of the reduction of task complexity. And, When the inspector's cognition for similarity between materials is consistanct, the inpsection time is improved. Concludingly, the consistency of reponses for similarity judgement becomes a measurement to present the performance levels. And the information of inspection time that due to the difference of similarity between materials must be considered in planning and scheduling inspection tasks.

  • PDF

Development of Road-Following Controller for Autonomous Vehicle using Relative Similarity Modular Network (상대분할 신경회로망에 의한 자율주행차량 도로추적 제어기의 개발)

  • Ryoo, Young-Jae;Lim, Young-Cheol
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.5 no.5
    • /
    • pp.550-557
    • /
    • 1999
  • This paper describes a road-following controller using the proposed neural network for autonomous vehicle. Road-following with visual sensor like camera requires intelligent control algorithm because analysis of relation from road image to steering control is complex. The proposed neural network, relative similarity modular network(RSMN), is composed of some learning networks and a partitioniing network. The partitioning network divides input space into multiple sections by similarity of input data. Because divided section has simlar input patterns, RSMN can learn nonlinear relation such as road-following with visual control easily. Visual control uses two criteria on road image from camera; one is position of vanishing point of road, the other is slope of vanishing line of road. The controller using neural network has input of two criteria and output of steering angle. To confirm performance of the proposed neural network controller, a software is developed to simulate vehicle dynamics, camera image generation, visual control, and road-following. Also, prototype autonomous electric vehicle is developed, and usefulness of the controller is verified by physical driving test.

  • PDF

An Efficient Processing Technique for Similarity based Visual Queries (효율적인 유사 시각질의 처리)

  • Hwang, Jun
    • Journal of Internet Computing and Services
    • /
    • v.1 no.1
    • /
    • pp.1-14
    • /
    • 2000
  • Visual information retrieval and image databases are very important applications of spatial access methods. The quaries for these applications are visual and based not on exact match but on dubjective similarity. The individual aperations of spatial access methods are much more expensive than those of conventional one-dimensional access methods. Also, because the visual queries are much more complex than textual queries, an efficient processing technique for visual queries is one of the critical requirements in the development of large and scalable image databases. Therefore, efficient translation and execution for the complex visual queries are not less important than those of textual databases. In this paper, we introduce our cognitive and topological studies that are required to process subjective visual queries effectively. Then, we propose an efficient translation and execution techniques for similarity based visual queries by conducting these related studies.

  • PDF

Shot Group and Representative Shot Frame Detection using Similarity-based Clustering

  • Lee, Gye-Sung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.9
    • /
    • pp.37-43
    • /
    • 2016
  • This paper introduces a method for video shot group detection needed for efficient management and summary of video. The proposed method detects shots based on low-level visual properties and performs temporal and spatial clustering based on visual similarity of neighboring shots. Shot groups created from temporal clustering are further clustered into small groups with respect to visual similarity. A set of representative shot frames are selected from each cluster of the smaller groups representing a scene. Shots excluded from temporal clustering are also clustered into groups from which representative shot frames are selected. A number of video clips are collected and applied to the method for accuracy of shot group detection. We achieved 91% of accuracy of the method for shot group detection. The number of representative shot frames is reduced to 1/3 of the total shot frames. The experiment also shows the inverse relationship between accuracy and compression rate.

Image Denoising via Fast and Fuzzy Non-local Means Algorithm

  • Lv, Junrui;Luo, Xuegang
    • Journal of Information Processing Systems
    • /
    • v.15 no.5
    • /
    • pp.1108-1118
    • /
    • 2019
  • Non-local means (NLM) algorithm is an effective and successful denoising method, but it is computationally heavy. To deal with this obstacle, we propose a novel NLM algorithm with fuzzy metric (FM-NLM) for image denoising in this paper. A new feature metric of visual features with fuzzy metric is utilized to measure the similarity between image pixels in the presence of Gaussian noise. Similarity measures of luminance and structure information are calculated using a fuzzy metric. A smooth kernel is constructed with the proposed fuzzy metric instead of the Gaussian weighted L2 norm kernel. The fuzzy metric and smooth kernel computationally simplify the NLM algorithm and avoid the filter parameters. Meanwhile, the proposed FM-NLM using visual structure preferably preserves the original undistorted image structures. The performance of the improved method is visually and quantitatively comparable with or better than that of the current state-of-the-art NLM-based denoising algorithms.

Web Usability Testing by Using Scanpath Similarity Analysis (탐색경로 일치도 분석을 이용한 웹사이트 사용성 평가)

  • Kim, Youngjun;Kim, Youngjin
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.14 no.2
    • /
    • pp.793-803
    • /
    • 2013
  • This study was performed to determine the usefulness of scanpath similarity analysis as one of new web usability testing. The 5 websites of public institutions were used and 15 students participated. First of all, eye movements were tracked and visual appeal ratings were measured as participants freely viewed each website for 3 seconds. Subsequently in continuously tracking the eye movements we asked the participants to perform 17 missions. Finally, in interview the participants rated on satisfaction, awareness, and mission difficulty. Results of this study showed that scanpath similarity had a significant relationship with both the visual appeal ratings(first impression) and the satisfaction. In other words, higher the visual appeal ratings were related to higher scanpath similarity. This result showed that measurement such as scanpath similarity of eye movements could become an objective index for usability testing instead of subjective evaluation such as the satisfaction. We discussed possibility that the usability testing by using the scanpath similarity with both fixation and duration on eye movements will find more appropriately inference on observers' experiences in websites.

Exploring Simultaneous Presentation in Online Restaurant Reviews: An Analysis of Textual and Visual Content

  • Lin Li;Gang Ren;Taeho Hong;Sung-Byung Yang
    • Asia pacific journal of information systems
    • /
    • v.29 no.2
    • /
    • pp.181-202
    • /
    • 2019
  • The purpose of this study is to explore the effect of different types of simultaneous presentation (i.e., reviewer information, textual and visual content, and similarity between textual-visual contents) on review usefulness and review enjoyment in online restaurant reviews (ORRs), as they are interrelated yet have rarely been examined together in previous research. By using Latent Dirichlet Allocation (LDA) topic modeling and state-of-the-art machine learning (ML) methodologies, we found that review readability in textual content and salient objects in images in visual content have a significant impact on both review usefulness and review enjoyment. Moreover, similarity between textual-visual contents was found to be a major factor in determining review usefulness but not review enjoyment. As for reviewer information, reputation, expertise, and location of residence, these were found to be significantly related to review enjoyment. This study contributes to the body of knowledge on ORRs and provides valuable implications for general users and managers in the hospitality and tourism industries.

Algorithms for Indexing and Integrating MPEG-7 Visual Descriptors (MPEG-7 시각 정보 기술자의 인덱싱 및 결합 알고리즘)

  • Song, Chi-Ill;Nang, Jong-Ho
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.1
    • /
    • pp.1-10
    • /
    • 2007
  • This paper proposes a new indexing mechanism for MPEG-7 visual descriptors, especially Dominant Color and Contour Shape descriptors, that guarantees an efficient similarity search for the multimedia database whose visual meta-data are represented with MPEG-7. Since the similarity metric used in the Dominant Color descriptor is based on Gaussian mixture model, the descriptor itself could be transform into a color histogram in which the distribution of the color values follows the Gauss distribution. Then, the transformed Dominant Color descriptor (i.e., the color histogram) is indexed in the proposed indexing mechanism. For the indexing of Contour Shape descriptor, we have used a two-pass algorithm. That is, in the first pass, since the similarity of two shapes could be roughly measured with the global parameters such as eccentricity and circularity used in Contour shape descriptor, the dissimilar image objects could be excluded with these global parameters first. Then, the similarities between the query and remaining image objects are measured with the peak parameters of Contour Shape descriptor. This two-pass approach helps to reduce the computational resources to measure the similarity of image objects using Contour Shape descriptor. This paper also proposes two integration schemes of visual descriptors for an efficient retrieval of multimedia database. The one is to use the weight of descriptor as a yardstick to determine the number of selected similar image objects with respect to that descriptor, and the other is to use the weight as the degree of importance of the descriptor in the global similarity measurement. Experimental results show that the proposed indexing and integration schemes produce a remarkable speed-up comparing to the exact similarity search, although there are some losses in the accuracy because of the approximated computation in indexing. The proposed schemes could be used to build a multimedia database represented in MPEG-7 that guarantees an efficient retrieval.

Moving Objects Modeling for Supporting Content and Similarity Searches (내용 및 유사도 검색을 위한 움직임 객체 모델링)

  • 복경수;김미희;신재룡;유재수;조기형
    • Journal of Korea Multimedia Society
    • /
    • v.7 no.5
    • /
    • pp.617-632
    • /
    • 2004
  • Video Data includes moving objects which change spatial positions as time goes by. In this paper, we propose a new modeling method for a moving object contained in the video data. In order to effectively retrieve moving objects, the proposed modeling method represents the spatial position and the size of a moving object. It also represents the visual features and the trajectory by considering direction, distance and speed or moving objects as time goes by. Therefore, It allows various types of retrieval such as visual feature based similarity retrieval, distance based similarity retrieval and trajectory based similarity retrieval and their mixed type of weighted retrieval.

  • PDF