• Title/Summary/Keyword: Image Descriptors

Search Result 160, Processing Time 0.02 seconds

A New Method for Measurement and Prediction of Memorability from Logo Images using Characteristics of Color and Shape (색상 및 형태 특성을 이용한 로고 영상의 기억용이성 측정 및 예측)

  • Oh, Sang-Il;Kang, Hang-Bong
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.12
    • /
    • pp.1509-1518
    • /
    • 2015
  • Because a logo is a medium that connects between consumers and corporations or brands, designing memorable logo images is vital. Although predicting logo's memorability for brand marketing is essential, there have been only few researches that deal with memorability of logo images. In this paper, we analyze the memorability characteristics in logo images by performing experiments based upon our proposed prediction method for logo image's memorability. Our proposed research consists of three phases: crowdsourcing for memorability computing, computational phase for logo image's memorability, and development of a prediction model. Using computed memorability of logo images by "Visual Memory Game," we analyze the different characteristics of logo's memorability. We first developed a novel computational method that reflects logo image's color and shape. Each computational method on color and shape are selected by comparing the correlations between result values and ground truth memorability. Selected computational value is then converged with generic image feature descriptors such as SIFT and HoG to make a prediction model of logo's memorability. Using our method, we obtain reasonable performances in predicting logo image's memorability.

Shape Description and Retrieval Using Included-Angular Ternary Pattern

  • Xu, Guoqing;Xiao, Ke;Li, Chen
    • Journal of Information Processing Systems
    • /
    • v.15 no.4
    • /
    • pp.737-747
    • /
    • 2019
  • Shape description is an important and fundamental issue in content-based image retrieval (CBIR), and a number of shape description methods have been reported in the literature. For shape description, both global information and local contour variations play important roles. In this paper a new included-angular ternary pattern (IATP) based shape descriptor is proposed for shape image retrieval. For each point on the shape contour, IATP is derived from its neighbor points, and IATP has good properties for shape description. IATP is intrinsically invariant to rotation, translation and scaling. To enhance the description capability, multiscale IATP histogram is presented to describe both local and global information of shape. Then multiscale IATP histogram is combined with included-angular histogram for efficient shape retrieval. In the matching stage, cosine distance is used to measure shape features' similarity. Image retrieval experiments are conducted on the standard MPEG-7 shape database and Swedish leaf database. And the shape image retrieval performance of the proposed method is compared with other shape descriptors using the standard evaluation method. The experimental results of shape retrieval indicate that the proposed method reaches higher precision at the same recall value compared with other description method.

Deep Local Multi-level Feature Aggregation Based High-speed Train Image Matching

  • Li, Jun;Li, Xiang;Wei, Yifei;Wang, Xiaojun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.5
    • /
    • pp.1597-1610
    • /
    • 2022
  • At present, the main method of high-speed train chassis detection is using computer vision technology to extract keypoints from two related chassis images firstly, then matching these keypoints to find the pixel-level correspondence between these two images, finally, detection and other steps are performed. The quality and accuracy of image matching are very important for subsequent defect detection. Current traditional matching methods are difficult to meet the actual requirements for the generalization of complex scenes such as weather, illumination, and seasonal changes. Therefore, it is of great significance to study the high-speed train image matching method based on deep learning. This paper establishes a high-speed train chassis image matching dataset, including random perspective changes and optical distortion, to simulate the changes in the actual working environment of the high-speed rail system as much as possible. This work designs a convolutional neural network to intensively extract keypoints, so as to alleviate the problems of current methods. With multi-level features, on the one hand, the network restores low-level details, thereby improving the localization accuracy of keypoints, on the other hand, the network can generate robust keypoint descriptors. Detailed experiments show the huge improvement of the proposed network over traditional methods.

Soil Particle Shape Analysis Using Fourier Descriptor Analysis (퓨리에 기술자 분석을 이용한 단일 흙 입자의 형상 분석)

  • Koo, Bonwhee;Kim, Taesik
    • Journal of the Korean GEO-environmental Society
    • /
    • v.17 no.3
    • /
    • pp.21-26
    • /
    • 2016
  • Soil particle shape analysis was conducted with sands from Jumujun, Korea and Ras Al Khair, Saudi Arabia. Two hundred times enlarged digital images of the particles of those two sands were obtained with an optical microscope. The resolution of the digital images was $640{\times}320$. By conducting digital image processing, the coordinates of the soil particle boundary were extracted. After mapping those coordinates to the complex space, Fourier transformation was performed and the coefficients of each trigonometry term were computed. The coefficients reflect the shape characteristics of the sand grains and are invariant to translation. To evaluate the shape itself excluding the size of the soil particle, the coefficient was normalized by the equivalent radius of soil particle; this is called Fourier descriptor. After analyzing the Fourier descriptors, it was found that the major characteristics of Jumunjin and Ras Al Khair sands were elongation and asymmetry. Furthermore, it was found that the particle shapes reflect the self-similar, fractal nature of the textural features. The effects of resolution on soil particle shape analysis was also studied. Regarding this, it was found that the significant Fourier descriptors were not significantly affected by the image resolution investigated in this study, but the descriptors associated with textural features were affected.

An Efficient Bitmap Indexing Method for Multimedia Data Reflecting the Characteristics of MPEG-7 Visual Descriptors (MPEG-7 시각 정보 기술자의 특성을 반영한 효율적인 멀티미디어 데이타 비트맵 인덱싱 방법)

  • Jeong Jinguk;Nang Jongho
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.32 no.1
    • /
    • pp.9-20
    • /
    • 2005
  • Recently, the MPEG-7 standard a multimedia content description standard is wide]y used for content based image/video retrieval systems. However, since the descriptors standardized in MPEG-7 are usually multidimensional and the problem called 'Curse of dimensionality', previously proposed indexing methods(for example, multidimensional indexing methods, dimensionality reduction methods, filtering methods, and so on) could not be used to effectively index the multimedia database represented in MPEG-7. This paper proposes an efficient multimedia data indexing mechanism reflecting the characteristics of MPEG-7 visual descriptors. In the proposed indexing mechanism, the descriptor is transformed into a histogram of some attributes. By representing the value of each bin as a binary number, the histogram itself that is a visual descriptor for the object in multimedia database could be represented as a bit string. Bit strings for all objects in multimedia database are collected to form an index file, bitmap index, in the proposed indexing mechanism. By XORing them with the descriptors for query object, the candidate solutions for similarity search could be computed easily and they are checked again with query object to precisely compute the similarity with exact metric such as Ll-norm. These indexing and searching mechanisms are efficient because the filtering process is performed by simple bit-operation and it reduces the search space dramatically. Upon experimental results with more than 100,000 real images, the proposed indexing and searching mechanisms are about IS times faster than the sequential searching with more than 90% accuracy.

Member Verification with Deep Learning-based Image Descriptors (깊은 인공 신경망 이미지 기술자를 활용하는 멤버 분류)

  • Jang, Young Kyun;Lee, Seok Hee;Cho, Nam Ik
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2020.07a
    • /
    • pp.36-39
    • /
    • 2020
  • 최근 딥 러닝을 이용한 방법들이 이미지 분류에서 뛰어난 성능을 보임에 따라, 복잡한 특징을 담고 있는 얼굴 이미지에 대해 이를 적용하려는 시도가 늘어나고 있다. 특히, 이미지로부터 주요한 특징들을 추출하여 간결하게 이미지를 대표할 수 있는 이미지 기술자 (Image descriptor)를 딥 러닝을 통해 생성하는 연구가 인기를 끌고 있다. 이는 딥 러닝 끝 단에 있는 Fully-connected layer 의 출력으로 얻을 수 있으며 이미지의 의미론적 상관관계를 이용하여 학습된다. 구체적으로, 이미지 기술자는 실수형 벡터 데이터로서, 한 장의 이미지를 수치화 하여 비슷한 이미지 사이에는 벡터 거리가 가깝게, 서로 다른 이미지 사이에는 벡터 거리가 멀게 구성된다. 본 연구에서는 미리 학습된 인공 신경망을 통과시켜 얻은 얼굴 이미지 기술자를 활용하여 멤버 분류를 위한 두 개의 인공 신경망을 학습하는 것을 목표로 한다. 제안된 방법을 검증하기 위해 얼굴 인식에 널리 사용되는 벤치 마크 데이터셋을 활용하였고, 그 결과 제안된 방법이 높은 정확도로 멤버를 분류할 수 있다는 것을 확인하였다.

  • PDF

Robust Facial Expression Recognition Based on Local Directional Pattern

  • Jabid, Taskeed;Kabir, Md. Hasanul;Chae, Oksam
    • ETRI Journal
    • /
    • v.32 no.5
    • /
    • pp.784-794
    • /
    • 2010
  • Automatic facial expression recognition has many potential applications in different areas of human computer interaction. However, they are not yet fully realized due to the lack of an effective facial feature descriptor. In this paper, we present a new appearance-based feature descriptor, the local directional pattern (LDP), to represent facial geometry and analyze its performance in expression recognition. An LDP feature is obtained by computing the edge response values in 8 directions at each pixel and encoding them into an 8 bit binary number using the relative strength of these edge responses. The LDP descriptor, a distribution of LDP codes within an image or image patch, is used to describe each expression image. The effectiveness of dimensionality reduction techniques, such as principal component analysis and AdaBoost, is also analyzed in terms of computational cost saving and classification accuracy. Two well-known machine learning methods, template matching and support vector machine, are used for classification using the Cohn-Kanade and Japanese female facial expression databases. Better classification accuracy shows the superiority of LDP descriptor against other appearance-based feature descriptors.

Consumer Needs and Sensory Evaluation of Jacquard Fabrics for Blind Using Low Melting Polyester (저융점 폴리에스터를 이용한 블라인드용 자카드 직물의 소비자 요구도 및 감성구조)

  • Kim, Jeong Hwa;Lee, Jung Soon;Lee, Sung Young;Lee, Seung Gu
    • Korean Journal of Human Ecology
    • /
    • v.22 no.6
    • /
    • pp.673-686
    • /
    • 2013
  • The purpose of this study is to identify consumer needs and sensory evaluation of jacquard fabrics for blind using low melting polyester. Ten kinds of jacquard fabric used for this study were developed. Developed jacquard fabrics were assessed subjectively by 164 consumers using 7-point scale of 22 consumer needs and 43 sensory descriptors. The results were briefly summarized as follows: the most important parameter to choose fabric for blind was 'Easy-use' and the other parameters are 'Lightproof', 'UV-protect', 'Design', 'Price', 'Color', 'Insulation', 'Easy-care'. The image sensibility of jacquard fabrics was explained by six factors: feminine, active, modern, traditional, pure, cozy. Higher preference was found in jacquard fabrics of clear, natural, luxurious, simple, feminine, young, cozy, graceful image. The preference was predicted 38.2% with feminine, modern, pure, cozy, traditional factors. Correlation coefficient between image sensibility factor 1 and preference was 0.437. The 3 factors (feminine, modern, pure)were selected as significant image sensibility affecting preference.

MPEG-7 Homogeneous Texture Descriptor

  • Ro, Yong-Man;Kim, Mun-Churl;Kang, Ho-Kyung;Manjunath, B.S.;Kim, Jin-Woong
    • ETRI Journal
    • /
    • v.23 no.2
    • /
    • pp.41-51
    • /
    • 2001
  • MPEG-7 standardization work has started with the aims of providing fundamental tools for describing multimedia contents. MPEG-7 defines the syntax and semantics of descriptors and description schemes so that they may be used as fundamental tools for multimedia content description. In this paper, we introduce a texture based image description and retrieval method, which is adopted as the homogeneous texture descriptor in the visual part of the MPEG-7 final committee draft. The current MPEG-7 homogeneous texture descriptor consists of the mean, the standard deviation value of an image, energy, and energy deviation values of Fourier transform of the image. These are extracted from partitioned frequency channels based on the human visual system (HVS). For reliable extraction of the texture descriptor, Radon transformation is employed. This is suitable for HVS behavior. We also introduce various matching methods; for example, intensity-invariant, rotation-invariant and/or scale-invariant matching. This technique retrieves relevant texture images when the user gives a querying texture image. In order to show the promising performance of the texture descriptor, we take the experimental results with the MPEG-7 test sets. Experimental results show that the MPEG-7 texture descriptor gives an efficient and effective retrieval rate. Furthermore, it gives fast feature extraction time for constructing the texture descriptor.

  • PDF

Region-based Image retrieval using EHD and CLD of MPEG-7 (MPEG-7의 EHD와 CLD를 조합한 영역기반 영상검색)

  • Ryu Min-Sung;Won Chee Sun
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.43 no.1 s.307
    • /
    • pp.27-34
    • /
    • 2006
  • In this paper, we propose a combined region-based image retrieval system using EHD(Edge Histogram Descriptor) and CLD(Color Layout Descriptor) of MPEG-7 descriptors. The combined descriptor can efficiently describe edge and color features in terms of sub-image regions. That is, the basic unit for the selection of the region-of-interest (ROI) in the image is the sub-image block of the EHD, which corresponds to 16 (i.e., $4{\times}4)$ non-overlapping image blocks in the image space. This implies that, to have a one-to-one region correspondence between ELE and CLD, we need to take an $8{\times}8$ inverse DCT (IDCT) for the CLD. Experimental results show that the proposed retrieval scheme can be used for image retrieval with the ROI based image retrieval for MPEG-7 indexed images.