• Title/Summary/Keyword: Image Feature

Search Result 3,596, Processing Time 0.026 seconds

Wine Label Recognition System using Image Similarity (이미지 유사도를 이용한 와인라벨 인식 시스템)

  • Jung, Jeong-Mun;Yang, Hyung-Jeong;Kim, Soo-Hyung;Lee, Guee-Sang;Kim, Sun-Hee
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.5
    • /
    • pp.125-137
    • /
    • 2011
  • Recently the research on the system using images taken from camera phones as input is actively conducted. This paper proposed a system that shows wine pictures which are similar to the input wine label in order. For the calculation of the similarity of images, the representative color of each cell of the image, the recognized text color, background color and distribution of feature points are used as the features. In order to calculate the difference of the colors, RGB is converted into CIE-Lab and the feature points are extracted by using Harris Corner Detection Algorithm. The weights of representative color of each cell of image, text color and background color are applied. The image similarity is calculated by normalizing the difference of color similarity and distribution of feature points. After calculating the similarity between the input image and the images in the database, the images in Database are shown in the descent order of the similarity so that the effort of users to search for similar wine labels again from the searched result is reduced.

A study on the lip shape recognition algorithm using 3-D Model (3차원 모델을 이용한 입모양 인식 알고리즘에 관한 연구)

  • 남기환;배철수
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.6 no.5
    • /
    • pp.783-788
    • /
    • 2002
  • Recently, research and developmental direction of communication system is concurrent adopting voice data and face image in speaking to provide more higher recognition rate then in the case of only voice data. Therefore, we present a method of lipreading in speech image sequence by using the 3-D facial shape model. The method use a feature information of the face image such as the opening-level of lip, the movement of jaw, and the projection height of lip. At first, we adjust the 3-D face model to speeching face Image sequence. Then, to get a feature information we compute variance quantity from adjusted 3-D shape model of image sequence and use the variance quality of the adjusted 3-D model as recognition parameters. We use the intensity inclination values which obtaining from the variance in 3-D feature points as the separation of recognition units from the sequential image. After then, we use discrete HMM algorithm at recognition process, depending on multiple observation sequence which considers the variance of 3-D feature point fully. As a result of recognition experiment with the 8 Korean vowels and 2 Korean consonants, we have about 80% of recognition rate for the plosives md vowels.

Dual Attention Based Image Pyramid Network for Object Detection

  • Dong, Xiang;Li, Feng;Bai, Huihui;Zhao, Yao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.12
    • /
    • pp.4439-4455
    • /
    • 2021
  • Compared with two-stage object detection algorithms, one-stage algorithms provide a better trade-off between real-time performance and accuracy. However, these methods treat the intermediate features equally, which lacks the flexibility to emphasize meaningful information for classification and location. Besides, they ignore the interaction of contextual information from different scales, which is important for medium and small objects detection. To tackle these problems, we propose an image pyramid network based on dual attention mechanism (DAIPNet), which builds an image pyramid to enrich the spatial information while emphasizing multi-scale informative features based on dual attention mechanisms for one-stage object detection. Our framework utilizes a pre-trained backbone as standard detection network, where the designed image pyramid network (IPN) is used as auxiliary network to provide complementary information. Here, the dual attention mechanism is composed of the adaptive feature fusion module (AFFM) and the progressive attention fusion module (PAFM). AFFM is designed to automatically pay attention to the feature maps with different importance from the backbone and auxiliary network, while PAFM is utilized to adaptively learn the channel attentive information in the context transfer process. Furthermore, in the IPN, we build an image pyramid to extract scale-wise features from downsampled images of different scales, where the features are further fused at different states to enrich scale-wise information and learn more comprehensive feature representations. Experimental results are shown on MS COCO dataset. Our proposed detector with a 300 × 300 input achieves superior performance of 32.6% mAP on the MS COCO test-dev compared with state-of-the-art methods.

Emotion Recognition by CCD Color Image (CCD 컬러영상에 의한 감성인식)

  • Lee, Sang-Yoon;Joo, Young-Hoon;Sim, Kwee-Bo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.12 no.2
    • /
    • pp.97-102
    • /
    • 2002
  • In this paper, we propose the technique for recognizing the human s emotion by using the CCD color image. To do this, we first get the face image by using skin-color from the original color image acquired by the CCD camera. And we propose the method for finding man s feature points(eyebrows, eye, nose, mouse) from the face image and the geometrical method for recognizing human s emotion (surprise, anger, happiness, sadness) from the structural correlation of man s feature feints. The proposed method in this paper recognize the human s emotion by learning the neural network. Finally, we have proven the effectiveness of the Proposed method through the experimentation.

Region of Interest Detection Based on Visual Attention and Threshold Segmentation in High Spatial Resolution Remote Sensing Images

  • Zhang, Libao;Li, Hao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.7 no.8
    • /
    • pp.1843-1859
    • /
    • 2013
  • The continuous increase of the spatial resolution of remote sensing images brings great challenge to image analysis and processing. Traditional prior knowledge-based region detection and target recognition algorithms for processing high resolution remote sensing images generally employ a global searching solution, which results in prohibitive computational complexity. In this paper, a more efficient region of interest (ROI) detection algorithm based on visual attention and threshold segmentation (VA-TS) is proposed, wherein a visual attention mechanism is used to eliminate image segmentation and feature detection to the entire image. The input image is subsampled to decrease the amount of data and the discrete moment transform (DMT) feature is extracted to provide a finer description of the edges. The feature maps are combined with weights according to the amount of the "strong points" and the "salient points". A threshold segmentation strategy is employed to obtain more accurate region of interest shape information with the very low computational complexity. Experimental statistics have shown that the proposed algorithm is computational efficient and provide more visually accurate detection results. The calculation time is only about 0.7% of the traditional Itti's model.

Image Retrieval Using Color feature and GLCM and Direction in Wavelet Transform Domain (Wavelet 변환 영역에서 칼라 정보와 GLCM 및 방향성을 이용한 영상 검색)

  • 이정봉
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2002.05a
    • /
    • pp.585-589
    • /
    • 2002
  • In this paper, hierarchical retrieval system based on efficient feature extraction is proposed. In order to retrieval the image with robustness for geometrical transformation such as translation, scaling, and rotation. After performing the 2-level wavelet transform on image, We extract moment in low-level subband which was subdivided into subimages and texture feature, contrast of GLCM(Gray Level Co-occurrence Matrix). At first we retrieve the candidate images in database by the ones of image. To perform a more accurate image retrieval, the edge information on the high-level subband was subdivided horizontally, vertically and diagonally. And then, the energy rate of edge per direction was determined and used to compare the energy rate of edge between images for higher accuracy.

  • PDF

Panorama Image Construction Method By Automatic Shot (자동 촬영에 의한 파노라마 영상 생성 방법)

  • Kim, Tae-Woo
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.8 no.6
    • /
    • pp.1524-1529
    • /
    • 2007
  • In this paper, automatic shot panorama construction method is presented. For construction of panorama image, conventional panoramic techniques manually took two panorama members, but the proposed method automatically takes panorama members according to moving camera and constructs panorama image. The panorama members are automatically selected and taken by tracking region over image stream form camera. Matching region for panorama including the tracking region in the members is selected and applied by invariant feature panoramic method. Our method can automatically shot panorama members and has merit of high processing speed. In the experiments, it was shown that the algorithm required about 0.89 second in processing time, about two times shorter than existing invariant feature based one(6), for color images of $320{\times}240$ size.

  • PDF

A Comparison Study on Back-Propagation Neural Network and Support Vector Machines for the Image Classification Problems (영상분류문제를 위한 역전파 신경망과 Support Vector Machines의 비교 연구)

  • Seo, Kwang-Kyu
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.9 no.6
    • /
    • pp.1889-1893
    • /
    • 2008
  • This paper explores the classification performance of applying to support vector machines (SVMs) for the image classification problems. In this study, we extract the color, texture and shape features of natural images and compare the performance of image classification using each individual feature and integrated features. The experiment results show that classification accuracy on the basis of color feature is better than that based on texture and shape features and the results of the integrating features also provides a better and more robust performance than individual feature. In additions, we show that the proposed classifier of SVM based approach outperforms BPNN to corporate the image classification problems.

Fast Image Retrieval Based on Object Regions Using Bidirectional Round Filter (양방향 반올림 필터를 이용한 객체 영역 기반 고속 영상 검색)

  • 류권열;강경원
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.2
    • /
    • pp.240-246
    • /
    • 2003
  • In this paper, we propose the fast image retrieval method based on object regions using bidirectional round filter in the wavelet transform region. A conventional method that extracts feature vectors on the whole of subband is reduced retrieval efficiency, because of unnecessary background information. The proposed method that extracts feature vectors on the only object region of subband by using bidirectional round filter improve retrieval efficiency, because of removing of background information. And it certainly maintains retrieval efficiency in case of reduction of feature vectors according to color information. Consequently, the retrieval efficiency is improved with 2.5%∼5.3% values, which have a little changes according to characteristics of image.

  • PDF

A Study on Method of Automatic Geospatial Feature Extraction through Relative Radiometric Normalization of High-resolution Satellite Images (고해상도 위성영상의 상대방사보정을 통한 자동화 지향 공간객체추출 방안 연구)

  • Lee, Dong-Gook;Lee, Hyun-Jik
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.5_2
    • /
    • pp.917-927
    • /
    • 2020
  • The Ministry of Land, Infrastructure and Transport of Korea is developing a CAS 500-1/2 satellite capable of photographing a GSD 0.5 m level image, and is developing a technology to utilize this. Therefore, this study attempted to develop a geospatial feature extraction technique aimed at automation as a technique for utilizing CAS 500-1/2 satellite images. KOMPSAT-3A satellite images that are expected to be most similar to CAS 500-1/2 were used for research and the possibility of automation of geospatial feature extraction was analyzed through relative radiometric normalization. For this purpose, the parameters and thresholds were applied equally to the reference images and relative radiometric normalized images, and the geospatial feature were extracted. The qualitative analysis was conducted on whether the extracted geospatial feature is extracted in a similar form from the reference image and relative radiometric normalized image. It was also intended to analyze the possibility of automation of geospatial feature extraction by quantitative analysis of whether the classification accuracy satisfies the target accuracy of 90% or more set in this study. As a result, it was confirmed that shape of geospatial feature extracted from reference image and relative radiometric normalized image were similar, and the classification accuracy analysis results showed that both satisfies the target accuracy of 90% or more. Therefore, it is believed that automation will be possible when extracting spatial objects through relative radiometric normalization.