• Title/Summary/Keyword: Visual Classification

Search Result 593, Processing Time 0.025 seconds

IMAGE CLASSIFICATION OF HIGH RESOLTION MULTISPECTRAL IMAGERY VIA PANSHARPENING

  • Lee, Sang-Hoon
    • Proceedings of the KSRS Conference
    • /
    • 2008.10a
    • /
    • pp.18-21
    • /
    • 2008
  • Lee (2008) proposed the pansharpening method to reconstruct at the higher resolution the multispectral images which agree with the spectral values observed from the sensor of the lower resolution values. It outperformed over several current techniques for the statistical analysis with quantitative measures, and generated the imagery of good quality for visual interpretation. However, if a small object stretches over two adjacent pixels with different spectral characteristics at the lower resolution, the pixels of the object at the higher resolution may have different multispectral values according to their location even though they have a same intensity in the panchromatic image of higher resolution. To correct this problem, this study employed an iterative technique similar to the image restoration scheme of Point-Jacobian iterative MAP estimation. The effect of pansharpening on image segmentation/classification was assessed for various techniques. The method was applied to the IKONOS image acquired over the area around Anyang City of Korea.

  • PDF

Analysis of Montage Pattern of e-book as a Film Language (영상 언어로써 이북(e-book)의 몽타주 패턴)

  • Shin, Seungyun
    • Journal of Korea Multimedia Society
    • /
    • v.19 no.7
    • /
    • pp.1216-1224
    • /
    • 2016
  • This study analyzed the montage pattern of e-book in a bid to include e-book in the film language system. To this end, this study targeted three pieces of e-books which reproduced the animation for theater use of Disney Company. This study did research on characteristics of montage in films, and kinds of film vectors, and defined it as 'Montage of Experience' according to media characteristics of e-book. This study deducted the montage pattern of 3 sorts of classification, and 10 sorts of detailed classification by doing research on the analysis object. This study has a significance in that it pioneers a new perspective in e-book research which has been biased towards a functional perspective. Analysis of the point of contact where the existing visual media, and new media meet with each other could be a driving force promoting the growth of the relevant industry, and this study thinks that the analysis of the point of contact is the research which is helpful to working out qualitative improvement of e-book contents.

Human Hand Detection Using Color Vision (컬러 시각을 이용한 사람 손의 검출)

  • Kim, Jun-Yup;Do, Yong-Tae
    • Journal of Sensor Science and Technology
    • /
    • v.21 no.1
    • /
    • pp.28-33
    • /
    • 2012
  • The visual sensing of human hands plays an important part in many man-machine interaction/interface systems. Most existing visionbased hand detection techniques depend on the color cues of human skin. The RGB color image from a vision sensor is often transformed to another color space as a preprocessing of hand detection because the color space transformation is assumed to increase the detection accuracy. However, the actual effect of color space transformation has not been well investigated in literature. This paper discusses a comparative evaluation of the pixel classification performance of hand skin detection in four widely used color spaces; RGB, YIQ, HSV, and normalized rgb. The experimental results indicate that using the normalized red-green color values is the most reliable under different backgrounds, lighting conditions, individuals, and hand postures. The nonlinear classification of pixel colors by the use of a multilayer neural network is also proposed to improve the detection accuracy.

Crop Field Extraction Method using NDVI and Texture from Landsat TM Images

  • Shibasaki, Ryosuke;Suzaki, Junichi
    • Proceedings of the KSRS Conference
    • /
    • 1998.09a
    • /
    • pp.159-162
    • /
    • 1998
  • Land cover and land use classification on a huge scale, e.g. national or continental scale, has become more and more important because environmental researches need land cover: And land use data on such scales. We developed a crop field extraction method, which is one of the steps in our land cover classification system for a huge area. Firstly, a crop field model is defined to characterize "crop field" in terms of NDVI value and textual information Textual information is represented by the density of straight lines which are extracted by wavelet transform. Secondly, candidates of NDVI threshold value are determined by "scale-space filtering" method. The most appropriate threshold value among the candidates is determined by evaluating the line density of the area extracted by the threshold value. Finally, the crop field is extracted by applying level slicing to Landsat TM image with the threshold value determined above. The experiment demonstrates that the extracted area by this method coincides very well with the one extracted by visual interpretation.

  • PDF

Hand Shape Classification using Contour Distribution (윤곽 분포를 이용한 이미지 기반의 손모양 인식 기술)

  • Lee, Changmin;Kim, DaeEun
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.20 no.6
    • /
    • pp.593-598
    • /
    • 2014
  • Hand gesture recognition based on vision is a challenging task in human-robot interaction. The sign language of finger spelling alphabets has been tested as a kind of hand gesture. In this paper, we test hand gesture recognition by detecting the contour shape and orientation of hand with visual image. The method has three stages, the first stage of finding hand component separated from the background image, the second stage of extracting the contour feature over the hand component and the last stage of comparing the feature with the reference features in the database. Here, finger spelling alphabets are used to verify the performance of our system and our method shows good performance to discriminate finger alphabets.

Improvement of Photogrammetry Image Merging in Satellite Image Processing (인공위성 영상처리를 위한 사진접합정확도 향상기법)

  • Kang, In-Joon;Choi, Chul-Ung
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.2 no.1 s.3
    • /
    • pp.93-98
    • /
    • 1994
  • This image of Kangseogu in Pusan, is a digital merge of aerial photos by scale of 1/1,200 map. The merge was carried out 2nd affine and bilinear interpolation. It can improve digital classification to help choose training sites and interprete classification results, and improve visual interpretation, as in this case, by adding detailed information to the multispectral TM data.

  • PDF

Visual inspection algorithm of cold rolled strips by wavelet frame transform (Wavelet frame 변환을 이용한 냉연 시각검사 알고리듬)

  • Lee, Chang-Su;Choi, Jong-Ho
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.4 no.3
    • /
    • pp.372-377
    • /
    • 1998
  • This paper deals with the detection, feature extraction and classification of surface defects in cold rolled strips. Inspection systems are one of the most important fields in factory automation. Defects such as slipmark and dullmark can be effectively detected with a Gaussian matched filter because their shapes are similar to Gaussian. It is justified that the proposed WF(Wavelet Frame) method could be regarded as multiscale Gaussian matched filter which can be applied to the inspection of cold rolled strip. After a wavelet frame transform, the entropies and moments are computed for each subband which pass through both local low pass filter and nonlinear operator. With these features as input, a MLP(Multi Layer Perceptron) is used as a classifier. The proposed inspection method was applied to the real images with defects, and hence showed good performance. The role of each extracted feature is analyzed by KLT(Karhunen-Loeve Transform).

  • PDF

Recognition of Korean Vowels using Bayesian Classification with Mouth Shape (베이지안 분류 기반의 입 모양을 이용한 한글 모음 인식 시스템)

  • Kim, Seong-Woo;Cha, Kyung-Ae;Park, Se-Hyun
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.8
    • /
    • pp.852-859
    • /
    • 2019
  • With the development of IT technology and smart devices, various applications utilizing image information are being developed. In order to provide an intuitive interface for pronunciation recognition, there is a growing need for research on pronunciation recognition using mouth feature values. In this paper, we propose a system to distinguish Korean vowel pronunciations by detecting feature points of lips region in images and applying Bayesian based learning model. The proposed system implements the recognition system based on Bayes' theorem, so that it is possible to improve the accuracy of speech recognition by accumulating input data regardless of whether it is speaker independent or dependent on small amount of learning data. Experimental results show that it is possible to effectively distinguish Korean vowels as a result of applying probability based Bayesian classification using only visual information such as mouth shape features.

3D Res-Inception Network Transfer Learning for Multiple Label Crowd Behavior Recognition

  • Nan, Hao;Li, Min;Fan, Lvyuan;Tong, Minglei
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.3
    • /
    • pp.1450-1463
    • /
    • 2019
  • The problem towards crowd behavior recognition in a serious clustered scene is extremely challenged on account of variable scales with non-uniformity. This paper aims to propose a crowed behavior classification framework based on a transferring hybrid network blending 3D res-net with inception-v3. First, the 3D res-inception network is presented so as to learn the augmented visual feature of UCF 101. Then the target dataset is applied to fine-tune the network parameters in an attempt to classify the behavior of densely crowded scenes. Finally, a transferred entropy function is used to calculate the probability of multiple labels in accordance with these features. Experimental results show that the proposed method could greatly improve the accuracy of crowd behavior recognition and enhance the accuracy of multiple label classification.

Research on Design of Mixed Reality Interface Based on Spatial Perception

  • Wei, Li;Cho, Dong-Mi
    • Journal of Korea Multimedia Society
    • /
    • v.24 no.6
    • /
    • pp.815-824
    • /
    • 2021
  • Based on the theory of space perception, this paper concludes that the mixed reality application under the theory of space perception has a three-level definition of visual hierarchy and then analyzes the component elements of interface design and the classification mode of interface windows. Next, carry out case practice research through this theoretical definition, and finally conduct the survey and analysis of questionnaire data, verifying that the mixed reality interface design based on spatial perception theory meets the user experience elements of Usability, Availability, and Attraction. The conclusion is that the constituent elements of interface design and the window classification mode can provide specific and practical design specifications for mixed reality interface design, reduce the interaction cost of completing tasks, reduce users' cognitive load, and make it easier for users to receive interface information