• Title/Summary/Keyword: 불변 인식

Search Result 162, Processing Time 0.022 seconds

Fundamental Frequency Estimation of Voiced Speech Signals Based on the Inflection Point Detection (변곡점 검출에 기반한 음성의 기본 주파수 추정)

  • Byeonggwan Iem
    • Journal of IKEEE
    • /
    • v.27 no.4
    • /
    • pp.472-476
    • /
    • 2023
  • Fundamental frequency/pitch period are major characteristics of speech signals. They are used in many speech applications like speech coding, speech recognition, speaker identification, and so on. In this paper, some of inflection points are used to estimate the pitch which is the inverse of the fundamental frequency. The inflection points are defined as points where local maxima, local minima or the slope changes occur. The speech signal is preprocessed to remove unnecessary inflection points due to the high frequency components using a low pass filter. Only the inflection points from local maxima are used to get the pitch period. While the existing pitch estimation methods process speech signals in blockwise, the proposed method detects the inflection points in sample and produces the pitch period/fundamental frequency estimates along the time. Computer simulation shows the usefulness of the proposed method as a fundamental frequency estimator.

Automatic Face and Eyes Detection: A Scale and Rotation Invariant Approach based on Log-Polar Mapping (Log-Polar 사상의 크기와 회전 불변 특성을 이용한 얼굴과 눈 검출)

  • Choi, Il;Chien, Sung-Il
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.36S no.8
    • /
    • pp.88-100
    • /
    • 1999
  • Detecting human face and facial landmarks automatically in an image is as essential step to a fully automatic face recognition system. In this paper, we present a new approach to detect automatically face and its eyes of input image with scale and rotation variations of faces by using an intensity based template matching with a single log-polar face template. In a template-based matching it is necessary to normalize the scale changes and rotations of an input image to a template ones. The log-polar mapping which simulates space-variant human visual system converts scale changes and rotations of input image into constant horizontal and cyclic vertical shifts in the output plane. Intelligent use of this property allows us to shift of the candidate log-polar faces mapped at various fixation points of an input image to be matched to a template over the log-polar plane. Thus, the proposed method eliminates the need of adapting multitemplate and multiresolution schemes, which inevitably give rise to intensive computation involved to cope with scale and rotation variations of faces. Through this scale and rotation involved to cope with scale and method can lead to detecting face and its eyes simultaneously. Experimental results on a database of 795 images show over 98% detection rate.

  • PDF

Invariant Classification and Detection for Cloth Searching (의류 검색용 회전 및 스케일 불변 이미지 분류 및 검색 기술)

  • Hwang, Inseong;Cho, Beobkeun;Jeon, Seungwoo;Choe, Yunsik
    • Journal of Broadcast Engineering
    • /
    • v.19 no.3
    • /
    • pp.396-404
    • /
    • 2014
  • The field of searching clothing, which is very difficult due to the nature of the informal sector, has been in an effort to reduce the recognition error and computational complexity. However, there is no concrete examples of the whole progress of learning and recognizing for cloth, and the related technologies are still showing many limitations. In this paper, the whole process including identifying both the person and cloth in an image and analyzing both its color and texture pattern is specifically shown for classification. Especially, deformable search descriptor, LBPROT_35 is proposed for identifying the pattern of clothing. The proposed method is scale and rotation invariant, so we can obtain even higher detection rate even though the scale and angle of the image changes. In addition, the color classifier with the color space quantization is proposed not to loose color similarity. In simulation, we build database by training a total of 810 images from the clothing images on the internet, and test some of them. As a result, the proposed method shows a good performance as it has 94.4% matching rate while the former Dense-SIFT method has 63.9%.

Light-Ontology Classification for Efficient Object Detection using a Hierarchical Tree Structure (효과적인 객체 검출을 위한 계층적 트리 구조를 이용한 조명 온톨로지 분류)

  • Kang, Sung-Kwan;Lee, Jung-Hyun
    • Journal of Digital Convergence
    • /
    • v.10 no.10
    • /
    • pp.215-220
    • /
    • 2012
  • This paper proposes a ontology of tree structure approach for adaptive object recognition in a situation-variant environment. In this paper, we introduce a new concept, ontology of tree structure ontology, for context sensitivity, as we found that many developed systems work in a context-invariant environment. Due to the effects of illumination on a supreme obstinate designing context-sensitive recognition system, we have focused on designing such a context-variant system using ontology of tree structure. Ontology can be defined as an explicit specification of conceptualization of a domain typically captured in an abstract model of how people think about things in the domain. People produce ontologies to understand and explain underlying principles and environmental factors. In this research, we have proposed context ontology, context modeling, context adaptation, and context categorization to design ontology of tree structure based on illumination criteria. After selecting the proper light-ontology domain, we benefit from selecting a set of actions that produces better performance on that domain. We have carried out extensive experiments on these concepts in the area of object recognition in a dynamic changing environment, and we have achieved enormous success, which will enable us to proceed on our basic concepts.

The Image of Bamboo from Yeoheon Chang Hyun-kwang's Poetry - Focusing on his view of world and attitude to life (여헌(旅軒) 한시(漢詩)에 나타난 대나무(竹)의 이미지 -현실인식(現實認識)과 삶의 지향(志向)을 중심으로-)

  • Kwon, hyok myong
    • (The)Study of the Eastern Classic
    • /
    • no.41
    • /
    • pp.91-120
    • /
    • 2010
  • In this paper, I tried to analyze the image of bamboo from Chang Hyun-kwang's poetry. Especially I focused on defining the symbolic meaning of bamboo as a poetic material. To achieve this, I inspected his view of world and politic life. He considered his contemporary period as 'ethically corrupted' world. During his life time, he served for three kings and also suffered from the fires of war four times. That means, he had to undergo the political unrest and national crisis during his life time. Due to the chaotic state of affairs, all the people were busy pursuing their own interests. It made Chang had no choice but retiring from active life. While Chang revealed his attitude to life through writing poetry, he used the image of bamboo as a completion of one's mature character. And the image of bamboo what he used was somewhat different from general one. Chang used the image of bamboo in two ways. The one is seeking live in retirement, and the other is chasing completion of his own mature character in academic way.

Visual Representations for Improving Proportional Reasoning in Solving Word Problems (비례 추론을 돕는 시각적 모델에 대하여: 초등 수학 교과서의 비례식과 비례배분 실생활 문제를 대상으로)

  • Yim, Jae Hoon;Lee, Hyung Sook
    • Journal of Educational Research in Mathematics
    • /
    • v.25 no.2
    • /
    • pp.189-206
    • /
    • 2015
  • There has been a recurring call for using visual representations in textbooks to improve the teaching and learning of proportional reasoning. However, the quantity as well as quality of visual representations used in textbooks is still very limited. In this article, we analyzed visual representations presented in a Grade 6 textbook from two perspectives of proportional reasoning, multiple-batches perspective and variable-parts perspective, and discussed the potential of the double number line and the double tape diagram to help develop the idea 'things covary while something stays the same', which is critical to reason proportionally. We also classified situations that require proportional reasoning into five categories and provided ways of using the double number line and the double tape diagram for each category.

Review of the Role of Dragging in Dynamic Geometry Environments (역동기하 환경에서 "끌기(dragging)"의 역할에 대한 고찰)

  • Cho, Cheong Soo;Lee, Eun Suk
    • School Mathematics
    • /
    • v.15 no.2
    • /
    • pp.481-501
    • /
    • 2013
  • The purpose of this study is to review the role of dragging in dynamic geometry environments. Dragging is a kind of dynamic representations that dynamically change geometric figures and enable to search invariances of figures and relationships among them. In this study dragging in dynamic geometry environments is divided by three perspectives: dynamic representations, instrumented actions, and affordance. Following this review, six conclusions are suggested for future research and for teaching and learning geometry in school geometry as well: students' epistemological change of basic geometry concepts by dragging, the possibilities to converting paper-and-pencil geometry into experimental mathematics, the role of dragging between conjecturing and proving, geometry learning process according to the instrumental genesis perspective, patterns of communication or discourse generated by dragging, and the role of measuring function as an affordance of DGS.

  • PDF

Texture Classification Using Rotation Invariant Local Directional Pattern (Rotation Invariant Local Directional Pattern을 이용한 텍스처 분류 방법)

  • Lee, Tae Hwan;Chae, Ok Sam
    • Convergence Security Journal
    • /
    • v.17 no.3
    • /
    • pp.21-29
    • /
    • 2017
  • Accurate encoding of local patterns is a very important factor in texture classification. However, LBP based methods w idely studied have fundamental problems that are vulnerable to noise. Recently, LDP method using edge response and dire ction information was proposed in facial expression recognition. LDP is more robust to noise than LBP and can accommod ate more information in it's pattern code, but it has drawbacks that it is sensitive to rotation transforms that are critical to texture classification. In this paper, we propose a new local pattern coding method called Rotation Invariant Local Direc tional Pattern, which combines rotation-invariant transform to LDP. To prove the texture classification performance of the proposed method in this paper, texture classification was performed on the widely used UIUC and CUReT datasets. As a result, the proposed RILDP method showed better performance than the existing methods.

Improving Matching Performance of SURF Using Color and Relative Position (위치와 색상 정보를 사용한 SURF 정합 성능 향상 기법)

  • Lee, KyungSeung;Kim, Daehoon;Rho, Seungmin;Hwang, Eenjun
    • Journal of Advanced Navigation Technology
    • /
    • v.16 no.2
    • /
    • pp.394-400
    • /
    • 2012
  • SURF is a robust local invariant feature descriptor and has been used in many applications such as object recognition. Even though this algorithm has similar matching accuracy compared to the SIFT, which is another popular feature extraction algorithm, it has advantage in matching time. However, these descriptors do not consider relative location information of extracted interesting points to guarantee rotation invariance. Also, since they use gray image of original color image, they do not use the color information of images, either. In this paper, we propose a method for improving matching performance of SURF descriptor using the color and relative location information of interest points. The location information is built from the angles between the line connecting the centers of interest points and the orientation line constructed for the center of each interest points. For the color information, color histogram is constructed for the region of each interest point. We show the performance of our scheme through experiments.

SIFT based Image Similarity Search using an Edge Image Pyramid and an Interesting Region Detection (윤곽선 이미지 피라미드와 관심영역 검출을 이용한 SIFT 기반 이미지 유사성 검색)

  • Yu, Seung-Hoon;Kim, Deok-Hwan;Lee, Seok-Lyong;Chung, Chin-Wan;Kim, Sang-Hee
    • Journal of KIISE:Databases
    • /
    • v.35 no.4
    • /
    • pp.345-355
    • /
    • 2008
  • SIFT is popularly used in computer vision application such as object recognition, motion tracking, and 3D reconstruction among various shape descriptors. However, it is not easy to apply SIFT into the image similarity search as it is since it uses many high dimensional keypoint vectors. In this paper, we present a SIFT based image similarity search method using an edge image pyramid and an interesting region detection. The proposed method extracts keypoints, which is invariant to contrast, scale, and rotation of image, by using the edge image pyramid and removes many unnecessary keypoints from the image by using the hough transform. The proposed hough transform can detect objects of ellipse type so that it can be used to find interesting regions. Experimental results demonstrate that the retrieval performance of the proposed method is about 20% better than that of traditional SIFT in average recall.