• 제목/요약/키워드: Image Signal Recognition

검색결과 185건 처리시간 0.03초

가우스 분류기를 이용한 입술영역 추출 (Lip Region Extraction by Gaussian Classifier)

  • 김정엽
    • 한국멀티미디어학회논문지
    • /
    • 제20권2호
    • /
    • pp.108-114
    • /
    • 2017
  • Lip reading is a field of image processing to assist the process of sound recognition. In some environment, the capture of sound signal usually has significant noise and therefore, the recognition rate of sound signal decreases. Lip reading can be a good feature for the increase of recognition rates. Conventional lip extraction methods have been proposed widely. Maia et. al. proposed a method by the sum of Cr and Cb. However, there are two problems as follows: the point with maximum saturation is not always regarded as lips region and the inner part of lips such as oral cavity and teeth can be classified as lips. To solve these problems, this paper proposes a method which adopts the histogram-based classifier for the extraction of lips region. The proposed method consists of two stages, learning and test. The amount of computation is minimized because this method has no color conversion. The performance of proposed method gives 66.8% of detection rate compared to 28% of conventional ones.

라돈변환을 통한 광 패턴인식에 관한 연구 (A Study on an Optical Pattern Recognition Via the Radon Transform)

  • 반재경;김남;박한규
    • 대한전자공학회논문지
    • /
    • 제24권5호
    • /
    • pp.880-886
    • /
    • 1987
  • This paper proposes a new pattern recognition system using Radon transform and analyzes the performances of the system for given input patterns. The proposed system uses many optical parts in order to utilize the high speed characteristics of light and processes a signal easily by transforming 2-D image into a 1-D signal to increase flexibility. The squared Mahalanobis distance obtained from means and standard deviations of the features for the given input patterns is used for discrimination. As a result, this system represents a better recognition rate than any other systems using the same input patterns.

  • PDF

다중 센서 융합 알고리즘을 이용한 사용자의 감정 인식 및 표현 시스템 (Emotion Recognition and Expression System of User using Multi-Modal Sensor Fusion Algorithm)

  • 염홍기;주종태;심귀보
    • 한국지능시스템학회논문지
    • /
    • 제18권1호
    • /
    • pp.20-26
    • /
    • 2008
  • 지능형 로봇이나 컴퓨터가 일상생활 속에서 차지하는 비중이 점점 높아짐에 따라 인간과의 상호교류도 점점 중요시되고 있다. 이렇게 지능형 로봇(컴퓨터) - 인간의 상호 교류하는데 있어서 감정 인식 및 표현은 필수라 할 수 있겠다. 본 논문에서는 음성 신호와 얼굴 영상에서 감정적인 특징들을 추출한 후 이것을 Bayesian Learning과 Principal Component Analysis에 적용하여 5가지 감정(평활, 기쁨, 슬픔, 화남, 놀람)으로 패턴을 분류하였다. 그리고 각각 매개체의 단점을 보완하고 인식률을 높이기 위해서 결정 융합 방법과 특징 융합 방법을 적용하여 감정 인식 실험을 하였다. 결정 융합 방법은 각각 인식 시스템을 통해 얻어진 인식 결과 값을 퍼지 소속 함수에 적용하여 감정 인식 실험을 하였으며, 특징 융합 방법은 SFS(Sequential Forward Selection) 특징 선택 방법을 통해 우수한 특징들을 선택한 후 MLP(Multi Layer Perceptron) 기반 신경망(Neural Networks)에 적용하여 감정 인식 실험을 실행하였다. 그리고 인식된 결과 값을 2D 얼굴 형태에 적용하여 감정을 표현하였다.

독립 성분 분석을 이용한 얼굴인식 (Face recognition by using independent component analysis)

  • 김종규;장주석;김영일
    • 전자공학회논문지C
    • /
    • 제35C권10호
    • /
    • pp.48-58
    • /
    • 1998
  • 신호처리 분야에서 미지의 신호원 분리에 주로 응용되는 독립 성분 분석법을 이용하여 얼굴인식을 할 수 있는 한 방식을 제안하였다. 하나의 얼굴영상 자체가 통계적으로 서로 독립인 어떤 미지의 특징영상의 합으로 표현될 수 있다고 가정하고 이 특징영상을 독립성분분석을 이용하여 구한 후, 새로운 얼굴이나 변화된 얼굴을 특징영상의 공간에 투영시켜 투영된 성분을 기준 얼굴영상과 비교하는 방법으로 인식을 수행하였다. 변화가 심한 여러 얼굴영상으로 구성된 데이터베이스(한 사람 당 10개씩의 변화된 400개의 얼굴 영상)에 대해 얼굴인식 실험을 수행하였고 또한 주성분 분석에 기초한 고유얼굴을 이용한 인식률과 비교 분석하였다. 제안된 방식은 주성분 분석법에 비해 높은 인식률을 제공하며, 특히 입력 얼굴 영상에 첨가되는 랜덤 잡음에 대단히 강한 특성을 갖는다.

  • PDF

영상처리를 위한 SIMT 기반 Image Signal Processor 구현 (Implementation of the SIMT based Image Signal Processor for the Image Processing)

  • 황윤섭;전희경;이관호;이광엽
    • 전기전자학회논문지
    • /
    • 제20권1호
    • /
    • pp.89-93
    • /
    • 2016
  • 본 논문에서는 다양한 영상 전처리 알고리즘들을 적용할 수 있고, 영상 인식과 같이 ISP 응용 프로그램을 병렬로 처리 가능한 SIMT(Single Instruction Multiple Threads) 기반 Image Signal Processor를 제안하였다. 기존의 ISP는 입력 영상의 품질 개선을 위하여 영상 개선 알고리즘이 하드웨어로 설계되어 처리 속도는 빠르지만 다양한 영상 처리 알고리즘에 따라 성능 최적화에 어려움이 있었다. 제안한 ISP는 명령어를 기반으로 한 프로세서로서 다양한 영상 처리 알고리즘을 수행하고 SIMT 구조를 적용하여 알고리즘을 병렬로 수행해 성능을 개선하였다. 제안하는 ISP를 검증하기 위해 Xilinx Virtex-7을 탑재한 VC707 Board를 사용하였으며 cell multicore processor와 비교했을 경우 수행시간이 약 71%, ARM Cortex-A9과 ARM Cortex-A15와 비교하였을 경우 각각 63%, 33% 성능을 개선하였다.

Adaptive low-resolution palmprint image recognition based on channel attention mechanism and modified deep residual network

  • Xu, Xuebin;Meng, Kan;Xing, Xiaomin;Chen, Chen
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권3호
    • /
    • pp.757-770
    • /
    • 2022
  • Palmprint recognition has drawn increasingly attentions in the past decade due to its uniqueness and reliability. Traditional palmprint recognition methods usually use high-resolution images as the identification basis so that they can achieve relatively high precision. However, high-resolution images mean more computation cost in the recognition process, which usually cannot be guaranteed in mobile computing. Therefore, this paper proposes an improved low-resolution palmprint image recognition method based on residual networks. The main contributions include: 1) We introduce a channel attention mechanism to refactor the extracted feature maps, which can pay more attention to the informative feature maps and suppress the useless ones. 2) The ResStage group structure proposed by us divides the original residual block into three stages, and we stabilize the signal characteristics before each stage by means of BN normalization operation to enhance the feature channel. Comparison experiments are conducted on a public dataset provided by the Hong Kong Polytechnic University. Experimental results show that the proposed method achieve a rank-1 accuracy of 98.17% when tested on low-resolution images with the size of 12dpi, which outperforms all the compared methods obviously.

Improvement of self-mixing semiconductor laser range finder and its application to range-image recognition of slowly moving object

  • Suzuki, Takashi;Shinohara, Shigenobu;Yoshida, Hirofumi;Ikeda, Hiroaki;Saitoh, Yasuhiro;Nishide, Ken-Ichi;Sumi, Masao
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1992년도 한국자동제어학술회의논문집(국제학술편); KOEX, Seoul; 19-21 Oct. 1992
    • /
    • pp.388-393
    • /
    • 1992
  • An infrared range finder using a self-mixing laser diode (SM-LD), which has been proposed and developed by the Authors, can measure not only a range of a moving target but its velocity simultaneously. In this paper, described is that the precise mode-hop pulse train can be obtained by employing a new signal processing circuit even when the backscattered light returning into the SM-LD is much more weaker. As a result, the distance to a tilted square sheet made from aluminium or white paper, which is placed 10 cm through 60 cm from the SM-LD, is measured with accuracy of a few percent even when the tilting angle is less than 75 degrees or 85 degrees, respectively. And in this paper, described is the range-image recognition of a plane object under the condition of standstill. The output laser beam is scanned by scanning two plane mirrors-equipped with each stepping motor. And we succeeded in the acquisition of the range-image of a plane object in a few tens of seconds. Furthermore, described is a feasibility study about the range-image recognition of a slowly moving plane object.

  • PDF

광공진 현상을 이용한 입체 영상센서 및 신호처리 기법 (Optical Resonance-based Three Dimensional Sensing Device and its Signal Processing)

  • 박용화;유장우;박창영;윤희선
    • 한국소음진동공학회:학술대회논문집
    • /
    • 한국소음진동공학회 2013년도 추계학술대회 논문집
    • /
    • pp.763-764
    • /
    • 2013
  • A three-dimensional image capturing device and its signal processing algorithm and apparatus are presented. Three dimensional information is one of emerging differentiators that provides consumers with more realistic and immersive experiences in user interface, game, 3D-virtual reality, and 3D display. It has the depth information of a scene together with conventional color image so that full-information of real life that human eyes experience can be captured, recorded and reproduced. 20 Mega-Hertz-switching high speed image shutter device for 3D image capturing and its application to system prototype are presented[1,2]. For 3D image capturing, the system utilizes Time-of-Flight (TOF) principle by means of 20MHz high-speed micro-optical image modulator, so called 'optical resonator'. The high speed image modulation is obtained using the electro-optic operation of the multi-layer stacked structure having diffractive mirrors and optical resonance cavity which maximizes the magnitude of optical modulation[3,4]. The optical resonator is specially designed and fabricated realizing low resistance-capacitance cell structures having small RC-time constant. The optical shutter is positioned in front of a standard high resolution CMOS image sensor and modulates the IR image reflected from the object to capture a depth image (Figure 1). Suggested novel optical resonator enables capturing of a full HD depth image with depth accuracy of mm-scale, which is the largest depth image resolution among the-state-of-the-arts, which have been limited up to VGA. The 3D camera prototype realizes color/depth concurrent sensing optical architecture to capture 14Mp color and full HD depth images, simultaneously (Figure 2,3). The resulting high definition color/depth image and its capturing device have crucial impact on 3D business eco-system in IT industry especially as 3D image sensing means in the fields of 3D camera, gesture recognition, user interface, and 3D display. This paper presents MEMS-based optical resonator design, fabrication, 3D camera system prototype and signal processing algorithms.

  • PDF

테라헤르츠 신호를 이용한 영상의 글자 추출을 위한 화질 개선처리에 대한 연구 (A Study of Image Enhancement Processing for Letter Extraction of Image Using Terahertz Signal)

  • 김성윤;최현근;박인호;김영섭;이용환
    • 반도체디스플레이기술학회지
    • /
    • 제16권3호
    • /
    • pp.111-115
    • /
    • 2017
  • Terahertz waves are superior to conventional X-ray or Magnetic Resonance Tomography(MRI), and the amount of information that can be transmitted is as large as thousands of times that conventional X-ray or MRI. In addition, Terahertz waves have great performance in analyzing an object which have some layered structure. By using this advantage, we can extract the letters of a page by analyzing information such as absorption amount and reflection amount by irradiating a closed book with pulses of various frequencies within gap of a terahertz wave. However, in the image of each page using the Terahertz wave might be obtained various kinds of noise and the different character occlusion region. So, to extract letters from the terahertz image, we must take the noise and occlusion region away. We have been working to enhancement the image quality in various ways, and keep on studying de-noising processing for enhancement about the image quality and high resolution. Finally, we also keep on studying about OCR(Optical Character Recognition) technology, which based on pattern matching technique, to read letters.

  • PDF

Performance Comparison on Speech Codecs for Digital Watermarking Applications

  • Mamongkol, Y.;Amornraksa, T.
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 ITC-CSCC -1
    • /
    • pp.466-469
    • /
    • 2002
  • Using intelligent information contained within the speech to identify the specific hidden data in the watermarked multimedia data is considered to be an efficient method to achieve the speech digital watermarking. This paper presents the performance comparison between various types of speech codec in order to determine an appropriate one to be used in digital watermarking applications. In the experiments, the speech signal encoded by four different types of speech codec, namely CELP, GSM, SBC and G.723.1codecs is embedded into a grayscale image, and theirs performance in term of speech recognition are compared. The method for embedding the speech signal into the host data is borrowed from a watermarking method based on the zerotrees of wavelet packet coefficients. To evaluate efficiency of the speech codec used in watermarking applications, the speech signal after being extracted from the attacked watermarked image will be played back to the listeners, and then be justified whether its content is intelligible or not.

  • PDF