• Title/Summary/Keyword: Image Signal Recognition

Search Result 187, Processing Time 0.025 seconds

Vehicle License Plate Recognition System By Edge-based Segment Image Generation (에지기반 세그먼트 영상 생성에 의한 차량 번호판 인식 시스템)

  • Kim, Jin-Ho;Noh, Duck-Soo
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.3
    • /
    • pp.9-16
    • /
    • 2012
  • The research of vehicle license plate recognition has been widely studied for the smart city project. The license plate recognition can be hard due to the geometric distortion and the image quality degradation in case of capturing the driving car image at CCTV without trigger signal on the road. In this paper, the high performance vehicle license plate recognition system using edge-based segment image is introduced which is robust in the geometric distortion and the image quality degradation according to non-trigger signal. The experimental results of the proposed real time license plate recognition algorithm which is implemented at the CCTV on the road show that the plate detection rate was 97.5% and the overall character recognition rate of the detected plates was 99.3% in a day average 1,535 vehicles for a week operation.

Lipreading과 음성인식에 의한 향상된 화자 인증 시스템

  • 지승남;이종수
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2000.10a
    • /
    • pp.274-274
    • /
    • 2000
  • In the future, the convenient speech command system will become an widely-using interface in automation systems. But the previous research in speech recognition didn't give satisfactory recognition results for the practical realization in the noise environment. The purpose of this research is the development of a practical system, which reliably recognizes the speech command of the registered users, by complementing an existing research which used the image information with the speech signal. For the lip-reading feature extraction from a image, we used the DWT(Discrete Wavelet Transform), which reduces the size and gives useful characteristics of the original image. And to enhance the robustness to the environmental changes of speakers, we acquired the speech signal by stereo method. We designed an economic stand-alone system, which adopted a Bt829 and an AD1819B with a TMS320C31 DSP based add-on board.

  • PDF

A Multimodal Emotion Recognition Using the Facial Image and Speech Signal

  • Go, Hyoun-Joo;Kim, Yong-Tae;Chun, Myung-Geun
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.5 no.1
    • /
    • pp.1-6
    • /
    • 2005
  • In this paper, we propose an emotion recognition method using the facial images and speech signals. Six basic emotions including happiness, sadness, anger, surprise, fear and dislike are investigated. Facia] expression recognition is performed by using the multi-resolution analysis based on the discrete wavelet. Here, we obtain the feature vectors through the ICA(Independent Component Analysis). On the other hand, the emotion recognition from the speech signal method has a structure of performing the recognition algorithm independently for each wavelet subband and the final recognition is obtained from the multi-decision making scheme. After merging the facial and speech emotion recognition results, we obtained better performance than previous ones.

Design of a Tree-Structured Fuzzy Neural Networks for Aircraft Target Recognition (비행체 표적식별을 위한 트리 구조의 퍼지 뉴럴 네트워크 설계)

  • Han, Chang-Wook
    • Journal of IKEEE
    • /
    • v.24 no.4
    • /
    • pp.1034-1038
    • /
    • 2020
  • In order to effectively process target recognition using radar, accurate signal information for the target is required. However, such a target signal is usually mixed with noise, and this part of the study is continuously carried out. Especially, image processing, target signal processing and target recognition for the target are examples. Since the field of target recognition is important from a military point of view, this paper carried out research on target recognition of aircraft using a tree-structured fuzzy neural networks. Fuzzy neural networks are learned by using reflected signal data for an aircraft to optimize the model, and then test data for the target are used for the optimized model to perform an experiment on target recognition. The effectiveness of the proposed method is verified by the simulation results.

Integrated Visual and Speech Parameters in Korean Numeral Speech Recognition

  • Lee, Sang-won;Park, In-Jung;Lee, Chun-Woo;Kim, Hyung-Bae
    • Proceedings of the IEEK Conference
    • /
    • 2000.07b
    • /
    • pp.685-688
    • /
    • 2000
  • In this paper, we used image information for the enhancement of Korean numeral speech recognition. First, a noisy environment was made by Gaussian generator at each 10 dB level and the generated signal was added to original Korean numeral speech. And then, the speech was analyzed to recognize Korean numeral speech. Speech through microphone was pre-emphasized with 0.95, Hamming window, autocorrelation and LPC analysis was used. Second, the image obtained by camera, was converted to gray level, autocorrelated, and analyzed using LPC algorithm, to which was applied in speech analysis, Finally, the Korean numerial speech recognition with image information was more ehnanced than speech-only, especially in ‘3’, ‘5’and ‘9’. As the same LPC algorithm and simple image management was used, additional computation a1gorithm like a filtering was not used, a total speech recognition algorithm was made simple.

  • PDF

The Real-time Printed Alphabets Recognition using Artificial Neural Networks (인공신경망을 이용한 실시간 영문인쇄체 인식)

  • 심성균;정원용
    • Proceedings of the Korea Institute of Convergence Signal Processing
    • /
    • 2001.06a
    • /
    • pp.149-152
    • /
    • 2001
  • The goals of this papper are not only to maximize of performance but also to reduce the response time for the real-time printed alphabets recognition system using the backpropagation algorithm in the artificial neural network. The Genesis board and MIL(Matrox Image Library) package were used to real-time acquisition, processing and display of images. Through this experiment proved the possibility of real-time recognition processing by comparing response times of the system and proposing the method to reduce of order of the output vectors.

  • PDF

Multiple Plankton Detection and Recognition in Microscopic Images with Homogeneous Clumping and Heterogeneous Interspersion

  • Soh, Youngsung;Song, Jaehyun;Hae, Yongsuk
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.19 no.2
    • /
    • pp.35-41
    • /
    • 2018
  • The analysis of plankton species distribution in sea or fresh water is very important in preserving marine ecosystem health. Since manual analysis is infeasible, many automatic approaches were proposed. They usually use images from in situ towed underwater imaging sensor or specially designed, lab mounted microscopic imaging system. Normally they assume that only single plankton is present in an image so that, if there is a clumping among multiple plankton of same species (homogeneous clumping) or if there are multiple plankton of different species scattered in an image (heterogeneous interspersion), they have a difficulty in recognition. In this work, we propose a deep learning based method that can detect and recognize individual plankton in images with homogeneous clumping, heterogeneous interspersion, or combination of both.

Research on Damage Identification of Buried Pipeline Based on Fiber Optic Vibration Signal

  • Weihong Lin;Wei Peng;Yong Kong;Zimin Shen;Yuzhou Du;Leihong Zhang;Dawei Zhang
    • Current Optics and Photonics
    • /
    • v.7 no.5
    • /
    • pp.511-517
    • /
    • 2023
  • Pipelines play an important role in urban water supply and drainage, oil and gas transmission, etc. This paper presents a technique for pattern recognition of fiber optic vibration signals collected by a distributed vibration sensing (DVS) system using a deep learning residual network (ResNet). The optical fiber is laid on the pipeline, and the signal is collected by the DVS system and converted into a 64 × 64 single-channel grayscale image. The grayscale image is input into the ResNet to extract features, and finally the K-nearest-neighbors (KNN) algorithm is used to achieve the classification and recognition of pipeline damage.

Speech Recognition Model Based on CNN using Spectrogram (스펙트로그램을 이용한 CNN 음성인식 모델)

  • Won-Seog Jeong;Haeng-Woo Lee
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.19 no.4
    • /
    • pp.685-692
    • /
    • 2024
  • In this paper, we propose a new CNN model to improve the recognition performance of command voice signals. This method obtains a spectrogram image after performing a short-time Fourier transform (STFT) of the input signal and improves command recognition performance through supervised learning using a CNN model. After Fourier transforming the input signal for each short-time section, a spectrogram image is obtained and multi-classification learning is performed using a CNN deep learning model. This effectively classifies commands by converting the time domain voice signal to the frequency domain to express the characteristics well and performing deep learning training using the spectrogram image for the conversion parameters. To verify the performance of the speech recognition system proposed in this study, a simulation program using Tensorflow and Keras libraries was created and a simulation experiment was performed. As a result of the experiment, it was confirmed that an accuracy of 92.5% could be obtained using the proposed deep learning algorithm.

Speech Activity Decision with Lip Movement Image Signals (입술움직임 영상신호를 고려한 음성존재 검출)

  • Park, Jun;Lee, Young-Jik;Kim, Eung-Kyeu;Lee, Soo-Jong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.1
    • /
    • pp.25-31
    • /
    • 2007
  • This paper describes an attempt to prevent the external acoustic noise from being misrecognized as the speech recognition target. For this, in the speech activity detection process for the speech recognition, it confirmed besides the acoustic energy to the lip movement image signal of a speaker. First of all, the successive images are obtained through the image camera for PC. The lip movement whether or not is discriminated. And the lip movement image signal data is stored in the shared memory and shares with the recognition process. In the meantime, in the speech activity detection Process which is the preprocess phase of the speech recognition. by conforming data stored in the shared memory the acoustic energy whether or not by the speech of a speaker is verified. The speech recognition processor and the image processor were connected and was experimented successfully. Then, it confirmed to be normal progression to the output of the speech recognition result if faced the image camera and spoke. On the other hand. it confirmed not to output of the speech recognition result if did not face the image camera and spoke. That is, if the lip movement image is not identified although the acoustic energy is inputted. it regards as the acoustic noise.