• Title/Summary/Keyword: temporal feature

Search Result 313, Processing Time 0.034 seconds

Active Sonar Target/Nontarget Classification Using Real Sea-trial Data (실제 해상 실험 데이터를 이용한 능동소나 표적/비표적 식별)

  • Seok, J.W.
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.10
    • /
    • pp.1637-1645
    • /
    • 2017
  • Target/Nontarget classification can be divided into the study of shape estimation of the target analysing reflected echo signal and of type classification of the target using acoustical features. In active sonar system, the feature vectors are extracted from the signal reflected from the target, and an classification algorithm is applied to determine whether the received signal is a target or not. However, received sonar signals can be distorted in the underwater environments, and the spatio-temporal characteristics of active sonar signals change according to the aspect of the target. In addition, it is very difficult to collect real sea-trial data for research. In this paper, target/non-target classification were performed using real sea-trial data. Feature vectors are extracted using MFCC(Mel-Frequency Cepstral Coefficients), filterbank energy in the Fourier spectrum and wavelet domain. For the performance verification, classification experiments were performed using backpropagation neural network classifiers.

Benthic Organisms and Environmental Variability in Antarctica: Responses to Seasonal, Decadal and Long-term Change

  • Clarke, Andrew
    • Ocean and Polar Research
    • /
    • v.23 no.4
    • /
    • pp.433-440
    • /
    • 2001
  • Marine organisms in Antarctica live in an environment which exhibits variability in physical processes over a wide range of temporal scales, from seconds to millennia. This time scale tends to be correlated with the spatial scale over which a given process operates, though this relationship is influenced by biology. The way organisms respond to variability in the physical environment depends on the time-scale of that variability in relation to life-span. Short-term variations are perceived largely as noise and probably have little direct impact on ecology. Of much greater importance to organisms in Antarctica are seasonal and decadal variations. Although seasonality has long been recognised as a key feature of polar environments, the realization that decadal scale variability is important is relatively recent. Long-term change has always been a feature of polar environments and may be a key factor in the evolution of the communities we see today.

  • PDF

A new approach for content-based video retrieval

  • Kim, Nac-Woo;Lee, Byung-Tak;Koh, Jai-Sang;Song, Ho-Young
    • International Journal of Contents
    • /
    • v.4 no.2
    • /
    • pp.24-28
    • /
    • 2008
  • In this paper, we propose a new approach for content-based video retrieval using non-parametric based motion classification in the shot-based video indexing structure. Our system proposed in this paper has supported the real-time video retrieval using spatio-temporal feature comparison by measuring the similarity between visual features and between motion features, respectively, after extracting representative frame and non-parametric motion information from shot-based video clips segmented by scene change detection method. The extraction of non-parametric based motion features, after the normalized motion vectors are created from an MPEG-compressed stream, is effectively fulfilled by discretizing each normalized motion vector into various angle bins, and by considering the mean, variance, and direction of motion vectors in these bins. To obtain visual feature in representative frame, we use the edge-based spatial descriptor. Experimental results show that our approach is superior to conventional methods with regard to the performance for video indexing and retrieval.

Weighting Method based on Motion Information for Objective Video Quality Assessment (객관적 영상 화질 평가 기준를 위한 움직임 정보에 따른 중요도 결정 기법)

  • Park, Su-Young;Kim, Tae-Wan;Lee, Sang-Hoon
    • Proceedings of the IEEK Conference
    • /
    • 2008.06a
    • /
    • pp.909-910
    • /
    • 2008
  • For evaluating the performance of some codecs, many researchers have study and develop new objective video quality assessments. However, it's not sufficient for evaluating the temporal feature of video data yet, which is a distinguishable and representative characteristic when compared with other multimedia. This paper propose the method to apply the weight to SSIM (Structural SIMilarity) according to the cognitive psychological feature. And, we presented that the performance of objective video quality assessment applied the weight to SSIM by using the proposed method is superior to one of original SSIM.

  • PDF

HMM-based missing feature reconstruction for robust speech recognition in additive noise environments (가산잡음환경에서 강인음성인식을 위한 은닉 마르코프 모델 기반 손실 특징 복원)

  • Cho, Ji-Won;Park, Hyung-Min
    • Phonetics and Speech Sciences
    • /
    • v.6 no.4
    • /
    • pp.127-132
    • /
    • 2014
  • This paper describes a robust speech recognition technique by reconstructing spectral components mismatched with a training environment. Although the cluster-based reconstruction method can compensate the unreliable components from reliable components in the same spectral vector by assuming an independent, identically distributed Gaussian-mixture process of training spectral vectors, the presented method exploits the temporal dependency of speech to reconstruct the components by introducing a hidden-Markov-model prior which incorporates an internal state transition plausible for an observed spectral vector sequence. The experimental results indicate that the described method can provide temporally consistent reconstruction and further improve recognition performance on average compared to the conventional method.

Human Action Recognition via Depth Maps Body Parts of Action

  • Farooq, Adnan;Farooq, Faisal;Le, Anh Vu
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.5
    • /
    • pp.2327-2347
    • /
    • 2018
  • Human actions can be recognized from depth sequences. In the proposed algorithm, we initially construct depth, motion maps (DMM) by projecting each depth frame onto three orthogonal Cartesian planes and add the motion energy for each view. The body part of the action (BPoA) is calculated by using bounding box with an optimal window size based on maximum spatial and temporal changes for each DMM. Furthermore, feature vector is constructed by using BPoA for each human action view. In this paper, we employed an ensemble based learning approach called Rotation Forest to recognize different actions Experimental results show that proposed method has significantly outperforms the state-of-the-art methods on Microsoft Research (MSR) Action 3D and MSR DailyActivity3D dataset.

Signal Synthesis and Feature Extraction for Active Sonar Target Classification (능동소나 표적 인식을 위한 신호합성 및 특징추출)

  • Uh, Y.;Seok, J.W.
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.1
    • /
    • pp.9-16
    • /
    • 2015
  • Various approaches to process active sonar signals are under study, but there are many problems to be considered. The sonar signals are distorted by the underwater environment, and the spatio-temporal and spectral characteristics of active sonar signals change in accordance with the aspect of the target even though they come from the same one. And it has difficulties in collecting actual underwater data. In this paper, we synthesized active target echoes based on ray tracing algorithm using target model having 3-dimensional highlight distribution. Then, Fractional Fourier transform was applied to synthesized target echoes to extract feature vector. Recognition experiment was performed using probabilistic neural network classifier.

Decision-Tree-Based Markov Model for Phrase Break Prediction

  • Kim, Sang-Hun;Oh, Seung-Shin
    • ETRI Journal
    • /
    • v.29 no.4
    • /
    • pp.527-529
    • /
    • 2007
  • In this paper, a decision-tree-based Markov model for phrase break prediction is proposed. The model takes advantage of the non-homogeneous-features-based classification ability of decision tree and temporal break sequence modeling based on the Markov process. For this experiment, a text corpus tagged with parts-of-speech and three break strength levels is prepared and evaluated. The complex feature set, textual conditions, and prior knowledge are utilized; and chunking rules are applied to the search results. The proposed model shows an error reduction rate of about 11.6% compared to the conventional classification model.

  • PDF

A STUDY ON THE RECOGNITION OF SPOKEN KOREAN LOCAL-NAMES USING SPATIO TEMPORAL

  • Song, Do-Sun;Kim, Suk-Dong;Lee, Haing-Sei
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.1003-1008
    • /
    • 1994
  • This paper is about an experiment of speaker-independent automation Korean spoken words recognition using Multi-Layered Perceptron and Error Back-propagation algorithm. The words were not segmented into syllables or phonemes, and some feature components extracted from the words in equal gap were applied to the neural network. This paper tried to find out the optimum conditions through various experiment which are comparison between total and pre-classified training.

  • PDF

Video Expression Recognition Method Based on Spatiotemporal Recurrent Neural Network and Feature Fusion

  • Zhou, Xuan
    • Journal of Information Processing Systems
    • /
    • v.17 no.2
    • /
    • pp.337-351
    • /
    • 2021
  • Automatically recognizing facial expressions in video sequences is a challenging task because there is little direct correlation between facial features and subjective emotions in video. To overcome the problem, a video facial expression recognition method using spatiotemporal recurrent neural network and feature fusion is proposed. Firstly, the video is preprocessed. Then, the double-layer cascade structure is used to detect a face in a video image. In addition, two deep convolutional neural networks are used to extract the time-domain and airspace facial features in the video. The spatial convolutional neural network is used to extract the spatial information features from each frame of the static expression images in the video. The temporal convolutional neural network is used to extract the dynamic information features from the optical flow information from multiple frames of expression images in the video. A multiplication fusion is performed with the spatiotemporal features learned by the two deep convolutional neural networks. Finally, the fused features are input to the support vector machine to realize the facial expression classification task. The experimental results on cNTERFACE, RML, and AFEW6.0 datasets show that the recognition rates obtained by the proposed method are as high as 88.67%, 70.32%, and 63.84%, respectively. Comparative experiments show that the proposed method obtains higher recognition accuracy than other recently reported methods.