• Title/Summary/Keyword: a model mismatch

Search Result 227, Processing Time 0.02 seconds

Robust Feature Normalization Scheme Using Separated Eigenspace in Noisy Environments (분리된 고유공간을 이용한 잡음환경에 강인한 특징 정규화 기법)

  • Lee Yoonjae;Ko Hanseok
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.4
    • /
    • pp.210-216
    • /
    • 2005
  • We Propose a new feature normalization scheme based on eigenspace for achieving robust speech recognition. In general, mean and variance normalization (MVN) is Performed in cepstral domain. However, another MVN approach using eigenspace was recently introduced. in that the eigenspace normalization Procedure Performs normalization in a single eigenspace. This Procedure consists of linear PCA matrix feature transformation followed by mean and variance normalization of the transformed cepstral feature. In this method. 39 dimensional feature distribution is represented using only a single eigenspace. However it is observed to be insufficient to represent all data distribution using only a sin91e eigenvector. For more specific representation. we apply unique na independent eigenspaces to cepstra, delta and delta-delta cepstra respectively in this Paper. We also normalize training data in eigenspace and get the model from the normalized training data. Finally. a feature space rotation procedure is introduced to reduce the mismatch of training and test data distribution in noisy condition. As a result, we obtained a substantial recognition improvement over the basic eigenspace normalization.

A Landmark Based Localization System using a Kinect Sensor (키넥트 센서를 이용한 인공표식 기반의 위치결정 시스템)

  • Park, Kwiwoo;Chae, JeongGeun;Moon, Sang-Ho;Park, Chansik
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.63 no.1
    • /
    • pp.99-107
    • /
    • 2014
  • In this paper, a landmark based localization system using a Kinect sensor is proposed and evaluated with the implemented system for precise and autonomous navigation of low cost robots. The proposed localization method finds the positions of landmark on the image plane and the depth value using color and depth images. The coordinates transforms are defined using the depth value. Using coordinate transformation, the position in the image plane is transformed to the position in the body frame. The ranges between the landmarks and the Kinect sensor are the norm of the landmark positions in body frame. The Kinect sensor position is computed using the tri-lateral whose inputs are the ranges and the known landmark positions. In addition, a new matching method using the pin hole model is proposed to reduce the mismatch between depth and color images. Furthermore, a height error compensation method using the relationship between the body frame and real world coordinates is proposed to reduce the effect of wrong leveling. The error analysis are also given to find out the effect of focal length, principal point and depth value to the range. The experiments using 2D bar code with the implemented system show that the position with less than 3cm error is obtained in enclosed space($3,500mm{\times}3,000mm{\times}2,500mm$).

Design of the Fuzzy Logic Cross-Coupled Controller using a New Contouring Modeling (새로운 윤곽 모델링에 의한 퍼지논리형 상호결합제어기 설계)

  • Kim, Jin-Hwan;Lee, Je-Hie;Huh, Uk-Youl
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.37 no.1
    • /
    • pp.10-18
    • /
    • 2000
  • This paper proposes a fuzzy logic cross-coupled controller using a new contouring modeling for a two-axis servo system. The general decoupled control approach may result in degraded contouring performance due to such factors as mismatch of axial dynamics and axial loop gains. In practice, such systems contain many uncertainties. The cross-coupled controller utilizes all axis position error information simultaneously to produce accurate contours. However, the conventional cross-coupled controllers cannot overcome friction, backlash, and parameter variations. Also since, it is difficult to obtain an accurate mathematical model of multi-axis system, here we investigate a fuzzy logic cross-coupled controller of servo system. In addition, new contouring error vector computation method is presented. The experimental results are presented to illustrate the performance of the proposed algorithm.

  • PDF

Improvements in Speaker Adaptation Using Weighted Training (가중 훈련을 이용한 화자 적응 시스템의 향상)

  • 장규철;우수영;진민호;박용규;유창동
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.3
    • /
    • pp.188-193
    • /
    • 2003
  • Regardless of the distribution of the adaptation data in the testing environment, model-based adaptation methods that have so far been reported in various literature incorporates the adaptation data undiscriminatingly in reducing the mismatch between the training and testing environments. When the amount of data is small and the parameter tying is extensive, adaptation based on outlier data can be detrimental to the performance of the recognizer. The distribution of the adaptation data plays a critical role on the adaptation performance. In order to maximally improve the recognition rate in the testing environment using only a small number of adaptation data, supervised weighted training is applied to the structural maximum a posterior (SMAP) algorithm. We evaluate the performance of the proposed weighted SMAP (WSMAP) and SMAP on TIDIGITS corpus. The proposed WSMAP has been found to perform better for a small amount of data. The general idea of incorporating the distribution of the adaptation data is applicable to other adaptation algorithms.

Academic Program Operation for the Industry Professional Practice Implementation (장기현장실습(IPP) 제도를 위한 학사운영 방안)

  • Oh, Chang-Heon;Ha, Jun-Hong;Kim, Namho;Cho, Jae-Soo;Om, Kiyong
    • The Journal of Korean Institute for Practical Engineering Education
    • /
    • v.4 no.2
    • /
    • pp.110-115
    • /
    • 2012
  • IPP (Industry Professional Practice) is an educational model that combines academic study and industrial work through university-industry cooperation. Students would decide suitable career based on their IPP experience, that will lead a university graduate to improve their recruitment potential. IPP could also be a key to solve national employment problems as well as a chronic manpower supply and demand mismatch issue between university and industry. This paper discusses about an academic program operation for the IPP implementation, that includes operation plan for semester-based quarter system, a guideline for new curriculum, an academic credit allocation, evaluation guideline, a capstone design class operation, and interim measures.

  • PDF

Numerical Homogenization in Concrete Materials Using Multi-Resolution Analysis (다중해상도해석을 이용한 콘크리트 재료의 수치적 동질화)

  • Rhee In-Kyu;Roh Young-Sook
    • Journal of the Korea Concrete Institute
    • /
    • v.17 no.6 s.90
    • /
    • pp.939-946
    • /
    • 2005
  • The stiffness properties of heterogeneous concrete materials and their degradation were investigated at different-levels of observations with aids of the opportunities and limitations of multi-resolution wavelet analysis. The successive Haw transformations lead to a recursive separation of the stiffness properties and the response into coarse-and fine-scale features. In the limit, this recursive process results in a homogenization parameter which is an average measure of stiffness and strain energy capacity at the coarse scale. The basic concept of multi-resolution analysis is illustrated with one and two-dimensional model problems of a two-phase particulate composite representative of the morphology of concrete materials. The computational studies include the meso-structural features of concrete in the form of a hi-material system of aggregate particles which are immersed in a hardened cement paste taking due to account of the mismatch of the two elastic constituents.

Frequency Domain Channel Estimation for MIMO SC-FDMA Systems with CDM Pilots

  • Kim, Hyun-Myung;Kim, Dongsik;Kim, Tae-Kyoung;Im, Gi-Hong
    • Journal of Communications and Networks
    • /
    • v.16 no.4
    • /
    • pp.447-457
    • /
    • 2014
  • In this paper, we investigate the frequency domain channel estimation for multiple-input multiple-output (MIMO) single-carrier frequency-division multiple-access (SC-FDMA) systems. In MIMO SC-FDMA, code-division multiplexed (CDM) pilots such as cyclic-shifted Zadoff-Chu sequences have been adopted for channel estimation. However, most frequency domain channel estimation schemes were developed based on frequency-division multiplexing of pilots. We first develop a channel estimation error model by using CDM pilots, and then analyze the mean-square error (MSE) of various minimum MSE (MMSE) frequency domain channel estimation techniques. We show that the cascaded one-dimensional robust MMSE (C1D-RMMSE) technique is complexity-efficient, but it suffers from performance degradation due to the channel correlation mismatch when compared to the two-dimensional MMSE (2D-MMSE) technique. To improve the performance of C1D-RMMSE, we design a robust iterative channel estimation (RITCE) with a frequency replacement (FR) algorithm. After deriving the MSE of iterative channel estimation, we optimize the FR algorithm in terms of the MSE. Then, a low-complexity adaptation method is proposed for practical MIMO SC-FDMA systems, wherein FR is performed according to the reliability of the data estimates. Simulation results show that the proposed RITCE technique effectively improves the performance of C1D-RMMSE, thus providing a better performance-complexity tradeoff than 2D-MMSE.

Image Mood Classification Using Deep CNN and Its Application to Automatic Video Generation (심층 CNN을 활용한 영상 분위기 분류 및 이를 활용한 동영상 자동 생성)

  • Cho, Dong-Hee;Nam, Yong-Wook;Lee, Hyun-Chang;Kim, Yong-Hyuk
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.9
    • /
    • pp.23-29
    • /
    • 2019
  • In this paper, the mood of images was classified into eight categories through a deep convolutional neural network and video was automatically generated using proper background music. Based on the collected image data, the classification model is learned using a multilayer perceptron (MLP). Using the MLP, a video is generated by using multi-class classification to predict image mood to be used for video generation, and by matching pre-classified music. As a result of 10-fold cross-validation and result of experiments on actual images, each 72.4% of accuracy and 64% of confusion matrix accuracy was achieved. In the case of misclassification, by classifying video into a similar mood, it was confirmed that the music from the video had no great mismatch with images.

Word Embeddings-Based Pseudo Relevance Feedback Using Deep Averaging Networks for Arabic Document Retrieval

  • Farhan, Yasir Hadi;Noah, Shahrul Azman Mohd;Mohd, Masnizah;Atwan, Jaffar
    • Journal of Information Science Theory and Practice
    • /
    • v.9 no.2
    • /
    • pp.1-17
    • /
    • 2021
  • Pseudo relevance feedback (PRF) is a powerful query expansion (QE) technique that prepares queries using the top k pseudorelevant documents and choosing expansion elements. Traditional PRF frameworks have robustly handled vocabulary mismatch corresponding to user queries and pertinent documents; nevertheless, expansion elements are chosen, disregarding similarity to the original query's elements. Word embedding (WE) schemes comprise techniques of significant interest concerning QE, that falls within the information retrieval domain. Deep averaging networks (DANs) defines a framework relying on average word presence passed through multiple linear layers. The complete query is understandably represented using the average vector comprising the query terms. The vector may be employed for determining expansion elements pertinent to the entire query. In this study, we suggest a DANs-based technique that augments PRF frameworks by integrating WE similarities to facilitate Arabic information retrieval. The technique is based on the fundamental that the top pseudo-relevant document set is assessed to determine candidate element distribution and select expansion terms appropriately, considering their similarity to the average vector representing the initial query elements. The Word2Vec model is selected for executing the experiments on a standard Arabic TREC 2001/2002 set. The majority of the evaluations indicate that the PRF implementation in the present study offers a significant performance improvement compared to that of the baseline PRF frameworks.

Feature Extraction to Detect Hoax Articles (낚시성 인터넷 신문기사 검출을 위한 특징 추출)

  • Heo, Seong-Wan;Sohn, Kyung-Ah
    • Journal of KIISE
    • /
    • v.43 no.11
    • /
    • pp.1210-1215
    • /
    • 2016
  • Readership of online newspapers has grown with the proliferation of smart devices. However, fierce competition between Internet newspaper companies has resulted in a large increase in the number of hoax articles. Hoax articles are those where the title does not convey the content of the main story, and this gives readers the wrong information about the contents. We note that the hoax articles have certain characteristics, such as unnecessary celebrity quotations, mismatch in the title and content, or incomplete sentences. Based on these, we extract and validate features to identify hoax articles. We build a large-scale training dataset by analyzing text keywords in replies to articles and thus extracted five effective features. We evaluate the performance of the support vector machine classifier on the extracted features, and a 92% accuracy is observed in our validation set. In addition, we also present a selective bigram model to measure the consistency between the title and content, which can be effectively used to analyze short texts in general.