• Title/Summary/Keyword: Variation feature

Search Result 464, Processing Time 0.024 seconds

Robust Speech Recognition Parameters for Emotional Variation (감정 변화에 강인한 음성 인식 파라메터)

  • Kim Weon-Goo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.15 no.6
    • /
    • pp.655-660
    • /
    • 2005
  • This paper studied the feature parameters less affected by the emotional variation for the development of the robust speech recognition technologies. For this purpose, the effect of emotional variation on the speech recognition system and robust feature parameters of speech recognition system were studied using speech database containing various emotions. In this study, LPC cepstral coefficient, met-cepstral coefficient, root-cepstral coefficient, PLP coefficient, RASTA met-cepstral coefficient were used as a feature parameters. And CMS and SBR method were used as a signal bias removal techniques. Experimental results showed that the HMM based speaker independent word recognizer using RASTA met-cepstral coefficient :md its derivatives and CMS as a signal bias removal showed the best performance of $7.05\%$ word error rate. This corresponds to about a $52\%$ word error reduction as compare to the performance of baseline system using met - cepstral coefficient.

Image Retrieval using Local Color Histogram and Shape Feature (지역별 색상 분포 히스토그램과 모양 특징을 이용한 영상 검색)

  • 정길선;김성만;이양원
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 1999.05a
    • /
    • pp.50-54
    • /
    • 1999
  • This paper is proposed to image retrieval system using color and shape feature. Color feature used to four maximum value feature among the maximum value extracted from local color distribution histogram. The preprocessing of shape feature consist of edge extraction and weight central point extraction and angular sampling. The sum of distance from weight central point to contour and variation and max/min used to shape feature. The similarity is estimated compare feature of query image with the feature of images in database and the candidate of image is retrieved in order of similarity. We evaluate the effectiveness of shape feature and color feature in experiment used to two hundred of the closed image. The Recall and the Precision is each 0.72 and 0.53 in the result of average experiment. So the proposed method is presented useful method.

  • PDF

The Variation of Prosody by Focus (의미의 강조에 의한 운율특징 -음향음성학적 관점에 의한 분석-)

  • Kim Seonhi
    • MALSORI
    • /
    • no.40
    • /
    • pp.51-63
    • /
    • 2000
  • There are sentences where sentence stress is imposed on a specific word. These sentences are called 'focused sentences'. The purpose of this paper is to investigate the variation of pitch, duration, amplitude in focused words. It is noted that pitch of a focused word is higher than that of unfocused words irrespective of the accentual pattern, and that contour tones such as HL or LH are realized longer when these tones appear in focused words. Not only the noun but also the following particle like '-boda' is higher when these words are in focus. Hence pitch is proved to be the most salient prosodic feature of the focused sentence.

  • PDF

Underwater Transient Signal Classification Using Eigen Decomposition Based on Wigner-Ville Distribution Function (위그너-빌 분포 함수 기반의 고유치 분해를 이용한 수중 천이 신호 식별)

  • Bae, Keun-Sung;Hwang, Chan-Sik;Lee, Hyeong-Uk;Lim, Tae-Gyun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.3
    • /
    • pp.123-128
    • /
    • 2007
  • This Paper Presents new transient signal classification algorithms for underwater transient signals. In general. the ambient noise has small spectral deviation and energy variation. while a transient signal has large fluctuation. Hence to detect the transient signal, we use the spectral deviation and power variation. To classify the detected transient signal. the feature Parameters are obtained by using the Wigner-Ville distribution based eigenvalue decomposition. The correlation is then calculated between the feature vector of the detected signal and all the feature vectors of the reference templates frame-by-frame basis, and the detected transient signal is classified by the frame mapping rate among the class database.

A Study on Trend Sharing in Segmental-feature HMM (분절 특징 은닉 마코프 모델에서의 경향 공유에 관한 연구)

  • 윤영선
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.7
    • /
    • pp.641-647
    • /
    • 2002
  • In this paper, we propose the reduction method of the number of parameters in the segmental-feature HMM using trend quantization method. The proposed method shares the trend information of the polynomial trajectories by quantization. The trajectory is obtained by the sequence of feature vectors of speech signals and can be divided by trend and location information. The trend indicates the variation of consequent frame features, while the location points to the positional difference of the trajectories. Since the trend occupies the large portion of SFHMM, if the trend is shared, the number of parameters maybe decreases. To exploit the proposed system the experiments are performed on TIMIT corpus. The experimental results show that the performance of the proposed system is roughly similar to that of previous system. Therefore, the proposed system can be considered one of parameter reduction method.

Reinforced Feature of Dynamic Search Area for the Discriminative Model Prediction Tracker based on Multi-domain Dataset (다중 도메인 데이터 기반 구별적 모델 예측 트레커를 위한 동적 탐색 영역 특징 강화 기법)

  • Lee, Jun Ha;Won, Hong-In;Kim, Byeong Hak
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.16 no.6
    • /
    • pp.323-330
    • /
    • 2021
  • Visual object tracking is a challenging area of study in the field of computer vision due to many difficult problems, including a fast variation of target shape, occlusion, and arbitrary ground truth object designation. In this paper, we focus on the reinforced feature of the dynamic search area to get better performance than conventional discriminative model prediction trackers on the condition when the accuracy deteriorates since low feature discrimination. We propose a reinforced input feature method shown like the spotlight effect on the dynamic search area of the target tracking. This method can be used to improve performances for deep learning based discriminative model prediction tracker, also various types of trackers which are used to infer the center of the target based on the visual object tracking. The proposed method shows the improved tracking performance than the baseline trackers, achieving a relative gain of 38% quantitative improvement from 0.433 to 0.601 F-score at the visual object tracking evaluation.

Analysis of Inductance Variation Characteristics in Interior Permanent Magnet Synchronous Motor (매입형 영구자석 동기 전동기의 인덕턴스 리플 특성 분석)

  • Lee, Sang-Yub;Kwak, Sang-Yeop;Jung, Hyun-Kyo
    • Proceedings of the KIEE Conference
    • /
    • 2006.07b
    • /
    • pp.821-822
    • /
    • 2006
  • In the case of the interior permanent magnet synchronous motor (IPMSM), it is important to know the accurate machine parameters in the design step. In particular, d- and q- axis inductance are expected to have ripple characteristics, due to the mechanical structure and the degree of magnetic saturation. In this paper, this feature is expressed as inductance variation. Inductance variation of the IPMSM is calculated with finite element analysis, and the reason for inductance variation is identified. Finally the validity of this paper is verified by the comparison with the experimental results.

  • PDF

Content-based retrieval system using wavelet transform (웨이브렛 변환을 이용한 내용기반 검색 시스템)

  • 반가운;유기형;박정호;최재호;곽훈성
    • Proceedings of the IEEK Conference
    • /
    • 1998.06a
    • /
    • pp.733-736
    • /
    • 1998
  • In this paper, we propose a new method for content-based retrieval system using wavelet transform and correlation, which has were used in signal processing and image compressing. The matching method is used not perfect matching but similar matching. Used feature vector is the lowest frequency(LL) itself, energy value, and edge information of 4-layer, after computng a 4-layer 2-D fast wavelet transform on image. By the proosed algorithm, we got the result that was faste rand more accurate than the traditional algorithm. Because used feature vector was compressed 256:1 over original image, retrieval speed was highly improved. By using correlation, moving object with size variation was reterieved without additional feature information.

  • PDF

A Study On the Comparison of the Geometric Invariance From A Single-View Image (단일 시각방향 영상에서의 기하 불변량의 특성 비교에 관한 연구)

  • 이영재;박영태
    • Proceedings of the IEEK Conference
    • /
    • 1999.06a
    • /
    • pp.639-642
    • /
    • 1999
  • There exist geometrically invariant relations in single-view images under a specific geometrical structure. This invariance may be utilized for 3D object recognition. Two types of invariants are compared in terms of the robustness to the variation of the feature points. Deviation of the invariant relations are measured by adding random noise to the feature point location. Zhu’s invariant requires six points on adjacent planes having two sets of four coplanar points, whereas the Kaist method requires four coplanar points and two non-coplanar points. Experimental results show that the latter method has the advantage in choosing feature points while suffering from weak robustness to the noise.

  • PDF

Nonlinear Tolerance Allocation for Assembly Components (조립품을 위한 비선형 공차할당)

  • Kim, Kwang-Soo;Choi, Hoo-Gon
    • IE interfaces
    • /
    • v.16 no.spc
    • /
    • pp.39-44
    • /
    • 2003
  • As one of many design variables, the role of dimension tolerances is to restrict the amount of size variation in a manufactured feature while ensuring functionality. In this study, a nonlinear integer model has been modeled to allocate the optimal tolerance to each individual feature at a minimum manufacturing cost. While a normal distribution determines statistically worst tolerances with its symmetrical property in many previous tolerance allocation studies, a asymmetrical distribution is more realistic because its mean is not always coincident with a process center. A nonlinear integer model is modeled to allocate the optimal tolerance to a feature based on a beta distribution at a minimum total cost. The total cost as a function of tolerances is defined by machining cost and quality loss. After the convexity of manufacturing cost is checked by the Hessian matrix, the model is solved by the Complex Method. Finally, a numerical example is presented demonstrating successful model implementation for a nonlinear design case.