• Title/Summary/Keyword: coding parameters

Search Result 279, Processing Time 0.023 seconds

A MFCC-based CELP Speech Coder for Server-based Speech Recognition in Network Environments (네트워크 환경에서 서버용 음성 인식을 위한 MFCC 기반 음성 부호화기 설계)

  • Lee, Gil-Ho;Yoon, Jae-Sam;Oh, Yoo-Rhee;Kim, Hong-Kook
    • MALSORI
    • /
    • no.54
    • /
    • pp.27-43
    • /
    • 2005
  • Existing standard speech coders can provide speech communication of high quality while they degrade the performance of speech recognition systems that use the reconstructed speech by the coders. The main cause of the degradation is that the spectral envelope parameters in speech coding are optimized to speech quality rather than to the performance of speech recognition. For example, mel-frequency cepstral coefficient (MFCC) is generally known to provide better speech recognition performance than linear prediction coefficient (LPC) that is a typical parameter set in speech coding. In this paper, we propose a speech coder using MFCC instead of LPC to improve the performance of a server-based speech recognition system in network environments. However, the main drawback of using MFCC is to develop the efficient MFCC quantization with a low-bit rate. First, we explore the interframe correlation of MFCCs, which results in the predictive quantization of MFCC. Second, a safety-net scheme is proposed to make the MFCC-based speech coder robust to channel error. As a result, we propose a 8.7 kbps MFCC-based CELP coder. It is shown from a PESQ test that the proposed speech coder has a comparable speech quality to 8 kbps G.729 while it is shown that the performance of speech recognition using the proposed speech coder is better than that using G.729.

  • PDF

A Random Deflected Subgradient Algorithm for Energy-Efficient Real-time Multicast in Wireless Networks

  • Tan, Guoping;Liu, Jianjun;Li, Yueheng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.10
    • /
    • pp.4864-4882
    • /
    • 2016
  • In this work, we consider the optimization problem of minimizing energy consumption for real-time multicast over wireless multi-hop networks. Previously, a distributed primal-dual subgradient algorithm was used for finding a solution to the optimization problem. However, the traditional subgradient algorithms have drawbacks in terms of i) sensitivity to iteration parameters; ii) need for saving previous iteration results for computing the optimization results at the current iteration. To overcome these drawbacks, using a joint network coding and scheduling optimization framework, we propose a novel distributed primal-dual Random Deflected Subgradient (RDS) algorithm for solving the optimization problem. Furthermore, we derive the corresponding recursive formulas for the proposed RDS algorithm, which are useful for practical applications. In comparison with the traditional subgradient algorithms, the illustrated performance results show that the proposed RDS algorithm can achieve an improved optimal solution. Moreover, the proposed algorithm is stable and robust against the choice of parameter values used in the algorithm.

Dimmable Spatial Intensity Modulation for Visible-light Communication: Capacity Analysis and Practical Design

  • Kim, Byung Wook;Jung, Sung-Yoon
    • Current Optics and Photonics
    • /
    • v.2 no.6
    • /
    • pp.532-539
    • /
    • 2018
  • Multiple LED arrays can be utilized in visible-light communication (VLC) to improve communication efficiency, while maintaining smart illumination functionality through dimming control. This paper proposes a modulation scheme called "Spatial Intensity Modulation" (SIM), where the effective number of turned-on LEDs is employed for data modulation and dimming control in VLC systems. Unlike the conventional pulse-amplitude modulation (PAM), symbol intensity levels are not determined by the amplitude levels of a VLC signal from each LED, but by counting the number of turned-on LEDs, illuminating with a single amplitude level. Because the intensity of a SIM symbol and the target dimming level are determined solely in the spatial domain, the problems of conventional PAM-based VLC and related MIMO VLC schemes, such as unstable dimming control, non uniform illumination functionality, and burdens of channel prediction, can be solved. By varying the number and formation of turned-on LEDs around the target dimming level in time, the proposed SIM scheme guarantees homogeneous illumination over a target area. An analysis of the dimming capacity, which is the achievable communication rate under the target dimming level in VLC, is provided by deriving the turn-on probability to maximize the entropy of the SIM-based VLC system. In addition, a practical design of dimmable SIM scheme applying the multilevel inverse source coding (MISC) method is proposed. The simulation results under a range of parameters provide baseline data to verify the performance of the proposed dimmable SIM scheme and applications in real systems.

Support vector ensemble for incipient fault diagnosis in nuclear plant components

  • Ayodeji, Abiodun;Liu, Yong-kuo
    • Nuclear Engineering and Technology
    • /
    • v.50 no.8
    • /
    • pp.1306-1313
    • /
    • 2018
  • The randomness and incipient nature of certain faults in reactor systems warrant a robust and dynamic detection mechanism. Existing models and methods for fault diagnosis using different mathematical/statistical inferences lack incipient and novel faults detection capability. To this end, we propose a fault diagnosis method that utilizes the flexibility of data-driven Support Vector Machine (SVM) for component-level fault diagnosis. The technique integrates separately-built, separately-trained, specialized SVM modules capable of component-level fault diagnosis into a coherent intelligent system, with each SVM module monitoring sub-units of the reactor coolant system. To evaluate the model, marginal faults selected from the failure mode and effect analysis (FMEA) are simulated in the steam generator and pressure boundary of the Chinese CNP300 PWR (Qinshan I NPP) reactor coolant system, using a best-estimate thermal-hydraulic code, RELAP5/SCDAP Mod4.0. Multiclass SVM model is trained with component level parameters that represent the steady state and selected faults in the components. For optimization purposes, we considered and compared the performances of different multiclass models in MATLAB, using different coding matrices, as well as different kernel functions on the representative data derived from the simulation of Qinshan I NPP. An optimum predictive model - the Error Correcting Output Code (ECOC) with TenaryComplete coding matrix - was obtained from experiments, and utilized to diagnose the incipient faults. Some of the important diagnostic results and heuristic model evaluation methods are presented in this paper.

Analytical Approximation Algorithm for the Inverse of the Power of the Incomplete Gamma Function Based on Extreme Value Theory

  • Wu, Shanshan;Hu, Guobing;Yang, Li;Gu, Bin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.12
    • /
    • pp.4567-4583
    • /
    • 2021
  • This study proposes an analytical approximation algorithm based on extreme value theory (EVT) for the inverse of the power of the incomplete Gamma function. First, the Gumbel function is used to approximate the power of the incomplete Gamma function, and the corresponding inverse problem is transformed into the inversion of an exponential function. Then, using the tail equivalence theorem, the normalized coefficient of the general Weibull distribution function is employed to replace the normalized coefficient of the random variable following a Gamma distribution, and the approximate closed form solution is obtained. The effects of equation parameters on the algorithm performance are evaluated through simulation analysis under various conditions, and the performance of this algorithm is compared to those of the Newton iterative algorithm and other existing approximate analytical algorithms. The proposed algorithm exhibits good approximation performance under appropriate parameter settings. Finally, the performance of this method is evaluated by calculating the thresholds of space-time block coding and space-frequency block coding pattern recognition in multiple-input and multiple-output orthogonal frequency division multiplexing. The analytical approximation method can be applied to other related situations involving the maximum statistics of independent and identically distributed random variables following Gamma distributions.

Feature Parameter Extraction and Analysis in the Wavelet Domain for Discrimination of Music and Speech (음악과 음성 판별을 위한 웨이브렛 영역에서의 특징 파라미터)

  • Kim, Jung-Min;Bae, Keun-Sung
    • MALSORI
    • /
    • no.61
    • /
    • pp.63-74
    • /
    • 2007
  • Discrimination of music and speech from the multimedia signal is an important task in audio coding and broadcast monitoring systems. This paper deals with the problem of feature parameter extraction for discrimination of music and speech. The wavelet transform is a multi-resolution analysis method that is useful for analysis of temporal and spectral properties of non-stationary signals such as speech and audio signals. We propose new feature parameters extracted from the wavelet transformed signal for discrimination of music and speech. First, wavelet coefficients are obtained on the frame-by-frame basis. The analysis frame size is set to 20 ms. A parameter $E_{sum}$ is then defined by adding the difference of magnitude between adjacent wavelet coefficients in each scale. The maximum and minimum values of $E_{sum}$ for period of 2 seconds, which corresponds to the discrimination duration, are used as feature parameters for discrimination of music and speech. To evaluate the performance of the proposed feature parameters for music and speech discrimination, the accuracy of music and speech discrimination is measured for various types of music and speech signals. In the experiment every 2-second data is discriminated as music or speech, and about 93% of music and speech segments have been successfully detected.

  • PDF

Parametric identification of the Bouc-Wen model by a modified genetic algorithm: Application to evaluation of metallic dampers

  • Shu, Ganping;Li, Zongjing
    • Earthquakes and Structures
    • /
    • v.13 no.4
    • /
    • pp.397-407
    • /
    • 2017
  • With the growing demand for metallic dampers in engineering practice, it is urgent to establish a reasonable approach to evaluating the mechanical performance of metallic dampers under seismic excitations. This paper introduces an effective method for parameter identification of the modified Bouc-Wen model and its application to evaluating the fatigue performance of metallic dampers (MDs). The modified Bouc-Wen model which eliminates the redundant parameter is used to describe the hysteresis behavior of MDs. Relations between the parameters of the modified Bouc-Wen model and the mechanical performance parameters of MDs are studied first. A modified Genetic Algorithm using real-integer hybrid coding with relative fitness as well as adaptive crossover and mutation rates (called RFAGA) is then proposed to identify the parameters of the modified Bouc-Wen model. A reliable approach to evaluating the fatigue performance of the MDs with respect to the Chinese Code for Seismic Design of Buildings (GB 50011-2010) is finally proposed based on the research results. Experimental data are employed to demonstrate the process and verify the effectiveness of the proposed approach. It is shown that the RFAGA is able to converge quickly in the identification process, and the simulation curves based on the identification results fit well with the experimental hysteresis curves. Furthermore, the proposed approach is shown to be a useful tool for evaluating the fatigue performance of MDs with respect to the Chinese Code for Seismic Design of Buildings (GB 50011-2010).

Efficient Blind Estimation of Block Interleaver Parameters (효율적인 블록 인터리버 파라미터 블라인드 추정 기법)

  • Jeong, Jin-Woo;Choi, Sung-Hwan;Yoon, Dong-Weon;Park, Cheol-Sun;Yoon, Sang-Bom
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.37 no.5C
    • /
    • pp.384-392
    • /
    • 2012
  • Recently, much research on blind estimation of the interleaver parameters has been performed by using Gauss-Jordan elimination to find the linearity of the block channel code. When using Gauss-Jordan elimination, the input data to be calculated needs to run as long as the square multiple of the number of the interleaver period. Thus, it has a limit in estimating the interleaver parameters with insufficient input data. In this paper, we introduce and analyze an estimation algorithm which can estimate interleaver parameters by using only 15 percent of the input data length required in the above algorithm. The shorter length of input data to be calculated makes it possible to estimate the interleaver parameters even when limited data is received. In addition, a 80 percent reduction in the number of the interleaver period candidates increases the efficiency of analysis. It is also feasible to estimate both the type and size of the interleaver and the type of channel coding.

Design of video encoder using Multi-dimensional DCT (다차원 DCT를 이용한 비디오 부호화기 설계)

  • Jeon, S.Y.;Choi, W.J.;Oh, S.J.;Jeong, S.Y.;Choi, J.S.;Moon, K.A.;Hong, J.W.;Ahn, C.B.
    • Journal of Broadcast Engineering
    • /
    • v.13 no.5
    • /
    • pp.732-743
    • /
    • 2008
  • In H.264/AVC, 4$\times$4 block transform is used for intra and inter prediction instead of 8$\times$8 block transform. Using small block size coding, H.264/AVC obtains high temporal prediction efficiency, however, it has limitation in utilizing spatial redundancy. Motivated on these points, we propose a multi-dimensional transform which achieves both the accuracy of temporal prediction as well as effective use of spatial redundancy. From preliminary experiments, the proposed multi-dimensional transform achieves higher energy compaction than 2-D DCT used in H.264. We designed an integer-based transform and quantization coder for multi-dimensional coder. Moreover, several additional methods for multi-dimensional coder are proposed, which are cube forming, scan order, mode decision and updating parameters. The Context-based Adaptive Variable-Length Coding (CAVLC) used in H.264 was employed for the entropy coder. Simulation results show that the performance of the multi-dimensional codec appears similar to that of H.264 in lower bit rates although the rate-distortion curves of the multi-dimensional DCT measured by entropy and the number of non-zero coefficients show remarkably higher performance than those of H.264/AVC. This implies that more efficient entropy coder optimized to the statistics of multi-dimensional DCT coefficients and rate-distortion operation are needed to take full advantage of the multi-dimensional DCT. There remains many issues and future works about multi-dimensional coder to improve coding efficiency over H.264/AVC.

Biomechanical Evaluation of Cement type hip Implants as Conditions of bone Cement and Variations of Stem Design (골시멘트 특성 및 스템 형상에 따른 시멘트 타입 인공관절의 생체역학적 평가)

  • Park, H.S.;Chun, H.J.;Youn, I.C.;Lee, M.K.;Choi, K.W.
    • Journal of Biomedical Engineering Research
    • /
    • v.29 no.3
    • /
    • pp.212-221
    • /
    • 2008
  • The total hip replacement (THR) has been used as the most effective way to restore the function of damaged hip joint. However, various factors have caused some side effects after the THR. Unfortunately, the success of the THR have been decided only by the proficiency of surgeons so far. Hence, It is necessary to find the way to minimize the side effect caused by those factors. The purpose of this study was to suggest the definite data, which can be used to design and choose the optimal hip implant. Using finite element analysis (FEA), the biomechanical condition of bone cement was evaluated. Stress patterns were analyzed in three conditions: cement mantle, procimal femur and stem-cement contact surface. Additionally, micro-motion was analyzed in the stem-cement contact surface. The 3-D femur model was reconstructed from 2-D computerized tomography (CT) images. Raw CT images were preprocessed by image processing technique (i.e. edge detection). In this study, automated edge detection system was created by MATLAB coding for effective and rapid image processing. The 3-D femur model was reconstructed based on anatomical parameters. The stem shape was designed using that parameters. The analysis of the finite element models was performed with the variation of parameters. The biomechanical influence of each parameter was analyzed and derived optimal parameters. Moreover, the results of FE A using commercial stem model (Zimmer's V erSys) were similar to the results of stem model that was used in this study. Through the study, the improved designs and optimal factors for clinical application were suggested. We expect that the results can suggest solutions to minimize various side effects.