• Title/Summary/Keyword: coding parameters

Search Result 275, Processing Time 0.029 seconds

On Altering the Pitch of Speech Signals in Waveform Coding -(Altering Method by the LPC and the Pitch Halving)- (음성 파형코딩의 음원피치 변경에 관한 연구 - LPC와 주기반분법에 의한 피치변경법 -)

  • 민경중
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1991.06a
    • /
    • pp.45-49
    • /
    • 1991
  • In area of the speech synthesis, the waveform coding with high quality are mainly used to the synthesis by analysis. However, it is difficult to applying the waveform coding to the synthesis by rule, because the parameters of this coding are not classified as either excitation parameters and vocal tract parameters. In this paper, we proposed a new pitch change method that can alter the pitch periods in the waveform coding. The proposed method expands the pitch period by the LPC synthesis method, and then the period is compressed by the waveform halving technique. Thus, it is possible that the waveform coding is carried out the synthesis by rule in speech processing.

  • PDF

Estimation of AVC Coding Efficiency (AVC 부호화 효율의 추정)

  • Dung, Luong Ngoc Thuy;Sohn, Won
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2011.11a
    • /
    • pp.310-313
    • /
    • 2011
  • This study investigates some schemes to estimate the coding efficiency of a video sequence. The texture complexity and motion are considered as two major parameters to decide the coding efficiency, and the methods to estimate the parameters are discussed. For a fixed values of PSNR, the bit rate of a video sequence is estimated using some schemes based on the estimated parameters, and compared with the bit rate by MPEG-4 AVC.

  • PDF

Performance Evaluation of Different Factors According to ROI Coding Methods in JPEG2000

  • Kim, Ho-Yong;Shim, Jong-Chae;Seo, Yeong-Geon
    • Journal of Digital Contents Society
    • /
    • v.7 no.3
    • /
    • pp.183-191
    • /
    • 2006
  • Currently, the preferred processing of a user-centered ROI(Region-of-Interest) or a specific region of image to transmission and decompression of a full image is needed in different applications, specifically mobile applications. Here, we have to study how different factors affect ROI coding methods. Therefore, an application can select an ROI coding method and several parameters suitable for the environments. The ROI coding methods used in the study are Maxshift and Implicit and the parameters are tile size, image size, code block size, ROI importance and the number of lowest resolution levels. This study shows the experimental results between the different parameters and the two ROI coding methods.

  • PDF

Enhanced RGB Video Coding Based on Correlation in the Adjacent Block (인접블록의 상관관계에 기반한 RGB video coding 개선 알고리즘)

  • Kim, Yang-Soo;Jeong, Jin-Woo;Choe, Yoon-Sik
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.58 no.12
    • /
    • pp.2538-2541
    • /
    • 2009
  • H.264/AVC High 4:4:4 Intra/Predictive profiles supports RGB 4:4:4 sequences for high fidelity video. RGB color planes rather than YCbCr color planes are preferred by high-fidelity video applications such as digital cinema, medical imaging, and UHDTV. Several RGB coding tools have therefore been developed to improve the coding efficiency of RGB video. In this paper, we propose a new method to extract more accurate correlation parameters for inter-plane prediction. We use a searching method to determine the matched macroblock (MB) that has a similar inter-color relation to the current MB. Using this block, we can infer more accurate correlation parameters to predict chroma MB from luma MB. Our proposed inter-plane prediction mode shows an average bits saving of 15.6% and a PSNR increase of 0.99 dB compared with H.264 high4:4:4 intra-profile RGB coding. Furthermore, extensive performance evaluation revealed that our proposed algorithm has better coding efficiency than existing algorithms..

Error Resilience Method of MPEG-2 Header Parameters by using LSB Coding for Robust DTV Video Transmission (견실한 DTV 영상 전송을 위해 LSB 부호화를 이용한 MPEG-2 헤러 정보의 오류 복원 방법)

  • Lim Tae-gyun;Lee Sang-hak
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.9 no.5
    • /
    • pp.1019-1024
    • /
    • 2005
  • MPEG-2 achieves high compression radio, by exploiting the temporal and spatial correlations in real image sequence, using the motion compensated prediction and the transform coding, respectively. However, as the image sequence is more highly compressed, the encoded bitstream becomes more vulnerable to transmission error over the noisy channels. Furthermore, er개rs in the headers are fatal to decoding processes, because the header parameters in the video coding standard include a lot of important information connected to the syntax elements, fables, and decoding process. In this paper, we propose a new error resilience method using LSB coding for header parameters in MPEG-2 coded video transmissions. The experimental results for football and susie video sequence demonstrate that the proposed error resilience method for header parameters in MPEG-2 bitstream has good performance.

On a Pitch Change of the Waveform Coding by the Cepstrum Analysis of Speech Waveforms (켑스트럼 분석에 의한 파형부호화의 피치변경에 관한 연구)

  • Bae, Myung-Jin;Lee, Mi-Suk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.11 no.4
    • /
    • pp.14-21
    • /
    • 1992
  • The waveform coding is concerned with simply preserving the wave shape of speech signal through a redundancy reduction process. In area of the speech synthesis, the waveform codings with high quality are mainly used to the synthesis by analysis. However, because the parameters of this coding are not classified as either excitation parameters and vocal tract parameters, it is difficult to applying the waveform coding to the synthesis by rule. In this paper, we proposed a new pitch alternation method that can change the pitch periods in the waveform coding by using the cepstrum analysis. Thus, it is possible that the waveform coding is carried out the synthesis by rule in speech processing.

  • PDF

Largest Coding Unit Level Rate Control Algorithm for Hierarchical Video Coding in HEVC

  • Yoon, Yeo-Jin;Kim, Hoon;Baek, Seung-Jin;Ko, Sung-Jea
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.1 no.3
    • /
    • pp.171-181
    • /
    • 2012
  • In the new video coding standard, called high efficiency video coding (HEVC), the coding unit (CU) is adopted as a basic unit of a coded block structure. Therefore, the rate control (RC) methods of H.264/AVC, whose basic unit is a macroblock, cannot be applied directly to HEVC. This paper proposes the largest CU (LCU) level RC method for hierarchical video coding in a HEVC. In the proposed method, the effective bit allocation is performed first based on the hierarchical structure, and the quantization parameters (QP) are then determined using the Cauchy density based rate-quantization (RQ) model. A novel method based on the linear rate model is introduced to estimate the parameters of the Cauchy density based RQ model precisely. The experimental results show that the proposed RC method not only controls the bitrate accurately, but also generates a constant number of bits per second with less degradation of the decoded picture quality than with the fixed QP coding and latest RC method for HEVC.

  • PDF

Analysis of the Weak Manual Assembly Process with Part Coding System (부품 코드체계를 이용한 수조립 애로공정의 파악)

  • Mok, Hak-Soo;Moon, Kwang-Sup;Park, Hong-Seok
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.18 no.4
    • /
    • pp.85-96
    • /
    • 2001
  • In this paper, part features are classified and then its coding system is constructed by the considered characteristics of features in assemble process. Analyzing the characteristics of features, code values about part features are determined. Assembly process is divided into five functions such as transporting, handing, approaching, alignment and joining, and then the detail parameters of each functions such as determined. Code values about assembly process are determined according to detail parameters. The detail parameters are kinds of available working method and assembly tools when each assembly function is going on. By the coding system, available assembly process can be grasped and perceived for the part that it is difficult to assemble.

  • PDF

On a Pitch Alteration Method Compensated with the Spectrum for High Quality Speech Synthesis (스펙트럼 보상된 고음질 합성용 피치 변경법)

  • 문효정
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1995.06a
    • /
    • pp.123-126
    • /
    • 1995
  • The waveform coding are concerned with simply preserving the wave shape of speech signal through a redundancy reduction process. In the case of speech synthesis, the wave form coding with high quality are mainly used to the synthesis by analysis. However, because the parameters of this coding are not classified as either excitation and vocal tract parameters, it is difficult to applying the waveform coding to the synthesis by rule. In this paper, we proposed a new pitch alteration method that can change the pitch period in waveform coding by using scaling the time-axis and compensating the spectrum. This is a time-frequency domain method that is preserved in the phase components of the waveform and that has a little spectrum distortion with 2.5% and less for 50% pitch change.

  • PDF

Two Cases Using the Praat-Based Automatic Voice Analysis Program as an Alternative to CSL (사례 적용 Praat 기반 CSL 대체 자동화 음성분석 프로그램)

  • Kang, Young Ae;Chang, Jae Won;Koo, Bon Seok
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.32 no.2
    • /
    • pp.87-93
    • /
    • 2021
  • There are a number of voice analysis programs around the world. Domestic voice analysis is performed by relying heavily on specific commercial program. We intend to develop coding for voice analysis using Praat and apply it to clinical practice. This study consisted of Experiment 1 and Experiment 2. Experiment 1 was the development of automated voice analysis coding based on Praat. The coding was largely divided into a recording, an analysis, and a storage section. Experiment 2 was applied to the voice analysis of 2 male patients pre- and post-operation with this coding. The analysis parameters of this coding provided 26 parameters for vowel /a/, nine parameters for sentence analysis, and a total of 4 parameters for voice range profile analysis. In two male patients, the pitch and the intensity increased, the voice quality improved, and the sentence length decreased after surgery. The coding was well made, so the output was good in real time. The code is automated as much as possible to block manual errors and increases convenience and efficiency by generating the result sheet in real time.