• 제목/요약/키워드: coding parameters

검색결과 275건 처리시간 0.022초

음성 파형코딩의 음원피치 변경에 관한 연구 - LPC와 주기반분법에 의한 피치변경법 - (On Altering the Pitch of Speech Signals in Waveform Coding -(Altering Method by the LPC and the Pitch Halving)-)

  • 민경중
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1991년도 학술발표회 논문집
    • /
    • pp.45-49
    • /
    • 1991
  • In area of the speech synthesis, the waveform coding with high quality are mainly used to the synthesis by analysis. However, it is difficult to applying the waveform coding to the synthesis by rule, because the parameters of this coding are not classified as either excitation parameters and vocal tract parameters. In this paper, we proposed a new pitch change method that can alter the pitch periods in the waveform coding. The proposed method expands the pitch period by the LPC synthesis method, and then the period is compressed by the waveform halving technique. Thus, it is possible that the waveform coding is carried out the synthesis by rule in speech processing.

  • PDF

AVC 부호화 효율의 추정 (Estimation of AVC Coding Efficiency)

  • 융 응옥 튜이 둥;손원
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2011년도 추계학술대회
    • /
    • pp.310-313
    • /
    • 2011
  • This study investigates some schemes to estimate the coding efficiency of a video sequence. The texture complexity and motion are considered as two major parameters to decide the coding efficiency, and the methods to estimate the parameters are discussed. For a fixed values of PSNR, the bit rate of a video sequence is estimated using some schemes based on the estimated parameters, and compared with the bit rate by MPEG-4 AVC.

  • PDF

Performance Evaluation of Different Factors According to ROI Coding Methods in JPEG2000

  • 김호용;심종채;서영건
    • 디지털콘텐츠학회 논문지
    • /
    • 제7권3호
    • /
    • pp.183-191
    • /
    • 2006
  • Currently, the preferred processing of a user-centered ROI(Region-of-Interest) or a specific region of image to transmission and decompression of a full image is needed in different applications, specifically mobile applications. Here, we have to study how different factors affect ROI coding methods. Therefore, an application can select an ROI coding method and several parameters suitable for the environments. The ROI coding methods used in the study are Maxshift and Implicit and the parameters are tile size, image size, code block size, ROI importance and the number of lowest resolution levels. This study shows the experimental results between the different parameters and the two ROI coding methods.

  • PDF

인접블록의 상관관계에 기반한 RGB video coding 개선 알고리즘 (Enhanced RGB Video Coding Based on Correlation in the Adjacent Block)

  • 김양수;정진우;최윤식
    • 전기학회논문지
    • /
    • 제58권12호
    • /
    • pp.2538-2541
    • /
    • 2009
  • H.264/AVC High 4:4:4 Intra/Predictive profiles supports RGB 4:4:4 sequences for high fidelity video. RGB color planes rather than YCbCr color planes are preferred by high-fidelity video applications such as digital cinema, medical imaging, and UHDTV. Several RGB coding tools have therefore been developed to improve the coding efficiency of RGB video. In this paper, we propose a new method to extract more accurate correlation parameters for inter-plane prediction. We use a searching method to determine the matched macroblock (MB) that has a similar inter-color relation to the current MB. Using this block, we can infer more accurate correlation parameters to predict chroma MB from luma MB. Our proposed inter-plane prediction mode shows an average bits saving of 15.6% and a PSNR increase of 0.99 dB compared with H.264 high4:4:4 intra-profile RGB coding. Furthermore, extensive performance evaluation revealed that our proposed algorithm has better coding efficiency than existing algorithms..

견실한 DTV 영상 전송을 위해 LSB 부호화를 이용한 MPEG-2 헤러 정보의 오류 복원 방법 (Error Resilience Method of MPEG-2 Header Parameters by using LSB Coding for Robust DTV Video Transmission)

  • 임태균;이상학
    • 한국정보통신학회논문지
    • /
    • 제9권5호
    • /
    • pp.1019-1024
    • /
    • 2005
  • MPEG-2로 부호화 된 영상에서 발생하는 전송 오류는 화질의 열화를 가져오고, 시공간적으로 오류를 전파시킨다. 특히 비디오 비트열에서 헤더 정보의 오류는 복호화 과정 전체에 영향을 미치므로 데이터 정보의 오류와 달리 전체 영상에 심각한 화질의 열화를 일으킬 수 있다. 따라서 헤더 정보에서의 오류를 복원하는 것은 데이터 정보에서 오류를 복원하는 것보다 더 중요하다. 본 논문에서는 LSB(least significant bit) 부호화를 이용하여 헤더 정보를 양자화 된 DCT(discrete cosine transform) 계수에 반복적으로 삽입하여 전송함으로써 MPEG-2의 신택스 구조 그대로 유지하면서 헤더 정보의 오류를 복원할 수 있는 방법을 제안한다.

켑스트럼 분석에 의한 파형부호화의 피치변경에 관한 연구 (On a Pitch Change of the Waveform Coding by the Cepstrum Analysis of Speech Waveforms)

  • 배명진;이미숙
    • 한국음향학회지
    • /
    • 제11권4호
    • /
    • pp.14-21
    • /
    • 1992
  • 음성신호의 합성기법들 중에서 파형부호화법은 음질이 우수하기 때문에 분석에 의한 합성법으로 많이 사용되고 있다. 그렇지만 음원과 성도의 특성을 분리하지 않고 파형의 잉여분만을 제거한 후에 파형자체를 저장하기 때문에 규칙에 의한 합성기법으로 사용하기에는 어려움이 많다. 본 논문에서는 파형부호화법 중에서 선형 PCM부호화법으로 저장된 음성파형에 대해 피치주기를 조절할 수 있는 켑스트럼 분석법을 제안하여 파형자체의 음원을 분리하지 않고 피치주기를 변경시킬 수 있는 새로운 피치 변경법을 제안하였다. 따라서 음질이 우수한 파형부호화 합성법으로 규칙에 의한 합성을 수행할 수 있다.

  • PDF

Largest Coding Unit Level Rate Control Algorithm for Hierarchical Video Coding in HEVC

  • Yoon, Yeo-Jin;Kim, Hoon;Baek, Seung-Jin;Ko, Sung-Jea
    • IEIE Transactions on Smart Processing and Computing
    • /
    • 제1권3호
    • /
    • pp.171-181
    • /
    • 2012
  • In the new video coding standard, called high efficiency video coding (HEVC), the coding unit (CU) is adopted as a basic unit of a coded block structure. Therefore, the rate control (RC) methods of H.264/AVC, whose basic unit is a macroblock, cannot be applied directly to HEVC. This paper proposes the largest CU (LCU) level RC method for hierarchical video coding in a HEVC. In the proposed method, the effective bit allocation is performed first based on the hierarchical structure, and the quantization parameters (QP) are then determined using the Cauchy density based rate-quantization (RQ) model. A novel method based on the linear rate model is introduced to estimate the parameters of the Cauchy density based RQ model precisely. The experimental results show that the proposed RC method not only controls the bitrate accurately, but also generates a constant number of bits per second with less degradation of the decoded picture quality than with the fixed QP coding and latest RC method for HEVC.

  • PDF

부품 코드체계를 이용한 수조립 애로공정의 파악 (Analysis of the Weak Manual Assembly Process with Part Coding System)

  • 목학수;문광섭;박홍석
    • 한국정밀공학회지
    • /
    • 제18권4호
    • /
    • pp.85-96
    • /
    • 2001
  • In this paper, part features are classified and then its coding system is constructed by the considered characteristics of features in assemble process. Analyzing the characteristics of features, code values about part features are determined. Assembly process is divided into five functions such as transporting, handing, approaching, alignment and joining, and then the detail parameters of each functions such as determined. Code values about assembly process are determined according to detail parameters. The detail parameters are kinds of available working method and assembly tools when each assembly function is going on. By the coding system, available assembly process can be grasped and perceived for the part that it is difficult to assemble.

  • PDF

스펙트럼 보상된 고음질 합성용 피치 변경법 (On a Pitch Alteration Method Compensated with the Spectrum for High Quality Speech Synthesis)

  • 문효정
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1995년도 제12회 음성통신 및 신호처리 워크샵 논문집 (SCAS 12권 1호)
    • /
    • pp.123-126
    • /
    • 1995
  • The waveform coding are concerned with simply preserving the wave shape of speech signal through a redundancy reduction process. In the case of speech synthesis, the wave form coding with high quality are mainly used to the synthesis by analysis. However, because the parameters of this coding are not classified as either excitation and vocal tract parameters, it is difficult to applying the waveform coding to the synthesis by rule. In this paper, we proposed a new pitch alteration method that can change the pitch period in waveform coding by using scaling the time-axis and compensating the spectrum. This is a time-frequency domain method that is preserved in the phase components of the waveform and that has a little spectrum distortion with 2.5% and less for 50% pitch change.

  • PDF

사례 적용 Praat 기반 CSL 대체 자동화 음성분석 프로그램 (Two Cases Using the Praat-Based Automatic Voice Analysis Program as an Alternative to CSL)

  • 강영애;장재원;구본석
    • 대한후두음성언어의학회지
    • /
    • 제32권2호
    • /
    • pp.87-93
    • /
    • 2021
  • There are a number of voice analysis programs around the world. Domestic voice analysis is performed by relying heavily on specific commercial program. We intend to develop coding for voice analysis using Praat and apply it to clinical practice. This study consisted of Experiment 1 and Experiment 2. Experiment 1 was the development of automated voice analysis coding based on Praat. The coding was largely divided into a recording, an analysis, and a storage section. Experiment 2 was applied to the voice analysis of 2 male patients pre- and post-operation with this coding. The analysis parameters of this coding provided 26 parameters for vowel /a/, nine parameters for sentence analysis, and a total of 4 parameters for voice range profile analysis. In two male patients, the pitch and the intensity increased, the voice quality improved, and the sentence length decreased after surgery. The coding was well made, so the output was good in real time. The code is automated as much as possible to block manual errors and increases convenience and efficiency by generating the result sheet in real time.