• Title/Summary/Keyword: Residual Coding

Search Result 124, Processing Time 0.028 seconds

A Study on Pitch Extraction Method using FIR-STREAK Digital Filter (FIR-STREAK 디지털 필터를 사용한 피치추출 방법에 관한 연구)

  • Lee, Si-U
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.1
    • /
    • pp.247-252
    • /
    • 1999
  • In order to realize a speech coding at low bit rates, a pitch information is useful parameter. In case of extracting an average pitch information form continuous speech, the several pitch errors appear in a frame which consonant and vowel are coexistent; in the boundary between adjoining frames and beginning or ending of a sentence. In this paper, I propose an Individual Pitch (IP) extraction method using residual signals of the FIR-STREAK digital filter in order to restrict the pitch extraction errors. This method is based on not averaging pitch intervals in order to accomodate the changes in each pitch interval. As a result, in case of Ip extraction method suing FIR-STREAK digital filter, I can't find the pitch errors in a frame which consonant and vowel are consistent; in the boundary between adjoining frames and beginning or ending of a sentence. This method has the capability of being applied to many fields, such as speech coding, speech analysis, speech synthesis and speech recognition.

  • PDF

Volumetric Image System for High Efficiency Video Coding (고효율 비디오코딩을 위한 입체영상시스템)

  • Kim, Sang Hyun
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.1
    • /
    • pp.515-520
    • /
    • 2016
  • Volumetric image system has many applications recently in education, 3D movie, medical images but these applications have several problems that need to be overcome. Volumetric display may process a amount of visual data and design the high efficient vision system for realtime display. In case of stereo system for volumetric display motion vectors, disparity vectors from the stereoscopic sequences and residual images with the reference images has been transmitted, and the stereoscopic sequences have been reconstructed at the receiver for volumetric display. So central issue for the design of efficient volumetric image system lies in selecting an appropriate stereo matching and robust vision system. In this paper, we proposed high efficient vision system, which design vision stage with rotating and moving horizontally, and match the successive stereo image efficiently. In experimental results with volumetric image system, the proposed method represents high efficiency with minimizing error and low computational load for volumetric display.

Distributed Video Coding based on Adaptive Block Quantization Using Received Motion Vectors (수신된 움직임 벡터를 이용한 적응적 블록 양자화 기반 분산 비디오 코딩 방법)

  • Min, Kyung-Yeon;Park, Sea-Nae;Nam, Jung-Hak;Sim, Dong-Gyu;Kim, Sang-Hyo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.2C
    • /
    • pp.172-181
    • /
    • 2010
  • In this paper, we propose an adaptive block quantization method. The propose method perfrect reconstructs side information without high complexity in the encoder side, as transmitting motion vectors from a decoder to an encoder side. Also, at the encoder side, residual signals between reconstructed side information and original frame are adaptively quantized to minimize parity bits to be transmitted to the decoder. The proposed method can effectively allocate bits based on bit error rate of side information. Also, we can achieved bit-saving by transmission of parity bits based on the error correction ability of the LDPC channel decoder, because we can know bit error rate and positions of error bit in encoder side. Experimental results show that the proposed algorithm achieves bit-saving by around 66% and delay of feedback channel, compared with the convntional algorithm.

VLSI Design of H.264/AVC CAVLC encoder for HDTV Application (실시간 HD급 영상 처리를 위한 H.264/AVC CAVLC 부호화기의 하드웨어 구조 설계)

  • Woo, Jang-Uk;Lee, Won-Jae;Kim, Jae-Seok
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.44 no.7 s.361
    • /
    • pp.45-53
    • /
    • 2007
  • In this paper, we propose an efficient hardware architecture for H.264/AVC CAVLC (Context-based Adaptive Variable Length Coding) encoding. Previous CAVLC architectures search all of the coefficients to find statistic characteristics in a block. However, it is unnecessary information that zero coefficients following the last position of a non-zero coefficient when CAVLC encodes residual coefficients. In order to reduce this unnecessary operation, we propose two techniques, which detect the first and last position of non-zero coefficients and arrange non-zero coefficients sequentially. By adopting these two techniques, the required processing time was reduced about 23% compared with previous architecture. It was designed in a hardware description language and total logic gate count is 16.3k using 0.18um standard cell library Simulation results show that our design is capable of real-time processing for $1920{\times}1088\;30fps$ videos at 81MHz.

Quality Improvement of Karaoke Mode in SAOC using Cross Prediction based Vocal Estimation Method (교차 예측 기반의 보컬 추정 방법을 이용한 SAOC Karaoke 모드에서의 음질 향상 기법에 대한 연구)

  • Lee, Tung Chin;Park, Young-Cheol;Youn, Dae Hee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.32 no.3
    • /
    • pp.227-236
    • /
    • 2013
  • In this paper, we present a vocal suppression algorithm that can enhance the quality of music signal coded using Spatial Audio Object Coding (SAOC) in Karaoke mode. The residual vocal component in the coded music signal is estimated by using a cross prediction method in which the music signal coded in Karaoke mode is used as the primary input and the vocal signal coded in Solo mode is used as a reference. However, the signals are extracted from the same downmix signal and highly correlated, so that the music signal can be severely damaged by the cross prediction. To prevent this, a psycho-acoustic disturbance rule is proposed, in which the level of disturbance to the reference input of the cross prediction filter is adapted according to the auditory masking property. Objective and subjective test were performed and the results confirm that the proposed algorithm offers improved quality.

Construction of an Industrial Brewing Yeast Strain to Manufacture Beer with Low Caloric Content and Improved Flavor

  • Wang, Jin-Jing;Wang, Zhao-Yue;Liu, Xi-Feng;Guo, Xue-Na;He, Xiu-Ping;Wense, Pierre Christian;Zhang, Bo-Run
    • Journal of Microbiology and Biotechnology
    • /
    • v.20 no.4
    • /
    • pp.767-774
    • /
    • 2010
  • In this study, the problems of high caloric content, increased maturation time, and off-flavors in commercial beer manufacture arising from residual sugar, diacetyl, and acetaldehyde levels were addressed. A recombinant industrial brewing yeast strain (TQ1) was generated from T1 [Lipomyces starkeyi dextranase gene (LSD1) introduced, ${\alpha}$-acetohydroxyacid synthase gene (ILV2) disrupted] by introducing Saccharomyces cerevisiae glucoamylase (SGA1) and a strong promoter (PGK1), while disrupting the gene coding alcohol dehydrogenase (ADH2). The highest glucoamylase activity for TQ1 was 93.26 U/ml compared with host strain T1 (12.36 U/ml) and wild-type industrial yeast strain YSF5 (10.39 U/ml), respectively. European Brewery Convention (EBC) tube fermentation tests comparing the fermentation broths of TQ1 with T1 and YSF5 showed that the real extracts were reduced by 15.79% and 22.47%; the main residual maltotriose concentrations were reduced by 13.75% and 18.82%; the caloric contents were reduced by 27.18 and 35.39 calories per 12 oz. Owing to the disruption of the ADH2 gene in TQ1, the off-flavor acetaldehyde concentrations in the fermentation broth were 9.43% and 13.28%, respectively, lower than that of T1 and YSF5. No heterologous DNA sequences or drug resistance genes were introduced into TQ1. Hence, the gene manipulations in this work properly solved the addressed problems in commercial beer manufacture.

Design and Implementation of Simple Text-to-Speech System using Phoneme Units (음소단위를 이용한 소규모 문자-음성 변환 시스템의 설계 및 구현)

  • Park, Ae-Hee;Yang, Jin-Woo;Kim, Soon-Hyob
    • The Journal of the Acoustical Society of Korea
    • /
    • v.14 no.3
    • /
    • pp.49-60
    • /
    • 1995
  • This paper is a study on the design and implementation of the Korean Text-to-Speech system which is used for a small and simple system. In this paper, a parameter synthesis method is chosen for speech syntheiss method, we use PARCOR(PARtial autoCORrelation) coefficient which is one of the LPC analysis. And we use phoneme for synthesis unit which is the basic unit for speech synthesis. We use PARCOR, pitch, amplitude as synthesis parameter of voice, we use residual signal, PARCOR coefficients as synthesis parameter of unvoice. In this paper, we could obtain the 60% intelligibility by using the residual signal as excitation signal of unvoiced sound. The result of synthesis experiment, synthesis of a word unit is available. The controlling of phoneme duration is necessary for synthesizing of a sentence unit. For setting up the synthesis system, PC 486, a 70[Hz]-4.5[KHz] band pass filter for speech input/output, amplifier, and TMS320C30 DSP board was used.

  • PDF

A Hybrid Optimized Deep Learning Techniques for Analyzing Mammograms

  • Bandaru, Satish Babu;Deivarajan, Natarajasivan;Gatram, Rama Mohan Babu
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.10
    • /
    • pp.73-82
    • /
    • 2022
  • Early detection continues to be the mainstay of breast cancer control as well as the improvement of its treatment. Even so, the absence of cancer symptoms at the onset has early detection quite challenging. Therefore, various researchers continue to focus on cancer as a topic of health to try and make improvements from the perspectives of diagnosis, prevention, and treatment. This research's chief goal is development of a system with deep learning for classification of the breast cancer as non-malignant and malignant using mammogram images. The following two distinct approaches: the first one with the utilization of patches of the Region of Interest (ROI), and the second one with the utilization of the overall images is used. The proposed system is composed of the following two distinct stages: the pre-processing stage and the Convolution Neural Network (CNN) building stage. Of late, the use of meta-heuristic optimization algorithms has accomplished a lot of progress in resolving these problems. Teaching-Learning Based Optimization algorithm (TIBO) meta-heuristic was originally employed for resolving problems of continuous optimization. This work has offered the proposals of novel methods for training the Residual Network (ResNet) as well as the CNN based on the TLBO and the Genetic Algorithm (GA). The classification of breast cancer can be enhanced with direct application of the hybrid TLBO- GA. For this hybrid algorithm, the TLBO, i.e., a core component, will combine the following three distinct operators of the GA: coding, crossover, and mutation. In the TLBO, there is a representation of the optimization solutions as students. On the other hand, the hybrid TLBO-GA will have further division of the students as follows: the top students, the ordinary students, and the poor students. The experiments demonstrated that the proposed hybrid TLBO-GA is more effective than TLBO and GA.

Efficient Coding of Motion Vector Predictor using Phased-in Code (Phased-in 코드를 이용한 움직임 벡터 예측기의 효율적인 부호화 방법)

  • Moon, Ji-Hee;Choi, Jung-Ah;Ho, Yo-Sung
    • Journal of Broadcast Engineering
    • /
    • v.15 no.3
    • /
    • pp.426-433
    • /
    • 2010
  • The H.264/AVC video coding standard performs inter prediction using variable block sizes to improve coding efficiency. Since we predict not only the motion of homogeneous regions but also the motion of non-homogeneous regions accurately using variable block sizes, we can reduce residual information effectively. However, each motion vector should be transmitted to the decoder. In low bit rate environments, motion vector information takes approximately 40% of the total bitstream. Thus, motion vector competition was proposed to reduce the amount of motion vector information. Since the size of the motion vector difference is reduced by motion vector competition, it requires only a small number of bits for motion vector information. However, we need to send the corresponding index of the best motion vector predictor for decoding. In this paper, we propose a new codeword table based on the phased-in code to encode the index of motion vector predictor efficiently. Experimental results show that the proposed algorithm reduces the average bit rate by 7.24% for similar PSNR values, and it improves the average image quality by 0.36dB at similar bit rates.

Weighted Prediction Using Residual Information for Multi-view Video Coding (레지듀얼 정보를 이용한 다시점 동영상 부호화의 가중치 예측)

  • Kim, Ji-Young;Kim, Young-Tae;Seo, Jung-Dong;Sohn, Kwang-Hoon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2007.02a
    • /
    • pp.9-12
    • /
    • 2007
  • 다시점 동영상 부호화기는 서로 다른 카메라에 의해 영상을 획득하므로 카메라 내부 파라미터의 차이나 조명의 차이 및 변화 등에 의한 시점 간 명도 성분의 불균형을 가지고 있다. 이로 인해 잘못된 변이 추정이 이루어질 수 있으며, 따라서 전체적인 다시점 동영상 부호화의 성능을 크게 저하시킬 수 있다. 본 논문에서는 레지듀얼이 가지고 있는 밝기 차 정보를 이용하여 시점 간의 불균형을 해소하는 가중치 예측 알고리듬을 제안한다. 주변의 인과적인 블록의 레지듀얼 정보를 이용하여 현재 블록과 참조 블록의 밝기 차를 예측하고, 이 값을 이용해 시점 간 불균형을 보정 한 후 변이 추정을 수행한다. 변이 보상 후 계산된 현재 블록의 레지듀얼 평균값을 앞에서 예측된 밝기 차의 값에 누적하여 다음 블록의 밝기 차 예측에 사용한다. 제안된 방법을 실험 영상에 적용한 결과 평균적으로 약 0.2dB의 이득을 얻었다.

  • PDF