Search | Korea Science

2.4kbps Speech Coding Algorithm Using the Sinusoidal Model (정현파 모델을 이용한 2.4kbps 음성부호화 알고리즘)

백성기;배건성
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.27 no.3A
- /
- pp.196-204
- /
- 2002
The Sinusoidal Transform Coding(STC) is a vocoding scheme based on a sinusoidal model of a speech signal. The low bit-rate speech coding based on sinusoidal model is a method that models and synthesizes speech with fundamental frequency and its harmonic elements, spectral envelope and phase in the frequency region. In this paper, we propose the 2.4kbps low-rate speech coding algorithm using the sinusoidal model of a speech signal. In the proposed coder, the pitch frequency is estimated by choosing the frequency that makes least mean squared error between synthetic speech with all spectrum peaks and speech synthesized with chosen frequency and its harmonics. The spectral envelope is estimated using SEEVOC(Spectral Envelope Estimation VOCoder) algorithm and the discrete all-pole model. The phase information is obtained using the time of pitch pulse occurrence, i.e., the onset time, as well as the phase of the vocal tract system. Experimental results show that the synthetic speech preserves both the formant and phase information of the original speech very well. The performance of the coder has been evaluated in terms of the MOS test based on informal listening tests, and it achieved over the MOS score of 3.1.
PDF KSCI

H.264/AVC Video coding rate Control Algorithm Using linear statistical characteristic for Intra frame (선형적 통계특성을 이용한 H.264/AVC 인트라 프레임의 비트율 제어 알고리즘)

Joo, Won-Hee;Kim, Myoung-Jin;Hong, Min-Cheol
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2009.11a
- /
- pp.255-258
- /
- 2009
제한된 채널을 통하여 영상을 전송하고자 할 때 한정된 비트량 에서 최적의 화질을 얻기 위한 비트 할당기법은 영상의 부호화 과정에서 중요한 역할을 하며 중요한 연구 과제이다. H.264/AVC 표준안의 비트율 제어 방식은 영상의 복잡도에 따라 최적의 비트를 할당하는 방식을 사용하지만 첫 번째 프레임인 인트라 프레임에 대한 QP 값을 정확히 예측하지 못하는 문제점을 보인다. 비트율 제어에 있어 인트라 프레임의 복잡도를 예측하여 인트라 프레임에 대하여 할당되는 비트량은 인트라 프레임 이후 영상들의 화질에 큰 영향을 미치게 되므로 인트라 프레임의 복잡도를 예측하여 적절한 QP를 결정 하는 것은 매우 중요하다. 본 논문에서는 실시간 H.264/AVC를 위하여 인트라 모드의 적응적 비트율 제어 기법에 대해 제안한다. 통계적 실험을 통한 인트라 프레임과 인터 프레임과의 선형적 특성을 이용하여 인트라 프레임과 인터 프레임 간의 관계식을 도출한다. 이 관계식을 통하여 인터 프레임 이후에 일어나는 인트라 프레임의 QP 값을 정확하게 예측하는 비트율 제어 알고리즘을 제안한다.
PDF

A Fast Inter Prediction Encoding Technique for Real-time Compression of H.264/AVC (H.264/AVC의 실시간 압축을 위한 고속 인터 예측 부호화 기술)

Kim, Young-Hyun;Choi, Hyun-Jun;Seo, Young-Ho;Kim, Dong-Wook
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.31 no.11C
- /
- pp.1077-1084
- /
- 2006
This paper proposed a fast algorithm to reduce the amount of calculation for inter prediction which takes a great deal of the operational time in H.264/AVC. This algorithm decides a search range according to the direction of predicted motion vector, and then performs an adaptive spiral search for the candidates with JM(Joint Model) FME(Fast Motion Estimation) which employs the rate-distortion optimization(RDO) method. Simultaneously, it decides a threshold cost value for each of the variable block sizes and performs the motion estimation for the variable search ranges with the threshold. These activities reduce the great amount of the complexity in inter prediction encoding. Experimental results by applying the proposed method .to various video sequences showed that the process time was decreased up to 80% comparing to the previous prediction methods. The degradation of video quality was only from 0.05dB to 0.19dB and the compression ratio decreased as small as 0.58% in average. Therefore, we are sure that the proposed method is an efficient method for the fast inter prediction.
PDF KSCI

Transmission Rate Decision of Live Video Based on Coding Information (부호화 정보에 기반한 라이브 비디오의 전송률 결정)

Lee Myeong-jin
- Journal of Korea Multimedia Society
- /
- v.8 no.9
- /
- pp.1216-1226
- /
- 2005
In this paper, a preventive transmission rate decision algorithm, called PTRD, is proposed for the transmission of live video over networks with dynamic bandwidth allocation capability. Frame analyzer predicts the bit-rates of future frames before encoding by analyzing the source information such as spatial variances and the degree of scene changes. By using the predicted bit-rates, transmission rate bounds are derived from the constraints of encoder and decoder buffers. To resolve the problem of renegotiation cost increment due to frequent renegotiations, the PTRD algorithm is presented to decide transmission rates considering the elapsed time after the recent renegotiation and the perceived video quality. From the simulation results, compared to the normalized LMS based method, PTRD is shown to achieve high channel utilization with low renegotiation cost and no delay violation.
PDF

A Wavefront Array Processor Utilizing a Recursion Equation for ME/MC in the frequency Domain (주파수 영역에서의 움직임 예측 및 보상을 위한 재귀 방정식을 이용한 웨이브프런트 어레이 프로세서)

Lee, Joo-Heung;Ryu, Chul
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.31 no.10C
- /
- pp.1000-1010
- /
- 2006
This paper proposes a new architecture for DCT-based motion estimation and compensation. Previous methods do riot take sufficient advantage of the sparseness of 2-D DCT coefficients to reduce execution time. We first derive a recursion equation to perform DCT domain motion estimation more efficiently; we then use it to develop a wavefront array processor (WAP) consisting of processing elements. In addition, we show that the recursion equation enables motion predicted images with different frequency bands, for example, from the images with low frequency components to the images with low and high frequency components. The wavefront way Processor can reconfigure to different motion estimation algorithms, such as logarithmic search and three step search, without architectural modifications. These properties can be effectively used to reduce the energy required for video encoding and decoding. The proposed WAP architecture achieves a significant reduction in computational complexity and processing time. It is also shown that the motion estimation algorithm in the transform domain using SAD (Sum of Absolute Differences) matching criterion maximizes PSNR and the compression ratio for the practical video coding applications when compared to tile motion estimation algorithm in the spatial domain using either SAD or SSD.
PDF KSCI

A Method for Improvement of Coding Efficiency in Scalability Extension of H.264/AVC (H.264/AVC Scalability Extension의 부호화 효율 향상 기법)

Kang, Chang-Soo
- 전자공학회논문지 IE
- /
- v.47 no.2
- /
- pp.21-26
- /
- 2010
This paper proposed an efficient algorithm to reduce the amount of calculation for Scalability Extension which takes a great deal of the operational time in H.264/AVC. This algorithm decides a search range according to the direction of predicted motion vector, and then performs an adaptive spiral search for the candidates with JM(Joint Model) FME(Fast Motion Estimation) which employs the rate-distortion optimization(RDO) method. Experimental results by applying the proposed method to various video sequences showed that the process time was decreased up to 80% comparing to the previous prediction methods. The degradation of video Quality was only from 0.05dB to 0.19dB and the compression ratio decreased as small as 0.58% in average. Therefore, we are sure that the proposed method is an efficient method for the fast inter prediction.
PDF KSCI

디지탈압축 기초기술

The Korea Society of Space Technology
- Satellite Communications and Space Industry
- /
- v.2 no.2
- /
- pp.115-124
- /
- 1994
디지틀압축의 이용이 급속히 확대되어 가고 있다. 메모리와 논리회로의 진보에 따라 경제성이 좋아지고 있다. 디지틀압축은 예측, 변환, 양자화, 부호화의 과정을 거쳐, 각 과정에 화상정보를 압축하여 전체 전송하는 데이타량을 대폭 감소시켰다. 특히 중요한 것은 화상정보의 주파수성분을 분석하는 변환으로 실용화가 진행중인 기술로서 DCT(Discrete Cosine Transform)가 있다. 특히 MPEG(Moving Picture Expert Group) 등 압축방식의 표준화기구에서는 DCT를 주축으로 연구가 행해지고 있다.
PDF

A Study on the Pulse-Train Code Excited Linear Prediction Coder: PT-CELP (Pulse-Train code 여기 선형 예측 (PT-CELP) 부호화기에 관한 연구)

김흥국
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1995.06a
- /
- pp.246-249
- /
- 1995
4.16kbps의 전송률을 갖는 음성 부호화기 구조에 관하여 기술한다. 제안된 음성 부호화기는 개방 회로 피치 검출기와 이로부터 생성된 pulse train을 코드북으로 갖는 CELP 부호화기이다. Pulse-Train codebook은 분석 프레임별로 부호화 및 복호화 양단에서 생성되며 음성의 피치 및 포만트 정보를 내포하고 있다. 구현된 PT-CELP는 random codebook 방식의 CELP에 비해 적은 크기로 codebook을 만들 수 있으며 음성의 특징을 충분히 반영하므로 합성된 음성의 음질을 향상시킬 수 있다.
PDF

On the Reduction of Pitch Search Time for G.723.1 Using the Skipping Technique (G.723.1에서 Skipping Technique을 이용한 피치검색시간 단축에 관한 연구)

김정진
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06e
- /
- pp.285-288
- /
- 1998
G.723.1은 저 전송률 환경에서 고음질을 제공하여 주고 있으나 CELP형 부호화기가 갖는 합성에 의한 분석(analysis by synthesis) 방식의 구조로 인해 많은 처리 시간과 계산량을 요구하게 된다. 본 논문에서는 G.723.1에 대해 skipping 기법을 이용하여 피치 검색과정이 계산량을 줄여 부호화기의 전체 처리 시간을 감소시키는 방법을 제안하였다. 예측 피치를 찾기 위한 개회로 피치 예측(open loop pitch estimation) 과정에서 계산량을 줄이기 위해 skipping 기법을 사용하였다. 피치 예측 과정시 상관관계를 파형은 양과 음의 파형이 교대로 나타나는 특징을 가지고 있기 때문에 계산시 음의 파형을 생략하는 방법을 사용하였다. 실제 음성시료에 대해 제안한 피치 검색법을 적용하였을 때 부호화시 평균 처리시간은 약 10%정도 감소하였으며 기존 G.723.1과 제안한 방법을 적용한 G.723.1의 음질 비교를 위하여 MOS 평가를 했을 때 기존의 방법이 평균 3.76인데 비해 제안한 방법의 평균 MOS는 3.73으로 주관적인 음질 저하는 거의 나타나지 않았다.
PDF

Point Cloud Sequence Compression by Matching between Graphs (그래프 간 정합을 이용한 포인트 클라우드 시퀀스 압축)

Lee, Seonho;Kim, Ji-Su;Lee, Se-Ho;Kim, Chang-Su
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2018.06a
- /
- pp.22-23
- /
- 2018
본 논문에서는 그래프 간 정합을 이용한 포인트 클라우드 시퀀스 압축 기법을 제안한다. 우선, 그래프를 활용하여 포인트 클라우드 시퀀스의 시변하는 기하학적 구조를 표현하고, 그래프로부터 웨이블릿 변환을 사용하여 추출한 특징 벡터를 매칭하는 방법으로 인접 프레임 간 움직임 예측을 수행한다. 그리고 움직임 예측을 통해 얻은 움직임 벡터 중 정합 점수가 높은 소수의 움직임 벡터를 보간하여 프레임 전체의 움직임 필드를 얻는다. 최종적으로 움직임 정보를 활용하여 얻은 예측 프레임과 타겟 프레임의 차이를 선택적 엔트로피 부호화 방식으로 코딩하여 포인트 클라우드 시퀀스 압축을 수행한다. 실험 결과 제안하는 기법이 3D 포인트 클라우드 시퀀스를 효과적으로 압축함을 확인할 수 있다.
PDF

Search Result 147, Processing Time 0.031 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)