• Title/Summary/Keyword: 음성압축

Search Result 218, Processing Time 0.03 seconds

A Study on Real Time Pitch Alteration of Speech Signal (음성신호의 실시간 피치변경에 관한 연구)

  • 김종국;박형빈;배명진
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.1
    • /
    • pp.82-89
    • /
    • 2004
  • This paper describes how to reduce the effect of an occupation threshold by that the transform of mixture components of HMM parameters is controlled in hierarchical tree structure to prevent from over-adaptation. To reduce correlations between data elements and to remove elements with less variance, we employ PCA (principal component analysis) and ICA (independent component analysis) that would give as good a representation as possible, and decline the effect of over-adaptation. When we set lower occupation threshold and increase the number of transformation function, ordinary WLLR adaptation algorithm represents lower recognition rate than SI models, whereas the proposed MLLR adaptation algorithm represents the improvement of over 2% for the word recognition rate as compared to performance of SI models.

레이다와 전파신호처리 기술(I)

  • 곽영길
    • The Proceeding of the Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.5 no.1
    • /
    • pp.100-110
    • /
    • 1994
  • 레이다 신호는 대표적인 전자파 신호로서 주변환경에 따라 시간, 주파수, 공간 영역에서 고유한 신호특성을 가지고 있으며, 신호처리 기법도 다양하다. 본 논문에서는 먼저 레이다를 위한 전파 신호처리 의정의와 필요성을 언급한뒤, 레이다 신호환경 특성을 살펴보고 신호처리를 위한 신호의 시간 및 스펙트럼 특성에 대해 기술하였다. 그리고, 신호특성에 적합한 신호처리기의 구현을 위해 레이다 신호처리에 관 련된 주요 기법에 대해 개괄적으로 설명하였다. 레이다 신호처리 분야는 일반적으로 잘 알려진 음성이 나 영상신호처리 분야와 달리 고유한 알고리듬과 구조가 요구된다. 신호처리기법으로서 레이다 파형설 계, 해상도 모호성, 펄스압축, 클러터제거, 도플러처리, 일정오경보탐지, 클러터 지도, 표적군 형성/ 추출, 표적식별, 레이다영상기법, 적응배열처리 등에 관해 개괄적으로 설명하였다. 레이다 선호처리 기술은 "스마트"한 레이다를 위한 두뇌 역할을 하기때문에 그 필요성과 중요성이 증가하고 있다. 그러나, 고속, 대용량의 신호를 주어진 빔 주사시간동안에 실시간으로 처리하여 표적 정보를 추출해야 하기 때문에 아직도 상용 프로세서의 속도 한계내에서 알고리듬의 수행에 다소 제약을 받고 있으나, 최근 디지탈 신호처리 전용의 고속 칩의 출현으로 많은 발전을 가져오고 있다. 끝으로, 향후 레이다 신호처리 발전 추세와 응용분야에 대해 살펴보았다. 응용분야는 군수 및 민수용의 겸용 파급효과가 매우 크고, 군용의 대공탐색 및 조기경보, 전장감시뿐만 아니라 전투기 탑재용으로 필수적이며, 특히 민수용의 공 항, 항공기, 선박, 위성 등 매우 다양하다. 최근 발전추세에 따른 기술로서 다중모드 신호처리, 고집적 회로기술, 적응배열, 디지탈 빔형성, 적응성, 고분해능 및 방향성, 표적식별, 다차원 신호처리에 대해 언급 하였다.

  • PDF

An Audio Coding Technique Employing the Inter-channel Phase Difference Skip (채널 간 위상차 파라미터 생략 기법을 이용한 오디오 부호화)

  • Kim, Hyun-Hwi;Kim, Rin-Chul
    • Journal of Broadcast Engineering
    • /
    • v.21 no.3
    • /
    • pp.369-379
    • /
    • 2016
  • This paper deals with an efficient method for skipping inter-channel phase differences (IPD) in the MPEG surround of the unified speech and audio coding (USAC). Based on the psycho-acoustic sensitivity on the IPD, we estimate a threshold on IPD, below which we can not notice degradation in spatial cue. We propose an IPD skip method, in which any IPDs within the threshold are set to zero and are not transmitted. The proposed IPD skip method gives about 38% savings in terms of bit amount for IPD. Nevertheless, in the MUSHRA test, the proposed method does not show any noticeable degradation in the decoded audio quality.

The Vocabulary Recognition Optimize using Acoustic and Lexical Search (음향학적 및 언어적 탐색을 이용한 어휘 인식 최적화)

  • Ahn, Chan-Shik;Oh, Sang-Yeob
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.4
    • /
    • pp.496-503
    • /
    • 2010
  • Speech recognition system is developed of standalone, In case of a mobile terminal using that low recognition rate represent because of limitation of memory size and audio compression. This study suggest vocabulary recognition highest performance improvement system for separate acoustic search and lexical search. Acoustic search is carry out in mobile terminal, lexical search is carry out in server processing system. feature vector of speech signal extract using GMM a phoneme execution, recognition a phoneme list transmission server using Lexical Tree Search algorithm lexical search recognition execution. System performance as a result of represent vocabulary dependence recognition rate of 98.01%, vocabulary independence recognition rate of 97.71%, represent recognition speed of 1.58 second.

Implementation of Video chatting System for the Consultation of Gas Safety using H.263 CODEC (가스안전 상담용 H.263 코덱을 이용한 영상채팅시스템 구현)

  • Jeong, Ae-Jeong;Park, Gyou-Tae;Han, Sang-In;Kwon, Jeong-Rock
    • Proceedings of the KIEE Conference
    • /
    • 2008.10b
    • /
    • pp.503-504
    • /
    • 2008
  • 최근 정보통신 기술들이 빠르게 발전하고 있다. 다양한 통신 기술들 중에서도 업무의 효율을 높이고자 회사 및 가정, 학교 등에서 자주 사용되고 있는 영상채팅시스템을 구현해보고자 한다. 쿼타임 코덱 중 가장 보편적인 코덱으로 인코딩이 쉽고 저사양의 CPU만으로도 실시간 스트리밍이 가능한 H.263 코덱을 사용하여 영상채팅시스템을 Visual C++로 구현을 하였다. 전송로의 지연을 줄이기 위하여 영상, 음성, 텍스트 등을 압축하고 복원하는 데 걸리는 시간을 최소화기 위하여 데이터의 전송대역폭을 적절히 조절하는 알고리듬을 제안하여 전송지연을 최소화하였다. 또한 P2P 방식을 사용하여 다양한 영상 환경에 대하여 영상 및 텍스트 데이터의 안정성과 화질이 우수함을 보였으며, 실시간 가스안전관리 상담에 이용하여 업무의 효율을 높이고자 한다.

  • PDF

Estimation of Lifetime Data Storage Capacity for Human Senses (인간 감각 정보를 위한 평생 기억용량 평가)

  • You, Young-Gap;Song, Young-Jun;Kim, Dong-Woo
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.1
    • /
    • pp.23-29
    • /
    • 2009
  • This paper presents a capacity estimation of a storage system accumulating all data sensed during the lifetime of an individual human being. The calculation assumes modern data compression and data collection schemes based on wearable or implanted devices under ubiquitous environment. More than 76% of the storage area is found to be used for video data storage of common TV image quality. The remaining storage area is for data from other sensing organs including audio, taste, olfactory and tactual systems in addition to indexing information. Total storage area of around 600 tera bytes is needed to cover 100 years of human life including his fetal period.

Multimedia Data Security of Video Conferencing System (영상회의 시스템에서의 멀티미디어 데이터 보안)

  • 이원호;한군희
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2003.05a
    • /
    • pp.231-236
    • /
    • 2003
  • Video conferencing system it is various at internet and uses the reading is become accomplished. Research of like this portion synchronization of audio, the video compression technique and multimedia data, supports the video conference the research of the Mbone of the If multicast for being active, being become accomplished the multimedia service which is various an video from internet, the line speed of communication becomes high-speed anger and to follow leads is become accomplished. The video conference from opening elder brother dispersion internet network environment the problem against the image which is an image conference data and a voice security is serious and it raises its head. To sleep it presents the security method which from the video conference it follows in quality of multimedia data from the dissertation which it sees and it does.

  • PDF

Structrral Analysis of Bridge Pier with 40MPa High Strength Concrete (설계강도 40MPa 고강도 콘크리트를 적용한 교량 교각 구조물의 구조해석)

  • Hur, Jae-Hun;Yi, Sang-Keun;Gwak, Seok-Hwan;Huh, Suk-Bum;Park, Chang-Min
    • Proceedings of the Korea Concrete Institute Conference
    • /
    • 2009.05a
    • /
    • pp.157-158
    • /
    • 2009
  • In this study, We analyze structural behavior feature of column under reinforced-bar and concrete strength and load conditions and analyze optimal column diameter and construction cost through parameter study. In case we use the 40MPa high strength concrete instead of 27MPa concrete in pier, the results show positive effect in appearance of pier and cost because of small column diameter and low construction cost. Also, practical effect is proved by applying this results in pier of Shin Hou Bridge on Hum-Sung ${\sim}$ Chung-Ju highway construction work.

  • PDF

Implementation of Real Time Multi-User Communication System with MPEG-4 CELP (MPEG-4 CELP를 이용한 실시간 다자간 통신시스템의 구현)

  • 김헌중;우광희;차형태
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.3
    • /
    • pp.57-62
    • /
    • 2000
  • In recent, the innovative improvement of a internet and computing environment make users desire the capability of processing information in real time. In this paper we implement a PC-to-PC real time multi-user communication system on the internet environment using the efficient algorithm for a real time processing and the MFEG-4 CELP codec which can be used for a low bit-rate coding from 6 to 24kbps. The implemented system produces a compressed bit-streams with the MPEG-4 CEU Mode-I 18200bps mode. There is 5 frames for a package and 1 frame has 160 samples. We can use this system to communicate with 4 users simultaneously in real time. The system is designed and examined on the Windows operating system.

  • PDF

A fast DCT algorithm with reduced propagation error in the fixed-point compuitation (고정 소수점 연산시 오차의 전파를 줄이는 고속 이산 여현 변환 알고리즘)

  • 정연식;이임건;최영호;박규태
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.23 no.9A
    • /
    • pp.2365-2371
    • /
    • 1998
  • Discrete cosine transform (DCT) has wide applications in speech and image coding. In this paper, we propose a novel fast dCT scheme with the property of reduced multiplication stages and the smaller number of additions and multiplications. This exploits the symmetry property of the DCT kernel to decompose the N-point dCT to N/2 point, and can be generally applied recursively to $2^{m}$-point. The proposed algorithm has a structure that most of multiplications tend to be performed at final stage, and this reduces propagation of truncation error which could occur in the fixed-point computation. Also the minimization of the multiplication stages further decreases the error.

  • PDF