통합 검색 | Korea Science

잡음음성에서의 음성 활성화 구간 검출 방법 (Speech Active Interval Detection Method in Noisy Speech)

이광석;추연규;김현덕
- 한국정보통신학회:학술대회논문집
- /
- 한국해양정보통신학회 2008년도 추계종합학술대회 B
- /
- pp.779-782
- /
- 2008
음성통신 및 음성인식에 있어서 잡음이 섞인 음성으로부터 음성의 활성화 구간의 검출은 대단히 중요한 과정으로 알려져 있다. 따라서 본 연구에서는 잡음음성으로부터 음성의 활성화 구간을 검출하기 위하여 스펙트럴 엔트로피와 복합으로 구성하는 특징 파라미터를 제안하고 에너지를 기반으로 음성 활성화 구간을 검출하는 방식과 성능 비교 실험을 행하였다. 실험결과, 노이즈 환경에서 다른 파라미터에 비하여 제안한 파라미터에 의한 음성 활성화 구간 검출의 성능이 우수함을 확인할 수 있었다.
PDF

움직임 추정 보정을 이용한 잡음 제거 및 디인터레이싱 기법 (Noise reduction and De-interlacing with motion vector validation)

정재한;양승준
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2003년도 신호처리소사이어티 추계학술대회 논문집
- /
- pp.149-152
- /
- 2003
This paper presents a method to find motion vectors that are closer to true motion with noisy images for simultaneous noise reduction and do-interlacing. The proposed method requires four interlaced field images: one noisy field image and three field images from which noise is already removed. The validation of motion provides accurate motion vectors and allows us to utilize them even in very noisy environment. The validated motion vectors are first used for the noise reduction, buffered and used later for the noise reduction and de -interlacing.
PDF

Detecting Object of Interest from a Noisy Image Using Human Visual Attention

Cheoi Kyung-Joo
- International Journal of Contents
- /
- 제2권1호
- /
- pp.5-8
- /
- 2006
This paper describes a new mechanism of detecting object of interest from a noisy image, without using any a-priori knowledge about the target. It employs a parallel set of filters inspired upon biological findings of mammalian vision. In our proposed system, several basic features are extracted directly from original input visual stimuli, and these features are integrated based on their local competitive relations and statistical information. Through integration process, unnecessary features for detecting the target are spontaneously decreased, while useful features are enhanced. Experiments have been performed on a set of computer generated and real images corrupted with noise.
PDF

Development of Statistical Edge Detector in Noisy Images and Implementation on the Web

이동훈
- 한국데이터정보과학회:학술대회논문집
- /
- 한국데이터정보과학회 2004년도 춘계학술대회
- /
- pp.197-201
- /
- 2004
We describe a new edge detector based on the robust rank-order (RRO) test which is a useful alternative to Wilcoxon test, using $r{\times}r$ window for detecting edges of all possible orientations in noisy images. Some experiments of statistical edge detectors based on the Wilcoxon test and T test with our RRO detector are carried out on synthetic and real images corrupted by both Gaussian and impulse noise. We also implement these edge detectors using Java on the Web.
PDF

Noisy Data Aggregation with Independent Sensors: Insights and Open Problems

Murayama, Tatsuto;Davis, Peter
- Journal of Multimedia Information System
- /
- 제3권2호
- /
- pp.21-26
- /
- 2016
Our networked world has been growing exponentially fast. The explosion in volume of machine-to-machine (M2M) transactions threatens to exceed the transport capacity of the networks that link them. Therefore, it is quite essential to reconsider the tradeoff between using many data sets versus using good data sets. We focus on this tradeoff in the context of the quality of information aggregated from many sensors in a noisy environment. We start with a basic theoretical model considered in the famous "CEO problem'' in the field of information theory. From a point of view of large deviations, we successfully find a simple statement for the optimal strategies under the limited network capacity condition. Moreover, we propose an open problem for a sensor network scenario and report a numerical result.
https://doi.org/10.9717/JMIS.2016.3.2.21 인용 PDF

Noise Whitening-Based Pitch Detection for Speech Highly Corrupted by Colored Noise

Byun, Kyung-Jin;Jeong, Sang-Bae;Kim, Hoi-Rin;Hahn, Min-Soo
- ETRI Journal
- /
- 제25권1호
- /
- pp.49-51
- /
- 2003
Pitch estimation is important in various speech research areas, but when the speech is noisy, accurate pitch estimation with conventional pitch detectors is almost impossible. To solve this problem, we propose a new pitch detection algorithm for noisy speech using a noise whitening technique on the background noise and obtain successful results.
PDF

Estimation of Moving Information for Tracking of Moving Objects

Park, Jong-An;Kang, Sung-Kwan;Jeong, Sang-Hwa
- Journal of Mechanical Science and Technology
- /
- 제15권3호
- /
- pp.300-308
- /
- 2001
Tracking of moving objects within video streams is a complex and time-consuming process. Large number of moving objects increases the time for computation of tracking the moving objects. Because of large computations, there are real-time processing problems in tracking of moving objects. Also, the change of environment causes errors in estimation of tracking information. In this paper, we present a new method for tracking of moving objects using optical flow motion analysis. Optical flow represents an important family of visual information processing techniques in computer vision. Segmenting an optical flow field into coherent motion groups and estimating each underlying motion are very challenging tasks when the optical flow field is projected from a scene of several moving objects independently. The problem is further complicated if the optical flow data are noisy and partially incorrect. Optical flow estimation based on regulation method is an iterative method, which is very sensitive to the noisy data. So we used the Combinatorial Hough Transform (CHT) and Voting Accumulation for finding the optimal constraint lines. To decrease the operation time, we used logical operations. Optical flow vectors of moving objects are extracted, and the moving information of objects is computed from the extracted optical flow vectors. The simulation results on the noisy test images show that the proposed method finds better flow vectors and more correctly estimates the moving information of objects in the real time video streams.
PDF

A Noisy Videos Background Subtraction Algorithm Based on Dictionary Learning

Xiao, Huaxin;Liu, Yu;Tan, Shuren;Duan, Jiang;Zhang, Maojun
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제8권6호
- /
- pp.1946-1963
- /
- 2014
Most background subtraction methods focus on dynamic and complex scenes without considering robustness against noise. This paper proposes a background subtraction algorithm based on dictionary learning and sparse coding for handling low light conditions. The proposed method formulates background modeling as the linear and sparse combination of atoms in the dictionary. The background subtraction is considered as the difference between sparse representations of the current frame and the background model. Assuming that the projection of the noise over the dictionary is irregular and random guarantees the adaptability of the approach in large noisy scenes. Experimental results divided in simulated large noise and realistic low light conditions show the promising robustness of the proposed approach compared with other competing methods.
https://doi.org/10.3837/tiis.2014.06.008 인용 PDF KSCI KPUBS HTML

신경망 기반 음성, 영상 및 문맥 통합 음성인식 (Speech Recognition by Integrating Audio, Visual and Contextual Features Based on Neural Networks)

김명원;한문성;이순신;류정우
- 전자공학회논문지CI
- /
- 제41권3호
- /
- pp.67-77
- /
- 2004
최근 잡음환경에서 신뢰도 높은 음성인식을 위해 음성정보와 영상정보를 융합하는 방법이 활발히 연구되고 있다. 본 논문에서는 이절적인 정보의 융합에 적합한 신경망 모델을 기반으로 음성, 영상 및 문맥 정보 등 다양한 정보를 융합하여 잡음 환경에서 고려단어를 인식하는 음성인식 기법에 대하여 기술한다. 음성과 영상 특징을 이용한 이중 모드 신경망 BMNN(BiModal Neural Network)을 제안한다. BMM은 4개 층으로 이루어진 다층퍼셉트론의 구조를 가지며 각 층은 입력 특징의 추상화 기능을 수행한다. BMNN에서는 제 3층이 잡음에 의한 음성 정보의 손실을 보상하기 위하여 음성과 영상 특징을 통합하는 기능을 수행한다. 또한, 잡음환경에서 음성 인식률을 향상시키기 위해 사용자가 말한 단어들의 순차 패턴을 나타내는 문맥정보를 이용한 후처리 방법을 제안한다. 잡음환경에서 BMNN은 단순히 음성만을 사용한 것 보다 높은 성능을 보임으로써 그 타당성을 확인할 수 있을 뿐 아니라, 특히 문맥을 이용한 후처리를 하였을 경우 잡음 환경에서 90%이상의 인식률을 달성하였다 본 연구는 잡음환경에서 강인한 음성인식을 위해 다양한 추가 정보를 사용함으로써 성능을 향상시킬 수 있음을 제시한다.
PDF KSCI

잡음 환경하에서 환경 군집화를 이용한 고속화자 적응 (Fast Speaker Adaptation in Noisy Environment using Environment Clustering)

김영국;송화전;김형순
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
- /
- pp.33-36
- /
- 2007
In this paper, we investigate a fast speaker adaptation method based on eigenvoice in several noisy environments. In order to overcome its weakness against noise, we propose a noisy environment clustering method which divides the noisy adaptation utterances into utterance groups with similar environments by the vector quantization based clustering using a cepstral mean as a feature vector. Then each utterance group is used for adaptation to make an environment dependent model. According to our experiment, we obtained 19-37 % relative improvement in error rate compared with the simultaneous speaker adaptation and environmental compensation method
PDF

검색결과 697건 처리시간 0.02초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)