Search | Korea Science

Quality Assessment and Predistortion Evaluation of the Multi-channel Audio Codec according to the bitrate changing (압축율 변화에 따른 멀티채널 오디오의 품질 및 Predistortion 의 영향 평가)

Cha, Kyung-Hwan;Jang, Dae-Young;Kim, Sung-Han;Kim, Chun-Duck
- The Journal of the Acoustical Society of Korea
- /
- v.15 no.2
- /
- pp.55-60
- /
- 1996
This paper describes the subjective assessment of the multi-channel audio quality according to the bitrate changing and evaluates the predistortion effect to avoid the unmasked noise after matrixing/dematrxing process in transmission and regeneration of the multi-channel audio. The simulation is processed by the perceptual coding that is MPEG-2 Audio layer II algorithm. We evaluate the quality improvement about predistortion using or not by 384, 320, 256, 128kbps. As the result of the double blind subjective assessment, 5 Grade-Impairment Scale is scored under minus one to 320kbps and so audio quality is evaluated to be perceptible, but not annoying in 3/2 channel. The effect of the predistortion is improved one level in 128kbps and especially speech test material I better improved than music test materials.
PDF

A Study on Brand Image Positioning for Ladies' Ready-to wear According to Fashion Involvement - As Object of working women (유행관여에 따른 여성기성복 상표이미지 포지셔닝 연구 -20대 직장여성을 중심으로-)

Park Hye Won;Lim Sook Ja
- Journal of the Korean Society of Clothing and Textiles
- /
- v.16 no.4 s.44
- /
- pp.393-403
- /
- 1992
This Study intended to provide positioning strategies of brand Image for ladies' ready to wear by analysing the perceptual dimensions of working women. The subjects were devided into two groups according to the fashion involvement, and in each group, a positioning map was composed by use of multidimensional scaling. 251 subjects of this study were gathered into stratified sample groups from working women in Seoul, being subdivided according to their each occupation and age. The data were analysed by frequency, percentage, average, $x^{2}-test$, 1-test, Factor Analysis, cronbach's $\alpha$. Also, KYST, PROFIT, PREFMAP for multidimensional scaling were used. The results were as follows. 1. Two groups were identified according to degree of fashion involvement: high-involvement group, and low-involvement group. 2. From the analysis of the similarity of brand image, high involvement group percieved greater difference in brand image than low involvement group. 3. From the analysis of the evaluation of brand attributes, the evaluations in self expression, fashionability, design, sales promotion activity, sociality, quality, fit showed differences bet-ween high involvement group and low involvement group. 4. From the analysis of the preference of brand image, the distribution of preference and ideal point were different between high involvement group and low involvement group.
PDF

The Comparisons of GRBAS Perceptual Judgments according to Levels of Utterances

Pyo, Hwa-Young;Sim, Hyun-Sub
- Speech Sciences
- /
- v.8 no.1
- /
- pp.135-142
- /
- 2001
The present study was performed to investigate adequate levels of utterances which can give essential as well as useful information about the patients' voice, by examining the degrees of correlation between the levels of utterances (vowels, words, and phrase paragraph reading) and the entire utterance including all of the levels. For this purpose, a total of 10 individual utterance samples (5 vowels, 3 words, 1 phrase, 1 paragraph reading) were collected from each of the 30 subjects with voice disorder patients, and four experienced voice therapists evaluated them using GRBAS. The results showed that four therapists highly agreed upon on 'G' parameter. The coefficient of the correlation between each level of utterance and entire utterance tended to be above 0.70. Judgements of the vowel /$\varepsilon$/ as well as /o/ highly correlated with the judgement of the entire utterance. Regardless of severity, the judgement of the entire utterance highly correlated with the judgements of the vowel /u/ and the paragraph reading. These results suggest that experienced voice therapists can precisely evaluate patients' voice quality with only one sustained vowel in the clinic field, as is done with the entire utterance evaluation.
PDF

Image saliency detection based on geodesic-like and boundary contrast maps

Guo, Yingchun;Liu, Yi;Ma, Runxin
- ETRI Journal
- /
- v.41 no.6
- /
- pp.797-810
- /
- 2019
Image saliency detection is the basis of perceptual image processing, which is significant to subsequent image processing methods. Most saliency detection methods can detect only a single object with a high-contrast background, but they have no effect on the extraction of a salient object from images with complex low-contrast backgrounds. With the prior knowledge, this paper proposes a method for detecting salient objects by combining the boundary contrast map and the geodesics-like maps. This method can highlight the foreground uniformly and extract the salient objects efficiently in images with low-contrast backgrounds. The classical receiver operating characteristics (ROC) curve, which compares the salient map with the ground truth map, does not reflect the human perception. An ROC curve with distance (distance receiver operating characteristic, DROC) is proposed in this paper, which takes the ROC curve closer to the human subjective perception. Experiments on three benchmark datasets and three low-contrast image datasets, with four evaluation methods including DROC, show that on comparing the eight state-of-the-art approaches, the proposed approach performs well.
https://doi.org/10.4218/etrij.2018-0039 인용 PDF KSCI

A Study of Subjective Speech Quality Measurement in VoIP (VoIP 음질의 주관적 평가에 관한 연구)

강영도;강진석;최연성;김장형
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.5 no.2
- /
- pp.279-287
- /
- 2001
In this paper, we discuss the scale of subjective speech quality measurement over VoIP(Voice over IP) network which is a component of broadband networks. Objective parameters of multimedia services like PSNR or jitter can easily measured and defined, but these factors are not easily meet the user's perceptual recognition. We suggest the speech quality measurement scale through the subjective measurement for end-to-end speech quality composed of sender-side quality, transmission quality, receiver-side quality, which provide the degree of correctness of representation of speaker, the degree of impairment caused by various factors, the degree of recognition of processed speech, respectively. Also, we examined the proposed method and verify it's availability.
PDF

Speech Enhancement Based on IMCRA Incorporating noise classification algorithm (잡음 환경 분류 알고리즘을 이용한 IMCRA 기반의 음성 향상 기법)

Song, Ji-Hyun;Park, Gyu-Seok;An, Hong-Sub;Lee, Sang-Min
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.61 no.12
- /
- pp.1920-1925
- /
- 2012
In this paper, we propose a novel method to improve the performance of the improved minima controlled recursive averaging (IMCRA) in non-stationary noisy environment. The conventional IMCRA algorithm efficiently estimate the noise power by averaging past spectral power values based on a smoothing parameter that is adjusted by the signal presence probability in frequency subbands. Since the minimum of smoothing parameter is defined as 0.85, it is difficult to obtain the robust estimates of the noise power in non-stationary noisy environments that is rapidly changed the spectral characteristics such as babble noise. For this reason, we proposed the modified IMCRA, which adaptively estimate and updata the noise power according to the noise type classified by the Gaussian mixture model (GMM). The performances of the proposed method are evaluated by perceptual evaluation of speech quality (PESQ) and composite measure under various environments and better results compared with the conventional method are obtained.
https://doi.org/10.5370/KIEE.2012.61.12.1920 인용 PDF KSCI

Subjective Evaluation of Sound Quality in Vehicle Passenger Compartment during Acceleration (자동차 주행 가속 차실 소음의 주관적 음질 평가)

Kang, S.W.;Lee, J.M.
- Proceedings of the KSME Conference
- /
- 2001.06b
- /
- pp.187-191
- /
- 2001
Sound quality engineering in automobile noise applications has become more and more important under the current quiet driving condition that the interior noise level is below 65 dBA, because various noise components masked under high noise level can be audible in quieter driving situation. Many researches have been carried out for subjective and objective assessments on automobile sounds and noises. In particular, the interior sound quality has been one of research fields that can give high-quality feature to automobile products. Although many works related to the interior sound quality have been progressed or completed in foreign countries, limited research results are presented in the country. In the study, as a base step necessary to objective assessments on car interior noises, subjective assessments are performed with 20 subjects. For this purpose, perceptual adjectives suitable to the assessment of acceleration noises are selected as assessment scales through questionnaire procedures using 35 subjects. Mean values and standard deviations are calculated for noises created through digital filtering of acceleration noises measured. In addition, the correlation analysis and the factor analysis are carried out to investigate the dependence of the assessment scales selected.
PDF

Study on optimal number of latent source in speech enhancement based Bayesian nonnegative matrix factorization (베이지안 비음수 행렬 인수분해 기반의 음성 강화 기법에서 최적의 latent source 개수에 대한 연구)

Lee, Hye In;Seo, Ji Hun;Lee, Young Han;Kim, Je Woo;Lee, Seok Pil
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2015.07a
- /
- pp.418-420
- /
- 2015
본 논문은 베이지안 비음수 행렬 인수분해 (Bayesian nonnegative matrix factorization, BNMF) 기반의 음성 강화 기법에서 음성과 잡음 성분의 latent source 수에 따른 강화성능에 대해 서술한다. BNMF 기반의 음성 강화 기법은 입력 신호를 서브 신호들의 합으로 분해한 후, 잡음 성분을 제거하는 방식으로 그 성능이 기존의 NMF 기반의 방법들보다 우수한 것으로 알려져 있다. 그러나 많은 계산량과 latent source 의 수에 따라 성능의 차이가 있다는 단점이 있다. 이러한 단점을 개선하기 위해 본 논문에서는 BNMF 기반의 음성 강화 기법에서 최적의 latent source 개수를 찾기 위한 실험을 진행하였다. 실험은 잡음의 종류, 음성의 종류, 음성과 잡음의 latent source 의 개수, 그리고 SNR 을 바꿔가며 진행하였고, 성능 평가 방법으로 PESQ (perceptual evaluation of speech quality) 를 이용하였다. 실험 결과, 음성의 latent source 개수는 성능에 영향을 주지 않지만, 잡음의 latent source 개수는 많을수록 성능이 좋은 것으로 확인되었다.
PDF

An Adaptive Wind Noise Reduction Method Based on a priori SNR Estimation for Speech Eenhancement (음성 강화를 위한 a priori SNR 추정기반 적응 바람소리 저감 방법)

Seo, Ji-Hun;Lee, Seok-Pil
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.64 no.12
- /
- pp.1756-1760
- /
- 2015
This paper focuses on a priori signal to noise ratio (SNR) estimation method for the speech enhancement. There are many researches for speech enhancement with several ambient noise cancellation methods. The method based on spectral subtraction (SS) which is widely used in noise reduction has a trade-off between the performance and the distortion of the signals. So the need of adaptive method like an estimated a priori SNR being able to making a high performance and low distortion is increasing. The decision directed (DD) approach is used to determine a priori SNR in noisy speech signals. A priori SNR is estimated by using only the magnitude components and consequently follows a posteriori SNR with one frame delay. We propose a modified a priori SNR estimator and the weighted rational transfer function for speech enhancement with wind noises. The experimental result shows the performance of our proposed estimator is better Perceptual Evaluation of Speech Quality scores (PESQ, ITU-T P.862) compare to the conventional DD approach-based systems and different noise reduction methods.
https://doi.org/10.5370/KIEE.2015.64.12.1756 인용 PDF KSCI

Two-Channel Noise Reduction Using Beamforming and DOA-Based Masking (빔포밍 및 DOA 기반의 마스킹을 이용한 2채널 잡음제거)

Kim, Youngil;Jeong, Sangbae
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.17 no.1
- /
- pp.32-40
- /
- 2013
In this paper, we propose a multi-channel speech enhancement algorithm using beamforming and direction-of-arrival (DOA)-based masking. The proposed algorithm enhances noisy speech basically by the linearly constrained minimum variance (LCMV) algorithm and then a mel-scale Wiener filter designed using DOA-based masking is applied to remove still remaining noises. To improve the performance, we optimize the learning rate of the adaptive filters in LCMV and the DOA threshold to detect target speech spectrum. As performance indices, the perceptual evaluation of speech quality (PESQ) score and output SNRs are measured. Experimantal results show that the proposed algorithm outperforms the conventional LCMV beamformer by 0.09 in PESQ score and 5.75 dB in output SNR, respectively.
https://doi.org/10.6109/jkiice.2013.17.1.32 인용 PDF KSCI

Search Result 248, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)