통합 검색 | Korea Science

Lightweight Quality Metric Based on No-Reference Bitstream for H.264/AVC Video

Kim, Yo-Han;Shin, Ji-Tae;Kim, Ho-Kyom
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제6권5호
- /
- pp.1388-1399
- /
- 2012
This paper proposes a quality metric based on a No-Reference Bitstream (NR-B) having least computational complexity for the assessment of the human-perceptual quality of H.264 encoded video. The proposed NR-B method performs a modeling of encoding distortion with three bit-stream information (i.e. frame-rate, motion-vector, and quantization-parameter) that can be directly extractable from the encoded bitstream and does not require additional complex processing of final pictures. From performance evaluation using 165 compressed video sequences, the experiment results show that the proposed metric has a higher correlation with subjective quality than is achieved with other comparable methods.
https://doi.org/10.3837/tiis.2012.05.008 인용 PDF KSCI

혜택세분화와 인식도에 의한 진의류 브랜드 이미지 연구(II) -인식도에 의한 브랜드 이미지 분석- (Brand Image: Analysis of Domestic Jeans Market through Benefit Segmentation and Perceptual Mapping(II))

최일경;고애란
- 한국의류학회지
- /
- 제19권5호
- /
- pp.699-712
- /
- 1995
The purpose of this study was 1) to identify the constructing factors of jeans brand image 2) to analyze the domestic jeans market using perceptual maps of three benefit segments based on stdy(I). The questionnaire consisted of brand preference, attribute of brand image and wearer image was selected from the previous studies or developed for this study. The subjects were 350 male and female university students who have purchased at least one of the nine jeans wear brand selected for the study. For statistical analysis, reliability test, factor analysis, MANOVA, and multiple regression were used. The results of this study were as follows: 1. Symbolism, quality, and economy were found out as constricting factors of brand image in the attribute dimensions, while innovative and active image were found out in the wearer image dimensions. 2. 9 Perceptual maps of attribute dimensions and 3 perceptual maps of wearer image dimensions were constructed and each ideal vector was drawn.
PDF

인지 왜곡 척도를 사용한 프랙탈 영상 압축 (Fractal image compression with perceptual distortion measure)

문용호;박기웅;손경식;김윤수;김재호
- 한국통신학회논문지
- /
- 제21권3호
- /
- pp.587-599
- /
- 1996
In general fractal imge compression, each range block is approximated by a contractive transform of the matching domain block under the mean squared error criterion. In this paper, a distortion measure reflecting the properties of human visual system is defined and applied to a fractal image compression. the perceptual distortion measure is obtained by multiplying the mean square error and the noise sensitivity modeled by using the background brightness and spatial masking. In order to compare the performance of the mean squared error and perceptual distortion measure, a simulation is carried out by using the 512*512 Lena and papper gray image. Compared to the results, 6%-10% compression ratio improvements under improvements under the same image quality are achieved in the perceptual distortion measure.
PDF

명료발화와 보통발화에서 파킨슨병환자 음성의 켑스트럼 및 스펙트럼 분석 (Characteristics of voice quality on clear versus casual speech in individuals with Parkinson's disease)

신희백;심희정;정훈;고도흥
- 말소리와 음성과학
- /
- 제10권2호
- /
- pp.77-84
- /
- 2018
The purpose of this study is to examine the acoustic characteristics of Parkinsonian speech, with respect to different utterance conditions, by employing acoustic/auditory-perceptual analysis. The subjects of the study were 15 patients (M=7, F=8) with Parkinson's disease who were asked to read out sentences under different utterance conditions (clear/casual). The sentences read out by each subject were recorded, and the recorded speech was subjected to cepstrum and spectrum analysis using Analysis of Dysphonia in Speech and Voice (ADSV). Additionally, auditory-perceptual evaluation of the recorded speech was conducted with respect to breathiness and loudness. Results indicate that in the case of clear speech, there was a statistically significant increase in the cepstral peak prominence (CPP), and a decrease in the L/H ratio SD (ratio of low to high frequency spectral energy SD) and CPP F0 SD values. In the auditory-perceptual evaluation, a decrease in breathiness and an increase in loudness were noted. Furthermore, CPP was found to be highly correlated to breathiness and loudness. This provides objective evidence of the immediate usefulness of clear speech intervention in improving the voice quality of Parkinsonian speech.
https://doi.org/10.13064/KSSS.2018.10.2.077 인용 PDF KSCI

모바일 VoIP 음성통신을 위한 대화음질 측정 시스템 (Conversational Quality Measurement System for Mobile VoIP Speech Communication)

조재만;김형국
- 한국ITS학회 논문지
- /
- 제10권4호
- /
- pp.71-77
- /
- 2011
본 논문에서는 고품질 모바일 VoIP 음성통신에 대한 객관적인 QoS를 제공하는 대화음질 측정시스템을 구현하였다. 대화음질 측정을 위해서 VoIP로 연결된 두 대의 스마트폰에 에코 및 잡음 제거, 음성 인코딩 및 디코딩, RTP (Real-TimeProtocol)을 적용한 패킷 생성, 지터버퍼 콘트롤, LC (Loss Concealment)를 포함한 POS (Play-out Schedule)로 구성된 VoIP음성 통화시스템을 구현하였다. 대화음질 측정 시스템은 VoIP로 연결된 두 스마트폰의 마이크, 그리고 스피커와 연결되어 각 화자별로 음성신호를 녹음한 후에, 녹음된 음성신호를 이용하여 CE (Conversational Efficiency), CS (Conversational Symmetry) 및 PESQ (Perceptual Evaluation of Speech Quality)를 측정하고, CE-CS-PESQ에 대한 상관관계를 측정한다. 본 논문에서는 다양한 SNR, IP 네트워크망 변동에 따른 지연, 손실 변화에 따른 CE, CS, PESQ를 측정하여 대화음질 측정시스템을 검증하였다.
PDF KSCI

다양한 손실 함수를 이용한 음성 향상 성능 비교 평가 (Performance comparison evaluation of speech enhancement using various loss functions)

황서림;변준;박영철
- 한국음향학회지
- /
- 제40권2호
- /
- pp.176-182
- /
- 2021
본 논문은 다양한 손실 함수에 따른 Deep Nerual Network(DNN) 기반 음성 향상 모델의 성능을 비교 평가한다. 베이스라인 모델로는 음성의 위상 정보를 고려할 수 있는 복소 네트워크를 사용하였다. 손실 함수는 두 가지 유형의 기본 손실 함수, Mean Squared Error(MSE)와 Scale-Invariant Source-to-Noise Ratio(SI-SNR)를 사용하였으며 두 가지 유형의 지각 기반 손실 함수 Perceptual Metric for Speech Quality Evaluation(PMSQE)과 Log Mel Spectra(LMS)를 사용한다. 성능은 각 손실 함수의 다양한 조합을 사용하여 얻은 출력을 객관적인 평가와 청취 테스트를 통해 측정하였다. 실험 결과, 지각기반 손실 함수를 MSE 또는 SI-SNR과 결합하였을 때 전반적으로 성능이 향상되며, 지각기반 손실함수를 사용하면 객관적 지표에서 약세를 보이는 경우라도 청취 테스트에서 우수한 성능을 보임을 확인하였다.
https://doi.org/10.7776/ASK.2021.40.2.176 인용 PDF KSCI

S-JND 기반의 HEVC 주관적 율 제어 알고리즘 (S-JND based Perceptual Rate Control Algorithm of HEVC)

김재련;심동규
- 방송공학회논문지
- /
- 제22권3호
- /
- pp.381-396
- /
- 2017
본 논문에서는 주관적 화질 기반의 비트 분배를 수행하는 율 제어 알고리즘을 수행하는 HEVC (High Efficiency Video Coding) 부호화 방법을 위한 연구를 진행하였다. 본 논문은 이러한 단점을 해소하고자 율 왜곡 최적화 시의 화질 측정에서 주관적 화질을 고려할 수 있는 율 제어 알고리즘을 통한 HEVC 부호화 방법을 제안한다. 제안하는 방법은 영상을 하나의 CTU 마다 인지 시각적 중요도를 측정하여, 이를 이용하여 픽쳐 단위, CTU 단위에의 비트 분배 시 적응적인 분배를 수행한다. 본 논문에서 제안하는 방법은 HEVC 참조 소프트웨어 16.9 버전 대비 CTC (Common Test Condition) Class B 영상에서 평균적으로 BD-rate 3.12%의 성능향상과 BD-PSNR의 0.08dB 향상 및 목표 비트율에의 비트 정확도 0.07% 증가를 보였다. 또한 주관적 화질 측정 결과도 기존 HEVC의 참조 소프트웨어에 적용된 율 제어 알고리즘 대비 DSCQS 스케일에서 평균 0.16 향상된 것을 확인하였다.
https://doi.org/10.5909/JBE.2017.22.3.381 인용 PDF KSCI KPUBS

가상 음질 분석을 이용한 자동차 실내소음 음질 평가 (Sound Quality Evaluation of Vehicle Interior Noise Using Virtual Sound Quality Analysis)

강상욱
- 한국소음진동공학회논문집
- /
- 제27권1호
- /
- pp.100-106
- /
- 2017
Sound quality engineering in automobile noise applications has become more and more important under the current quiet driving condition because various noise components masked under high noise level can be audible in quieter driving situation. Many researches have been carried out for subjective and objective assessments on automobile sounds and noises. In particular, the interior sound quality has been one of research fields that can give high-quality feature to automobile products. Although many works related to the interior sound quality have been progressed or completed in foreign countries, limited research results are presented in the country. In the study, subjective assessments are first performed with 20 subjects to select perceptual adjectives suitable to the assessment of car interior noises during acceleration. The selected perceptual adjectives are employed as the assessment scales to evaluate the acceleration noises in questionnaire procedures using 35 subjects, for which several noises are created through digital filtering of the acceleration noises measured. Mean values and standard deviations for subjective assessment scores obtained by the questionnaire procedures are calculated and their reliability are also verified. Finally, various statistical analyses such as the correlation analysis and the factor analysis are carried out to reveal the interrelationship between the assessment scales and the spectrum components of the acceleration noises.
https://doi.org/10.5050/KSNVE.2017.27.1.100 인용 PDF KSCI

A Perception-based Color Correction Method for Multi-view Images

Shao, Feng;Jiang, Gangyi;Yu, Mei;Peng, Zongju
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제5권2호
- /
- pp.390-407
- /
- 2011
Three-dimensional (3D) video technologies are becoming increasingly popular, as it can provide users with high quality and immersive experiences. However, color inconsistency between the camera views is an urgent problem to be solved in multi-view imaging. In this paper, a perception-based color correction method for multi-view images is proposed. In the proposed method, human visual sensitivity (VS) and visual attention (VA) models are incorporated into the correction process. Firstly, the VS property is used to reduce the computational complexity by removing these visual insensitive regions. Secondly, the VA property is used to improve the perceptual quality of local VA regions by performing VA-dependent color correction. Experimental results show that compared with other color correction methods, the proposed method can greatly promote the perceptual quality of local VA regions greatly and reduce the computational complexity, and obtain higher coding performance.
https://doi.org/10.3837/tiis.2011.02.009 인용 PDF KSCI

An Objective No-Reference Perceptual Quality Assessment Metric based on Temporal Complexity and Disparity for Stereoscopic Video

Ha, Kwangsung;Bae, Sung-Ho;Kim, Munchurl
- IEIE Transactions on Smart Processing and Computing
- /
- 제2권5호
- /
- pp.255-265
- /
- 2013
3DTV is expected to be a promising next-generation broadcasting service. On the other hand, the visual discomfort/fatigue problems caused by viewing 3D videos have become an important issue. This paper proposes a perceptual quality assessment metric for a stereoscopic video (SV-PQAM). To model the SV-PQAM, this paper presents the following features: temporal variance, disparity variation in intra-frames, disparity variation in inter-frames and disparity distribution of frame boundary areas, which affect the human perception of depth and visual discomfort for stereoscopic views. The four features were combined into the SV-PQAM, which then becomes a no-reference stereoscopic video quality perception model, as an objective quality assessment metric. The proposed SV-PQAM does not require a depth map but instead uses the disparity information by a simple estimation. The model parameters were estimated based on linear regression from the mean score opinion values obtained from the subjective perception quality assessments. The experimental results showed that the proposed SV-PQAM exhibits high consistency with subjective perception quality assessment results in terms of the Pearson correlation coefficient value of 0.808, and the prediction performance exhibited good consistency with a zero outlier ratio value.
PDF

검색결과 344건 처리시간 0.025초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)