• Title/Summary/Keyword: Perceptual quality

Search Result 344, Processing Time 0.033 seconds

HVS design for Internet Shopping-Mall Image Watermarking (인터넷 쇼핑몰 이미지 워터마킹을 위한 HVS 설계 방법)

  • Seo, Yong-Seok;Kim, Won-Gyum;Lee, Seon-Hwa;Hwang, Chi-Jung
    • Proceedings of the IEEK Conference
    • /
    • 2006.06a
    • /
    • pp.443-444
    • /
    • 2006
  • In this paper, a spatial-based perceptual watermarking considering human visual system (HVS) that is proposed for small-size images such as internet shopping-mall image. In our method, a multi-bit data can be embedded in luminance component of color images still keeping the perceptual quality of image. Experimental results demonstrated that watermarks can be strongly embedded while preserving a good fidelity.

  • PDF

Voice Quality of Dysarthric Speakers in Connected Speech (연결발화에서 마비말화자의 음질 특성)

  • Seo, Inhyo;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.5 no.4
    • /
    • pp.33-41
    • /
    • 2013
  • This study investigated the perceptual and cepstral/spectral characteristics of phonation and their relationships in dysarthria in connected speech. Twenty-two participants were divided into two groups; the eleven dysarthric speakers were paired with matching age and gender healthy control participants. A perceptual evaluation was performed by three speech pathologists using the GRBAS scale to measure the cepstrual/spectral characteristics of phonation between the two groups' connected speech. Correlations showed dysarthric speakers scored significantly worse (with a higher rating) with severities in G (overall dysphonia grade), B (breathiness), and S (strain), while the smoothed prominence of the cepstral peak (CPPs) was significantly lower. The CPPs were significantly correlated with the perceptual ratings, including G, B, and S. The utility of CPPs is supported by its high relationship with perceptually rated dysphonia severity in dysarthric speakers. The receiver operating characteristic (ROC) analysis showed that the threshold of 5.08 dB for the CPPs achieved a good classification for dysarthria, with 63.6% sensitivity and the perfect specificity (100%). Those results indicate the CPPs reliably distinguished between healthy controls and dysarthric speakers. However, the CPP frequency (CPP F0) and low-high spectral ratio (L/H ratio) were not significantly different between the two groups.

Enhanced Adjustment Strategy of Masking Threshold for Speech Signals in Low Bit-Rate Audio Coding (저전송률 오디오 부호화에서 음성 신호의 성능 개선을 위한 마스킹 임계값 적응기법 향상)

  • Lee, Chang-Heon;Kang, Hong-Goo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.1
    • /
    • pp.62-68
    • /
    • 2010
  • This paper proposes a new masking threshold adjustment strategy to improve the performance for speech signals in low bit-rate audio coding. After determining formant regions, the masking threshold is adjusted by using the energy ratio of each sub-band to the average energy of each formant. More quantization noises are added to the bands that have relatively large energy, but less distortion is allowed in spectral valley regions by allocating more bits, which reflects the concept of perceptual weighting widely used in speech coding. From the results of objective speech quality measure, we verified that the proposed method improves quality for the speech input signals compared to the conventional one.

A Correlation Study between Acoustic and Perceptual Parameters of the Singing Voice in Singing Students (성악 전공 학생의 가창 시 음성의 음향학적 매개 변수와 지각적 매개 변수사이의 상관 연구)

  • Jo, Sung-Mi;Lee, Sang-Ouk;Jeong, Ok-Ran
    • Proceedings of the KSPS conference
    • /
    • 2004.05a
    • /
    • pp.219-222
    • /
    • 2004
  • The purpose of this study was to determine a correlation between acoustic and perceptual parameters of the singing voice in singing students and compare them with the results with previous studies, and a more sensitive parameters in analyzing professional vocal usage. This study measured acoustic and perceptual parameters in 41 singing students. Digital audio recordings were made in sung vowels acoustic analysis. Each sample was judged by 1 experienced singing teacher and 1 voice pathologist on two semantic bipolar 7-point scales (ringing-dull, rich-thin). The results showed that SPP1 (p<0.01), SPP2 (p<0.01), and P1(p<0.01) had significant correlations with ringing and richness quality.

  • PDF

Desgin of Foveated Frequency Sensitivity (Foveated Frequency Sensitivity의 구현)

  • Tran, Nhat Huy;Bui, Minh Trung;Kim, Wonha
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2014.11a
    • /
    • pp.248-251
    • /
    • 2014
  • We develop the signal processing method for implementing the human perceptual variant on frequency and space. The human visual perceptual sensitivity varies as frequency components and the human perceivable resolution diminishes as the distances further from the eye-focused point. For realizing the frequency sensitivity, we developed the signal direction adaptive multiband energy scaling method to weight the frequency components. The low-pass filtering is designed on the developed energy scaling method for diminishing perceivable resolutions as the deviated distance from the eye-focused point. The developed method not only enhances the frequency components of image signals at the eye-focused region but also smoothes non-perceivable detailed image signals at non-focused regions. The proposed method is verified by the subjective and objective evaluations that it can improve human perceptual visual quality.

  • PDF

Perceptual weighting on English lexical stress by Korean learners of English

  • Goun Lee
    • Phonetics and Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.19-24
    • /
    • 2022
  • This study examined which acoustic cue(s) that Korean learners of English give weight to in perceiving English lexical stress. We manipulated segmental and suprasegmental cues in 5 steps in the first and second syllables of an English stress minimal pair "object". A total of 27 subjects (14 native speakers of English and 13 Korean L2 learners) participated in the English stress judgment task. The results revealed that native Korean listeners used the F0 and intensity cues in identifying English stress and weighted vowel quality most strongly, as native English listeners did. These results indicate that Korean learners' experience with these cues in L1 prosody can help them attend to these cues in their L2 perception. However, L2 learners' perceptual attention is not entirely predicted by their linguistic experience with specific acoustic cues in their native language.

A comparison of the perceptual-auditory voice quality evaluation (GRBAS) and voice-related quality of life (K-VRQOL) according to choir type of elderly women choir members (여성 노인 합창단원의 합창단 유형에 따른 청지각적 음성평가(GRBAS) 및 음성관련 삶의 질(K-VRQOL) 비교)

  • Lee, Hyeonjung;Kang, Binna;Kim, Soo Ji
    • Phonetics and Speech Sciences
    • /
    • v.12 no.2
    • /
    • pp.51-61
    • /
    • 2020
  • The purpose of this study is to compare voice characteristics and voice-related quality of life (K-VRQOL) of the elderly female choir members using perceptual-auditory voice quality evaluation (GRBAS) and K-VRQOL scales. The participants were 77 women over 60 years old who were actively engaged in the choir in either Seoul or Busan. There are two kinds of choirs that indicate different engagement levels: regular choir and church choir. The perceptual-auditory vocal quality evaluation was listened to by / a / vowels and were graded by experts using the GRBAS scale. As a result, when comparing the differences between groups, the elderly female participants of the regular choir showed higher satisfaction in speech using the subjective speech recognition level than the elderly female members who performed in the church choir. In addition, the analysis showed that the satisfaction level was high in the physical function area of the K-VRQOL scale. This study confirmed that choral activities could yield positive results not only in terms of improving voice function in old age, but also to improve the subjective perception level of voice use, thus suggesting the necessity of systematic music programs to improve voices that are aging.

A Model-Based Image Steganography Method Using Watson's Visual Model

  • Fakhredanesh, Mohammad;Safabakhsh, Reza;Rahmati, Mohammad
    • ETRI Journal
    • /
    • v.36 no.3
    • /
    • pp.479-489
    • /
    • 2014
  • This paper presents a model-based image steganography method based on Watson's visual model. Model-based steganography assumes a model for cover image statistics. This approach, however, has some weaknesses, including perceptual detectability. We propose to use Watson's visual model to improve perceptual undetectability of model-based steganography. The proposed method prevents visually perceptible changes during embedding. First, the maximum acceptable change in each discrete cosine transform coefficient is extracted based on Watson's visual model. Then, a model is fitted to a low-precision histogram of such coefficients and the message bits are encoded to this model. Finally, the encoded message bits are embedded in those coefficients whose maximum possible changes are visually imperceptible. Experimental results show that changes resulting from the proposed method are perceptually undetectable, whereas model-based steganography retains perceptually detectable changes. This perceptual undetectability is achieved while the perceptual quality - based on the structural similarity measure - and the security - based on two steganalysis methods - do not show any significant changes.

A Study on the Evaluation Method of Perceptual Contrast with CIECAM02

  • Chong, Jong-Ho;Lee, Seung-Bae;Lee, Sang-Myung;Choi, Young-Chul;Bae, Jae-Woo;Kim, Hun-Soo;Chung, Ho-Kyoon
    • 한국정보디스플레이학회:학술대회논문집
    • /
    • 2007.08b
    • /
    • pp.1661-1663
    • /
    • 2007
  • The contrast of display is one of the important specifications. Even if the contrast indicates luminance range which is a capability of the display and is greater in lower luminance or higher luminance, we consider that the greater contrast gets not the better performance. It is not the same value in human visual system. In practice, it is difficult to achieve the full dynamic range seen by human beings using electronic equipment. Therefore, we consider ambient condition and human perception to calculate perceptual contrast using the CIECAM02. In this paper, we propose perceptual contrast that is calculated using the brightness of CIECAM02.

  • PDF

Visual-Attention-Aware Progressive RoI Trick Mode Streaming in Interactive Panoramic Video Service

  • Seok, Joo Myoung;Lee, Yonghun
    • ETRI Journal
    • /
    • v.36 no.2
    • /
    • pp.253-263
    • /
    • 2014
  • In the near future, traditional narrow and fixed viewpoint video services will be replaced by high-quality panorama video services. This paper proposes a visual-attention-aware progressive region of interest (RoI) trick mode streaming service (VA-PRTS) that prioritizes video data to transmit according to the visual attention and transmits prioritized video data progressively. VA-PRTS enables the receiver to speed up the time to display without degrading the perceptual quality. For the proposed VA-PRTS, this paper defines a cutoff visual attention metric algorithm to determine the quality of the encoded video slice based on the capability of visual attention and the progressive streaming method based on the priority of RoI video data. Compared to conventional methods, VA-PRTS increases the bitrate saving by over 57% and decreases the interactive delay by over 66%, while maintaining a level of perceptual video quality. The experiment results show that the proposed VA-PRTS improves the quality of the viewer experience for interactive panoramic video streaming services. The development results show that the VA-PRTS has highly practical real-field feasibility.