• Title/Summary/Keyword: Perceptual model

Search Result 219, Processing Time 0.032 seconds

Coding Unit-level Multi-loop Encoding Method based on JND for Perceptual Coding (JND 모델을 사용한 코딩 유닛 레벨 멀티-루프 인코딩 기반의 비디오 압축 방법)

  • Lim, Woong;Sim, Donggyu
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.52 no.5
    • /
    • pp.147-154
    • /
    • 2015
  • In this paper, we employed a model which defines the sensitivity according to the background luminance, so called JND (Just Noticeable Difference), and applied to the video coding. The proposed method finds out the maximum possible quantization parameter for the current unit based on the threshold of JND model and reduce the bitrate with similar perceptual quality. It selects the higher quantization parameter and reduce the bitrate when the reconstructed signal which is coded with higher quantization parameter is in a range of allowance based on the JND threshold, i.e. the signal has the similar perceptual quality compared to that is coded with the initial quantization parameter. The proposed algorithm was implemented on HM16.0, which is a reference software of the latest video coding standard HEVC (High Efficiency Video Coding) and the coding performance was evaluated. Compared to HM16.0, the proposed algorithm achieved maximum 20.21% and 6.18% of average bitrate reduction with the similar perceptual quality.

A Multi-category Task for Bitrate Interval Prediction with the Target Perceptual Quality

  • Yang, Zhenwei;Shen, Liquan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.12
    • /
    • pp.4476-4491
    • /
    • 2021
  • Video service providers tend to face user network problems in the process of transmitting video streams. They strive to provide user with superior video quality in a limited bitrate environment. It is necessary to accurately determine the target bitrate range of the video under different quality requirements. Recently, several schemes have been proposed to meet this requirement. However, they do not take the impact of visual influence into account. In this paper, we propose a new multi-category model to accurately predict the target bitrate range with target visual quality by machine learning. Firstly, a dataset is constructed to generate multi-category models by machine learning. The quality score ladders and the corresponding bitrate-interval categories are defined in the dataset. Secondly, several types of spatial-temporal features related to VMAF evaluation metrics and visual factors are extracted and processed statistically for classification. Finally, bitrate prediction models trained on the dataset by RandomForest classifier can be used to accurately predict the target bitrate of the input videos with target video quality. The classification prediction accuracy of the model reaches 0.705 and the encoded video which is compressed by the bitrate predicted by the model can achieve the target perceptual quality.

A Study on the Wayfinding Model of Outpatient Department in General Hospital (종합병원 외래진료부 진로인지계획 모형에 관한 연구)

  • Han, Gi-Jeung;Lee, Teuk-Koo
    • Journal of The Korea Institute of Healthcare Architecture
    • /
    • v.13 no.2
    • /
    • pp.27-36
    • /
    • 2007
  • Recently, hospital patients experience anxiety, confusion, and stress about wayfinding as the spacial layout and treatment circulatory system of hospitals have become complicated due to their oversized and complex structure. As part of finding a solution to the problem, this study seeks to examine what are the essential elements of the wayfinding planning of O.P.D. in general hospitals, to develop the model of wayfinding, and to suggest the methods of improving the wayfinding system. The research methods of this study adopted were literature review in wayfinding cognition, plan analysis of ten general hospitals, space analysis of these hospitals through space syntax, analysis of the system of visual-perceptual information through a field study, and analysis of surveys and follow-up surveys conducted to support the results. Based on these results, the proposals for finding decision points, providing the information, and developing a model planning are listed as follows. 1) The comprehensive understanding of O.P.D. spacial layout and the visual-perceptual information system is necessary to find the essential elements of wayfinding. 2) The decision points are found through the full understanding of spacial functions, circulation systems, and facility configuration, considering the spacial layout, the bound of the visual-perceptual information system, and the circulatory system. Furthermore, the information decision points could be confined by space syntax. 3) The checklist and color compound & color codes, developed through the planning of signage system and color system could be applied to the methods of providing the information. 4) The planning of wayfinding system according to the whole process of practices for outpatients was mentioned above. The system of visual-perceptual information developed through the process of this study should be integrated in the spacial layout of the whole O.P.D.

  • PDF

Speech Enhancement Based on Psychoacoustic Model (심리음향모델에 근거한 음성개선)

  • Lee Jingeol
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.337-338
    • /
    • 2000
  • The perceptual filter for speech enhancement was analytically derived where the frequency content of the input noisy signal was made the same as that of the estimated clean signal in auditory domain. However, the analytical derivation should rely on the deconvolution associated with the spreading function in the psychoacoustic model, which results in an ill-conditioned problem. In order to cope with the problem associated with the deconvolution, we propose a novel psychoacoustic model based speech enhancement filter whose principle is the same as the perceptual filter, however the filter is derived by a constrained optimization which provides solutions to the ill-conditioned problem.

  • PDF

Comparion of Noise Suppression Methods in Voice CODEC (음성코덱에서의 잡음제거 방식 비교)

  • Lee, Jin-Geol
    • The Journal of Engineering Research
    • /
    • v.3 no.1
    • /
    • pp.43-46
    • /
    • 1998
  • Considerable research in the last three decades has examined the problem of enhancement of speech degraded by additive background noise. We compare traditional methods such as spectral subtraction and Wiener filter, recently proposed psychoacoustic model based methods such as perceptual filter and noise suppression in EVRC in terms of performance and complexity.

  • PDF

The Study on the Factors of film's Processing Fluency Inducing Film's Preference Fluency (영화의 선호도 유창성에 영향을 미치는 영화의 처리 유창성 요인에 관한 연구)

  • Choi, Nak Hwan;Lim, Ahyoung
    • Asia Marketing Journal
    • /
    • v.13 no.4
    • /
    • pp.29-54
    • /
    • 2012
  • Recently, as film has been admitted as the artistic merit and higher value-added business, the past studies on film have been lively carried on various fields. Especially, the film business is thought to be value-added business and has explored the causes of influencing spectators. However, there are no researches enough to explain what induces the choice and diffusion of the film. The film is not only hedonic products but also typical example of experiential products. So how to process film formation plays important roles in explaining the procedures of forming preference on the film and the movies spread more widely. But the great part of study of films has been concentrated on exploring hedonic factors of influencing spectator's choice. Until now there is not enough study for the relationship between experiential information processing and film preference. To explain film's preference, our study focuses on preference fluency and processing fluency that can provide an insight for our question about the relationship. In this article, to explain the procedures of processing experiential information and forming preference on the film, our study focuses on finding the relationship between film's processing fluency and film's preference fluency and explores the factors that affect film's processing fluency. To achieve the goal of this study, we distinguish factors of film's conceptual fluency from factors of film's perceptual fluency and explore the paths from the factors to film's preference fluency. The factors which have effects on perceptual fluency are hypothesized to be distinction of image expression, distinction of sound expression, correspondence between actors' image and their role. The factors which have effect on conceptual fluency are supposed to be well-organized story, suitability of lines expression. The experiments in which students were sampled at 'C' university were conducted in 2010 (december). Data collection was proceeded through questionnaires. We test the hypothesized model by using structural equation modeling(Amos 17.0). The fit indices for the model are as follows : x2=416.266(df=213, p=0.00), GFI=0.855, AGFI=0.812, RMSEA =0.069, IFI=O.925, CFI=0.920, TLI=0.905. According to the guidelines, there is evidence that our measurement model fits data. The results of empirical study are as follows. The path from film's perceptual fluency to film's preference fluency is supported(estimate: 0.223, C.R: 2.641). The path from film's conceptual fluency to film's preference fluency is supported(estimate: 0.397, C.R: 4.863). The path from distinction of image expression to film's perceptual fluency is not supported(estimate: 0.113, C.R: 1.665). The path from distinction of sound expression to film's perceptual fluency is supported (estimate: 0.190, C.R: 2.042). The path from correspondence between actors' image and their role to film's perceptual fluency is supported(estimate: 0.686, C.R: 5.566). The path from well-organized story to film's conceptual fluency is supported(estimate: 0.396, C.R: 4.023). The path from suitability of lines expression to film's conceptual fluency is supported(estimate: 0.536, C.R: 5.441). Concludingly, our study explored the influencing factors of film's processing fluency inducing film's Preference fluency. First, film's perceptual fluency and film's conceptual fluency have positive effects on film's preference fluency. Second, distinction of image expression is not significant on film's perceptual fluency, but distinction of sound expression and image's correspondence of actors' image and their role have positive effects on film's perceptual fluency. Lastly, well-organized story and suitability of lines expression have positive effects on film's conceptual fluency.

  • PDF

A Study on the Evaluation Method of Perceptual Contrast with CIECAM02

  • Chong, Jong-Ho;Lee, Seung-Bae;Lee, Sang-Myung;Choi, Young-Chul;Bae, Jae-Woo;Kim, Hun-Soo;Chung, Ho-Kyoon
    • 한국정보디스플레이학회:학술대회논문집
    • /
    • 2007.08b
    • /
    • pp.1661-1663
    • /
    • 2007
  • The contrast of display is one of the important specifications. Even if the contrast indicates luminance range which is a capability of the display and is greater in lower luminance or higher luminance, we consider that the greater contrast gets not the better performance. It is not the same value in human visual system. In practice, it is difficult to achieve the full dynamic range seen by human beings using electronic equipment. Therefore, we consider ambient condition and human perception to calculate perceptual contrast using the CIECAM02. In this paper, we propose perceptual contrast that is calculated using the brightness of CIECAM02.

  • PDF

SPATIAL EXPLANATIONS OF SPEECH PERCEPTION: A STUDY OF FRICATIVES

  • Choo, Won;Mark Huckvale
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.399-403
    • /
    • 1996
  • This paper addresses issues of perceptual constancy in speech perception through the use of a spatial metaphor for speech sound identity as opposed to a more conventional characterisation with multiple interacting acoustic cues. This spatial representation leads to a correlation between phonetic, acoustic and auditory analyses of speech sounds which can serve as the basis for a model of speech perception based on the general auditory characteristics of sounds. The correlations between the phonetic, perceptual and auditory spaces of the set of English voiceless fricatives /f $\theta$ s $\int$ h / are investigated. The results show that the perception of fricative segments may be explained in terms of 2-dimensional auditory space in which each segment occupies a region. The dimensions of the space were found to be the frequency of the main spectral peak and the 'peakiness' of spectra. These results support the view that perception of a segment is based on its occupancy of a multi-dimensional parameter space. In this way, final perceptual decisions on segments can be postponed until higher level constraints can also be met.

  • PDF

Perceptual weighting on English lexical stress by Korean learners of English

  • Goun Lee
    • Phonetics and Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.19-24
    • /
    • 2022
  • This study examined which acoustic cue(s) that Korean learners of English give weight to in perceiving English lexical stress. We manipulated segmental and suprasegmental cues in 5 steps in the first and second syllables of an English stress minimal pair "object". A total of 27 subjects (14 native speakers of English and 13 Korean L2 learners) participated in the English stress judgment task. The results revealed that native Korean listeners used the F0 and intensity cues in identifying English stress and weighted vowel quality most strongly, as native English listeners did. These results indicate that Korean learners' experience with these cues in L1 prosody can help them attend to these cues in their L2 perception. However, L2 learners' perceptual attention is not entirely predicted by their linguistic experience with specific acoustic cues in their native language.

Tonality Detection based on Spectrum Energy in Perceptual Audio Coder (지각 오디오 부호화기에서의 스펙트럼 에너지 기반 톤 성분 검출 알고리듬)

  • 이근섭;연규철;박영철;윤대희
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.6C
    • /
    • pp.770-776
    • /
    • 2004
  • The goal of perceptual audio coder is to reduce redundancy and irrelevancy of audio signal based on the concept of masking. Several studies on masking effect reveal that the masking threshold varies as a function of the noise-like or tone-like nature of audio signals. Therefore, tonality of audio signal influences significantly the quality and efficiency of perceptual audio coder In this paper, we propose a new effective algorithm for tonality measure using spectrum energy. Since the proposed algorithm consists of a few transcendental functions and simple operations, it has lower complexity than MPEG psychoacoustic model-II. The proposed algorithm was tested with some audio signals, and DSP implementation showed that the proposed algorithm could be implemented with 3 MIPS. These results illustrate the efficiency of proposed algorithm in both performance and complexity.