• Title/Summary/Keyword: Perceptual region

Search Result 47, Processing Time 0.028 seconds

Adaptive Watermark Detection Algorithm Using Perceptual Model and Statistical Decision Method Based on Multiwavelet Transform

  • Hwang Eui-Chang;Kim Dong Kyue;Moon Kwang-Seok;Kwon Ki-Ryong
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.6
    • /
    • pp.783-789
    • /
    • 2005
  • This paper is proposed a watermarking technique for copyright protection of multimedia contents. We proposed adaptive watermark detection algorithm using stochastic perceptual model and statistical decision method in DMWT(discrete multi wavelet transform) domain. The stochastic perceptual model calculates NVF(noise visibility function) based on statistical characteristic in the DMWT. Watermark detection algorithm used the likelihood ratio depend on Bayes' decision theory by reliable detection measure and Neyman-Pearson criterion. To reduce visual artifact of image, in this paper, adaptively decide the embedding number of watermark based on DMWT, and then the watermark embedding strength differently at edge and texture region and flat region embedded when watermark embedding minimize distortion of image. In experiment results, the proposed statistical decision method based on multiwavelet domain could decide watermark detection.

  • PDF

Adaptive Digital Watermarking Based on Wavelet Transform Using Successive Subband Quantization and Perceptual Model

  • Kim, Ju-Young;Kwon, Seong-geun;Hwang, Hee-Chul;Kwon, Ki-Ryong;Kim, Duk-Gyoo
    • Proceedings of the IEEK Conference
    • /
    • 2002.07b
    • /
    • pp.1240-1243
    • /
    • 2002
  • In this paper, we propose an adaptive digital image watermarking algorithm using successive subband quantization (SSQ) and perceptual model based on wavelet domain. The watermark is embedded into the perceptually significant coefficients (PSCs) of image. The PSCs in the baseband are selected according to the amplitude of the coefficients and the high frequency subbands are selected by SSQ. To embed the watermark, we use perceptual model. The perceptual model is based on the computation of the noise visibility function (NVF) and embed at the texture and edge region stronger embedded watermarks.

  • PDF

SPATIAL EXPLANATIONS OF SPEECH PERCEPTION: A STUDY OF FRICATIVES

  • Choo, Won;Mark Huckvale
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.399-403
    • /
    • 1996
  • This paper addresses issues of perceptual constancy in speech perception through the use of a spatial metaphor for speech sound identity as opposed to a more conventional characterisation with multiple interacting acoustic cues. This spatial representation leads to a correlation between phonetic, acoustic and auditory analyses of speech sounds which can serve as the basis for a model of speech perception based on the general auditory characteristics of sounds. The correlations between the phonetic, perceptual and auditory spaces of the set of English voiceless fricatives /f $\theta$ s $\int$ h / are investigated. The results show that the perception of fricative segments may be explained in terms of 2-dimensional auditory space in which each segment occupies a region. The dimensions of the space were found to be the frequency of the main spectral peak and the 'peakiness' of spectra. These results support the view that perception of a segment is based on its occupancy of a multi-dimensional parameter space. In this way, final perceptual decisions on segments can be postponed until higher level constraints can also be met.

  • PDF

Desgin of Foveated Frequency Sensitivity (Foveated Frequency Sensitivity의 구현)

  • Tran, Nhat Huy;Bui, Minh Trung;Kim, Wonha
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2014.11a
    • /
    • pp.248-251
    • /
    • 2014
  • We develop the signal processing method for implementing the human perceptual variant on frequency and space. The human visual perceptual sensitivity varies as frequency components and the human perceivable resolution diminishes as the distances further from the eye-focused point. For realizing the frequency sensitivity, we developed the signal direction adaptive multiband energy scaling method to weight the frequency components. The low-pass filtering is designed on the developed energy scaling method for diminishing perceivable resolutions as the deviated distance from the eye-focused point. The developed method not only enhances the frequency components of image signals at the eye-focused region but also smoothes non-perceivable detailed image signals at non-focused regions. The proposed method is verified by the subjective and objective evaluations that it can improve human perceptual visual quality.

  • PDF

Image Processing based on Tensor Voting and its Applications (텐서 보팅에 기반한 영상처리 및 응용)

  • Park, Jong Hyun;Park, Soonyoung;Lee, Guee Sang
    • Smart Media Journal
    • /
    • v.1 no.2
    • /
    • pp.23-33
    • /
    • 2012
  • In this paper, the characteristics of tensor voting, which are used extensively in image processing and computer vision, have been surveyed. In general, tensor voting can infer the structural features like junctions, curves, regions and surfaces from n-dimensional data given as points, curve elements or surface patch elements. Currently various perceptual grouping methods based on such structural inference are studied and are used for diverse applications on images or scenes. Tensor voting provides robustness to noises and demonstrates itself efficient in many applications.

  • PDF

Adaptive Image Watermarking Using a Stochastic Multiresolution Modeling

  • Kim, Hyun-Chun;Kwon, Ki-Ryong;Kim, Jong-Jin
    • Proceedings of the IEEK Conference
    • /
    • 2002.07a
    • /
    • pp.172-175
    • /
    • 2002
  • This paper presents perceptual model with a stochastic rnultiresolution characteristic that can be applied with watermark embedding in the biorthogonal wavelet domain. The perceptual model with adaptive watermarking algorithm embed at the texture and edge region for more strongly embedded watermark by the SSQ(successive subband quantization). The watermark embedding is based on the computation of a NVF(noise visibility function) that have local image properties. This method uses non-stationary Gaussian model stationary Generalized Gaussian model because watermark has noise properties. In order to determine the optimal NVF, we consider the watermark as noise. The particularities of embedding in the stationary GG model use shape parameter and variance of each subband regions in multiresolution. To estimate the shape parameter, we use a moment matching method. Non-stationary Gaussian model use the local mean and variance of each subband. The experiment results of simulation were found to be excellent invisibility and robustness. Experiments of such distortion are executed by Stirmark benchmark test.

  • PDF

Salient Object Detection via Adaptive Region Merging

  • Zhou, Jingbo;Zhai, Jiyou;Ren, Yongfeng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.9
    • /
    • pp.4386-4404
    • /
    • 2016
  • Most existing salient object detection algorithms commonly employed segmentation techniques to eliminate background noise and reduce computation by treating each segment as a processing unit. However, individual small segments provide little information about global contents. Such schemes have limited capability on modeling global perceptual phenomena. In this paper, a novel salient object detection algorithm is proposed based on region merging. An adaptive-based merging scheme is developed to reassemble regions based on their color dissimilarities. The merging strategy can be described as that a region R is merged with its adjacent region Q if Q has the lowest dissimilarity with Q among all Q's adjacent regions. To guide the merging process, superpixels that located at the boundary of the image are treated as the seeds. However, it is possible for a boundary in the input image to be occupied by the foreground object. To avoid this case, we optimize the boundary influences by locating and eliminating erroneous boundaries before the region merging. We show that even though three simple region saliency measurements are adopted for each region, encouraging performance can be obtained. Experiments on four benchmark datasets including MSRA-B, SOD, SED and iCoSeg show the proposed method results in uniform object enhancement and achieve state-of-the-art performance by comparing with nine existing methods.

Target Detection Based on Moment Invariants

  • Wang, Jiwu;Sugisaka, Masanori
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.677-680
    • /
    • 2003
  • Perceptual landmarks are an effective solution for a mobile robot realizing steady and reliable long distance navigation. But the prerequisite is those landmarks must be detected and recognized robustly at a higher speed under various lighting conditions. This made image processing more complicated so that its speed and reliability can not be both satisfied at the same time. Color based target detection technique can separate target color regions from non-target color regions in an image with a faster speed, and better results were obtained only under good lighting conditions. Moreover, in the case that there are other things with a target color, we have to consider other target features to tell apart the target from them. Such thing always happens when we detect a target with its single character. On the other hand, we can generally search for only one target for each time so that we can not make use of landmarks efficiently, especially when we want to make more landmarks work together. In this paper, by making use of the moment invariants of each landmark, we can not only search specified target from separated color region but also find multi-target at the same time if necessary. This made the finite landmarks carry on more functions. Because moment invariants were easily used with some low level image processing techniques, such as color based target detection and gradient runs based target detection etc, and moment invariants are more reliable features of each target, the ratio of target detection were improved. Some necessary experiments were carried on to verify its robustness and efficiency of this method.

  • PDF

Visual-Attention-Aware Progressive RoI Trick Mode Streaming in Interactive Panoramic Video Service

  • Seok, Joo Myoung;Lee, Yonghun
    • ETRI Journal
    • /
    • v.36 no.2
    • /
    • pp.253-263
    • /
    • 2014
  • In the near future, traditional narrow and fixed viewpoint video services will be replaced by high-quality panorama video services. This paper proposes a visual-attention-aware progressive region of interest (RoI) trick mode streaming service (VA-PRTS) that prioritizes video data to transmit according to the visual attention and transmits prioritized video data progressively. VA-PRTS enables the receiver to speed up the time to display without degrading the perceptual quality. For the proposed VA-PRTS, this paper defines a cutoff visual attention metric algorithm to determine the quality of the encoded video slice based on the capability of visual attention and the progressive streaming method based on the priority of RoI video data. Compared to conventional methods, VA-PRTS increases the bitrate saving by over 57% and decreases the interactive delay by over 66%, while maintaining a level of perceptual video quality. The experiment results show that the proposed VA-PRTS improves the quality of the viewer experience for interactive panoramic video streaming services. The development results show that the VA-PRTS has highly practical real-field feasibility.

Effects of F1/F2 Manipulation on the Perception of Korean Vowels /o/ and /u/ (F1/F2의 변화가 한국어 /오/, /우/ 모음의 지각판별에 미치는 영향)

  • Yun, Jihyeon;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.5 no.3
    • /
    • pp.39-46
    • /
    • 2013
  • This study examined the perception of two Korean vowels using F1/F2 manipulated synthetic vowels. Previous studies indicated that there is an overlap between the acoustic spaces of Korean /o/ and /u/ in terms of the first two formants. A continuum of eleven synthetic vowels were used as stimuli. The experiment consisted of three tasks: an /o/ identification task (Yes-no), an /u/ identification task (Yes-no), and a forced choice identification task (/o/-/u/). ROC(Receiver Operating Characteristic) analysis and logistic regression were performed to calculate the boundary criterion of the two vowels along the stimulus continuum, and to predict the perceptual judgment on F1 and F2. The result indicated that the location between stimulus no.5 (F1 = 342Hz, F2 = 691Hz) and no.6 (F1 = 336Hz, F2 = 700Hz) was estimated as a perceptual boundary region between /o/ and /u/, while stimulus no.0 (F1=405Hz, F2=666Hz) and no.10 (F1=321Hz, F2=743Hz) were at opposite ends of the continuum. The influence of F2 was predominant over F1 on the perception of the vowel categories.