• Title/Summary/Keyword: perceptual experiment

Search Result 138, Processing Time 0.032 seconds

A Study on Colour Perception according to the Edge Brightness in Graphic Images (그래픽 영상에서 테두리 밝기에 따른 색채 인지 연구)

  • Hong, Ji-Young
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.21 no.4
    • /
    • pp.31-36
    • /
    • 2021
  • Colour properties can be classified based on hue, brightness, and saturation, and the characteristics of colour perception vary by colour property. In terms of colour perception, surrounding colours are viewed together rather than independently, and the characteristics of the original colour perception may vary depending on the colour characteristics of the surrounding. Thus far, research on colour perception and sensibility has focused on a monotonic environment, which is different from colour perception in real life. As such, a fundamental study on colour perception considering surrounding colours must be conducted. In this study, a psychophysical experiment was conducted, and the results were analysed to study the colour perception characteristics by edge brightness, Edge brightness was set as a variable based on the visual traits that are most sensitive to brightness, and the experiment was conducted using the Munsell colour system to specify the stimulus colour. Based on the analysis of the experimental outcomes, this study reveals the significance of fundamental research data on the visual and perceptual characteristics of colour perception with regard to edge brightness and provides directions for future research.

Front-End Processing for Speech Recognition in the Telephone Network (전화망에서의 음성인식을 위한 전처리 연구)

  • Jun, Won-Suk;Shin, Won-Ho;Yang, Tae-Young;Kim, Weon-Goo;Youn, Dae-Hee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.4
    • /
    • pp.57-63
    • /
    • 1997
  • In this paper, we study the efficient feature vector extraction method and front-end processing to improve the performance of the speech recognition system using KT(Korea Telecommunication) database collected through various telephone channels. First of all, we compare the recognition performances of the feature vectors known to be robust to noise and environmental variation and verify the performance enhancement of the recognition system using weighted cepstral distance measure methods. The experiment result shows that the recognition rate is increasedby using both PLP(Perceptual Linear Prediction) and MFCC(Mel Frequency Cepstral Coefficient) in comparison with LPC cepstrum used in KT recognition system. In cepstral distance measure, the weighted cepstral distance measure functions such as RPS(Root Power Sums) and BPL(Band-Pass Lifter) help the recognition enhancement. The application of the spectral subtraction method decrease the recognition rate because of the effect of distortion. However, RASTA(RelAtive SpecTrAl) processing, CMS(Cepstral Mean Subtraction) and SBR(Signal Bias Removal) enhance the recognition performance. Especially, the CMS method is simple but shows high recognition enhancement. Finally, the performances of the modified methods for the real-time implementation of CMS are compared and the improved method is suggested to prevent the performance degradation.

  • PDF

An Objective Estimation for Simulating of Asymmetrical Auditory Filter of the Hearing Impaired According to Hearing Loss Degree (난청인의 난청 정도에 따른 비대칭 청각 필터 구현의 객관적 평가)

  • Joo, S.I.;Jeon, Y.Y.;Song, Y.R.;Lee, S.M.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.3 no.1
    • /
    • pp.27-34
    • /
    • 2009
  • Hearing impaired person's hearing loss has personally various shape, so existing symmetrical auditory filter of frequency band method wasn't properly simulated the hearing impaired person's various hearing loss shape. The shapes of auditory filter are asymmetrical different with each center frequency and each input level. Hearing impaired person which has hearing loss was differently changed with that of normal hearing people and it has different value for speech of quality through auditory filter. In this study, the asymmetrical auditory filter was simulated and then some tests to estimate the filter's performance objectively were performed. The experiment as simulated auditory filter's performance evaluation method used perceptual evaluation of speech quality (PESQ) and log likelihood ratio (LLR) for speech through auditory filter. In the test, processed speech was evaluated objective speech quality and distortion using PESQ and LLR value. When hearing loss processed, PESQ and LLR value have big difference between symmetrical and asymmetrical auditory filter. It means that the difference of the shape auditory filter may affect to speech quality. Especially, when hearing loss existed, auditory filter changing according to asymmetrical shape for each center frequency affected to perceive speech quality of the hearing impaired.

  • PDF

A study on combination of loss functions for effective mask-based speech enhancement in noisy environments (잡음 환경에 효과적인 마스크 기반 음성 향상을 위한 손실함수 조합에 관한 연구)

  • Jung, Jaehee;Kim, Wooil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.40 no.3
    • /
    • pp.234-240
    • /
    • 2021
  • In this paper, the mask-based speech enhancement is improved for effective speech recognition in noise environments. In the mask-based speech enhancement, enhanced spectrum is obtained by multiplying the noisy speech spectrum by the mask. The VoiceFilter (VF) model is used as the mask estimation, and the Spectrogram Inpainting (SI) technique is used to remove residual noise of enhanced spectrum. In this paper, we propose a combined loss to further improve speech enhancement. In order to effectively remove the residual noise in the speech, the positive part of the Triplet loss is used with the component loss. For the experiment TIMIT database is re-constructed using NOISEX92 noise and background music samples with various Signal to Noise Ratio (SNR) conditions. Source to Distortion Ratio (SDR), Perceptual Evaluation of Speech Quality (PESQ), and Short-Time Objective Intelligibility (STOI) are used as the metrics of performance evaluation. When the VF was trained with the mean squared error and the SI model was trained with the combined loss, SDR, PESQ, and STOI were improved by 0.5, 0.06, and 0.002 respectively compared to the system trained only with the mean squared error.

The Influence of Stimulus Contrast and Color on Target Detection under Multiple Rapid Serial Visual Presentation (다중신속순차제시아래 자극의 명암대비 및 색상이 표적 탐지에 미치는 영향)

  • Park, Jong-Min;Kim, Giyeon;Hyun, Joo-Seok
    • Science of Emotion and Sensibility
    • /
    • v.20 no.2
    • /
    • pp.137-148
    • /
    • 2017
  • The present study examined the effect of stimulus contrast and color on detection of a target embedded in streams of letters. In Experiment 1, each trial displayed four rapid serial visual presentation (RSVP) streams of letters (i.e., multi-RSVP), and each stream occupied one of four different locations. Each frame in the RSVP stream had four white distractors at the locations except one frame where a dim grey target was displayed at a location with three white distractors at the remaining locations. In the low-visibility target condition, the target's grey color was slightly darker than the background grey whereas much dimmer in the high-visibility condition. Participants were asked to report presence of a predesignated target as quickly and accurately as possible upon its detection in each trial, and their target detection turned out more accurate and quicker in the high-visibility than the low-visibility condition. In Experiment 2, the same RSVP displays and task as Experiment were used, but the grey target letters in the high-visibility condition were replaced with those of distinct chromatic colors. Participants detected target presence more accurately in the high-visibility condition, but the reaction time did not differ between the visibility conditions. The results indicate that higher stimulus contrast as well as distinct color can improve perception of a target stimulus displayed among visually-demanding background, but also suggest that stimulus contrast may play a more substantial role for such perceptual improvement.

A Preliminary Report on Perceptual Resolutions of Korean Consonant Cluster Simplification and Their Possible Change over Time

  • Cho, Tae-Hong
    • Phonetics and Speech Sciences
    • /
    • v.2 no.4
    • /
    • pp.83-92
    • /
    • 2010
  • The present study examined how listeners of Seoul Korean would recover deleted phonemes in consonant cluster simplification. In a phoneme monitoring experiment, listeners had to monitor for C2 (/k/ or /p/) in C1C2C3 when C2 was deleted (C1 was preserved) or preserved (C1 was deleted). The target consonant (C2) was either /k/ or /p/ (e.g., i$\b{lk}$-t${\partial}$lato vs. pa$\b{lp}$-t${\partial}$lato), and there were two listener groups, one group tested in 2002 and the other in 2009. Some points have emerged from the results. First, listeners were able to detect deleted phonemes as accurately and rapidly as preserved phonemes, showing that the physical presence of the acoustic information did not improve the listeners' performance. This suggests that listeners must have relied on language-specific phonological knowledge about the consonant cluster simplification, rather than relying on the low-level acoustic-phonetic information. Second, listener groups (participants in 2002 vs. 2009), differed in processing /p/ versus /k/: listeners in 2009 failed to detect /p/ more frequently than those in 2002, suggesting that the way the consonant cluster sequence is produced and perceived has changed over time. This result was interpreted as coming from statistical patterns of speech production in contemporary Seoul Korean as reported in a recent study by Cho & Kim (2009): /p/ is deleted far more often than /p/ is preserved, which is likely reflected in the way listeners process simplified variants. Finally, listeners processed /k/ more efficiently than /p/, especially when the target was physically present (in C-preserved condition), indicating that listeners benefited more from the presence of /k/ than of /p/. This was interpreted as supporting the view that velars are perceptually more robust than labials, which constrains shaping phonological patterns of the language. These results were then discussed in terms of their implications for theories of spoken word recognition.

  • PDF

Effectiveness of Pavement Marking for Speed Reduction by Using Driving Simulator (시뮬레이터를 이용한 감속유도 노면표시의 효과 연구)

  • Noh, Kwan-Sub;Lee, Jong-Hak;Kim, Jong-Min;Jang, Hye-Ran
    • International Journal of Highway Engineering
    • /
    • v.12 no.3
    • /
    • pp.9-16
    • /
    • 2010
  • The purpose of this study is to establish how drivers react on different pavement markings in a driving simulator for speed reduction. For this, forty one people between the ages of twenty and seventy took part in this experiment. A study analyzed which patterns are effective to reduce a speeding. As a results, 'Peripheral Transverse Lines II' were the most effective treatment for reducing driving speeds in the driving simulator. This study carries an important meaning when presenting the effectiveness of speed reduction for perceptual countermeasure based on the human factors. Further research will be required to determine the long term effectiveness of these countermeasures due to concerns over drivers becoming too accustomed to implemented markings. Also, the color contrasts (for example, dark asphalt with white markings) can be considered.

Realistic and Fast Depth-of-Field Rendering in Direct Volume Rendering (직접 볼륨 렌더링에서 사실적인 고속 피사계 심도 렌더링)

  • Kang, Jiseon;Lee, Jeongjin;Shin, Yeong-Gil;Kim, Bohyoung
    • The Journal of Korean Institute of Next Generation Computing
    • /
    • v.15 no.5
    • /
    • pp.75-83
    • /
    • 2019
  • Direct volume rendering is a widely used method for visualizing three-dimensional volume data such as medical images. This paper proposes a method for applying depth-of-field effects to volume ray-casting to enable more realistic depth-of-filed rendering in direct volume rendering. The proposed method exploits a camera model based on the human perceptual model and can obtain realistic images with a limited number of rays using jittered lens sampling. It also enables interactive exploration of volume data by on-the-fly calculating depth-of-field in the GPU pipeline without preprocessing. In the experiment with various data including medical images, we demonstrated that depth-of-field images with better depth perception were generated 2.6 to 4 times faster than the conventional method.

A study on the site-specific theatre-performance - focused on the Korean performances - (장소특정적 연극-퍼포먼스 연구 - 한국의 공연작품들을 중심으로 -)

  • Shin, Hyun-Sook
    • Journal of Korean Theatre Studies Association
    • /
    • no.49
    • /
    • pp.171-208
    • /
    • 2013
  • Site-specific performance is always the real on-site work taking place at the site. Hence, it deals with the reality, in other words, the time of creation and formation. And it creates value and meaning through the interaction and continuous direct communication process between the performers, audience and the local residents. In this performance, the audience's status as the passive observer changes. They become the co-agent who actively lead and complete the performance through their own experience. We have examined the The Working Methods of Site-specific Performance and Aesthetics of Effect through four Korean performances ; Marie, An aesthetic experiment of site as the storyteller; Heterotopia and Urban Movement Research or Play: We Will Move Your Sofa, as performances which have Revealed history, politics, institution engraved in the site ; A Song of Mandala and Miracle, as a ritualistic site-specific performance at the historic site. Some remarks on Site-specific performance ; First, In Site-specific performance, the habitus peculiar to the stage art and the mode of reception are changed. Second, a new mode of theatrical communication requires creator and audience to have a sharp aesthetic sensibility and to change one's perceptual habit. Third, Site-specific performance can act as a demonstration for the viewpoints of political activism through what could be called a dramatic close-up effect. Fourth, Site-specific performance also has the risk of merely becoming an unfocused and scattered performance or degenerating into a pseudo-sightseeing. To avoid this, an in-depth study of the site and its socio-cultural context, and the clear motivation with which one is trying to reveal and tell from the site must be indispensable. As the co-agent, the audience should also be aware and think about what the given performance signifies today.

Annoyance and sportiness perception of the acceleration sound by the driver and passengers (가속 사운드에 대한 운전자와 탑승객의 성가심과 스포티함 지각)

  • Kim, Seonghyeon;Altinsoy, M. Ercan
    • The Journal of the Acoustical Society of Korea
    • /
    • v.40 no.6
    • /
    • pp.566-570
    • /
    • 2021
  • This study presents a perceptual difference in acceleration sounds of a sporty sedan between the driver and passenger. We found a significant difference in annoyance and sportiness perception according to the acceleration sound level through subjective evaluations. The multimodal reproduction system, which can reproduce the driving image, motion, vibration, and sound, was applied for the test. A subjective experiment was conducted to evaluate the perceived intensity of annoyance and sportiness by varying the acceleration sound level in five steps of 3 dB. The experimental results showed that the driver perceives the acceleration sound less annoying than the passenger at a relatively low sound level. Meanwhile, the driver has perceived the acceleration sound more sporty than the passenger at a relatively high sound level. Moreover, it was found that passengers were 35 % less sensitive to an annoyance than drivers, whereas the driver was 74 % more susceptible to sportiness than passengers according to the sound level change. This finding is expected to be applied as a sound design strategy that differentiates the acceleration sound level in active sound design.