• Title/Summary/Keyword: Auditory Information

Search Result 311, Processing Time 0.024 seconds

Speech Enhancement System Using a Model of Auditory Mechanism (청각기강의 모델을 이용한 음성강조 시스템)

  • 최재승
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.6
    • /
    • pp.295-302
    • /
    • 2004
  • On the field of speech processing the treatment of noise is still important problems for speech research. Especially, it has been noticed that the background noise causes remarkable reduction of speech recognition ratio. As the examples of the background noise, there are such various non-stationary noises existing in the real environment as driving noise of automobiles on the road or typing noise of printer. The treatment for these kinds of noises is not so simple as could be eliminated by the former Wiener filter, but needs more skillful techniques. In this paper as one of these trials, we show an algorithm which is a speech enhancement method using a model of mutual inhibition for noise reduction in speech which is contaminated by white noise or background noise mentioned above. It is confirmed that the proposed algorithm is effective for the speech degraded not only by white noise but also by colored noise, judging from the spectral distortion measurement.

A Novel Speech Enhancement Based on Speech/Noise-dominant Decision in Time-frequency Domain (시간-주파수 영역에서 음성/잡음 우세 결정에 의한 새로운 잡음처리)

  • 윤석현;유창동
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.3
    • /
    • pp.48-55
    • /
    • 2001
  • A novel method to reduce additive non-stationary noise is proposed. The method requires neither the information about noise nor the estimate of the noise statistics from any pause regions. The enhancement is performed on a band-by-band basis for each time frame. Based on both the decision on whether a particular band in a frame is speech or noise dominant and the masking property of the human auditory system, an appropriate amount of noise is reduced using spectral subtraction. The proposed method was tested on various noisy conditions (car noise, Fl6 noise, white Gaussian noise, pink noise, tank noise and babble noise) and on the basis of comparing segmental SNR with spectral subtraction method and visually inspecting the enhanced spectrograms and listening to the enhanced speech, the method was able to effectively reduce various noise while minimizing distortion to speech.

  • PDF

Human-Computer Interaction Based Only on Auditory and Visual Information

  • Sha, Hui;Agah, Arvin
    • Transactions on Control, Automation and Systems Engineering
    • /
    • v.2 no.4
    • /
    • pp.285-297
    • /
    • 2000
  • One of the research objectives in the area of multimedia human-computer interaction is the application of artificial intelligence and robotics technologies to the development of computer interfaces. This involves utilizing many forms of media, integrating speed input, natural language, graphics, hand pointing gestures, and other methods for interactive dialogues. Although current human-computer communication methods include computer keyboards, mice, and other traditional devices, the two basic ways by which people communicate with each other are voice and gesture. This paper reports on research focusing on the development of an intelligent multimedia interface system modeled based on the manner in which people communicate. This work explores the interaction between humans and computers based only on the processing of speech(Work uttered by the person) and processing of images(hand pointing gestures). The purpose of the interface is to control a pan/tilt camera to point it to a location specified by the user through utterance of words and pointing of the hand, The systems utilizes another stationary camera to capture images of the users hand and a microphone to capture the users words. Upon processing of the images and sounds, the systems responds by pointing the camera. Initially, the interface uses hand pointing to locate the general position which user is referring to and then the interface uses voice command provided by user to fine-the location, and change the zooming of the camera, if requested. The image of the location is captured by the pan/tilt camera and sent to a color TV monitor to be displayed. This type of system has applications in tele-conferencing and other rmote operations, where the system must respond to users command, in a manner similar to how the user would communicate with another person. The advantage of this approach is the elimination of the traditional input devices that the user must utilize in order to control a pan/tillt camera, replacing them with more "natural" means of interaction. A number of experiments were performed to evaluate the interface system with respect to its accuracy, efficiency, reliability, and limitation.

  • PDF

A Study on VR News - In Recognition of the VR News (VR 뉴스에 관한 연구 - VR 뉴스 인식을 중심으로)

  • Park, Jun Hyung;Yang, Jong Hoon
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.12
    • /
    • pp.50-59
    • /
    • 2016
  • VR refers to Virtual Reality technology that allows experiences of virtual contents as if they are real through visual, auditory and other senses. VR is even affecting the news. This greatly shakes the frame of the news. In other words, the conventional way of the new was to watch it with the passive action of seeing pictures, articles and images while VR News offers an active paradigm of experience and participation. This study analyzed the VR news of each press and investigated how VR news is developing. Furthermore, an experimental study was conducted to examine how users actually perceived the VR news. News made using both the existing method and the VR news method were comparatively shown to users who were following the news, and the interview was carried out through questionnaires. The results obtained through the statistics and analysis are as follows. Users made an assessment that they felt the sense of realism in recognition of the VR news while it was still lacking in terms of the unique information delivery that the news performs. However, the intention of using the VR news again was shown to be high, which demonstrated the expectation of users who have experienced the VR news.

Study on the Sampling of Distributors : Relating Olfactory Cues and Social Density (유통점의 샘플링에 관한 연구 : 후각적 자극과 매장 밀집도를 중심으로)

  • Hwang, Hee-Joong;Youn, Myoung-Kil
    • Journal of Distribution Science
    • /
    • v.16 no.9
    • /
    • pp.59-63
    • /
    • 2018
  • Purpose - It has already been proved that 'mood' as the physical environment of shopping affects consumers' main sensory channels such as sight, hearing, smell, touch. However, there is no consensus on how the olfactory cue influences the customers in the shopping environment. In this study, we examine the previous studies on how the olfactory cue affects the customers in the shopping environment and present a clear direction as a suggestion for progressive research. Research design, data, and methodology - It is not important to use a lot of unconditional fragrance, but it should be exposed to the environment that suits the proper fragrance. In recent years, meaningful research on store fragrance has been slowly increasing. As a result, studies on the fragrance effects of retail stores have been conducted to verify the relevance of fragrance suitability in stores and consumer spending scale. Results - The fragrance appropriate for each store can not be uniformly specified as any fragrance. This is because external variables such as time, season, temperature, lighting, density of shoppers, and music in the store also affect customer evaluation. For example, using an unsuitable fragrance may encourage customers to leave the store quickly by restraining impulsive purchases or by disturbing concentration. The store manager should also be interested in using fragrances that are proven and effective in the store environment, but they should also have the ability to easily manipulate and manage the fragrances very appropriately according to changes in the store environment. Store managers should observe consumer preferences and responses according to their goals and strategies, and then systematically manage and store information about the fragrance appropriate to the store. Conclusions - In the future, the fragrance marketing researcher needs to consider the spatial form and density of the customer. In practice, managers operating a retail store should check the most appropriate store density(congestion) according to the size and spatial characteristics of the store and maintain the ideal conditions. To do this, it is necessary to pay attention to how to select and control sensory elements such as fragrance(olfactory), music(auditory), and lighting(visual).

Verification of Automatic PAR Control System using DEVS Formalism (DEVS 형식론을 이용한 공항 PAR 관제 시스템 자동화 방안 검증)

  • Sung, Chang-ho;Koo, Jung;Kim, Tag-Gon;Kim, Ki-Hyung
    • Journal of the Korea Society for Simulation
    • /
    • v.21 no.3
    • /
    • pp.1-9
    • /
    • 2012
  • This paper proposes automatic precision approach radar (PAR) control system using digital signal to increase the safety of aircraft, and discrete event systems specification (DEVS) methodology is utilized to verify the proposed system. Traditionally, a landing aircraft is controlled by the human voice of a final approach controller. However, the voice information can be missed during transmission, and pilots may also act improperly because of incorrectness of auditory signals. The proposed system enables the stable operation of the aircraft, regardless of the pilot's capability. Communicating DEVS (C-DEVS) is used to analyze and verify the behavior of the proposed system. A composed C-DEVS atomic model has overall composed discrete state sets of models, and the state sequence acquired through full state search is utilized to verify the safeness and the liveness of a system behavior. The C-DEVS model of the proposed system shows the same behavior with the traditional PAR control system.

The Convergence of Literature & Movie in - The Impact of Computer Graphics (<위대한 개츠비>에서 만난 문학과 영화의 융합 - 컴퓨터 그래픽이 미치는 영향)

  • Choi, Sun-Wha
    • Journal of Convergence for Information Technology
    • /
    • v.7 no.4
    • /
    • pp.121-127
    • /
    • 2017
  • In 2013, Baz Luhrmann's movie re-made Fitzgerald's novel, "The Great Gatsby". In Novel, readers keep trace of the plot with their imagination, but in Movie , movie director comes together to create visual and auditory elements of it. Daisy Buchanan is a fashion icon, wearing Prada, Chanel, and Tiffany's jewelry, which reproduce the costume of Jazz Age, and make viewers well understand that of Jazz Age. Symbols like "Ash Valley", "Green light", "East Egg", "West Egg" are presented more directly in movie. Roaring parties held in Gatsby's great mansion was made by computer graphic, and its enormous scale also reflects the mental chaos and the material affluence in those age. Additionally, actors excellent show highlights the theme of the novel. With the adaptation of novel, the film finally achieves more appealing art in front of the public. This thesis investigates these more logistically with the materials of internet.

Reaction Test Platform and Application by Auditory and Visual Stimulus for Language Learning Ability Improvement (언어 학습 능력 향상을 위한 청각 및 시각 자극에 대한 반응속도 측정 플랫폼과 응용)

  • Lee, Hye-Ran;Beak, Seung-Hyun
    • Journal of Internet Computing and Services
    • /
    • v.11 no.1
    • /
    • pp.77-84
    • /
    • 2010
  • Children, who have a language disorder, have difficulty in expressing their reaction about stimulus of sound and vision. So it is very hard to grasp that they recognize external stimulus or not. For solving these problem, we can check response time and make them to choose stimulus by giving stimulus of sound and vision to them through Audio and Visual Stimulus and Reaction Meter System. Additionally, We can help them by improving response time by repeated study based on the results and making them to recognize and choose stimulus faster without aversion about external stimulus. It would make them not to feel uncomfortable and isolated because they are unfamiliar with external stimulus.

Changes of Speech Discrimination Score Depending on Inter-syllable Pause Duration in Normal Hearing Children (정상 청력 아동의 음절 간 쉼 간격에 따른 어음이해도 변화)

  • Park, J.I.;Lee, J.Y.;Heo, S.D.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.8 no.2
    • /
    • pp.139-144
    • /
    • 2014
  • Speech discrimination is affected by the speed of speech. The speed of speech can be adjusted at the pause duration, the pause duration can take the resting time to avoid in overloading information. The study will be examine the effects of aging and audiological rehabilitation, and the auditory processing as basic research to investigate the normative data. 7 boys and 8 girls were participated. They have no problem with speech language pathologically and audiologically. There are 4 sets of test implement, and each test set was made out with 20 3-syllable words. Pause duration of all of these words are adjusted in normal(250 ms), slow(500 ms) and very slow(1000 ms). There are 4 words for a multiple-choice that including one word with written correctly and three words with written 1 phoneme wrong. Participant hear the word, and then have to choose one. Speech discrimination score in 250, 500, 1,000 ms of pause duration were $73{\pm}19.4%$, $84{\pm}12.2%$, $88{\pm}8.8%$, respectively.

  • PDF

The Effect of Voice Therapy for the Treatment of Functional Aphonia: A Preliminary Study (기능적 실성증에 대한 음성치료의 효과 분석: 기초 연구)

  • Kim, No Eul;Kim, Jun Seok;Oh, Jae Hwan;Kim, Dong Young;Woo, Joo Hyun
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.32 no.2
    • /
    • pp.75-80
    • /
    • 2021
  • Background and Objectives Functional aphonia refers to in which by presenting whispering voice and almost producing very high-pitched tensed voices are produced. Voice therapy is the most effective treatment, but there is a lack of consensus for application of voice therapy. The purpose of this study was to examine the vocal characteristics of functional aphonia and the effect of voice therapy applied accordingly. Materials and Method From October 2019 to December 2020, 11 patients with functional aphonia were treated using voice therapy which was processing three stages such as vocal hygiene, trial therapy, and behavioral therapy. Of these, 7 patients who completed the voice evaluation before and after voice therapy was enrolled in this study. By retrospective chart review, clinical information such as sex, age, symptoms, duration, social and medical history, process of voice therapy, subjective and objective findings were analyzed. Voice parameters before and after voice therapy were compared. Results In GRBAS study, grade, rough, and asthenic, and in Consensus Auditory-Perceptual Evaluation of Voice, overall severity, roughness, pitch, and loudness were significantly improved after voice therapy. In Voice handicap index, all of the scores of total and sub-categories were significantly decreased. In objective voice analysis, jitter, cepstral peak prominence, and maximum phonation time were significantly improved. Conclusion The voice therapy was effective for the treatment of functional aphonia by restoring patient's vocalization and improving voice quality, pitch and loudness.