• Title/Summary/Keyword: visual-audio

Search Result 424, Processing Time 0.035 seconds

Abnormal Active Pig Detection System using Audio-visual Multimodal Information (Audio-visual 멀티모달 정보 기반의 비정상 활성 돼지 탐지 시스템)

  • Chae, Heechan;Lee, Junhee;Lee, Jonguk;Chung, Yonghwa;Park, Daihee
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.05a
    • /
    • pp.661-664
    • /
    • 2022
  • 양돈을 관리하는 데에 있어 비정상 개체를 식별하고 사전에 추적하거나 격리할 수 있는 양돈업 시스템을 구축하는 것은 효율적인 돈사관리를 위한 필수 요소이다. 그러나 돈사내의 이상 상황을 탐지하는 연구는 보고되었지만, 이상 상황이 발생한 돼지를 특정하여 식별하는 연구는 찾아보기 힘들다. 따라서, 본 연구에서는 소리를 활용하여 이상 상황이 발생함을 탐지한 후 영상을 활용하여 소리를 낸 특정 돼지를 식별할 수 있는 시스템을 제안한다. 해당 시스템의 주요 알고리즘은 활성 화자 탐지 문제에서 착안하여 이를 돈사에 맞게 적용하여, 비정상 소리를 내는 활성 돼지를 식별 가능하도록 구현하였다. 제안한 방법론은 모의 실험을 통해 돈사 내의 이상 상황이 발생한 돼지를 식별할 수 있음을 확인하였다.

A 3D Audio Broadcasting Terminal for Interactive Broadcasting Services (대화형 방송을 위한 3차원 오디오 방송단말)

  • Park Gi Yoon;Lee Taejin;Kang Kyeongok;Hong Jinwoo
    • Journal of Broadcast Engineering
    • /
    • v.10 no.1 s.26
    • /
    • pp.22-30
    • /
    • 2005
  • We implement an interactive 3D audio broadcasting terminal which synthesizes an audio scene according to the request of a user. Audio scene structure is described by the MPEG-4 AudioBIFS specifications. The user updates scene attributes and the terminal synthesizes the corresponding sound images in the 3D space. The terminal supports the MPEG-4 Audio top nodes and some visual nodes. Instead of using sensor nodes and route elements, we predefine node type-specific user interfaces to support BIFS commands for field replacement. We employ sound spatialization, directivity/shape modeling, and reverberation effects for 3D audio rendering and realistic feedback to user inputs. We also introduce a virtual concert program as an application scenario of the interactive broadcasting terminal.

Comparison of McGurk Effect across Three Consonant-Vowel Combinations in Kannada

  • Devaraju, Dhatri S;U, Ajith Kumar;Maruthy, Santosh
    • Journal of Audiology & Otology
    • /
    • v.23 no.1
    • /
    • pp.39-48
    • /
    • 2019
  • Background and Objectives: The influence of visual stimulus on the auditory component in the perception of auditory-visual (AV) consonant-vowel syllables has been demonstrated in different languages. Inherent properties of unimodal stimuli are known to modulate AV integration. The present study investigated how the amount of McGurk effect (an outcome of AV integration) varies across three different consonant combinations in Kannada language. The importance of unimodal syllable identification on the amount of McGurk effect was also seen. Subjects and Methods: Twenty-eight individuals performed an AV identification task with ba/ga, pa/ka and ma/ṇa consonant combinations in AV congruent, AV incongruent (McGurk combination), audio alone and visual alone condition. Cluster analysis was performed using the identification scores for the incongruent stimuli, to classify the individuals into two groups; one with high and the other with low McGurk scores. The differences in the audio alone and visual alone scores between these groups were compared. Results: The results showed significantly higher McGurk scores for ma/ṇa compared to ba/ga and pa/ka combinations in both high and low McGurk score groups. No significant difference was noted between ba/ga and pa/ka combinations in either group. Identification of /ṇa/ presented in the visual alone condition correlated negatively with the higher McGurk scores. Conclusions: The results suggest that the final percept following the AV integration is not exclusively explained by the unimodal identification of the syllables. But there are other factors which may also contribute to making inferences about the final percept.

Comparison of McGurk Effect across Three Consonant-Vowel Combinations in Kannada

  • Devaraju, Dhatri S;U, Ajith Kumar;Maruthy, Santosh
    • Korean Journal of Audiology
    • /
    • v.23 no.1
    • /
    • pp.39-48
    • /
    • 2019
  • Background and Objectives: The influence of visual stimulus on the auditory component in the perception of auditory-visual (AV) consonant-vowel syllables has been demonstrated in different languages. Inherent properties of unimodal stimuli are known to modulate AV integration. The present study investigated how the amount of McGurk effect (an outcome of AV integration) varies across three different consonant combinations in Kannada language. The importance of unimodal syllable identification on the amount of McGurk effect was also seen. Subjects and Methods: Twenty-eight individuals performed an AV identification task with ba/ga, pa/ka and ma/ṇa consonant combinations in AV congruent, AV incongruent (McGurk combination), audio alone and visual alone condition. Cluster analysis was performed using the identification scores for the incongruent stimuli, to classify the individuals into two groups; one with high and the other with low McGurk scores. The differences in the audio alone and visual alone scores between these groups were compared. Results: The results showed significantly higher McGurk scores for ma/ṇa compared to ba/ga and pa/ka combinations in both high and low McGurk score groups. No significant difference was noted between ba/ga and pa/ka combinations in either group. Identification of /ṇa/ presented in the visual alone condition correlated negatively with the higher McGurk scores. Conclusions: The results suggest that the final percept following the AV integration is not exclusively explained by the unimodal identification of the syllables. But there are other factors which may also contribute to making inferences about the final percept.

The Effects of the Presentation Mode of Web Contents on the Children's Information Processing Process (웹 콘텐츠의 정보제시유형이 어린이 뉴스정보처리과정에 미치는 영향)

  • Choi E-Jung
    • The Journal of the Korea Contents Association
    • /
    • v.5 no.3
    • /
    • pp.113-122
    • /
    • 2005
  • The major purpose of this study is to explore the effect of the presentation undo combined by main four media(moving Image, audio, turf image) of web contents on the children's information processing process. So children were assigned to one of five experimental medium conditions: 'moving Image1 (auditory-visual redundancy)', 'moving Image2 (auditory-visual dissonance)', 'text', 'text-with-image', 'audio'. Results indicated that the moving image was found to be the most effective transmitter of internet news information for children's recall. And the recall advantage of moving image was found to be particularly pronounced for verbal information supplemented with redundant visual.

  • PDF

Method of Motion Graphic Design Approach from Postmodern Point of View (포스트모던적 관점에서 본 모션그래픽 디자인 접근 방안)

  • Kim Gyo-Wan;Hong Su-Jung
    • The Journal of the Korea Contents Association
    • /
    • v.6 no.9
    • /
    • pp.124-131
    • /
    • 2006
  • Motion graphic is also developing into its own genre in graphic design by establishing individual industrial fields. images and graphics but also recognize even motions and sounds as communication elements delivering a message. Nevertheless, web communication designers have yet to experimentally test the visual motion priniciples or the audio sound expressions in terms of technology. If the dance are used, they may be new communication approaches in the age of multi-media, which allows for communication of visual and audio information or images. With such conceptions in mind, this study was aimed at reviewing the structural relationship between visual motion principles of the kinetic and the sound images to combine the audio and video effects. To this end, the basic structure of dance and music were substituted into the dance to determine their relevancy, and thereupon, examine the effective sound expression methods and techniques depending on movements of the objects in the monitor. Thus, this study, by inquiring into the uniqueness of choreography in motion graphic, presents the possibility of limitless expression of designer creation and inner world, and the ultimate goal lies in assuring the artistic value of motion graphic and its position as a synthetic art.

  • PDF

A Model for the Use of Middle School Rooms by the Community (지역주민(地域住民)의 중학교(中學校) 실(室) 이용(利用)에 관(關)한 모델)

  • Min, Chang-Kee
    • Journal of the Korean Institute of Educational Facilities
    • /
    • v.6 no.2
    • /
    • pp.13-23
    • /
    • 1999
  • This paper seeks to find out the policies of management and layout of middle school rooms for the community people's use. This paper surveys community's needs with respect to both the use of school rooms before, during, and after classes and preferences of use of school rooms. This paper adopts two experimental case studies to find out the models. It uses t-test analysis of the statistics to find out community people's preferences for the use of school rooms between two communities in an urban area, and uses simple and multiple regression analyses to develop models concerning community people's uses of school rooms before, during, and after classes. It also uses cluster analysis to find out the cluster among community people's preference of school rooms. It found, first, that community people's use of school rooms after class can be influenced by the uses of a play ground, a music classroom, an audio visual classroom, and a gymnasium. The use during regular classes is related to the uses of the fine arts classroom, a general classroom, a home economics classroom, a gymnasium, and a playground. The use before class is affected by the uses of a fine arts classroom, a playground, and a library. It also found that, with respect to community people's preferential use of school rooms, the rooms can be clustered as a cluster of laboratories such as a general classroom, a music room, a fine arts classroom, a science classroom, a home economics classroom, and a technique classroom, a cluster of athletic areas such as a gymnasium and a playground, and a cluster of supporting facilities such as a library, an audio visual classroom, and a computer classroom. Those clusters can also be clustered in more detail, i. e., that both a general classroom and playground can be apart from a cluster of laboratories or a cluster of supporting facilities; that an audio visual classroom can be fostered into a cluster with a home economics affairs classroom and a technique classroom. Finally this paper suggests policies of management and layout of school rooms.

  • PDF

Video Summarization Using Eye Tracking and Electroencephalogram (EEG) Data (시선추적-뇌파 기반의 비디오 요약 생성 방안 연구)

  • Kim, Hyun-Hee;Kim, Yong-Ho
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.56 no.1
    • /
    • pp.95-117
    • /
    • 2022
  • This study developed and evaluated audio-visual (AV) semantics-based video summarization methods using eye tracking and electroencephalography (EEG) data. For this study, twenty-seven university students participated in eye tracking and EEG experiments. The evaluation results showed that the average recall rate (0.73) of using both EEG and pupil diameter data for the construction of a video summary was higher than that (0.50) of using EEG data or that (0.68) of using pupil diameter data. In addition, this study reported that the reasons why the average recall (0.57) of the AV semantics-based personalized video summaries was lower than that (0.69) of the AV semantics-based generic video summaries. The differences and characteristics between the AV semantics-based video summarization methods and the text semantics-based video summarization methods were compared and analyzed.

Development of Audio-visual Aids of Death Education for Hospice Patients and Their Families (호스피스 환자와 가족을 위한 임종교육 시청각 자료 개발)

  • Seo, Mi-Suk;Kang, Yu Jung;Yoon, Ji Yoon;Kim, Tae Yeon;Cho, Hye Jun;Park, So Yeon;Lee, Si Yeon;Jang, Ji Hye;Kim, Yu Jin;Kang, Mi Teum
    • Journal of Hospice and Palliative Care
    • /
    • v.19 no.3
    • /
    • pp.240-248
    • /
    • 2016
  • Purpose: Patients and their caretakers need to understand various problems and requirements in the dying process so that they may prepare for death for the rest of their remaining life. Accordingly, a systematic audio-visual resource was developed to educate hospice patients and their families at the palliative care ward about the process of dying. Methods: For the development of an audio-visual resource, a initial education material was produced in the form of simple and accessible Power Point handouts based on literature study. Then, the program was completed through five rounds of a process, including expert advice, revision, update and evaluation. Results: The final version of the program was filmed with cooperation of the medical literature information division. Using the program, patients and families were educated through five phases over three sessions for a total 26 minutes and 34 seconds. Conclusion: The significance of this study lies in the fact that it was conducted after the establishment of the palliative care ward, which made it easier for nurses provide the education. It is expected that the program may be used by hospice specialists as well as nurses as an education resource for hospice patients and their families.

EVALUATION OF PEDIATRIC DENIAL PATIENTS' BEHAVIOR AFTER USING AUDIO-VISUAL AIDS (시청각 기구를 이용한 소아환자의 행동조절에 관한 연구)

  • Yeom, Soon-Joon;Park, Ki-Tae
    • Journal of the korean academy of Pediatric Dentistry
    • /
    • v.29 no.2
    • /
    • pp.189-195
    • /
    • 2002
  • In the area of pediatric dentistry, several behavior modification techniques have been attempted to relieve young patients' dental fear. The use of audio-visual(AV) aids is one of them and is increasing. In this study, several patients' reactions to dental treatment have been investigated after using AV aids, including patients' sleep, movement, crying and overall behavior. The effectiveness of AV aids have also been investigated through patients' age, previous dental experience and daily exposure to TV or video. Thirty healthy children with Frankl behavior rating (+) or (-) were included in this study. The average age of the children was $52.9{\pm}12.7$ months and no statistical difference was found between the two groups. Thirty patients were equally divided into two groups. Group I(control) received dental treatment with the conventional tell-show-do while group II(AV) with tell-show-do and AV aids. All patients received only restorative dental treatment and received no extraction. Houpt behavior rating scale was used to evaluate patients' behavior during the dental treatment. As a result, there was no significant difference between the two groups in movement and crying. However, more patients in the AV group fell asleep during the dental treatment compared to the control group. Within the AV group, patients with previous dental experience, older age and frequent exposure to AV materials showed better overall behavior during the dental treatment as audio-visual aids were used for behavior management.

  • PDF