• Title/Summary/Keyword: Audio-visual interaction

Search Result 29, Processing Time 0.025 seconds

Audio-Based Human-Robot Interaction Technology (오디오 기반 인간로봇 상호작용 기술)

  • Kwak, K.C.;Kim, H.J.;Bae, K.S.;Yoon, H.S.
    • Electronics and Telecommunications Trends
    • /
    • v.22 no.2 s.104
    • /
    • pp.31-37
    • /
    • 2007
  • 인간로봇 상호작용 기술(human-robot interaction)은 다양한 의사소통 채널인 로봇카메라, 마이크로폰, 기타 센서를 통해 인지 및 정서적으로 상호작용할 수 있도록 로봇시스템 및 상호작용 환경을 디자인하고 구현 및 평가하는 지능형 서비스 로봇의 핵심기술이다. 본 고에서는 오디오 기반 인간로봇 상호작용 기술 중에서 음원 추적(sound localization)과 화자인식(speaker recognition) 기술의 국내외 기술동향을 살펴보고 최근 ETRI 지능형로봇연구단에서 상용화를 추진중인 시청각 기반 음원 추적(audio visual sound localization)과 문장독립 화자인식(text-independent speaker recognition)기술들을 다룬다. 또한 이들 기술들을 가정환경에서 효과적으로 사용하기 위해 음성인식, 얼굴검출, 얼굴인식 등을 결합한 시나리오에 대해서 살펴본다.

Effects of Audio-visual Entertainment and Soft Tissue Mobilization on Pressure Pain Thresholds, Psychophysiological parameters, and Brain waves in University Students with Tension-type Headache (긴장성 두통이 있는 대학생들에게 시청각적 엔터테인먼트와 연부조직 가동술이 압력통각역치, 바이오피드백, 뇌파에 미치는 영향)

  • Jung, Dae-In;Lee, Eun-Sang;Kim, Hyun-Joong
    • Journal of Korea Entertainment Industry Association
    • /
    • v.14 no.7
    • /
    • pp.539-548
    • /
    • 2020
  • TTH(tension-type headache) is the most common primary headache among adults. Long-term headaches cause chronic headaches and have a better impact on daily life. The purpose of this study is to compare the contributions to TTH through AVE(audio-visual entertainment) and STM(soft tissue mobilization) suitable for management of pathogenic and psychogenic factors of TTH. The participants of this study were from 30 people who complained of intermittent or persistent headaches for more than 6 months, and 10 participants each in the AVE group, STM group, and AVE plus STM group. In the assigned group, a total of 12 sessions were performed three times a week for 4 weeks after the baseline, followed by post-test. Outcome measures measured PPTs(pressure pain thresholds), psychophysiological parameters, and EEG(electroencephalogram). The measured results were analyzed for interaction between time and group through a two way rmANOVA(repeated measurement variance analysis). As a result of the PPTs, interaction was found in the results of the right trapezius (p<.05), and the more improvement was observed in the AVE group. Therefore, through AVE based on psychological factors rather than direct access to the muscles of pathogenic factors, a positive impact on the PPTs was shown, but the average value of the psychophysiological parameters and brain waves that were not statistically significant. The amount of change was observed. Through this, it is suggested that audio-visual stimulation could be considered in the management of TTH.

Haptic Media Broadcasting (촉각방송)

  • Cha, Jong-Eun;Kim, Yeong-Mi;Seo, Yong-Won;Ryu, Je-Ha
    • Broadcasting and Media Magazine
    • /
    • v.11 no.4
    • /
    • pp.118-131
    • /
    • 2006
  • With rapid development in ultra fast communication and digital multimedia, the realistic broadcasting technology, that can stimulate five human senses beyond the conventional audio-visual service is emerging as a new generation broadcasting technology. In this paper, we introduce a haptic broadcasting system and related core system and component techniques by which we can 'touch and feel' objects in an audio-visual scene. The system is composed of haptic media acquisition and creation, contents authoring, in the haptic broadcasting, the haptic media can be 3-D geometry, dynamic properties, haptic surface properties, movement, tactile information to enable active touch and manipulation and passive movement following and tactile effects. In the proposed system, active haptic exploration and manipulation of a 3-D mesh, active haptic exploration of depth video, passive kinesthetic interaction, and passive tactile interaction can be provided as potential haptic interaction scenarios and a home shopping, a movie with tactile effects, and conducting education scenarios are produced to show the feasibility of the proposed system.

A Study of the spatial perception by audio-visual information (시각과 청각에 의한 공간적 지각에 관한 연구)

  • Lee, Chai-Bong;Kang, Dae-Gee
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.11 no.2
    • /
    • pp.132-136
    • /
    • 2010
  • Psychophysical experiment was performed to investigate how audio-visual spatial disparity affects on perceptual space in peripheral vision. In the experiment, participants were exposed to two stimuli of vision and sound which comes simultaneously from different directions, respectively. The visual stimulus was implemented by 7 white LEDs which were located at an equal distance with 7 different angles of $-70^{\circ}$, $-40^{\circ}$, $-20^{\circ}$, $0^{\circ}$, $20^{\circ}$, $40^{\circ}$, and $70^{\circ}$ from the right front. Those audial stimuli were also implemented by loudspeakers which were placed at 9 different directions equally spaced by $5^{\circ}$ ranged from $-20^{\circ}$ to $20^{\circ}$. Each participant then evaluated spatial disparity between visual and audial stimuli with 5 levels of response, in which the higher level indicates the larger gap. When the visual stimulus is applied from the right, the results show that the response level gets higher for a larger angle between visual and auditory stimuli. A similar tendency for the visual stimulus with $0^{\circ}$ orientation was also be observed. On the other hand, when the visual stimulus is applied from the left, the response level gets lower for the larger angle.

Development of a Real-time Vehicle Driving Simulator

  • Kim, Hyun-Ju;Park, Min-Kyu;Lee, Min-Cheoul;You, Wan-Suk
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2001.10a
    • /
    • pp.51.2-51
    • /
    • 2001
  • A vehicle driving simulator is a virtual reality device which makes a human being feel as if the one drives a vehicle actually. The driving simulator is effectively used for studying interaction of a driver-vehicle and developing the vehicle system of new concepts. The driving simulator consists of a motion platform, a motion controller, a visual and audio system, a vehicle dynamic analysis system, a vehicle operation system and etc. The vehicle dynamic analysis system supervises overall operation of the simulator and also simulates dynamic motion of a multi-body vehicle model in real-time. In this paper, the main procedures to develop the driving simulator are classified by 4 parts. First, a vehicle motion platform and a motion controller, which generates realistic motion using a six degree of freedom Stewart platform driven hydraulically. Secondly, a visual system generates high fidelity visual scenes which are displayed on a screen ...

  • PDF

A Research of User Experience on Multi-Modal Interactive Digital Art

  • Qianqian Jiang;Jeanhun Chung
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.16 no.1
    • /
    • pp.80-85
    • /
    • 2024
  • The concept of single-modal digital art originated in the 20th century and has evolved through three key stages. Over time, digital art has transformed into multi-modal interaction, representing a new era in art forms. Based on multi-modal theory, this paper aims to explore the characteristics of interactive digital art in innovative art forms and its impact on user experience. Through an analysis of practical application of multi-modal interactive digital art, this study summarises the impact of creative models of digital art on the physical and mental aspects of user experience. In creating audio-visual-based art, multi-modal digital art should seamlessly incorporate sensory elements and leverage computer image processing technology. Focusing on user perception, emotional expression, and cultural communication, it strives to establish an immersive environment with user experience at its core. Future research, particularly with emerging technologies like Artificial Intelligence(AR) and Virtual Reality(VR), should not merely prioritize technology but aim for meaningful interaction. Through multi-modal interaction, digital art is poised to continually innovate, offering new possibilities and expanding the realm of interactive digital art.

How of Improve an identity of mobile device interface and usability? (모바일 디바이스의 인터페이스 아이덴티티 개선 및 사용성 증대방안)

  • Song, Sang-Gon;Kim, Young-Sun;Choo, Hee-Jeong;Kang, Tae-Young;Hong, Noh-Kyung
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02b
    • /
    • pp.140-145
    • /
    • 2008
  • Mobile device has a constraint such as a small physical display size and interaction. One of the most important issues in mobile devices is the express an identity of one's products company. It can be appeal to the user in side of consistency. Thus, we integrate and extract an identity element from user experience including, Graphical User Interface, Information Architecture and Audio User Interface. The study was conducted by a task force team with User Interface practitioners of managing divisions of each product. In this study, methods and processes that were attempted in order to establish consistency principles of user experiences, enhancing the various characteristics of each product, are described. The results and practical experiences obtained through the processes are introduced.

  • PDF

Multimodal audiovisual speech recognition architecture using a three-feature multi-fusion method for noise-robust systems

  • Sanghun Jeon;Jieun Lee;Dohyeon Yeo;Yong-Ju Lee;SeungJun Kim
    • ETRI Journal
    • /
    • v.46 no.1
    • /
    • pp.22-34
    • /
    • 2024
  • Exposure to varied noisy environments impairs the recognition performance of artificial intelligence-based speech recognition technologies. Degraded-performance services can be utilized as limited systems that assure good performance in certain environments, but impair the general quality of speech recognition services. This study introduces an audiovisual speech recognition (AVSR) model robust to various noise settings, mimicking human dialogue recognition elements. The model converts word embeddings and log-Mel spectrograms into feature vectors for audio recognition. A dense spatial-temporal convolutional neural network model extracts features from log-Mel spectrograms, transformed for visual-based recognition. This approach exhibits improved aural and visual recognition capabilities. We assess the signal-to-noise ratio in nine synthesized noise environments, with the proposed model exhibiting lower average error rates. The error rate for the AVSR model using a three-feature multi-fusion method is 1.711%, compared to the general 3.939% rate. This model is applicable in noise-affected environments owing to its enhanced stability and recognition rate.

Development of Interactive Data Broadcasting System Compliant with ATSC Standards

  • Jeong, Jong-Myeon;Lee, Yong-Ju;Park, Min-Sik;Choi, Ji-Hoon;Choi, Jin-Soo;Kim, Jin-Woong
    • ETRI Journal
    • /
    • v.26 no.2
    • /
    • pp.149-160
    • /
    • 2004
  • In this paper, we present an interactive data broadcasting system compliant with the Advanced Television Systems Committee (ATSC) standards. The proposed system provides users not only with various data broadcasting services but also remote interactive services. For various data broadcasting services, we have adopted a synchronized data injector that calculates the transmission time of synchronized data accurately and multiplexes synchronized data with the data of an MPEG-2 audio-visual program according to the calculated transmission time. To support remote interactive services, we designed and implemented a return channel server connected on a bi-directional interaction channel. Test results show that the proposed system provides both an asynchronous and synchronized data broadcasting service and remote interactive service appropriately.

  • PDF

Screen Performance of the Korean Actress Kim Hye-Soo (영화배우 김혜수의 스크린 퍼포먼스)

  • Kim, Jong-Guk
    • Journal of Information Technology Applications and Management
    • /
    • v.28 no.1
    • /
    • pp.43-51
    • /
    • 2021
  • This article explores Kim Hye-soo's film acting from the perspective of performance, which means a socio-cultural action planned and intended for a certain purpose. Through the aspect of screen performance which the identity of the era that the performance study aims for is expressed through acting and reappeared in a system of verbal and non-verbal symbols, it was intended to enhance the academic value of Korean film acting. First, Kim Hye-soo's acting performance transforms by repeating genre acting. The sensuality and sexual attractiveness that evaluates Kim Hye-soo are repeated by the typical vision required by genre films, but the acting performance is not consumed or subordinated as a tool for visual pleasure. Second, Kim Hye-soo's body, face, emotion and audio are engraved with memories of the times, and the sociocultural identity of the performance is expressed through dynamic interaction between actions and reactions. Third, Kim Hye-soo's restored and recreated performance is sensitive to the changes of the times and is still in the process.