• Title/Summary/Keyword: visual/audio system

Search Result 150, Processing Time 0.023 seconds

Design of A/V Communication System for Passenger and Attendant in Train (열차 내 승무원과의 원격대화 시스템 설계에 관한 연구)

  • Chang, Duk-Jin;Kang, Song-Hee;Park, Hyun-Hue;Kang, Dae-Ho;Heo, Jae-Seok;Song, Dahl-Ho
    • Proceedings of the KSR Conference
    • /
    • 2010.06a
    • /
    • pp.448-454
    • /
    • 2010
  • Currently a KTX train of 20-car formation is 388m long and carries 931 passengers including one captain and three crews which is quite a few to cover the lengthy service area and many customers. On the other hand, if a passenger wants to talk to an attendant, he has to wait for an attendant passing by his/her seat or walk to an intercom which is placed at every other car. Any of these choices is inconvenient. So, in this paper, we presented a system design for developing an audio/visual communication system for a passenger and an attendant. The system was analyzed and designed according to the Object-Oriented methodology with UML (Unified Modeling Language). Based on a problem statement, a Use-case Diagram, Sequence Diagrams, Class Diagram, State Charts, collaboration Diagram were generated. The design will be used in system implementation to a HEMU-400X test train and to be tested.

  • PDF

Implementation of Text-to-Audio Visual Speech Synthesis Using Key Frames of Face Images (키프레임 얼굴영상을 이용한 시청각음성합성 시스템 구현)

  • Kim MyoungGon;Kim JinYoung;Baek SeongJoon
    • MALSORI
    • /
    • no.43
    • /
    • pp.73-88
    • /
    • 2002
  • In this paper, for natural facial synthesis, lip-synch algorithm based on key-frame method using RBF(radial bases function) is presented. For lips synthesizing, we make viseme range parameters from phoneme and its duration information that come out from the text-to-speech(TTS) system. And we extract viseme information from Av DB that coincides in each phoneme. We apply dominance function to reflect coarticulation phenomenon, and apply bilinear interpolation to reduce calculation time. At the next time lip-synch is performed by playing the synthesized images obtained by interpolation between each phonemes and the speech sound of TTS.

  • PDF

A Blind Video Watermarking Technique Using Luminance Masking and DC Modulus Algorithm (휘도 마스킹과 DC Modulus 알고리즘을 이용한 비디오 워터마킹)

  • Jang Yong-Won;Kim, In-Taek;Han, Seung-Soo
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.51 no.7
    • /
    • pp.302-307
    • /
    • 2002
  • Digital watermarking is the technique, which embeds an invisible signal including signal including owner identification and copy control information into multimedia data such as audio, video, and images for copyright protection. A new MPEG watermark embedding algorithm using complex block effect based on the Human Visual System(HVS) is introduced in this paper. In this algorithm, $8{\times}8$ dark blocks are selected, and the watermark is embedded in the DC component of the discrete cosine transform(DCT) by using quantization and modulus calculation. This algorithm uses a blind watermark retrieval technique, which detects the embedded watermark without using the original image. The experimental results show that the proposed watermark technique is robust against MPEG coding, bitrate changes, and various GOP(Group of Picture) changes.

Beginning of a New Standard: Internet of Media Things

  • Kim, Sang-Kyun;Sahu, Nevadita;Preda, Marius
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.11
    • /
    • pp.5182-5199
    • /
    • 2017
  • Recently, Internet of Things (IoT) drives a large variety of research, development, and new type of markets. All type of devices and sensors will be part of the Internet of Things and will be able to communicate not only plain data, but also audio-visual, olfactory, and haptic media data. In addition, as the devices and sensors getting smarter, it is highly probable that they can process acquired media and metadata to extract higher level of information (e.g., semantics). To support such enhanced functionalities, ISO/IEC SC29 WG11 (MPEG) starts a new standard project, ISO/IEC 23093, called Internet of Media Things (IoMT) to provide standard data formats and APIs for media things. This paper presents the standardization activities of IoMT focusing on explaining terms, standard scopes, and major media things with their use cases. One of the use cases, an IoT system for a blind pedestrian navigation assistance, is evaluated to prove its effectiveness.

A Study on the State of Sex Education in High Schools and for Improvement in It -in case of Taegu city & Kyungbuk province- (학교 성교육의 현황 및 개선방향에 관한 연구 -대구시와 경북지역을 중심으로-)

  • 김정옥
    • Journal of Korean Home Economics Education Association
    • /
    • v.5 no.1
    • /
    • pp.121-132
    • /
    • 1993
  • The purpose of this study is to evaluate the state and problems on sex education of junior & senior high schools in Taegu city & Kyungbuk province and then to try to find the solutions. This article is consisted of four parts: the present state of sex education, how to improve sex education, the problems of teacher training and the actual condition of educational materials for that. The item of questionnaires was analyzed by SAS-PC program. the results are marked with frequency, percentage, mean and $\chi$$^2$(chi-square). For effective and better sex education in high schools, development & distribution of various audio-visual teaching material, and systematic & correlative arrangement of educational contents are to be first requisite. Concerning about the teacher training it is needed that to establish the standard o general management and equipment of instruction system.

  • PDF

BER DEGRADATION DUE TO THE PHASE NOISE SPECTRAL SHAPE IN LMDS SYSTEMS

  • Kim, Youngsun;Song, Jong-In;Kim, Kiseon
    • Proceedings of the IEEK Conference
    • /
    • 2000.07a
    • /
    • pp.113-116
    • /
    • 2000
  • Phase noise of oscillator gives the performance degradation significantly when a high carrier frequency and low transmission rate are used. The BER(Bit Error Rates) degradation of QPSK(Quadrature Phase Shift Keying) transmission is analyzed with the oscillator phase noise level specified in downstream physical interface of LMDS(Local Multipoint Distribution Services) which is described in DAVIC(Digital Audio Visual Council). The model used for the phase noise is a power-law model. We also investigated the effects of the various transmission rates on system performance. For the transmission rate below 0.5 Mbps, the BER performance is severely degraded and we verified that the transmission rate, 20 Mbps, is adequate for the downstream of LMDS systems.

  • PDF

Audio-Visual Integration based Multi-modal Speech Recognition System (오디오-비디오 정보 융합을 통한 멀티 모달 음성 인식 시스템)

  • Lee, Sahng-Woon;Lee, Yeon-Chul;Hong, Hun-Sop;Yun, Bo-Hyun;Han, Mun-Sung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.11a
    • /
    • pp.707-710
    • /
    • 2002
  • 본 논문은 오디오와 비디오 정보의 융합을 통한 멀티 모달 음성 인식 시스템을 제안한다. 음성 특징 정보와 영상 정보 특징의 융합을 통하여 잡음이 많은 환경에서 효율적으로 사람의 음성을 인식하는 시스템을 제안한다. 음성 특징 정보는 멜 필터 캡스트럼 계수(Mel Frequency Cepstrum Coefficients: MFCC)를 사용하며, 영상 특징 정보는 주성분 분석을 통해 얻어진 특징 벡터를 사용한다. 또한, 영상 정보 자체의 인식률 향상을 위해 피부 색깔 모델과 얼굴의 형태 정보를 이용하여 얼굴 영역을 찾은 후 강력한 입술 영역 추출 방법을 통해 입술 영역을 검출한다. 음성-영상 융합은 변형된 시간 지연 신경 회로망을 사용하여 초기 융합을 통해 이루어진다. 실험을 통해 음성과 영상의 정보 융합이 음성 정보만을 사용한 것 보다 대략 5%-20%의 성능 향상을 보여주고 있다.

  • PDF

A Study on Audio-visual Stimulation Based Unconstrained Stress Analysis using Chair-type BCG Measurement System (의자형 심탄도 측정시스템을 이용한 시청각 자극 기반의 무구속 스트레스 분석 연구)

  • Kim, Byeong-Ju;Noh, Yun-hong;Jeong, Do-Un
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2014.04a
    • /
    • pp.1012-1013
    • /
    • 2014
  • 본 논문에서는 일상생활 중 지속적으로 심장 상태를 모니터링 할 수 있는 무구속 의자형 심탄도 측정시스템을 개발하였다. 또한 구현된 시스템에서 측정된 생체신호를 이용하여 주관적인 감정자극의 스트레스를 분석하기 위한 연구를 수행하였다. 수준을 분석하고자 하였다. 실험은 시스템에 착석하여 실시간으로 시청각 자극 실험을 수행하였고, 심박수와 심박변이도의 시간영역 및 주파수영역 파라미터를 확인하였다. 확인된 심박변이도의 파라미터는 시청각 도중 기술한 인간의 감정들을 체계화하여 2차원 공간에 여러 감정들의 관계를 나타낸 제임스 러셀(J. Russell)의 감정모델을 주관적인 감정 자극에 의한 스트레스 지표 나타내어 비교 분석하였다. 실험결과는 RMSSD, LF/HF 파라미터가 스트레스 수준 분류에 사용될 수 있는 잠재력을 가지고 있음을 증명한다.

A Fundamental Study on the Marine Leisure - focus on the Psychology of Emotion for Seashore Relaxation - (해양레저에 관한 기초적인 연구 - 해변휴양의 정서심리를 중심으로 -)

  • Yoon, Soon-Dong
    • Proceedings of KOSOMES biannual meeting
    • /
    • 2008.05a
    • /
    • pp.75-80
    • /
    • 2008
  • There are a lot of interest and research on practical area of marine leisure but few research on fundamental area. We need to suggest the theoretical basis on the merit of marine leisure. The author analyzed in visual and audio informations of seashore environment based on psychology of emotion aesthetically and musically. As a results, Peoples could get affirmative emotion through participating in seashore relaxation and changed their negative emotion into affirmative.

  • PDF

Screen Performance of the Korean Actress Kim Hye-Soo (영화배우 김혜수의 스크린 퍼포먼스)

  • Kim, Jong-Guk
    • Journal of Information Technology Applications and Management
    • /
    • v.28 no.1
    • /
    • pp.43-51
    • /
    • 2021
  • This article explores Kim Hye-soo's film acting from the perspective of performance, which means a socio-cultural action planned and intended for a certain purpose. Through the aspect of screen performance which the identity of the era that the performance study aims for is expressed through acting and reappeared in a system of verbal and non-verbal symbols, it was intended to enhance the academic value of Korean film acting. First, Kim Hye-soo's acting performance transforms by repeating genre acting. The sensuality and sexual attractiveness that evaluates Kim Hye-soo are repeated by the typical vision required by genre films, but the acting performance is not consumed or subordinated as a tool for visual pleasure. Second, Kim Hye-soo's body, face, emotion and audio are engraved with memories of the times, and the sociocultural identity of the performance is expressed through dynamic interaction between actions and reactions. Third, Kim Hye-soo's restored and recreated performance is sensitive to the changes of the times and is still in the process.