• Title/Summary/Keyword: visual-audio

Search Result 424, Processing Time 0.151 seconds

BER DEGRADATION DUE TO THE PHASE NOISE SPECTRAL SHAPE IN LMDS SYSTEMS

  • Kim, Youngsun;Song, Jong-In;Kim, Kiseon
    • Proceedings of the IEEK Conference
    • /
    • 2000.07a
    • /
    • pp.113-116
    • /
    • 2000
  • Phase noise of oscillator gives the performance degradation significantly when a high carrier frequency and low transmission rate are used. The BER(Bit Error Rates) degradation of QPSK(Quadrature Phase Shift Keying) transmission is analyzed with the oscillator phase noise level specified in downstream physical interface of LMDS(Local Multipoint Distribution Services) which is described in DAVIC(Digital Audio Visual Council). The model used for the phase noise is a power-law model. We also investigated the effects of the various transmission rates on system performance. For the transmission rate below 0.5 Mbps, the BER performance is severely degraded and we verified that the transmission rate, 20 Mbps, is adequate for the downstream of LMDS systems.

  • PDF

An Augmented Refrigerator with the Awareness of Wasteful Electricity Usage

  • Fujinami, Kaori;Kagatsume, Shota;Murata, Satoshi;Alasalmi, Tuomo;Suutala, Jaakko;Roning, Juha
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.6 no.1
    • /
    • pp.1-4
    • /
    • 2014
  • In this paper, an augmented refrigerator is proposed that presents information to increase the awareness of electric power consumption of a household fridge. The key idea is to reflect wasteful behavior on the feedback to a user, rather than mere amount of consumption or duration of opening a door of a fridge.

Virtual displays and virtual environments

  • Gilkey, R.H.;Isabelle, S.K.;Simpson, B.B.
    • Journal of the Ergonomics Society of Korea
    • /
    • v.16 no.2
    • /
    • pp.101-122
    • /
    • 1997
  • Our recent work on virtual environments and virtual displays is reviewed, including our efforts to establish the Virtual Environment Research, Interactive Technology, And Simulation (VERITAS) facility and our research on spatial hearing. VERITAS is a state-of -the-art multisensory facility, built around the ${CAVE}^{TM}$ technology. High-quality 3D audio is included and haptic interfaces are planned. The facility will support technical and non-technical users working in a wide variety of application areas. Our own research emphasizes the importance of auditory stimulation in virtual environments and complex display systems. Experiments on auditory-aided visual target acquistion, sensory conflict, sound localization in noise, and loxalization of speech stimuli are discussed.

  • PDF

A Study on Implementation of Objective Quality Assurance System for Mobile Multimedia Video (이동 멀티미디어 영상의 객관적인 품질측정 시스템 구현에 관한 연구)

  • Paek, Seung-Eun;Ohn, Jin-Ho;Joo, Hae-Jong;Hong, Bong-Wha;Kim, Eun-Won;Park, Young-Bae
    • Proceedings of the IEEK Conference
    • /
    • 2007.07a
    • /
    • pp.487-488
    • /
    • 2007
  • This Paper provides perceptual metrics for video quality based on properties of human visual system, and audio quality based on human audition. All metrics work without reference signals, allowing non-intrusive, in-service measurements. A simple and easy-to-learn user interface displays the metrics and saves them in popular file formats like CSV.

  • PDF

Subword-based Lip Reading Using State-tied HMM (상태공유 HMM을 이용한 서브워드 단위 기반 립리딩)

  • Kim, Jin-Young;Shin, Do-Sung
    • Speech Sciences
    • /
    • v.8 no.3
    • /
    • pp.123-132
    • /
    • 2001
  • In recent years research on HCI technology has been very active and speech recognition is being used as its typical method. Its recognition, however, is deteriorated with the increase of surrounding noise. To solve this problem, studies concerning the multimodal HCI are being briskly made. This paper describes automated lipreading for bimodal speech recognition on the basis of image- and speech information. It employs audio-visual DB containing 1,074 words from 70 voice and tri-viseme as a recognition unit, and state tied HMM as a recognition model. Performance of automated recognition of 22 to 1,000 words are evaluated to achieve word recognition of 60.5% in terms of 22word recognizer.

  • PDF

A Comparative Case Study on the Environment Lesson in a Middle School (중학교 환경교육 학습방안에 관한 사례 비교 연구)

  • Kim, Mi-Jeong;Jo, Yeong-Min
    • Proceedings of the Korean Society for Environmental Edudation Conference
    • /
    • 2005.12a
    • /
    • pp.138-147
    • /
    • 2005
  • In the present study, the middle school students' perception on the environment course was surveyed before and after three different lessons. It was found that most students were taking environment related issues mainly from mass media including internet and broadcasting. The young students were satisfied at practical experiments, but at the same time a few old fashioned experimental programs would not be preferred. Utilization of multimedia such as audio-visual tools was one of the effective tools for the environment class, because it could help to understand even the profound principles of the chemical and physical processes. However, some students did not concentrate on the display, causing frequent disturbance of the class. Since the school for this investigation did not choose the environment as a selective course, practical tools and materials for the experiment should not be sufficient, and thereby a further detailed work must be followed with the students who are taking environment lessons.

  • PDF

Audio-Visual Integration based Multi-modal Speech Recognition System (오디오-비디오 정보 융합을 통한 멀티 모달 음성 인식 시스템)

  • Lee, Sahng-Woon;Lee, Yeon-Chul;Hong, Hun-Sop;Yun, Bo-Hyun;Han, Mun-Sung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.11a
    • /
    • pp.707-710
    • /
    • 2002
  • 본 논문은 오디오와 비디오 정보의 융합을 통한 멀티 모달 음성 인식 시스템을 제안한다. 음성 특징 정보와 영상 정보 특징의 융합을 통하여 잡음이 많은 환경에서 효율적으로 사람의 음성을 인식하는 시스템을 제안한다. 음성 특징 정보는 멜 필터 캡스트럼 계수(Mel Frequency Cepstrum Coefficients: MFCC)를 사용하며, 영상 특징 정보는 주성분 분석을 통해 얻어진 특징 벡터를 사용한다. 또한, 영상 정보 자체의 인식률 향상을 위해 피부 색깔 모델과 얼굴의 형태 정보를 이용하여 얼굴 영역을 찾은 후 강력한 입술 영역 추출 방법을 통해 입술 영역을 검출한다. 음성-영상 융합은 변형된 시간 지연 신경 회로망을 사용하여 초기 융합을 통해 이루어진다. 실험을 통해 음성과 영상의 정보 융합이 음성 정보만을 사용한 것 보다 대략 5%-20%의 성능 향상을 보여주고 있다.

  • PDF

Designing Education Contents for Chinese Character Utilizing Internet of Things (IoT)

  • Jung, Sugkyu
    • Smart Media Journal
    • /
    • v.5 no.2
    • /
    • pp.24-32
    • /
    • 2016
  • Recently, the development of electronic teaching materials and the demand of digital learners have led the needs on the education contents that replace learning from character information and the change of an information design method for this. Chinese character education in the traditional schooling mainly focuses on writing and memorization (semantic memory). This way that the stories do not exist has brought the learners' recognition that Chinese character is difficult to learn. Meanwhile, for a language study such as English, cross-media development between printed materials and audio-visual materials has been actively introduced. The method that extends episode memories along with memorization through a story is widely used. Therefore, this content suggests a prototype, which is broken away from an existing way of learning Chinese character that mainly focuses on writing, one sided instruction and information cramming. This makes learners learn through a story from printed materials and animation. Furthermore, it suggests a method that extends episode memories through Chinese education contents based on IoT explaining the principle of Chinese character by combining IT technology (information and communications, IoT) and education contents on block toys.

A Design of Automatic Translation System for Military English Abbreviation Including Phonetic and Educational Function (음성출력/학습기능을 지원하는 군사영어약어 자동번역 시스템 설계)

  • Kim Hong-Seop;Lee Hyeon-Geol
    • Journal of the military operations research society of Korea
    • /
    • v.18 no.1
    • /
    • pp.32-46
    • /
    • 1992
  • One of the problems we frequently face during the ROK and US Combined operations is the English Military abbreviations because they often causes a lot of confusion. Many military abbreviations we generated, changed, and disappeared, so it is very hard to figure out their meaning sometimes. This system is designed to make it easier to register, alter, and find out English abbreviations through hypermedia techniques, which is utilizing nonsequential and direct search system similar to human sensory organs. So this enables us to keep up with the latest abbreviations. It is also designed to overcome mutual communications barriers by audio-visual aids through the graphic and phonetic functions of the program, and to test users via a random selection of questions.

  • PDF

A Study on Audio-visual Stimulation Based Unconstrained Stress Analysis using Chair-type BCG Measurement System (의자형 심탄도 측정시스템을 이용한 시청각 자극 기반의 무구속 스트레스 분석 연구)

  • Kim, Byeong-Ju;Noh, Yun-hong;Jeong, Do-Un
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2014.04a
    • /
    • pp.1012-1013
    • /
    • 2014
  • 본 논문에서는 일상생활 중 지속적으로 심장 상태를 모니터링 할 수 있는 무구속 의자형 심탄도 측정시스템을 개발하였다. 또한 구현된 시스템에서 측정된 생체신호를 이용하여 주관적인 감정자극의 스트레스를 분석하기 위한 연구를 수행하였다. 수준을 분석하고자 하였다. 실험은 시스템에 착석하여 실시간으로 시청각 자극 실험을 수행하였고, 심박수와 심박변이도의 시간영역 및 주파수영역 파라미터를 확인하였다. 확인된 심박변이도의 파라미터는 시청각 도중 기술한 인간의 감정들을 체계화하여 2차원 공간에 여러 감정들의 관계를 나타낸 제임스 러셀(J. Russell)의 감정모델을 주관적인 감정 자극에 의한 스트레스 지표 나타내어 비교 분석하였다. 실험결과는 RMSSD, LF/HF 파라미터가 스트레스 수준 분류에 사용될 수 있는 잠재력을 가지고 있음을 증명한다.