• Title/Summary/Keyword: Audio-Visual Information

Search Result 207, Processing Time 0.063 seconds

A Preliminary Study on the Method of Media Investigation Using a Hypertext Technique (하이퍼텍스트 기법을 이용한 미디어 조사 방법에 관한 기초 연구)

  • 김종현;김석태
    • Proceedings of the Korean Institute of Interior Design Conference
    • /
    • 2003.05a
    • /
    • pp.35-38
    • /
    • 2003
  • According to developing digital media, the delivery and the collection of information are changing with a variety of ways. Especially, digital, as it is, is easy for its sound, video and letter to be mixed. Therefore, media character, Audio-Visual will appear in total. Hypertext must be a basic form in the traveling way of media information which includes such digital Quality in originality. Taking advantage of digital media trait of hypertext , most users can make a research on a variety of studying method. On this thesis, a style of made-up hypertext is taking a part of recent digital culture, and I want to present a different method in research, using two hypertext skills, adventurous choice and joint origination

  • PDF

A Study on the Utilization of Electronic Publication in School Library (학교도서관에서의 전자출판물 활용방안 연구)

  • 황금숙
    • Journal of Korean Library and Information Science Society
    • /
    • v.33 no.4
    • /
    • pp.85-100
    • /
    • 2002
  • According to increase number of electronic publications, teacher ind student demand them in their teaching md teaming activities. So school libraries must provide various materials(ex : printed, audio-visual and electronic materials). The purpose of this study is to identify the use of electronic publications in school libraries md to suggest the methods of utilization of them.

  • PDF

Design of Video Quality Assurance and Integrated Quality Management System using No Reference QoE (비 참조 QoE를 이용한 영상품질 측정 및 통합품질 관리 시스템의 설계)

  • Kim, Sang-Soo;Park, Dong-Soo
    • The Journal of Information Technology
    • /
    • v.12 no.3
    • /
    • pp.49-57
    • /
    • 2009
  • This Paper provides perceptual metrics for video quality based on properties of human visual system, and audio quality based on human audition. All metrics work without reference signals, allowing non-intrusive, in-service measurements. A simple and easy-to-learn user interface displays the metrics and saves them in popular file formats like CSV. In this paper, proposed method was able to various and corrective measurement for the multimedia service video quality. As that it was able to application to set up service guide line and the methode of measurement and system for the set up standardization of the high quality video service.

  • PDF

Audio-Visual Integration based Multi-modal Speech Recognition System (오디오-비디오 정보 융합을 통한 멀티 모달 음성 인식 시스템)

  • Lee, Sahng-Woon;Lee, Yeon-Chul;Hong, Hun-Sop;Yun, Bo-Hyun;Han, Mun-Sung
    • Annual Conference of KIPS
    • /
    • 2002.11a
    • /
    • pp.707-710
    • /
    • 2002
  • 본 논문은 오디오와 비디오 정보의 융합을 통한 멀티 모달 음성 인식 시스템을 제안한다. 음성 특징 정보와 영상 정보 특징의 융합을 통하여 잡음이 많은 환경에서 효율적으로 사람의 음성을 인식하는 시스템을 제안한다. 음성 특징 정보는 멜 필터 캡스트럼 계수(Mel Frequency Cepstrum Coefficients: MFCC)를 사용하며, 영상 특징 정보는 주성분 분석을 통해 얻어진 특징 벡터를 사용한다. 또한, 영상 정보 자체의 인식률 향상을 위해 피부 색깔 모델과 얼굴의 형태 정보를 이용하여 얼굴 영역을 찾은 후 강력한 입술 영역 추출 방법을 통해 입술 영역을 검출한다. 음성-영상 융합은 변형된 시간 지연 신경 회로망을 사용하여 초기 융합을 통해 이루어진다. 실험을 통해 음성과 영상의 정보 융합이 음성 정보만을 사용한 것 보다 대략 5%-20%의 성능 향상을 보여주고 있다.

  • PDF

A Study on Audio-visual Stimulation Based Unconstrained Stress Analysis using Chair-type BCG Measurement System (의자형 심탄도 측정시스템을 이용한 시청각 자극 기반의 무구속 스트레스 분석 연구)

  • Kim, Byeong-Ju;Noh, Yun-hong;Jeong, Do-Un
    • Annual Conference of KIPS
    • /
    • 2014.04a
    • /
    • pp.1012-1013
    • /
    • 2014
  • 본 논문에서는 일상생활 중 지속적으로 심장 상태를 모니터링 할 수 있는 무구속 의자형 심탄도 측정시스템을 개발하였다. 또한 구현된 시스템에서 측정된 생체신호를 이용하여 주관적인 감정자극의 스트레스를 분석하기 위한 연구를 수행하였다. 수준을 분석하고자 하였다. 실험은 시스템에 착석하여 실시간으로 시청각 자극 실험을 수행하였고, 심박수와 심박변이도의 시간영역 및 주파수영역 파라미터를 확인하였다. 확인된 심박변이도의 파라미터는 시청각 도중 기술한 인간의 감정들을 체계화하여 2차원 공간에 여러 감정들의 관계를 나타낸 제임스 러셀(J. Russell)의 감정모델을 주관적인 감정 자극에 의한 스트레스 지표 나타내어 비교 분석하였다. 실험결과는 RMSSD, LF/HF 파라미터가 스트레스 수준 분류에 사용될 수 있는 잠재력을 가지고 있음을 증명한다.

Incomplete Cholesky Decomposition based Kernel Cross Modal Factor Analysis for Audiovisual Continuous Dimensional Emotion Recognition

  • Li, Xia;Lu, Guanming;Yan, Jingjie;Li, Haibo;Zhang, Zhengyan;Sun, Ning;Xie, Shipeng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.2
    • /
    • pp.810-831
    • /
    • 2019
  • Recently, continuous dimensional emotion recognition from audiovisual clues has attracted increasing attention in both theory and in practice. The large amount of data involved in the recognition processing decreases the efficiency of most bimodal information fusion algorithms. A novel algorithm, namely the incomplete Cholesky decomposition based kernel cross factor analysis (ICDKCFA), is presented and employed for continuous dimensional audiovisual emotion recognition, in this paper. After the ICDKCFA feature transformation, two basic fusion strategies, namely feature-level fusion and decision-level fusion, are explored to combine the transformed visual and audio features for emotion recognition. Finally, extensive experiments are conducted to evaluate the ICDKCFA approach on the AVEC 2016 Multimodal Affect Recognition Sub-Challenge dataset. The experimental results show that the ICDKCFA method has a higher speed than the original kernel cross factor analysis with the comparable performance. Moreover, the ICDKCFA method achieves a better performance than other common information fusion methods, such as the Canonical correlation analysis, kernel canonical correlation analysis and cross-modal factor analysis based fusion methods.

Development Directions for the Agricultural Technical Information Systems (농업기술정보 전달체계의 발전 방향)

  • Kim, Seong-Il;Choi, Min-Ho
    • Journal of Agricultural Extension & Community Development
    • /
    • v.2 no.2
    • /
    • pp.191-203
    • /
    • 1995
  • One of the major functions of rural extension services is to transfer agricultural technologies and information, and advanced new agricultural techniques developed by research institutes, which are meaningful when they are transferred to farmers for practicl application. Information materials can be transferred in the form of newspapers, radio and television broadcasting, printed materials, audio-visual aids, and public communication networks. Agricultural information systems in the era of localization should be oriented toward county extension services, and the following points should be emphasized for more effective dissemination of agricultural technologies : 1) Central organization of the Rural Development Administration should put more emphasis on the production and dissemination of agricultural information to support activities of extension agents at the county level. 2) An Agricultural information center should be established for more effective collection, analysis, processing, production and dissemination of various agri-related information. 3) An advanced and unified network system should be adopted for more accuate and rapid information flow throughout the country, and reinforcement of manpower and facility at the county level should be emphasized for more effective dissemination of agricultural information.

  • PDF

Crossmodal Perception of Mismatched Emotional Expressions by Embodied Agents (에이전트의 표정과 목소리 정서의 교차양상지각)

  • Cho, Yu-Suk;Suk, Ji-He;Han, Kwang-Hee
    • Science of Emotion and Sensibility
    • /
    • v.12 no.3
    • /
    • pp.267-278
    • /
    • 2009
  • Today an embodied agent generates a large amount of interest because of its vital role for human-human interactions and human-computer interactions in virtual world. A number of researchers have found that we can recognize and distinguish between various emotions expressed by an embodied agent. In addition many studies found that we respond to simulated emotions in a similar way to human emotion. This study investigates interpretation of mismatched emotions expressed by an embodied agent (e.g. a happy face with a sad voice); whether audio-visual channel integration occurs or one channel dominates when participants judge the emotion. The study employed a 4 (visual: happy, sad, warm, cold) $\times$ 4 (audio: happy, sad, warm, cold) within-subjects repeated measure design. The results suggest that people perceive emotions not depending on just one channel but depending on both channels. Additionally facial expression (happy face vs. sad face) makes a difference in influence of two channels; Audio channel has more influence in interpretation of emotions when facial expression is happy. People were able to feel other emotion which was not expressed by face or voice from mismatched emotional expressions, so there is a possibility that we may express various and delicate emotions with embodied agent by using only several kinds of emotions.

  • PDF

A DATABASE FOR HUMAN PERFORMANCE UNDER SIMULATED EMERGENCIES OF NUCLEAR POWER PLANTS

  • Park, Jin-Kyun;Jung, Won-Dea
    • Nuclear Engineering and Technology
    • /
    • v.37 no.5
    • /
    • pp.491-502
    • /
    • 2005
  • Reliable human performance is a prerequisite in securing the safety of complicated process systems such as nuclear power plants. However, the amount of available knowledge that can explain why operators deviate from an expected performance level is so small because of the infrequency of real accidents. Therefore, in this study, a database that contains a set of useful information extracted from simulated emergencies was developed in order to provide important clues for understanding the change of operators' performance under stressful conditions (i.e., real accidents). The database was developed under Microsoft Windows TM environment using Microsoft Access $97^{TM}$ and Microsoft Visual Basic $6.0^{TM}$. In the database, operators' performance data obtained from the analysis of over 100 audio-visual records for simulated emergencies were stored using twenty kinds of distinctive data fields. A total of ten kinds of operators' performance data are available from the developed database. Although it is still difficult to predict operators' performance under stressful conditions based on the results of simulated emergencies, simulation studies remain the most feasible way to scrutinize performance. Accordingly, it is expected that the performance data of this study will provide a concrete foundation for understanding the change of operators' performance in emergency situations.

Design and Implementation of Korean Voice Web Browser (한국어 음성 웹브라우저 설계 및 구현)

  • Jang, Young-Gun;Jo, Kyoung-Hwan
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.7 no.5
    • /
    • pp.458-466
    • /
    • 2001
  • This paper is addressed to a design and implementation of Korean voice web browser using voice technologies for controling web browser and selecting contents in the web document, and converting them to voice after HTML analysis. Main feature of this web browser is universal design which considers both of normal person and visual disabled, allows multi-modal interface. As voice interface for visual disabled, it supports tree structure which allows to recognize web document structure easily by only voice guidance regardless of frame usage, can handle all elements described as tag in the web document, identify them as predefined different voice property according to element property. This method gets rid of additional guidance voice for element property without audio style sheet or additional programming effort.

  • PDF