• 제목/요약/키워드: Video-audio media

검색결과 203건 처리시간 0.028초

XCRAB :내용 및 주석 기반의 멀티미디어 인덱싱과 검색 시스템 (XCRAB : A Content and Annotation-based Multimedia Indexing and Retrieval System)

  • 이수철;노승민;황인준
    • 정보처리학회논문지B
    • /
    • 제11B권5호
    • /
    • pp.587-596
    • /
    • 2004
  • 최근들어 오디오, 비디오와 이미지 같은 다양한 디지털 멀티미디어 데이터의 인덱싱, 브라우징과 질의를 위한 새로운 형태의 시스템이 개발되었다. 이러한 시스템은 각 미디어 스트림을 실제 물리적 이벤트에 따라서 작은 유닛단위로 나누고, 물리적 이벤트들을 검색을 위해서 효율적으로 인덱스화 시킨다. 본 논문에서는 오디오-비주얼 데이터의 분석과 세그멘테이션을 위해서 각 데이터가 가지고 있는 오디오, 이미지, 비디오 특징을 이용하는 새로운 방법을 사용한다. 이것은 이미지나 비디오만을 분석했던 이전의 방법들을 문제점을 해결 할 수 있다. 본 논문에서는 이와 같은 방법을 이용하여 XCRAB이라고 불리는 웹 기반 멀티미디어 검색 시스템을 구현하였고, 성능평가를 위해서 여러가지 질의의 조합을 이용하여 실험을 하였다.

A Beamforming-Based Video-Zoom Driven Audio-Zoom Algorithm for Portable Digital Imaging Devices

  • Park, Nam In;Kim, Seon Man;Kim, Hong Kook;Kim, Myeong Bo;Kim, Sang Ryong
    • IEIE Transactions on Smart Processing and Computing
    • /
    • 제2권1호
    • /
    • pp.11-19
    • /
    • 2013
  • A video-zoom driven audio-zoom algorithm is proposed to provide audio zooming effects according to the degree of video-zoom. The proposed algorithm is designed based on a super-directive beamformer operating with a 4-channel microphone array in conjunction with a soft masking process that uses the phase differences between microphones. The audio-zoom processed signal is obtained by multiplying the audio gain derived from the video-zoom level by the masked signal. The proposed algorithm is then implemented on a portable digital imaging device with a clock speed of 600 MHz after different levels of optimization, such as algorithmic level, C-code and memory optimization. As a result, the processing time of the proposed audio-zoom algorithm occupies 14.6% or less of the clock speed of the device. The performance evaluation conducted in a semi-anechoic chamber shows that the signals from the front direction can be amplified by approximately 10 dB compared to the other directions.

  • PDF

디지털 미디어 콘텐츠 방송 시스템 구현 (Implementation of the Broadcasting System for Digital Media Contents)

  • 신재흥;김홍열;이상철
    • 전기학회논문지
    • /
    • 제57권10호
    • /
    • pp.1883-1887
    • /
    • 2008
  • Most of digital media contents are composed with video and audio, picture and animation informations. Sometime, there is some deviation of information recognition quality for the video and audio information according to information receiver's characteristics or the understanding. But visual information using the text provide most clear and accurate ways for information recognition to human being. In this paper, we propose a new broadcasting system(BSDMC) to transmit clear and accurate meaning of the digital media contents. We implement general-purpose components to display the video, picture, text and symbol simultaneously. Only plug-in and call these components with proper parameters on the application developing tool, we can easily develop the multimedia contents broadcasting system. These components are implemented based on the object-oriented framework and modular structure so that increase the reusability and can be develop other applications quick and reliable.

동영상 정보제공이 위내시경 대상자의 신체적 불편감, 불안 및 간호 만족도에 미치는 효과 (The Effects of Video-audio Information Provision on Physical Discomfort, Anxiety, and Nursing Satisfaction of the Clients for Gastroscopy)

  • 권영은;김분한
    • 성인간호학회지
    • /
    • 제25권2호
    • /
    • pp.231-239
    • /
    • 2013
  • Purpose: This study was conducted to identify the effects of video-audio information provision on physical discomfort, anxiety and nursing satisfaction of the clients for gastroscopy. Methods: The study design was nonequivalent control group pre-post test design. The subjects were 50 patients who visited H hospital health examination center for gastroscopy. Video-audio information developed by the authors was used as educational material for the treatment group. The data were collected between September 15 and November 15, 2010. The study instruments were the State-Trait Anxiety Inventory, the Physical Discomfort Scale, and the Nursing Satisfaction Scale. Results: The level of anxiety and physical discomfort in the treatment group were not significantly different from that in the comparison group (t=-0.28, p=.781; t=-0.34, p=.741). The level of clients' satisfaction with nursing care in the treatment group was significantly higher than in the comparison group (t=-4.12, p<.001). Conclusion: Use of video-audio information was effective in the increase in satisfaction with care. Therefore, it could be useful in the nursing practice, and be utilized as a way of nursing intervention to improve nursing satisfaction.

사용자 취향, 감성 및 상황인지 기반 음원 추천 서비스 구현 (A Design of real sound recommendation service based-on User's preference, emotion and circumstance)

  • 정종진;임태범;이석필
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2011년도 춘계학술발표대회
    • /
    • pp.689-691
    • /
    • 2011
  • Due to the rapid development of Information and communication, the technology of multimedia presentation technology is evolving into the service that user can actively, realistically enjoy and play based on user's preference and taste not only for User's passive service. Especially, the industry related the realistic multimedia service that supports targeting Human emotion with the property of Human hearing is expected to be formed of the high value-added premium market. Audio technology is affected on human's emotion and the viewing environment around than video technology. Also the audio technology compared to video technology is a research part that appeals to human emotion and emphasize on psychological aspects. With this viewpoint, the development of intelligent and realistic audio technology needs highly specialty. In this study, "intelligent real-sound presentation technology" that support high quality and realistic audio and the "core technologies" that are composing of this will be introduced.

씬클라이언트 컴퓨팅에서 스트리밍 미디어의 QoS를 보장하는 지능형 미디어 플레이어 (An Intelligent Media Player for Guaranteeing QoS Streaming Media on Thin-Client Computing)

  • 김병길;이좌형;정인범
    • 정보처리학회논문지B
    • /
    • 제12B권5호
    • /
    • pp.607-616
    • /
    • 2005
  • 한정된 자원을 보유하고 있는 씬클라이언트 환경 아래에서는 많은 연산량을 요구하는 MPEG 미디어의 복호화를 사용자에게 QoS를 보장되는 수준으로 동작시키기 어렵다. 이러한 문제점을 극복하기 위하여 미디어에 대한 복호화 연산은 중앙의 터미널 서버들의 자원을 이용하게 하고 씬클라이언트 쪽에서는 단지 화면 업데이트만 처리하는 방식들이 사용되어지고 있다. 그러나 제안된 기존의 방법들에서는 재생된 스트리밍 미디어의 화질이 열악한 형편이다. 더구나, 서버들에게 복호화의 전 과정을 부담시키므로 서버들이 적은 부하에도 쉽게 포화점에 도달하고 있다. 본 논문에서는 유선 및 무선 씬클라이언트 환경에서 화질의 열화가 발생되는 원인들을 규명한다. 분석된 기존 씬클라이언트 방법들의 문제점을 기반으로 미디어 화질의 질을 향상시키며 영상과 음성을 동기화를 맞추어 사용자들에게 QoS 가 보장되는 스트리밍 미디어 서비스를 제공하는 지능형 미디어 재생기를 제안한다.

Multimedia Conferencing System with Intramedia and Intermedia Synchronization Support

  • Yoo, Sang-Shin;Kim, Duck-Jin
    • Journal of Electrical Engineering and information Science
    • /
    • 제2권3호
    • /
    • pp.41-50
    • /
    • 1997
  • In this paper, we describe the design, implementation and evaluation for a multimedia conferencing system with intramedia and intermedia synchronization support between audio and video. The synchronization mechanism proposed here is capable of dynamically adapting to various network conditions thus providing an optimized QoS. In realizing the system based on this mechanism, NeVoT on Mbone is used for audio and VIC for video. Furthermore a synchromization controller is designed and realized with a unique process in supporting intermedia synchronization. Each media agents handling its media stream are modified with intramedia synchronization function. And a communicative function between media agents and synchronization controller is added as well for intermedia synchronization function. Each media agents function reports its buffering status to the synchronization control process which in turn send out optimized buffering delay value thus supporting intermedia synchronization. The realized system is configured and tested on Ethernet and ATM network where performance measurements were performed and its effective synchronization support has been assured.

  • PDF

BI-DIRECTIONAL TRANSPORT AND NETWORKED DISPLAY INTERFACE OF UNCOMPRESSED HD VIDEO

  • Park, Jong-Churl;Jo, Jin-Yong;Goo, Bon-Cheol;Kim, Jong-Won
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2009년도 IWAIT
    • /
    • pp.184-188
    • /
    • 2009
  • To interactively share High Definition (HD)-quality visualization over emerging ultra-high-speed network infrastructure, several lossless and low-delay real-time media (i.e., uncompressed HD video and audio) transport systems are being designed and prototyped. However, most of them still rely on expensive hardware components. As an effort to reduce the building cost of system, in this paper, we propose the integration of both transmitter and receiver machines into a single bi-directional transport system. After detailed bottleneck analysis and subsequent refinements of embedded software components, the proposed integration can provide Real-time Transport Protocol (RTP)-based bi-directional transport of uncompressed HD video and audio from a single machine. We also explain how to interface the Gbps-bandwidth display output of uncompressed HD media system to the networked tiled display of 10240 $\times$ 3200 super-high-resolution. Finally, to verify the feasibility of proposed integration, several prototype systems are built and evaluated by operating them in several different experiment scenarios.

  • PDF

DTV 화질향상을 위한 자막데이터 전송방법 (Caption Data Transmission Method for HDTV Picture Quality Improvement)

  • 한찬호
    • 한국멀티미디어학회논문지
    • /
    • 제20권10호
    • /
    • pp.1628-1636
    • /
    • 2017
  • Such as closed caption, ancillary data, electronic program guide(EPG), data broadcasting, and etc, increased data for service convenience cause to degrade video quality of high definition contents. This article propose a method to transfer the closed caption data of video contents without video quality degradation. Video quality degradation does not cause in video compression by the block image insertion of caption data in DTV essential hidden area. Additionally the proposed methods have advantage to synchronize video, audio, and caption from preinserted script without time delay.

시선에 따른 영상 음향 정위 일치에 관한 연구 (Study on the Localization Concordance of Video and Audio)

  • 이규원;최해근;박소연;박구만;김성권
    • 한국전자통신학회논문지
    • /
    • 제13권6호
    • /
    • pp.1293-1300
    • /
    • 2018
  • $360^{\circ}$ 영상은 많은 영상정보를 담고 있어 유용하나, 눈에 보이는 물체의 방향과 그 물체의 소리가 들려오는 방향이 다른 경우 시청자에게 피로도를 느끼게 하여 감각적 이질감이 증대되어 그 활용도가 떨어지고 있다. 이에 본 논문에서는 $360^{\circ}$ 영상에서 시선에 따른 음향 정위가 얼마나 일치하는지를 백분율로 나타내는 기준을 제안하며, 제시한 영상 음향 정위 일치율을 이용하여 몰입도를 증대시키는 $360^{\circ}$ 영상의 제작 가능성을 제시한다. 제안한 영상 음향 정위 일치율은, 입체음향 콘텐츠 제작, 재생 솔루션의 정위 성능을 측정, 평가에 유용하고, 더욱 실감성 높은 시스템을 제작하는데 기여할 것으로 기대한다.