• Title/Summary/Keyword: object-based audio content

Search Result 11, Processing Time 0.023 seconds

Non-uniform Linear Microphone Array Based Source Separation for Conversion from Channel-based to Object-based Audio Content (채널 기반에서 객체 기반의 오디오 콘텐츠로의 변환을 위한 비균등 선형 마이크로폰 어레이 기반의 음원분리 방법)

  • Chun, Chan Jun;Kim, Hong Kook
    • Journal of Broadcast Engineering
    • /
    • v.21 no.2
    • /
    • pp.169-179
    • /
    • 2016
  • Recently, MPEG-H has been standardizing for a multimedia coder in UHDTV (Ultra-High-Definition TV). Thus, the demand for not only channel-based audio contents but also object-based audio contents is more increasing, which results in developing a new technique of converting channel-based audio contents to object-based ones. In this paper, a non-uniform linear microphone array based source separation method is proposed for realizing such conversion. The proposed method first analyzes the arrival time differences of input audio sources to each of the microphones, and the spectral magnitudes of each sound source are estimated at the horizontal directions based on the analyzed time differences. In order to demonstrate the effectiveness of the proposed method, objective performance measures of the proposed method are compared with those of conventional methods such as an MVDR (Minimum Variance Distortionless Response) beamformer and an ICA (Independent Component Analysis) method. As a result, it is shown that the proposed separation method has better separation performance than the conventional separation methods.

Adaptation for Object-based MPEG-4 Content with Multiple Streams (다중 스트림을 이용한 객체기반 MPEG-4 컨텐트의 적응 기법)

  • Cha Kyung-Ae
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.11 no.3
    • /
    • pp.69-81
    • /
    • 2006
  • In this paper, an adaptive algorithm is proposed in streaming MPEG-4 contents with fluctuating resource amount such as throughput of network conditions. In the area of adaptive streaming issue, a lot of researches have been made on how to represent encoded media(such as video) bitstream in scalable way. By contrast, MPEG-4 supports object-based multimedia content which is composed of various types of media streams such as audio, video, image and other graphical elements. Thus, it can be more effective to provide individual media streams in scalable way for streaming object-based content to heterogeneous environment. The proposed method provides the multiple media streams corresponding to an object with different qualities and bit rate in order to support object based scalability to the MPEG-4 content. In addition, an optimal selection of the multiple streams for each object to meet a given constraint is proposed. The selection process is adopted a multiple choice knapsack problem with multi-step selection for the MPEG-4 objects with different scalability levels. The proposed algorithm enforces the optimal selection process to maintain the perceptual qualities of more important objects at the best effort. The experimental results show that the set of selected media stream for presenting objects meets a current transmission condition with more high perceptual quality.

  • PDF

A Study on Realistic Sound Reproduction for UHDTV (UHDTV를 위한 실감 오디오 재현 기술)

  • Jang, Daeyoung;Seo, Jeongil;Lee, Yong Ju;Yoo, Jae-Hyoun;Park, Taejin;Lee, Taejin
    • Journal of Broadcast Engineering
    • /
    • v.20 no.1
    • /
    • pp.68-81
    • /
    • 2015
  • Owing to the latest development of component and media processing technologies, UHDTV as a successor of the HDTV is expected that this will be coming soon realization. Accordingly, an audio technology that provides a 5.1-channel surround sound in home should be contemplating on what services should be provided with the advent of UHDTV era. In fact, however, the market of 5.1-channel audio is struggling, due to the difficulty of installation and maintenance of the multi speakers in a home. Meanwhile, the movie sound market for a long time been used in 5.1 and 7.1-channel sound formats, have changed as Dolby ATMOS, IOSONO, AURO3D etc. are launched one after another with the introduction of hybrid audio technologies that include the ceiling and object-based sounds. This very object-based audio technology is assured to be introduced in the home theater and broadcast audio market, and this change in audio technology is expected to be a breath of pioneering technological advances and market growth from the channel-based audio market that lacks flexibility. In this paper, we will investigate a suitable realistic audio solution for UHDTV, and introduce hybrid audio technologies, which is expected to be an audio technology for UHDTV, and we will describe the hybrid audio content format and reproduction methods in a home and consider the future prospects of realistic audio.

A study of effective contents construction for AR based English learning (AR기반 영어학습을 위한 효과적 콘텐츠 구성 방향에 대한 연구)

  • Kim, Young-Seop;Jeon, Soo-Jin;Lim, Sang-Min
    • Journal of The Institute of Information and Telecommunication Facilities Engineering
    • /
    • v.10 no.4
    • /
    • pp.143-147
    • /
    • 2011
  • The system using augmented reality can save the time and cost. It is verified in various fields under the possibility of a technology by solving unrealistic feeling in the virtual space. Therefore, augmented reality has a variety of the potential to be used. Generally, multimodal senses such as visual/auditory/tactile feed back are well known as a method for enhancing the immersion in case of interaction with virtual object. By adapting tangible object we can provide touch sensation to users. a 3D model of the same scale overlays the whole area of the tangible object; thus, the marker area is invisible. This contributes to enhancing immersive and natural images to users. Finally, multimodal feedback also creates better immersion. In this paper, sound feedback is considered. By further improving immersion learning augmented reality for children with the initial step learning content is presented. Augmented reality is in the intermediate stages between future world and real world as well as its adaptability is estimated more than virtual reality.

  • PDF

Broadband Content Insertion Technology based on Terrestrial UHD Broadcasting MMT/ROUTE (지상파 UHD 방송 MMT/ROUTE기반 브로드밴드 콘텐츠 삽입 기술)

  • Kim, Doohwan;Lee, Dongkwan;Kim, Kyuheon
    • Journal of Broadcast Engineering
    • /
    • v.24 no.2
    • /
    • pp.329-340
    • /
    • 2019
  • Recently, broadcasting technologies have evolved as high-quality AV services such as domestic terrestrial UHD(Ultra-High Definition) broadcasting have been increasing, and broadcasting standards have been newly defined. Also, as network technology develops, contents are consumed not only in the country but also the world. Accordingly, content insertion technology, which is a method of providing suitable contents in accordance with the national and local environments, will be needed. This paper proposes a content insertion service system model and synchronization scheme using ATSC(Advanced Television Systems Committee) 3.0 Event Signaling standard under heterogeneous network environment of broadcasting network and internet network based on transmission standard DASH(Dynamic Adaptive Streaming over HTTP)/ROUTE(Real time Object delivery Over Unidirectional Transport) and MMT(MPEG Media Transport) of terrestrial UHD broadcasting. It also verifies that the service operates in an environment that meets the broadcast standard.

Implementation of A Multimedia Streaming System using MPEG-4 (MPEG-4 표준을 이용한 멀티미디어 스트리밍 시스템 구현)

  • 임동근;이정우;김선태;마평수;호요성
    • Journal of Broadcast Engineering
    • /
    • v.6 no.3
    • /
    • pp.215-224
    • /
    • 2001
  • In recent days, research activities on multimedia services mainly focus on the multiplexing system with timing synchromization for media components, such as video, audio and text. The MPEG-4 standard emphasizes object-based coding which includes analysis and understanding of the Image content. Since in MPEG-4 we can define objects and encode them independently, we can manipulate and display each object for different applications. This feature of MPEG-4 is also vero useful for multimedia services, such as video streaming cia different network channels, digital versatile disc, internet TV, video E-mail, and so on. In this Paper, we implement a multimedia streaming system which is compliant with the MPEG-4 system and the MP4 file format.

  • PDF

CNN-based Visual/Auditory Feature Fusion Method with Frame Selection for Classifying Video Events

  • Choe, Giseok;Lee, Seungbin;Nang, Jongho
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.3
    • /
    • pp.1689-1701
    • /
    • 2019
  • In recent years, personal videos have been shared online due to the popular uses of portable devices, such as smartphones and action cameras. A recent report predicted that 80% of the Internet traffic will be video content by the year 2021. Several studies have been conducted on the detection of main video events to manage a large scale of videos. These studies show fairly good performance in certain genres. However, the methods used in previous studies have difficulty in detecting events of personal video. This is because the characteristics and genres of personal videos vary widely. In a research, we found that adding a dataset with the right perspective in the study improved performance. It has also been shown that performance improves depending on how you extract keyframes from the video. we selected frame segments that can represent video considering the characteristics of this personal video. In each frame segment, object, location, food and audio features were extracted, and representative vectors were generated through a CNN-based recurrent model and a fusion module. The proposed method showed mAP 78.4% performance through experiments using LSVC data.

Exploring Practices of Interpretation and Communication in Art Museums (미술관의 해석과 소통의 모색)

  • Kim, Elm-Yeong
    • The Journal of Art Theory & Practice
    • /
    • no.2
    • /
    • pp.147-168
    • /
    • 2004
  • This study examined the role of interpretation with various practices in art museums to seek a new meaning and a concept of art museum today. The exploration of interpretation would he a starting point to discuss about on art museums with professionals in each art-related field. While museums recognize the concept of interpretation and the scope of the functions in different levels, the study focused on the practices of collecting and exhibiting that will entrust the museum new realms of activities toward the audience. In particular, its emphases are set force on the information on the collections via the museum's web sites, interpretation policies, and theories and methodologies in exhibition development. Art museum websites well reflect how museums utilize the new medium to enhance the understanding of art works by providing in-depth art historical information, comprehensive contexts, and subject/concept based search methods. In recent decades, these have enacted changes to expand dimensions of interpretive functions in most museums, particularly in the United States and others. In an administrative perspective, Tate Gallery Interpretation Policy became an good example how an art museum put its interpretation philosophy as the basis of interpreting collection and public programs. Tate established functions of intrepretation and education not only within a task-based team but also as an intrer-divisional coorperation to provide an interpretation scheme of information provisions such as guide brochure, audio tour, multimedia content, and library. New environment and trends of museum exhibition, and its development processes stem from communication theories, object interpretation philosophy, display strategies, and various evaluation techniques through audiences, with the communication theories of Shannon and Weaver, Berlo's SMCR(Source-Message-Channel-Receiver) models were perceived as to understand the mechanism to communicate museum exhibits to visitors Suzan vogel's insight into object display strategy helped to conceive the mechanism of object recontextualization. She emphasized that the museum's practice to construe opinions and impressions through object display should be discreet and critical, therefore, the professionals to plan the exhibition should reveal the intention and their practices. For a prevailing new methodology from the field, the interpretive exhibition development processes are articulated as the front-end, formative, and summative evaluation, futhermore the team process in industrial product management models was adapted. These have turned out to be more interactive with visitors and effective to communicate the exhibition concepts and messages, hence resulting in enriched museum experiences. Finally the study concluded that understanding the aspects of interpretation should help art museums to set a framework for current practices to expand its public dimension. It can provide curators with a critical view to website planning and its content. And obviously, the interpretive exhibition development methodology will lead museum exhibition developers to be skilled in its current approaches to thematic exhibition concerning diverse subjects and topics.

  • PDF

Content Based Video Retrieval by Example Considering Context (문맥을 고려한 예제 기반 동영상 검색 알고리즘)

  • 박주현;낭종호;김경수;하명환;정병희
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.30 no.12
    • /
    • pp.756-771
    • /
    • 2003
  • Digital Video Library System which manages a large amount of multimedia information requires efficient and effective retrieval methods. In this paper, we propose and implement a new video search and retrieval algorithm that compares the query video shot with the video shots in the archives in terms of foreground object, background image, audio, and its context. The foreground object is the region of the video image that has been changed in the successive frames of the shot, the background image is the remaining region of the video image, and the context is the relationship between the low-level features of the adjacent shots. Comparing these features is a result of reflecting the process of filming a moving picture, and it helps the user to submit a query focused on the desired features of the target video clips easily by adjusting their weights in the comparing process. Although the proposed search and retrieval algorithm could not totally reflect the high level semantics of the submitted query video, it tries to reflect the users' requirements as much as possible by considering the context of video clips and by adjusting its weight in the comparing process.

Image Enhancement Techniques for MPEG-4 (MPEG-4 영상의 화질 개선에 관한 연구)

  • 김태근;신정호;백준기
    • Journal of Broadcast Engineering
    • /
    • v.2 no.2
    • /
    • pp.169-181
    • /
    • 1997
  • In this paper, we propose and discuss about image enhancement techniques for MPEG-4. which represents very low bit-rate, content-based. and object-based hierarchical audio-visual coding standard. The proposed enhancement technique removes undesired artifacts arising in the compression procedure and increase resolution in both spatial and temporal domains. In order to remove undesired artifacts. we divide the MPEG-4 video algorithm in two parts: MPEG-2 like part and the new part. For removing artifacts caused by the first part. we adopt the conventional blocking artifacts algorithm developed for MPEG-2. On the other hand for removing artifacts caused by the second part. we provide a new degradation model. and propose the corresponding image restoration method. For increasing resolution of the MPEG-4 images, we propose a general framework of multichannel image interpolation process. which includes both spatial and temporal interpolations. As the MPEG-4 standard is under development. various sophisticated techniques are considered. but research on image enhancement techniques is relatively underestimated. By this reason. additional image enhancement techniques will become very important issue in realization phase of MPEG-4.

  • PDF