• 제목/요약/키워드: Video-audio media

검색결과 203건 처리시간 0.032초

Automatic Generation of Video Metadata for the Super-personalized Recommendation of Media

  • Yong, Sung Jung;Park, Hyo Gyeong;You, Yeon Hwi;Moon, Il-Young
    • Journal of information and communication convergence engineering
    • /
    • 제20권4호
    • /
    • pp.288-294
    • /
    • 2022
  • The media content market has been growing, as various types of content are being mass-produced owing to the recent proliferation of the Internet and digital media. In addition, platforms that provide personalized services for content consumption are emerging and competing with each other to recommend personalized content. Existing platforms use a method in which a user directly inputs video metadata. Consequently, significant amounts of time and cost are consumed in processing large amounts of data. In this study, keyframes and audio spectra based on the YCbCr color model of a movie trailer were extracted for the automatic generation of metadata. The extracted audio spectra and image keyframes were used as learning data for genre recognition in deep learning. Deep learning was implemented to determine genres among the video metadata, and suggestions for utilization were proposed. A system that can automatically generate metadata established through the results of this study will be helpful for studying recommendation systems for media super-personalization.

멀티미디어 통신을 위한 동기 프로토콜의 설계에 관한 연구 (A Study on the Design of Synchronization Protocol for Multimedia Communication)

  • 우희곤;김대영
    • 한국통신학회논문지
    • /
    • 제19권8호
    • /
    • pp.1612-1627
    • /
    • 1994
  • 기존 OSI 세션계층 동기기능은 문자 위주의 단일 미디어 동기만을 다루고 있기 때문에 audio, video, graphic 등의 멀티미디어 정보통신 서비스를 위해서는 새로운 동기 방식과 프로토콜이 필요하다. 본 논문은 이러한 멀티미디어 동기 서비스를 위해서 개념적인 동기층 환경을 설정하고 이 계층에서 사용하는 '멀티채널, 기준미디어 동기' 기법의 동기층 프리미티브와 동기층 프로토콜을 설계, 제안 하였다. 본 멀티미디어 동기층(MS layer)은 상대편 동기층과의 연결을 설정한 후 미디어별로 별도의 채널을 관리하여 미디어별 특성을 효과적으로 이용할 수 있게 하고, 미디어 프레임 번호를 time stamp처럼 이용함으로써 특정 동기점의 삽입 없이도 손쉽게 동기점을 찾아내고 동기 서비스를 제공한다.

  • PDF

Implementation of Audio Equalization in Video-on-Demand Broadcast Content

  • Kwon, Myung-Kyu
    • 한국컴퓨터정보학회논문지
    • /
    • 제22권10호
    • /
    • pp.63-71
    • /
    • 2017
  • In this paper, we develop the system for audio volume equalization of video on demand(VoD) content and propose the solution for it. In recent years, there has been a steady increase in the number of VoD users in addition to linear channels. However, viewers ought to sit in an uncomfortable way, adjusting the volume intermittently while they are broadcasted. Sudden changes of volume occur between the broadcasting channels, the programs from the co-channel, or the linear channels and the VoDs. Especially, upsurged dissatisfaction from the televiewers has been found due to the unequalized volume when shifting between the linear channel and the VoD. In order to solve this problem, multilateral efforts were put forth, such as a system for keeping the volume at a certain level in digital broadcasting program has been legislated domestically. It leads success in equalizing linear channel volume. On contrary, too little notice has been taken for distorted volume problem of video on demand(VoD) content. In this paper, we developed and applied the volume equalization system into VoD content to achieve uniformization, a similar condition with linear channel(-24LKFS). This suggestion helped uneven current of volume which was in the stage -16 ~ -20LKFS to stable condition by lowering into the stage of -24LKFS. It also brought 20% increase in perspective of volume quality satisfaction level.

Social Media Fake News in India

  • Al-Zaman, Md. Sayeed
    • Asian Journal for Public Opinion Research
    • /
    • 제9권1호
    • /
    • pp.25-47
    • /
    • 2021
  • This study analyzes 419 fake news items published in India, a fake-news-prone country, to identify the major themes, content types, and sources of social media fake news. The results show that fake news shared on social media has six major themes: health, religion, politics, crime, entertainment, and miscellaneous; eight types of content: text, photo, audio, and video, text & photo, text & video, photo & video, and text & photo & video; and two main sources: online sources and the mainstream media. Health-related fake news is more common only during a health crisis, whereas fake news related to religion and politics seems more prevalent, emerging from online media. Text & photo and text & video have three-fourths of the total share of fake news, and most of them are from online media: online media is the main source of fake news on social media as well. On the other hand, mainstream media mostly produces political fake news. This study, presenting some novel findings that may help researchers to understand and policymakers to control fake news on social media, invites more academic investigations of religious and political fake news in India. Two important limitations of this study are related to the data source and data collection period, which may have an impact on the results.

화면해설방송 저작을 위한 비 대사 구간 검출 (Non-Dialog Section Detection for the Descriptive Video Service Contents Authoring)

  • 장인선;안충현;장윤선
    • 방송공학회논문지
    • /
    • 제19권3호
    • /
    • pp.296-306
    • /
    • 2014
  • 본 논문에서는 방송 오디오에서로부터 화면해설 삽입을 위한 비 대사 구간 검출 방법을 제시한다. 방송 오디오에서의 대사와 비 대사 구간을 분류하기 위해서는 대사와 배경 음악 등 다양한 종류의 소리가 혼합되어 있는 스테레오 신호로부터 음성 활성 여부의 검출이 우선되어야 한다. 본 논문에서는 방송 오디오 제작과정을 파악함으로써 신호의 채널 특성 분석 결과를 대사 음성 활성 여부 검출에 적용한다. 본 논문에서 제안하는 비 대사 구간 검출 방법은 방송 오디오의 센터채널과 서라운드 성분 간의 에너지 비율을 추가적인 오디오 특징으로 이용하여 센터채널의 음성 활성도와의 결합을 통해 성능 향상을 이루어 낸다. 또한, 실제 화면해설 방송물의 분석을 통해 생성한 규칙 기반의 후처리를 통해 화면해설 삽입이 가능한 비 대사 구간을 검출한다. 이를 실제 방송 컨텐츠를 대상으로 한 실험을 통하여 검증한다.

지상파 DMB 컨텐츠의 MPEG-4 BIFS 최적화 기법 (MPEG-4 BIFS Optimization for Interactive T-DMB Content)

  • 차경애
    • 한국산업정보학회논문지
    • /
    • 제12권1호
    • /
    • pp.54-60
    • /
    • 2007
  • The Digital Multimedia Broadcasting(DMB) system is developed to offer high quality multimedia content to the mobile environment. The system adopts the MPEG-4 standard for the main video, audio and other media format. For providing interactive contents, it also adopts the MPEG-4 scene description that refers to the spatio-temporal specifications and behaviors of individual objects. With more interactive contents, the scene description also needs higher bitrate. However, the bandwidth for allocating meta data, such as scene description is restrictive in the mobile environment. On one hand, the DMB terminal renders each media stream according to the scene description. Thus the binary format for scene(BIFS) stream corresponding to the scene description should be decoded and parsed in advance when presenting media data. With this reasoning, the transmission delay of the BIFS stream would cause the delay in transmitting whole audio-visual scene presentations, although the audio or video streams are encoded in very low bitrate. This paper presents the effective optimization technique in adapting the BIFS stream into the expected bitrate without any waste in bandwidth and avoiding transmission delays inthe initial scene description for interactive DMB content.

  • PDF

Multimodal Approach for Summarizing and Indexing News Video

  • Kim, Jae-Gon;Chang, Hyun-Sung;Kim, Young-Tae;Kang, Kyeong-Ok;Kim, Mun-Churl;Kim, Jin-Woong;Kim, Hyung-Myung
    • ETRI Journal
    • /
    • 제24권1호
    • /
    • pp.1-11
    • /
    • 2002
  • A video summary abstracts the gist from an entire video and also enables efficient access to the desired content. In this paper, we propose a novel method for summarizing news video based on multimodal analysis of the content. The proposed method exploits the closed caption data to locate semantically meaningful highlights in a news video and speech signals in an audio stream to align the closed caption data with the video in a time-line. Then, the detected highlights are described using MPEG-7 Summarization Description Scheme, which allows efficient browsing of the content through such functionalities as multi-level abstracts and navigation guidance. Multimodal search and retrieval are also within the proposed framework. By indexing synchronized closed caption data, the video clips are searchable by inputting a text query. Intensive experiments with prototypical systems are presented to demonstrate the validity and reliability of the proposed method in real applications.

  • PDF

다중모드 특징을 사용한 뉴스 동영상의 앵커 장면 검출 기법 (Multi-modal Detection of Anchor Shot in News Video)

  • 유성열;강동욱;김기두;정경훈
    • 방송공학회논문지
    • /
    • 제12권4호
    • /
    • pp.311-320
    • /
    • 2007
  • 본 논문에서는 뉴스 동영상 정보의 생성을 위해 뉴스 단위의 기준이 되는 앵커 장면을 효과적으로 검출하는 기법을 제안한다. 우선 뉴스 동영상의 오디오 및 비디오 구성 요소에 대한 관찰을 통하여 앵커 장면 검출에 적합한 기본적인 특징들을 선택하였다. 제안 알고리듬에서는 색인의 정확도를 높이기 위해 몇몇 오디오 특징과 함께 비디오 특징으로서 움직임 특징을 함께 이용하였으며, 전체적인 구조는 '오디오 정지 구간 검출', '오디오 클러스터 분류', 그리고 '움직임 활동도와의 매칭'의 3단계로 구성된다. MPEG-2 방식으로 부호화된 뉴스 동영상에 대한 실험을 통해 제안 알고리듬의 성능이 만족스러움을 확인하였다.

체감형 미디어 서비스를 위한 공간음향 기술 동향 (Spatial Audio Technologies for Immersive Media Services)

  • 이용주;유재현;장대영;이미숙;이태진
    • 전자통신동향분석
    • /
    • 제34권3호
    • /
    • pp.13-22
    • /
    • 2019
  • Although virtual reality technology may not be deemed as having a satisfactory quality for all users, it tends to incite interest because of the expectation that the technology can allow one to experience something that they may never experience in real life. The most important aspect of this indirect experience is the provision of immersive 3D audio and video, which interacts naturally with every action of the user. The immersive audio faithfully reproduces an acoustic scene in a space corresponding to the position and movement of the listener, and this technology is also called spatial audio. In this paper, we briefly introduce the trend of spatial audio technology in view of acquisition, analysis, reproduction, and the concept of MPEG-I audio standard technology, which is being promoted for spatial audio services.

An Implementation of Highly Integrated Signal Processing IC for HDTV

  • Hahm Cheul-Hee;Park Kon-Kyu;Kim Hyoung-Gil;Jung Choon-Sik;Lee Sang-keun;Jang Jae-Young;Park Sung-Uk;Chon Byung-Hoan;Chun Kang-Wook;Jo Jae-Moon;Song Dong-il
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2003년도 정기총회 및 학술대회
    • /
    • pp.69-72
    • /
    • 2003
  • This paper presents a signal processing IC for digital HDTV, which is designed to operate in bunt-in HDW or in HD-set-top Box. The chip supports de-multiplexing an ISO/IEC 13818-1 MPEG-2 TS stream. It decodes MPEG-2 MP@HL video bitstream, and provides high-quality scaled video for display on HDTV monitor. The chip consists of ARM7TDMI for TS-Demux, PCI interface, Audio interface, MPEG2 MP@HL video decoder Display processor, Graphic processor, Memory controller, Audio int3face, Smart Card interface and UART. It is fabricated using Sam sung's 0.18-um and the package of 492-pin BGA is used.

  • PDF