• Title/Summary/Keyword: Object-based Audio

Search Result 63, Processing Time 0.033 seconds

Method of extracting context from media data by using video sharing site

  • Kondoh, Satoshi;Ogawa, Takeshi
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.01a
    • /
    • pp.709-713
    • /
    • 2009
  • Recently, a lot of research that applies data acquired from devices such as cameras and RFIDs to context aware services is being performed in the field on Life-Log and the sensor network. A variety of analytical techniques has been proposed to recognize various information from the raw data because video and audio data include a larger volume of information than other sensor data. However, manually watching a huge amount of media data again has been necessary to create supervised data for the update of a class or the addition of a new class because these techniques generally use supervised learning. Therefore, the problem was that applications were able to use only recognition function based on fixed supervised data in most cases. Then, we proposed a method of acquiring supervised data from a video sharing site where users give comments on any video scene because those sites are remarkably popular and, therefore, many comments are generated. In the first step of this method, words with a high utility value are extracted by filtering the comment about the video. Second, the set of feature data in the time series is calculated by applying functions, which extract various feature data, to media data. Finally, our learning system calculates the correlation coefficient by using the above-mentioned two kinds of data, and the correlation coefficient is stored in the DB of the system. Various other applications contain a recognition function that is used to generate collective intelligence based on Web comments, by applying this correlation coefficient to new media data. In addition, flexible recognition that adjusts to a new object becomes possible by regularly acquiring and learning both media data and comments from a video sharing site while reducing work by manual operation. As a result, recognition of not only the name of the seen object but also indirect information, e.g. the impression or the action toward the object, was enabled.

  • PDF

The development of the WEB-Based Virtual Reality for the Treatment of the Alcoholism (알코올중독자 치료를 위한 WEB 기반 가상현실 제작)

  • Paek, Seung-Eun;Beack, Seung-Hwa;Ryu, Jong-Hyun;Kim, Dong-Wan
    • Proceedings of the KIEE Conference
    • /
    • 2004.07d
    • /
    • pp.2690-2692
    • /
    • 2004
  • Medications or cognitive-behavior methods have been mainly used as a treatment of alcoholism. lately the virtualy technology has been applied to the kink of alcoholic disorders. A virtual environment makes him having ability to over come the drink. In this study, we were implemented by making panorama images and 3D object modules using 3D Studio MAX. VRML, JAVA Applet. And the BAR stimulator that composed with a position sensor head mount display, and audio system, is suggested. To illustrate the physiological difference between a person who has a alcoholism and and without a liquor bottle, heart rate was measured during experiment, and also measured a Person's HR after the virtual reality training. we demonstrated the subjective effectiveness of virtual reality psychotherapy through the clinical experiment.

  • PDF

Object-based Audio Player using a User Information (사용자 정보를 반영한 객체 기반 오디오 재생 기술)

  • Moon, Jae-Won;Jung, Jong-Jin;Kim, Kyung-Won;Lim, Tae-Beom;Lee, Seok-Pil
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2010.06b
    • /
    • pp.197-200
    • /
    • 2010
  • 멀티미디어 서비스는 신호 처리 기술의 발달 및 전송 환경의 개선으로 정보 전달의 기존 역할 뿐 아니라 사용자의 다양한 요구 및 재생 환경을 반영하는 맞춤형 서비스로 진화하고 있다. 본 논문에서는 사용자의 다양한 청취 환경 변화, 선호도 및 감정을 네트워크상 입력 디바이스를 통해 전송하고, 이를 바탕으로 처리한 객체 기반 음원을 다채널 스피커를 통해 출력하는 능동형 재생 플랫폼을 제안한다. 다수의 청취자는 각각의 감성 및 환경 정보 등 음원 처리에 관련 데이터베이스를 실시간 저장하여 동일한 음원으로 단일 플랫폼에서 다양한 청취감으로 음원 재생이 가능하다.

  • PDF

Comparisons between Distributed Connections and Centralized Connections of Multimedia Streams for Computer-based Audio-Video Teleconferences (컴퓨터 영상회의 시스템을 위한 분산형과 집중형 스트림 연결 구조 비교)

  • Lee, Gyeong-Hui;Kim, Du-Hyeon;Im, Heon-Gyu;Im, Yeong-Hwan
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.3
    • /
    • pp.591-607
    • /
    • 1996
  • To support various multimedia applications. MuX server produces object-oriented and consistent interfaces for creation, copying, splitting, mixing and interleaving of streams. In this paper, we describes distributed connection structures and centralized connection structures which can be used in building a teleconferencing system using basic objects of MuX and compares merits and demerits of each structure from the viewpoint of multimedia related performance like delay and synchronization.

  • PDF

Classification of General Sound with Non-negativity Constraints (비음수 제약을 통한 일반 소리 분류)

  • 조용춘;최승진;방승양
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.10
    • /
    • pp.1412-1417
    • /
    • 2004
  • Sparse coding or independent component analysis (ICA) which is a holistic representation, was successfully applied to elucidate early auditor${\gamma}$ processing and to the task of sound classification. In contrast, parts-based representation is an alternative way o) understanding object recognition in brain. In this thesis we employ the non-negative matrix factorization (NMF) which learns parts-based representation in the task of sound classification. Methods of feature extraction from the spectro-temporal sounds using the NMF in the absence or presence of noise, are explained. Experimental results show that NMF-based features improve the performance of sound classification over ICA-based features.

A Study on Unmanned Image Tracking System based on Smart Phone (스마트폰 기반의 무인 영상 추적 시스템 연구)

  • Ahn, Byeong-tae
    • Journal of Convergence for Information Technology
    • /
    • v.9 no.3
    • /
    • pp.30-35
    • /
    • 2019
  • An unattended recording system based on smartphone based image image tracking is rapidly developing. Among the existing products, a system that automatically tracks and rotates the object to be photographed using an infrared signal is very expensive for general users. Therefore, this paper proposes a mobile unattended recording system that enables automatic recording by anyone who uses a smartphone. The system consists of a commercial mobile camera, a servomotor that moves the camera from side to side, a microcontroller to control the motor, and a commercial wireless Bluetooth Earset for video audio input. In this paper, we designed a system that enables unattended recording through image tracking using smartphone.

Hierarchical QoS Architecture for Virtual Dancing Environment (분산 가상현실을 위한 계층적 QoS 지원 기법)

  • 김진용;원유집;김범은;박종일;박용진
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.30 no.11
    • /
    • pp.675-690
    • /
    • 2003
  • In this paper, we present the virtual dancing studio for distributed virtual environment. In this system, geographically distributed user shares the virtual dancing hall and interacts with each other. The participating object can be a graphical avatar or a live video stream. It allows the coexistence of graphic objects and real images in the shared virtual space. One of the main technical challenges in developing the distributed virtual environment is to handle excessive network traffic. In an effort to effectively reduce the network traffic, we propose a scheme to adjust the QoS of each object with respect to the distance from the observer in the virtual space. The server maintains the QoS vector for each client's shared space and controls the packet traffic to individual clients based on its QoS vectors. We develop a proto-type virtual dancing environment. Java based development enables the client to be platform independent. The result of experiment shows that the adoption of hierarchical QoS management significantly reduces the overall network traffic.

Broadband Content Insertion Technology based on Terrestrial UHD Broadcasting MMT/ROUTE (지상파 UHD 방송 MMT/ROUTE기반 브로드밴드 콘텐츠 삽입 기술)

  • Kim, Doohwan;Lee, Dongkwan;Kim, Kyuheon
    • Journal of Broadcast Engineering
    • /
    • v.24 no.2
    • /
    • pp.329-340
    • /
    • 2019
  • Recently, broadcasting technologies have evolved as high-quality AV services such as domestic terrestrial UHD(Ultra-High Definition) broadcasting have been increasing, and broadcasting standards have been newly defined. Also, as network technology develops, contents are consumed not only in the country but also the world. Accordingly, content insertion technology, which is a method of providing suitable contents in accordance with the national and local environments, will be needed. This paper proposes a content insertion service system model and synchronization scheme using ATSC(Advanced Television Systems Committee) 3.0 Event Signaling standard under heterogeneous network environment of broadcasting network and internet network based on transmission standard DASH(Dynamic Adaptive Streaming over HTTP)/ROUTE(Real time Object delivery Over Unidirectional Transport) and MMT(MPEG Media Transport) of terrestrial UHD broadcasting. It also verifies that the service operates in an environment that meets the broadcast standard.

CNN-based Visual/Auditory Feature Fusion Method with Frame Selection for Classifying Video Events

  • Choe, Giseok;Lee, Seungbin;Nang, Jongho
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.3
    • /
    • pp.1689-1701
    • /
    • 2019
  • In recent years, personal videos have been shared online due to the popular uses of portable devices, such as smartphones and action cameras. A recent report predicted that 80% of the Internet traffic will be video content by the year 2021. Several studies have been conducted on the detection of main video events to manage a large scale of videos. These studies show fairly good performance in certain genres. However, the methods used in previous studies have difficulty in detecting events of personal video. This is because the characteristics and genres of personal videos vary widely. In a research, we found that adding a dataset with the right perspective in the study improved performance. It has also been shown that performance improves depending on how you extract keyframes from the video. we selected frame segments that can represent video considering the characteristics of this personal video. In each frame segment, object, location, food and audio features were extracted, and representative vectors were generated through a CNN-based recurrent model and a fusion module. The proposed method showed mAP 78.4% performance through experiments using LSVC data.

Similar sub-Trajectory Retrieval Technique based on Grid for Video Data (비디오 데이타를 위한 그리드 기반의 유사 부분 궤적 검색 기법)

  • Lee, Ki-Young;Lim, Myung-Jae;Kim, Kyu-Ho;Kim, Joung-Joon
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.9 no.5
    • /
    • pp.183-189
    • /
    • 2009
  • Recently, PCS, PDA and mobile devices, such as the proliferation of spread, GPS (Global Positioning System) the use of, the rapid development of wireless network and a regular user even images, audio, video, multimedia data, such as increased use is for. In particular, video data among multimedia data, unlike the moving object, text or image data that contains information about the movements and changes in the space of time, depending on the kinds of changes that have sigongganjeok attributes. Spatial location of objects on the flow of time, changing according to the moving object (Moving Object) of the continuous movement trajectory of the meeting is called, from the user from the database that contains a given query trajectory and data trajectory similar to the finding of similar trajectory Search (Similar Sub-trajectory Retrieval) is called. To search for the trajectory, and these variations, and given the similar trajectory of the user query (Tolerance) in the search for a similar trajectory to approximate data matching (Approximate Matching) should be available. In addition, a large multimedia data from the database that you only want to be able to find a faster time-effective ways to search different from the existing research is required. To this end, in this paper effectively divided into a grid to search for the trajectory to the trajectory of moving objects, similar to the effective support of the search trajectory offers a new grid-based search techniques.

  • PDF