• Title/Summary/Keyword: object audio

Search Result 95, Processing Time 0.027 seconds

A Digital Library Prototype for Access to Diverse Collections (다양한 장서 접근을 위한 디지털 도서관의 프로토타입 구축)

  • Choi Won-Tae
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.32 no.2
    • /
    • pp.295-307
    • /
    • 1998
  • This article is an overview of the digital library project, indicating what roles Koreas diverse digital collections may play. Our digital library prototype has simple architecture, consisting of digital repositories, filters, indexing and searching, and clients. Digital repositories include various types of materials and databases. The role of filters is to recognize a format of a document collection and mark the structural components of each of its documents. We are using a database management system (ORACLE and ConText) supporting user-defined functions and access methods that allows us to easily incorporate new object analysis, structuring, and indexing technology into a repository. Clients can be considered browsers or viewers designed for different document data types, such as image, audio, video, SGML, PDF, and KORMARC. The combination of navigational tools supports a variety of approaches to identifying collections and browsing or searching for individual items. The search interface was implemented using HTML forms and the World Wide Web's CGI mechanism.

  • PDF

Image Enhancement Techniques for MPEG-4 (MPEG-4 영상의 화질 개선에 관한 연구)

  • 김태근;신정호;백준기
    • Journal of Broadcast Engineering
    • /
    • v.2 no.2
    • /
    • pp.169-181
    • /
    • 1997
  • In this paper, we propose and discuss about image enhancement techniques for MPEG-4. which represents very low bit-rate, content-based. and object-based hierarchical audio-visual coding standard. The proposed enhancement technique removes undesired artifacts arising in the compression procedure and increase resolution in both spatial and temporal domains. In order to remove undesired artifacts. we divide the MPEG-4 video algorithm in two parts: MPEG-2 like part and the new part. For removing artifacts caused by the first part. we adopt the conventional blocking artifacts algorithm developed for MPEG-2. On the other hand for removing artifacts caused by the second part. we provide a new degradation model. and propose the corresponding image restoration method. For increasing resolution of the MPEG-4 images, we propose a general framework of multichannel image interpolation process. which includes both spatial and temporal interpolations. As the MPEG-4 standard is under development. various sophisticated techniques are considered. but research on image enhancement techniques is relatively underestimated. By this reason. additional image enhancement techniques will become very important issue in realization phase of MPEG-4.

  • PDF

A Study on Multi-function Implementation using Single Sensor (단일 센서를 사용한 다기능 구현에 관한 연구)

  • Choi, Su-Yeol;Lee, Chang-Hee
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.16 no.4
    • /
    • pp.133-137
    • /
    • 2016
  • The video and audio information occupies a large portion of the IoT information. Various sensors can be used in a more accurate situation awareness and the absence of the main information has been required. Increasing in resource management in accordance with the use of various sensors. As a method to reduce the resources required in the communication of the various sensors and find the possibility to process the sensor information that can take the place of the other sensor. In this paper, using the LIS302 DL MEMS motion sensor to measure the data in the ping-pong ball, shuttlecock, tennis ball falling into table tennis. Data measured in the three object was confirmed that in proportion to the amount of impact. This experiment using the accelerometer can be confirmed that changes in the amount of impact. The results using a single multi-function sensor showed a possible implementation. In addition, the recognized in consideration of the situation in the early development stage of the multi-function sensor.

Online Monitoring System based notifications on Mobile devices with Kinect V2 (키넥트와 모바일 장치 알림 기반 온라인 모니터링 시스템)

  • Niyonsaba, Eric;Jang, Jong-Wook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.20 no.6
    • /
    • pp.1183-1188
    • /
    • 2016
  • Kinect sensor version 2 is a kind of camera released by Microsoft as a computer vision and a natural user interface for game consoles like Xbox one. It allows acquiring color images, depth images, audio input and skeletal data with a high frame rate. In this paper, using depth image, we present a surveillance system of a certain area within Kinect's field of view. With computer vision library(Emgu CV), if an object is detected in the target area, it is tracked and kinect camera takes RGB image to send it in database server. Therefore, a mobile application on android platform was developed in order to notify the user that Kinect has sensed strange motion in the target region and display the RGB image of the scene. User gets the notification in real-time to react in the best way in the case of valuable things in monitored area or other cases related to a reserved zone.

MPEG-4 based XMT APIs for Scene Description (장면 기술을 위한 MPEG-4 기반 XMT API 구현)

  • 정예선;김규헌;기명석
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2001.11b
    • /
    • pp.91-94
    • /
    • 2001
  • MPEG-4 시스템은 장면 자체를 하나의 구성 요소로 여기는 기존의 시스템과는 달리, 그 장면을 구성하는 부호화 또는 복호화된 A/V 객체(Audio/visual Objects)들을 하나의 단위로 인식하여, 다양한 멀티미디어 컨텐츠의 장면을 구성(Scene Composition)하고 표현 하는 것에 그 특징이 있다. 이러한 MPEG-4 시스템의 객체 기반 특징은 다양한 사용자와의 대화성(Interactivity)을 가능하게 하며 , 또한 편리한 컨텐츠 편집 및 재사용 등이 가능하기에 차세대 디지털 방송 컨텐츠 제작에 중요하게 활용될 전망이다. 객체 기반 A/V 편집 도구는 MPEG-4를 기반으로 차세대 디지털 방송 컨텐츠 제작을 용이하게 하기 위한 제작/편집 도구로써 , 장면을 표현하기 위하여 BIFS(Binary Format for Scene description)와 XMT(eXtensible MPEG-4 Textual format) 포맷을 모두 사용하고 있다. BIFS 포맷은 저작된 결과물을 바이너리 형태로 표현하기 때문에, 저작된 결과물을 전송하는 데에는 용이하나, 중간에 저작된 결과물을 확인하기 어렵고, 또한 기존의 다른 어플리케이션과의 상호 작용(Interoperability)과 교환(Exchange)에도 어려움이 따른다. 이에 반해, XMT는 차세대 마크업 언어로 각광 받고 있는 XML 에 그 기반을 두고 있기에 저작된 결과물을 제작자가 쉽게 저작물을 이해할 수 있으며, SMIL 과 X3D 같은 다른 어플리케이션과의 상호작용과 교환 또한 용이하게 한다 XMT는 기술 방법에 따라 XMT-A 와 XMT-0 두 가지 형태가 있으며, XMT-A 포맷은 VRML에서 발전한 X3D(extensible 3D)를 바탕으로 MPEG-4 시스템의 특징들을 수용하여 구성되고 BIFS와 일대일로 대응된다. 반면에 XMT-0는 멀티미디어 문서를 웹문서로 표현하는 SMIL 2.0 을 그 기반으로 하였기에 MPEG-4 시스템의 특징보다는 컨텐츠를 저작하는 제작자의 초점에 맞추어 개발된 형태이다. XMT를 이용하여 컨텐츠를 저작하기 위해서는 사용자 인터페이스를 통해 입력되는 저작 정보들을 손쉽게 저장하고 조작할 수 있으며, 또한 XMT 파일 형태로 출력하기 위한 API 가 필요하다. 이에, 본 논문에서는 XMT 형태의 중간 자료형으로의 저장 및 조작을 위하여 XML 에서 표준 인터페이스로 사용하고 있는 DOM(Document Object Model)을 기반으로 하여 XMT 문법에 적합하게 API를 정의하였으며, 또한, XMT 파일을 생성하기 위한 API를 구현하였다. 본 논문에서 제공된 API는 객체기반 제작/편집 도구에 응용되어 다양한 멀티미디어 컨텐츠 제작에 사용되었다.

  • PDF

A study on searching image by cluster indexing and sequential I/O (연속적 I/O와 클러스터 인덱싱 구조를 이용한 이미지 데이타 검색 연구)

  • Kim, Jin-Ok;Hwang, Dae-Joon
    • The KIPS Transactions:PartD
    • /
    • v.9D no.5
    • /
    • pp.779-788
    • /
    • 2002
  • There are many technically difficult issues in searching multimedia data such as image, video and audio because they are massive and more complex than simple text-based data. As a method of searching multimedia data, a similarity retrieval has been studied to retrieve automatically basic features of multimedia data and to make a search among data with retrieved features because exact match is not adaptable to a matrix of features of multimedia. In this paper, data clustering and its indexing are proposed as a speedy similarity-retrieval method of multimedia data. This approach clusters similar images on adjacent disk cylinders and then builds Indexes to access the clusters. To minimize the search cost, the hashing is adapted to index cluster. In addition, to reduce I/O time, the proposed searching takes just one I/O to look up the location of the cluster containing similar object and one sequential file I/O to read in this cluster. The proposed schema solves the problem of multi-dimension by using clustering and its indexing and has higher search efficiency than the content-based image retrieval that uses only clustering or indexing structure.

An XMT Authoring System supporting Multiple Presentation Environments (다양한 재생 환경을 지원하는 XMT 저작 시스템)

  • 김희선;임영순
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.10 no.3
    • /
    • pp.251-258
    • /
    • 2004
  • The XMT standard is MPEG-4 Scene Description of textual format. It can be utilized to edit the audio/video media for broadcasting and develop the user oriented media contents. This paper proposes XMT authoring system that supports exchange among contents in various presentation environment. The XMT authoring system creates two levels of textual syntax and semantics: XMT-$\alpha$ format and XMT-$\Omega$ format. Because XMT-$\alpha$ and XMT-$\Omega$ have different expression method about an object, the authoring tool offers interface for them. the authoring tool offers interface for them. Also, it defines interior data structure that can support two file formats, and offers the function that transforms XMT-$\alpha$ into BIFS and transforms XMT-$\Omega$ into SMIL or XMT-$\alpha$. It offers interoperability among multimedia data in various environment that is XMT's characteristic.

A Study on Design and Implementation of Speech Recognition System Using ART2 Algorithm

  • Kim, Joeng Hoon;Kim, Dong Han;Jang, Won Il;Lee, Sang Bae
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.4 no.2
    • /
    • pp.149-154
    • /
    • 2004
  • In this research, we selected the speech recognition to implement the electric wheelchair system as a method to control it by only using the speech and used DTW (Dynamic Time Warping), which is speaker-dependent and has a relatively high recognition rate among the speech recognitions. However, it has to have small memory and fast process speed performance under consideration of real-time. Thus, we introduced VQ (Vector Quantization) which is widely used as a compression algorithm of speaker-independent recognition, to secure fast recognition and small memory. However, we found that the recognition rate decreased after using VQ. To improve the recognition rate, we applied ART2 (Adaptive Reason Theory 2) algorithm as a post-process algorithm to obtain about 5% recognition rate improvement. To utilize ART2, we have to apply an error range. In case that the subtraction of the first distance from the second distance for each distance obtained to apply DTW is 20 or more, the error range is applied. Likewise, ART2 was applied and we could obtain fast process and high recognition rate. Moreover, since this system is a moving object, the system should be implemented as an embedded one. Thus, we selected TMS320C32 chip, which can process significantly many calculations relatively fast, to implement the embedded system. Considering that the memory is speech, we used 128kbyte-RAM and 64kbyte ROM to save large amount of data. In case of speech input, we used 16-bit stereo audio codec, securing relatively accurate data through high resolution capacity.

Investigation of Indicator Kriging for Evaluating Proper Rock Mass Classification based on Electrical Resistivity and RMR Correlation Analysis (RMR과 전기비저항의 상관성 해석에 기초하여 지시크리깅을 적용한 최적 암반 분류 기법 고찰)

  • Lee, Kyung-Ju;Ha, Hee-Sang;Ko, Kwang-Buem;Kim, Ji-Soo
    • Tunnel and Underground Space
    • /
    • v.19 no.5
    • /
    • pp.407-420
    • /
    • 2009
  • In this study geostatistical technique using indicator kriging was performed to evaluate the optimal rock mass classification by integrating the various geophysical information such as borehole data and geophysical data. To get the optimal kriging result, it is necessary to devise the suitable technique to integrate the hard (borehole) and soft (geophysical) data effectively. Also, the model parameters of the variogram must be determined as a priori procedure. Iterative non-linear inversion method was implemented to determine the model parameters of theoretical variogram. To verify the algorithm, behaviour of object function and precision of convergence were investigated, revealing that gradient of the range is extremely small. This algorithm for the field data was applied to a mountainous area planned for a large-scale tunneling construction. As for a soft data, resistivity information from AMT survey is incorporated with RMR information from borehole data, a sort of hard data. Finally, RMR profiles were constructed and attempted to be interpreted at the tunnel elevation and the upper 1D level.

멀티미디어 서비스의 환경변화 및 COSMOS 멀티미디어 운영체제

  • 송동호;임영환
    • Information and Communications Magazine
    • /
    • v.11 no.6
    • /
    • pp.37-54
    • /
    • 1994
  • Technical innovation on multimedia data processing brings us new multimedia services. Multimedia services are classified into five groups : TVs, computers, telecommunications, periperals, and softwares. This paper surveys on the services in various aspects and, in particular, computer areas are discussed in detail. To provide the services, major subsystems such as highspeed networks, operating systems, intelligent agent based user interfaces are discussed. In particular, multimedia operating systems are the most actively investigating research area as an infrastructure of multimedia computer systems to provide integrated multimedia services. So, the trends of new multimedia operating systems are analyzed and COSMOS (Collaborative Object Sharing for Multimedia Operating System) multimedia group presentation is discussed. The characteristics, model and abstract data structure of COSMOS is described. The performance analysis of 3 person conference system using audio, video and shared graphic editor on COSMOS shows that taking integrated multimedia operating system approach leads changing of new multimedia service environments.

  • PDF