Search | Korea Science

Segmentation and Classification Using Audio and Image Information (오디오와 영상 정보를 이용한 비디오 세그먼테이션 및 크래시피케이션)

Jung, Hae-Jun;Jung, Sung-Hwan
- Proceedings of the Korea Information Processing Society Conference
- /
- 2000.10b
- /
- pp.891-894
- /
- 2000
본 논문에서는 효과적인 내용기반 비디오 검색을 위한 샷 경계 검출, 장면 경계 검출, 그리고 비디오 크래시피케이션 방법을 연구하였다. 먼저, 샷 경계 검출을 위해 칼라 히스토그램과 DCT 변환 계수를 통합하여 사용했다. 그리고 장면 경계 검출을 위해서는 영상 정보뿐만 아니라 오디오 정보를 함께 사용하여 장면 경계를 검출하였다. 또한 비디오 크래시피케이션에서는 장면 경계검출시 추출한 오디오 정보를 이용해 비디오를 내용별로 분류하는 연구를 제안하였다. 뉴스, 광고, 스포츠 등 다양한 3개 분야의 TV 프로그램으로 구성된 약 8,500개 영상 프레임과 약 50,000개의 오디오 프레임을 가진 실험 비디오 데이터베이스를 구성하여 제안된 시스템을 실험하였다. 실험한 결과, 약 88%의 정확도(Precision)를 가지는 장면 경계 검출과 약 85%의 평균 분류율을 보였다.
PDF

Performance Analysis of the Time-series Pattern Index File for Content-based Music Genre Retrieval (내용기반 음악장르 검색에서 시계열 패턴 인덱스 화일의 성능 분석)

Kim, Young-In;Kim, Seon-Jong
- Journal of Korea Society of Industrial Information Systems
- /
- v.11 no.5
- /
- pp.18-27
- /
- 2006
Rapid increase of the amount of music data demands for a new method that allows efficient similarity retrieval of music genre using audio features in music databases. To build this similarity retrieval, an indexing techniques that support audio features as a time-series pattern and data mining technologies are needed. In this paper, we address the development of a system that retrieves similar genre music based on the indexing techniques. We first propose the structure of content-based music genre retrieval system based on the time-series pattern index file and data mining technologies. In addition, we implement the time-series pattern index file using audio features and present performance analysis of the time-series pattern index file for similar genre retrieval. The experiments are performed on real data to verify the performance of the proposed method.
PDF

Extension of SHORE storage system for multimedia applications (멀티미디어 응용을 위한 SHORE 하부저장 시스템의 확장)

정재욱;장재욱
- Proceedings of the Korean Information Science Society Conference
- /
- 1999.10a
- /
- pp.6-8
- /
- 1999
컴퓨터 통신 기술의 급속한 발달로 인해 정지영상, 오디오, 비디오와 같은 다양한 미디어로 구성된 대용량의 멀티미디어 자료를 효율적으로 저장하고 관리할 수 있는 하부 저장 시스템이 필요하다. 이러한 멀티미디어 자료에 대한 내용-기반 검색을 위해 텍스트 기반 검색과 색상 또는 질감과 같은 특징 벡터에 기반한 검색이 이루어져야 한다. 본 논문에서는 멀티미디어 응용을 위한 하부저장 시스템을 구현하기 위해 미국 위스콘신 대학에서 개발한 지속성 객체 시스템인 SHORE를 확장하고자 한다. 텍스트 기반 검색을 위해 역화일 구조를 구현하였으며, 고차원의 특징 벡터의 검색을 위해 X-트리를 통합하였다.
PDF

Search speed improved minimum audio fingerprinting using the difference of Gaussian (가우시안의 차를 이용하여 검색속도를 향상한 최소 오디오 핑거프린팅)

Kwon, Jin-Man;Ko, Il-Ju;Jang, Dae-Sik
- Journal of the Korea Society of Computer and Information
- /
- v.14 no.12
- /
- pp.75-87
- /
- 2009
This paper, which is about the method of creating the audio fingerprint and comparing with the audio data, presents how to distinguish music using the characteristics of audio data. It is a process of applying the Difference of Gaussian (DoG: generally used for recognizing images) to the audio data, and to extract the music that changes radically, and to define the location of fingerprint. This fingerprint is made insensitive to the changes of sound, and is possible to extract the same location of original fingerprint with just a portion of music data. By reducing the data and calculation of fingerprint, this system indicates more efficiency than the pre-system which uses pre-frequency domain. Adopting this, it is possible to indicate the copyrighted music distributed in internet, or meta information of music to users.
https://doi.org/10.9708/jksci.2009.14.12.075 인용 PDF

Audio Fingerprinting Based Spatial Audio Reproduction System (오디오 핑거프린팅기반 입체음향 재현 시스템)

Ryu, Sang Hyeon;Kim, Hyoung-Gook
- Journal of the Institute of Electronics and Information Engineers
- /
- v.50 no.12
- /
- pp.217-223
- /
- 2013
This paper proposes a spatial audio reproduction system based on audio fingerprinting that combines the audio fingerprinting and the spatial audio processing. In the proposed system, a salient audio peak pair fingerprint based on modulation spectrum improves the accuracy of the audio fingerprinting system in real noisy environments and spatial audio information as metadata gives a listener a sensation of being listening to the sound in the space, where the sound is actually recorded.
https://doi.org/10.5573/ieek.2013.50.12.217 인용 PDF KSCI

Scope and Status of Audio Visual Interactive Services Standardization (상호대화형 오디오비주얼 서비스의 표준화 현황과 전망)

Hyun, D.W.;Lee, B.H.
- Electronics and Telecommunications Trends
- /
- v.9 no.3
- /
- pp.97-102
- /
- 1994
상호대화형 오디오비주얼 서비스는 텍스트, 도형, 사진, 오디오, 비디오 등과 같은 다양한 형태의 표현 요소로 구성되는 입출력 정보를 사용자의 단말이나 워크스테이션에 제공하는 서비스이다. 이러한 기능의 범위는 간단한 검색에서부터 상호대화적인 문의, 구성요소들의 재배치, 그들 요소들의 수정등의 서비스를 사용자에게 제공 할 수 있다. 이와 관련하여 ITU-T SG8/Q.11에서는 AVI 서비스를 위해 요구되는, 시스템, 데이터 교환형식, 그리고 프로토콜과 같은 일련의 기술적 사항을 표준화하는 작업을 하고 있다. 본고에서는 AVI 서비스의 기술적인 사항에 대하여 논하고, 현재 진행되고 있는 표준화 동향에 대하여 알아본다.
https://doi.org/10.22648/ETRI.1994.J.090307 인용 PDF

Audio fingerprint matching based on a power weight (파워 가중치를 이용한 오디오 핑거프린트 정합)

Seo, Jin Soo;Kim, Junghyun;Kim, Hyemi
- The Journal of the Acoustical Society of Korea
- /
- v.38 no.6
- /
- pp.716-723
- /
- 2019
Fingerprint matching accuracy is essential in deploying a music search service. This paper deals with a method to improve fingerprint matching accuracy by utilizing an auxiliary information which is called power weight. Power weight is an expected robustness of each hash bit. While the previous power mask binarizes the expected robustness into strong and weak bits, the proposed method utilizes a real-valued function of the expected robustness as weights for fingerprint matching. As a countermeasure to the increased storage cost, we propose a compression method for the power weight which has strong temporal correlation. Experiments on the publicly-available music datasets confirmed that the proposed power weight is effective in improving fingerprint matching performance.
https://doi.org/10.7776/ASK.2019.38.6.716 인용 PDF KSCI

XML Based Multimedia Retrieval System supporting Scene Search (장면 검색을 지원하는 XML 기반 멀티미디어 검색 시스템)

Joung, Mi-Ra;Hwang, Bu-Hyun
- Proceedings of the Korea Information Processing Society Conference
- /
- 2001.10a
- /
- pp.133-136
- /
- 2001
오디오 비디오 데이터의 활용이 증가함에 따라 멀티미디어 데이터의 내용에 대해 표현하려는 연구와 함께 멀티미디어 데이터의 내용이나 메타데이터를 저장하고, 검색하고, 조작하는 연구의 필요성이 증가하였다. 멀티미디어 데이터의 표현은 사용자가 원하는 내용만을 쉽게 검색하고, 접근한 수 있도록 표현되고 저장되어야 한다. 그러나 기존의 멀티미디어 검색 시스템들은 특정 객체에 중점을 두고 색상, 위치, 모양 등의 정보를 가지고 유사 객체를 찾는 방식을 취하고 있으므로 특정 사건이나 구체적인 인물 정보나 에피소드의 정보를 검색하고자 한 때는 키워드에 의한 검색을 해야하므로 불필요한 정보가 다량으로 검색되며 여러 번의 검색이 이루어져야 하는 단점이 있다. 또한 일반 사용자들은 주로 특정 장면에서 특정 객체의 특징이나 행동, 장소, 사건 등의 정보에 대해 관심을 갖고, 이에 따른 질의를 하는 경향이 있다. 따라서 본 논문에서는 "장면"이라는 계층 구조에 중점을 두고 멀티미디어 데이터의 내용 정보와 구조 정보를 표현 및 저장을 하며, 사용자는 특정 사건이나 객체들의 특징 정보를 가지고 장면이나 전체 구조를 검색찬 수 있는 시스템을 설계하고 구현한다. 멀티미디어 데이터의 표현 및 저장 검색의 모든 과정은 데이터의 재사용성과 접근 용이성을 위해 XML을 기반으로 하여 처리된다. 이렇게 XML로 표현된 데이터는 사용자들에게 구조 정보나 내용 정보에 있어서 다양한 검색 결과를 제공할 수 있는 장점이 있다.
PDF

The Implementation of Personal Audio Recorder Service based on Embedded Linux (임베디드 리눅스 기반의 개인 오디오 레코더 서비스 구현)

Kim, Do-Hyung;Lee, Kyung-Hee;Lee, Cheol-Hoon
- The KIPS Transactions:PartD
- /
- v.15D no.2
- /
- pp.257-262
- /
- 2008
This paper describes the implementations of the application service based on embedded Linux; Personal Audio Recorder (PAR) which uses WiBro network for data communications and CDMA network for voice communications. At PAR, when PAR client starts voice recording on a dual-mode terminal, the CDMA voice data of caller and callee is transmitted to storage server located in the Internet through WiBro network. Then, PAR server stores voice data on storage server according to the call number and call time. In case of shortage of storage space on terminal, PAR makes user to store voice data. And, PAR can search a catalog of stored data on server and play the specific content.
https://doi.org/10.3745/KIPSTD.2008.15-D.2.257 인용 PDF KSCI

Implementation of Musical Note Generation System using Rhythm Information (리듬정보를 이용한 악보생성 시스템 구현)

소두석;최재원;이종혁
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.7 no.6
- /
- pp.1210-1216
- /
- 2003
Traditional indexing mechanism are based on the song's metadata such as the title and the composer and so on. However, these system have a major limitation that users have to know the metadata of the songs they want to retrieve. In order to solve these limitation, we proposed a rhythm extraction system that allows users to retrieve music information efficiently from a large music database using the rhythm that is defined as the parts of the music.
PDF KSCI

Search Result 98, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)