• 제목/요약/키워드: MP3 Files

검색결과 19건 처리시간 0.022초

MP3 태그의 XML 확장을 이용한 동기화된 재생 시스템 (Synchronized MP3 Playing System Using XML Extension of MP3 Tag)

  • 곽미라;조동섭
    • 정보처리학회논문지B
    • /
    • 제9B권1호
    • /
    • pp.67-76
    • /
    • 2002
  • 고품질의 오디오 표준인 MP3포맷의 사용이 증가하면서, 오디오 데이터 외에 작곡가, 가사 등의 관련정보를 함께 저장하려는 요구가 나타났고 이를 만족하는 태깅 시스템들이 등장했다. 특히 ID3 vl 태그와 Lyrics3 v2 태그를 함께 사용하는 태깅 방법이 많이 사용되고 있다. 그러나 이 태그들은 MP3 파일 내에서 오디오 스트림의 뒷부분에 기록되므로, 이러한 태깅 방법이 적용된 MP3 파일이 스트리밍 방식으로 전달되는 경우 사용자는 전체 스트림이 로컬 시스템에 전송되기 전까지 태그 정보를 볼 수 없다. 또한 태그 정보들 중 오디오 스트림에 시간적으로 동기화된 정보들은 동기화의 기능을 잃는다. 본 논문에서는 원격지로부터 전달되는 MP3 파일의 재생시 태그 정보가 무시되는 문제를 해결하였다. XML을 사용하여 MP3 오디오 객체를 모델링하였고, 그 요소들의 시간관계성과 동기성을 HTML+TIME 방식으로 표현하는 XSL 문서를 설계하여 오디오 데이터가 시간성과 동기성을 가지고 웹 상에서 재생되도록 하였다.

A Novel Query-by-Singing/Humming Method by Estimating Matching Positions Based on Multi-layered Perceptron

  • Pham, Tuyen Danh;Nam, Gi Pyo;Shin, Kwang Yong;Park, Kang Ryoung
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제7권7호
    • /
    • pp.1657-1670
    • /
    • 2013
  • The increase in the number of music files in smart phone and MP3 player makes it difficult to find the music files which people want. So, Query-by-Singing/Humming (QbSH) systems have been developed to retrieve music from a user's humming or singing without having to know detailed information about the title or singer of song. Most previous researches on QbSH have been conducted using musical instrument digital interface (MIDI) files as reference songs. However, the production of MIDI files is a time-consuming process. In addition, more and more music files are newly published with the development of music market. Consequently, the method of using the more common MPEG-1 audio layer 3 (MP3) files for reference songs is considered as an alternative. However, there is little previous research on QbSH with MP3 files because an MP3 file has a different waveform due to background music and multiple (polyphonic) melodies compared to the humming/singing query. To overcome these problems, we propose a new QbSH method using MP3 files on mobile device. This research is novel in four ways. First, this is the first research on QbSH using MP3 files as reference songs. Second, the start and end positions on the MP3 file to be matched are estimated by using multi-layered perceptron (MLP) prior to performing the matching with humming/singing query file. Third, for more accurate results, four MLPs are used, which produce the start and end positions for dynamic time warping (DTW) matching algorithm, and those for chroma-based DTW algorithm, respectively. Fourth, two matching scores by the DTW and chroma-based DTW algorithms are combined by using PRODUCT rule, through which a higher matching accuracy is obtained. Experimental results with AFA MP3 database show that the accuracy (Top 1 accuracy of 98%, with an MRR of 0.989) of the proposed method is much higher than that of other methods. We also showed the effectiveness of the proposed system on consumer mobile device.

오디오 바이너리 파일을 컬러 QR코드로 표현하는 방법과 그 응용 (A Method to Express Audio Binary Files by Color QR Codes and Its Application)

  • 이충호
    • 융합신호처리학회논문지
    • /
    • 제19권2호
    • /
    • pp.47-53
    • /
    • 2018
  • 본 논문은 MP3 오디오 바이너리 파일을 일련의 컬러 QR 코드로 생성하여 종이에 인쇄할 수 있는 방법을 제안한다. 또한 이 방법이 상당한 압축효과를 가져올 수 있음을 기술한다. 이 방법은 먼저, 한 개의 MP3 파일을 QR코드가 바이너리로 표현할 수 있는 최대용량으로 나눈다. 그런 다음 각각의 분할된 파일들을 흑백 QR코드들로 변환한다. 최종적으로, 분할된 파일을 3개씩 중첩하여 1개의 컬러 QR코드를 만든다. 중첩 시에 3개의 흑백 QR 코드는 각각 적색, 녹색, 청색으로 간주된다. 이 방법에서 한 개의 컬러 QR코드는 2개의 흑백 QR코드 영역이 겹쳐지는 부분은 시안(Cyan), 마젠타(Magenta), 노란색(Yellow)로 표현되며, 3개의 흑백 QR코드가 겹쳐지는 부분은 흑색, 전혀 겹쳐지지 않는 부분은 백색으로 표현한다. 실험결과 약8.5Mb의 MP3파일은 A4용지 9페이지에 인쇄될 수 있다. 부수적인 효과로서 인쇄하지 않은 컬러 QR코드의 크기는 원래의 MP3파일보다 약 15.7배의 압축효과를 가질 수 있음을 보였다. 제안된 방법은 인터넷 액세스가 불가능한 환경에서 사용될 수 있는 장점이 있다.

저작권 보호를 위한 HMM기반의 음악 식별 시스템 (HMM-based Music Identification System for Copyright Protection)

  • 김희동;김도현;김지환
    • 말소리와 음성과학
    • /
    • 제1권1호
    • /
    • pp.63-67
    • /
    • 2009
  • In this paper, in order to protect music copyrights, we propose a music identification system which is scalable to the number of pieces of registered music and robust to signal-level variations of registered music. For its implementation, we define the new concepts of 'music word' and 'music phoneme' as recognition units to construct 'music acoustic models'. Then, with these concepts, we apply the HMM-based framework used in continuous speech recognition to identify the music. Each music file is transformed to a sequence of 39-dimensional vectors. This sequence of vectors is represented as ordered states with Gaussian mixtures. These ordered states are trained using Baum-Welch re-estimation method. Music files with a suspicious copyright are also transformed to a sequence of vectors. Then, the most probable music file is identified using Viterbi algorithm through the music identification network. We implemented a music identification system for 1,000 MP3 music files and tested this system with variations in terms of MP3 bit rate and music speed rate. Our proposed music identification system demonstrates robust performance to signal variations. In addition, scalability of this system is independent of the number of registered music files, since our system is based on HMM method.

  • PDF

안골격형과 교합과의 상호관계에 대한 연구 (A STUDY ON RELATONS BETWEEN FACIAL SKELETAL PATTERNS AND DENTAL OCCLUSION)

  • 장영일
    • 대한치과교정학회지
    • /
    • 제12권1호
    • /
    • pp.21-26
    • /
    • 1982
  • This study was undertaken to document relations between facial skeletal pattern and dental occlusion. The data in .this study were collected from pretreatment cephalometric radiographs and study models of patients' records present in the files of Orthodontic Department, Seoul National University Hospital. Patients were selected on the basis of a mandibular plane-sella nasion angle equal to or greater than $38^{\circ}$ (high SN-MP angle) or equal to or less than $26^{\circ}$ (low SN-MP angle). Patients in the mixed dentition and with missing permanent teeth were excluded for ease of assessing tooth size / arch circumference relationships and then 30 high SN-MP and 11 low SN-MP patients were selected among them. The mean age of these two groups of patients was high SN-MP, $12.8{\pm}1.23$ years and low SN-MP, $13.0{\pm}1.48$ years. The following conclusions were obtained. 1. In the maxilla and mandible the mean tooth size of high SN-MP patients was nearlly identical to the low SN-MP patients. 2. The mean maxillary arch circumference was increased in low SN-MP group compared with high SN-MP group and a smilar, but smaller, mean increase was present in mandible. 3. The difference between the mean maxillary circumference required and the mean maillary circumference present ranged from -4.8mm in the high SN-MP group to -1.3mm in the low SN-MP group. A small range of means occurred in the mandible (high SN-MP: -4.0mm to low SN-MP: -1.8mm). 4. In the maxilla and mandible the mean arch length was nearly identical in the high and low SN-MP groups. 5. The mean incisor inclination was increased as the SN-MP angle decreased in the maxilla and mandible. 6. The men distance of the maxillary first molar from anterior border of the pterygomaxillary fissure was nearly similar between high and low groups. 7. The mean mandibular intermolar width was increased from high SN-MP to low SN-MP patients.

  • PDF

Design and Implementation of Damaged Video File Recovery Tool using Container Format Structure

  • Choi, Yun-Seok;Lee, Wan Yeon
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제11권3호
    • /
    • pp.64-70
    • /
    • 2019
  • Video files of video devices such as black box and CCTV may be damaged due to repetitive file read / write and physical environment factors. Even though there are available parts of video information, it may happen that playback can't be performed due to damage of some information. To playback the remaining video information normally, it is necessary to recover damaged areas of the files. For this, it is necessary to accurately check the damage range of the files. In this paper, we propose the design and implementation of a tool which detects damaged areas of a video file and recovers the usable area of the file to playback. The proposed tool can analyze and recover without additional information by analyzing common information of video container format and can check detailed damaged ranges with chunks. It is possible to perform recovery just only with the target file and reference file without any other information such as codec specification.

임베디드 리눅스 시스템을 이용한 디지털 사진 액자 구현 (Implementation of Digital Photo Frame using Embedded Linux System)

  • 현경석;이명의
    • 한국산학기술학회논문지
    • /
    • 제7권5호
    • /
    • pp.901-906
    • /
    • 2006
  • 본 논문에서는 디지털 카메라의 사진을 메모리 카드를 통해 입력받고 디스플레이하며 각 사진에 대한 음성 레코딩과 MP3 플레이가 가능한 디지털 사진 액자 시스템 구현에 대하여 기술한다. Intel PXA255 보드의 시스템 제어를 위한 부트로더와 리눅스 커널을 포팅하며 외부 장치들을 위한 디바이스 드라이버를 작성한다. 리눅스 시스템 상에서 이미지 출력 및 음성 레코딩, MP3 플레어 기능을 구현하기 위해 마이크로윈도우즈 시스템의 구성 파일을 수정하고 응용 프로그램을 작성한다. 본 논문 연구를 통해서 저 전력, 고성능의 임베디드 프로세서와 리눅스 시스템을 이용한 디지털 사진 액자 개발에 쉽게 접근할 수 있으며 구현된 디바이스 드라이버와 응용 프로그램 개발 절차를 통해 임베디드 시스템 개발과 관련한 분야에 기초 자료로 사용할 수 있을 것이다.

  • PDF

MPEG-2 오디오를 위한 MDCT 설계에 관한 연구 (A Study on the MDCT Design for MPEG-2 Audio)

  • 김정태;구대성;이강현
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 추계종합학술대회 논문집(3)
    • /
    • pp.97-100
    • /
    • 2000
  • The most important technology is the compression methods in the multimedia society. Audio files are rapidly propagated through internet. MP-3(MPEG-1 Layer3) is offered to CD tone quality in 128kbps, but 64kbps below tone-quality is abruptly down. On the other hand, MPEG-II AAC (Advanced Audio Coding) is not compatible with MPEG-I, but AAC has a high compression ratio 1.4 times better than MP-3 and it has max. 7.1 channel and 96KHz sampling rate. In this paper, we designed the optimized MDCT (Modified Discrete Cosine Transform) that could decrease the capacity of enormous computation and could increase the processing speed in the MPEG-2 AAC encoder.

  • PDF

Audio Steganography Method Using Least Significant Bit (LSB) Encoding Technique

  • Alarood, Alaa Abdulsalm;Alghamdi, Ahmed Mohammed;Alzahrani, Ahmed Omar;Alzahrani, Abdulrahman;Alsolami, Eesa
    • International Journal of Computer Science & Network Security
    • /
    • 제22권7호
    • /
    • pp.427-442
    • /
    • 2022
  • MP3 is one of the most widely used file formats for encoding and representing audio data. One of the reasons for this popularity is their significant ability to reduce audio file sizes in comparison to other encoding techniques. Additionally, other reasons also include ease of implementation, its availability and good technical support. Steganography is the art of shielding the communication between two parties from the eyes of attackers. In steganography, a secret message in the form of a copyright mark, concealed communication, or serial number can be embedded in an innocuous file (e.g., computer code, video film, or audio recording), making it impossible for the wrong party to access the hidden message during the exchange of data. This paper describes a new steganography algorithm for encoding secret messages in MP3 audio files using an improved least significant bit (LSB) technique with high embedding capacity. Test results obtained shows that the efficiency of this technique is higher compared to other LSB techniques.

MPEG-II AAC Encoder의 perceptual Model에 관한 연구 (A study on the Perceptual Model for MPEG II AAC Encoder)

  • 구대성;김정태;이강현
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 하계종합학술대회 논문집(3)
    • /
    • pp.93-96
    • /
    • 2000
  • Currently, the most important technology is the compression methods in the multimedia society. Audio files are rapidly propagated through internet. MP-3 is offered to CD tone quality in 128Kbps, but 64Kbps below tone quality is abruptly down and high bitrate. on the other hand, MPEG-II AAC (Advanced Audio Coding) is not compatible with MPEG-I, but AAC has a high compression ratio 1.4 better than MP-3. Especially, AAC has max. 7.1 channel and 96KHz sampling rate. In this paper, the perceptual model is dealt with 44.1KHz sampling rate for SMR(Signal to Masking Ratio)

  • PDF