• Title/Summary/Keyword: 음향데이터

Search Result 943, Processing Time 0.032 seconds

Automatic Vowel Onset Point Detection Based on Auditory Frequency Response (청각 주파수 응답에 기반한 자동 모음 개시 지점 탐지)

  • Zang, Xian;Kim, Hag-Tae;Chong, Kil-To
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.13 no.1
    • /
    • pp.333-342
    • /
    • 2012
  • This paper presents a vowel onset point (VOP) detection method based on the human auditory system. This method maps the "perceptual" frequency scale, i.e. Mel scale onto a linear acoustic frequency, and then establishes a series of Triangular Mel-weighted Filter Bank simulate the function of band pass filtering in human ear. This nonlinear critical-band filter bank helps greatly reduce the data dimensionality, and eliminate the effect of harmonic waves to make the formants more prominent in the nonlinear spaced Mel spectrum. The sum of mel spectrum peaks energy is extracted as feature for each frame, and the instinct at which the energy amplitude starts rising sharply is detected as VOP, by convolving with Gabor window. For the single-word database which contains 12 vowels articulated with different kinds of consonants, the experimental results showed a good average detection rate of 72.73%, higher than other vowel detection methods based on short-time energy and zero-crossing rate.

Knowledge-based Video Retrieval System Using Korean Closed-caption (한국어 폐쇄자막을 이용한 지식기반 비디오 검색 시스템)

  • 조정원;정승도;최병욱
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.41 no.3
    • /
    • pp.115-124
    • /
    • 2004
  • The content-based retrieval using low-level features can hardly provide the retrieval result that corresponds with conceptual demand of user for intelligent retrieval. Video includes not only moving picture data, but also audio or closed-caption data. Knowledge-based video retrieval is able to provide the retrieval result that corresponds with conceptual demand of user because of performing automatic indexing with such a variety data. In this paper, we present the knowledge-based video retrieval system using Korean closed-caption. The closed-caption is indexed by Korean keyword extraction system including the morphological analysis process. As a result, we are able to retrieve the video by using keyword from the indexing database. In the experiment, we have applied the proposed method to news video with closed-caption generated by Korean stenographic system, and have empirically confirmed that the proposed method provides the retrieval result that corresponds with more meaningful conceptual demand of user.

A Study on Improving the Direction of Moving Image Material Descriptions (영상기록물 기술의 개선 방향 연구)

  • Shim, Bomee;Chang, Yunkeum
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.29 no.1
    • /
    • pp.325-344
    • /
    • 2018
  • Since the year 2000, the need for an improvement of archival descriptions has been an increasing issue, due to the growing usage and amount of archival materials. Unlike the development of descriptions for paper records, however, the technological development and research for moving image descriptions has been limited due to its diversity and specificity. This research investigated the current status and the specificity of the moving image descriptions and also examined major international archival description cases. In-depth interviews with archival professionals were also conducted. Based on the findings, this study suggested the need for redefinition of and continuous research on the fundamental values of moving image information, moving image description and management based on digilog view points, the development of user-centric description and search aides, the creation of moving image values using a relevant information management system, and the improvement of moving image description elements throughout the life-cycle of the material.

Performance analysis of subjective Loudness meter with ITU-R BS. 1387-1 algorithm for digital audio (디지털 오디오 주관적 음향레벨 계측기 구현을 위한 ITU-R BS. 1387-1의 알고리즘 특성 분석)

  • Ngan, Nguyen Vo Bao;Park, Seonggyoon;Ro, Soonghwan;Han, Chankyu
    • Journal of IKEEE
    • /
    • v.16 no.4
    • /
    • pp.395-404
    • /
    • 2012
  • In this paper, the perceived loudness metering algorithm based on ITU-R BS.1387-1 was investigated and implemented, and its performance was evaluated by applying to 23 pure tones and 9 digital audio samples. Error of the tone test results compared with ISO226:2003 was below 5%, and sample test results, in comparison with Moore's algorithm, showed deviation of less than 4.7% and correlation of 0.96. On the other hand, it was investigated how the implemented algorithm's performance was subject to auditory pitch scale. Its result showed that the algorithm with 37 auditory filters, through correcting a bias effect, has a good performance of less than 2% in comparison with the one with 109 auditory filters.

Analysis on Thermal Structural Characteristics of Thermal Protection System Panel for a High-speed Vehicle (초고속 비행체 열방어 시스템 패널의 열구조 특성 분석)

  • Lee, Heesoo;Kim, Yongha;Park, Jungsun;Goo, Namseo;Kim, Jaeyoung
    • Proceedings of the Korean Society of Propulsion Engineers Conference
    • /
    • 2017.05a
    • /
    • pp.942-944
    • /
    • 2017
  • High-speed vehicles are subjected to complex loads, such as acoustic pressure from the engine at launch and aerodynamic heating and aerodynamic pressure during flight. A thermal protection system panel is required to protect internal systems such as the fuel tank of the vehicle from the external environment. This study defines analytical models for heat transfer and thermal structure characteristics of the thermal protection system panel. Furthermore, the study performed parameters analysis to achieve the thermal structural integrity and to make it lighter.

  • PDF

Adaptive Contention Window Mechanism for Enhancing Throughput in HomePlug AV Networks (HomePlug AV 네트워크에서의 성능 향상을 위한 적응적 Contention Window 조절 방식)

  • Yoon, Sung-Guk;Yun, Jeong-Kyun;Kim, Byung-Seung;Bahk, Sae-Woong
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.5B
    • /
    • pp.318-325
    • /
    • 2008
  • HomePlug AV(HPAV) is the standard for distribution of Audio/video content as well as data within the home by using the power line. It uses a hybrid access mechanism that combines TDMA with CSMA/CA for MAC technology. The CSMA/CA protocol in HPAV has two main control blobs that can be used for access control: contention window(CW) size and deferral counter(DC). In this paper, we extensively investigate the impacts of CW and DC on performance through simulations, and propose an adaptive mechanism that adjusts the CW size to enhance the throughput in HPAV MAC. We find that the CW size is more influential on performance than the DC. Therefore, to make controlling the network easier, our proposal uses a default value of DC and adjusts the CW size. Our scheme simply increases or decreases the CW size if the network is too busy or too idle, respectively, We compare the performance of our proposal with those of the standard and other competitive schemes in terms of throughput and fairness. Our simulation and analysis results show that our adaptive CW mechanism performs very well under various scenarios.

Implementation of MP3 decoder with TMS320C541 DSP (TMS320C541 DSP를 이용한 MP3 디코더 구현)

  • 윤병우
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.4 no.3
    • /
    • pp.7-14
    • /
    • 2003
  • MPEG-1 audio standard is the algorithm for the compression of high-qualify digital audio signals. The standard dictates the functions of encoder and decoder pair, and includes three different layers as the complexity and the performance of the encoder and decoder. In this paper, we implemented the real-time system of MPEG-1 audio layer III decoder(MP3) with the TMS320C541 fixed point DSP chip. MP3 algorithm uses psycho-acoustic characteristic of human hearing system, and it reduces the amount of data with eliminating the signals hard to be heard to the hearing system of human being. It is difficult to implement MP3 decoder with fixed Point DSP because of it's broad dynamic range. We implemented realtime system with fixed DSP chip by using weighted look-up tables to reduce the amount of calculation and solve the problem of broad dynamic range.

  • PDF

Development and Application of IoT-based Contactless Ultraosonic System (IoT 기반 비접촉 초음파 측정 시스템 개발 및 적용)

  • Kim, Jihwan;Hong, Jinyoung;Kim, Rrulri;Woo, Ukyong;Choi, Hajin
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.24 no.3
    • /
    • pp.70-79
    • /
    • 2020
  • The main objective of this research to develop an IoT based wireless contactless ultrasonic system (ICUS) and its application to concrete structure. The developed system consists of 16 mems, 2Mhz digitizer, amplifying circuit, FPGA, and wifi module, enabling to measure leaky surface waves from concrete specimens without physical coupling process and wires. Multi-channel analysis is performed to improve the accuracy of data analysis, and the velocity of leaky surface waves and acoustics are derived. Field inspection of railroad concrete sleepers is conducted to evaluate the performance of the system and to compare the results with conventional ultrasonic pulse velocity (UPV). As a result of the field inspection, UPV was limited to evaluate damages. This is because crack pattern of railroad sleepers is parallel to ultrasonic ray path and accessibility of the railroad at the field is disadvantageous to contact-based UPV. On the other hand, ICUS possibly detect the damages as reduction of dynamic modulus by up to 59% compared to non-damaged specimen.

An Objective Speech Quality Measure using Masking Effect under Digital Mobile Telephone Network Environment (디지털 이동통신망 환경 하에서 마스킹 효과를 이용한 객관적 음질 평가 척도)

  • 김광수;김민정;석수영;정호열;정현일
    • Journal of Korea Multimedia Society
    • /
    • v.5 no.4
    • /
    • pp.405-414
    • /
    • 2002
  • In this paper, we propose a new objective speech quality measure using noise masking threshold for speech quality assessment of mobile telephone network environments, and verify the effectiveness of the proposed method through the experiments. For such a purpose, well known objective speech quality measures such as BSD and PSQM are first evaluated for digital mobile telephone network environments. However, these conventional methods does not have good performance under mobile networks environments compared to literary results. To be mote effective objective speech quality measure under mobile telephone environments, the proposed method employs human psychoacoustic masking effect. The DMOS, instead of MOS, is used as a subjective speech quality measure for performance evaluation. The performance comparison are carried out with speech data collected from digital mobile telephone environments. As results, the proposed measure have and average 4% higher performance, in terms of correlation, than existing objective speech quality measures such as BSD and PSQM.

  • PDF

Applications of the improved Hilbert-Huang transform method to the detection of thermo-acoustic instabilities (열음향학적 불안정성 검출에 대한 개선된 힐버트-후앙 변환의 적용)

  • Cha, Ji-Hyeong;Kim, Young-Seok;Ko, Sang-Ho
    • Proceedings of the Korean Society of Propulsion Engineers Conference
    • /
    • 2012.05a
    • /
    • pp.555-561
    • /
    • 2012
  • The Hilbert Huang Transform (HHT) technigue with Empirical Mode Decomposition (EMD) is one of the time-frequency domain analysis methods and it has several advantages such that analyzing non-stationary and nonlinear signal is possible. However, there are shortcomings in detecting near-range of frequencies and added noise signals. In this paper, to analyze characteristics of each method, HHT and Short-Time Fourier Transform (STFT) effective in dealing with stationary signals are compared. And with thermoacoustic instabilities signals from a Rijke tube test, HHT and the improved HHT with Ensemble Empirical Mode Decomposition (EEMD) are compared. The results show that the improved HHT is more appropriate than the original HHT due to the relative insensitivity to noise. Therefore it will result in more accurate analysis.

  • PDF