• 제목/요약/키워드: Audio Signal Processing

검색결과 157건 처리시간 0.02초

오디오 신호처리용 DAC디지털 단의 설계기법 (Design methodology of digital circuits for an audio-signal-processing DAC)

  • 김선호;손영철;김상호;이지행;김대정;김동명
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 하계종합학술대회 논문집(2)
    • /
    • pp.157-160
    • /
    • 2002
  • This paper proposed a guideline for selecting the arithmetic circuit architecture. The guideline incorpo-rates the new concept of PDSP (power-delay-size product) and the weighting method. HSPICE simulations havc been performed to several full adders in order to prove the validity of the proposed guideline. We applied this guideline to select an optimized FA (full adder) architecture and successfully implemented the DAC's digital blocks.

  • PDF

디지털전관방송을 위한 통합믹서컨트롤러 개발 (Development of Integrated Mixer Controller for Digital Public Address)

  • 조주필;김관웅;김대익
    • 한국인터넷방송통신학회논문지
    • /
    • 제17권1호
    • /
    • pp.19-24
    • /
    • 2017
  • 최근 IT기술의 발전을 근간으로 PA시스템에 IT기술이 접목된 혁신적인 제품들이 개발되고 있다. 본 논문에서는 디지털 전관방송을 위한 통합 믹서 컨트롤러를 제안하였다. 기존 디지털 전관방송을 구성하는 디지털믹서와 디지털 통합 컨트롤러의 기능을 포함한 통합믹서컨트롤러를 개발하였다. 개발된 통합믹서컨트롤러는 16개의 오디오입력채널과 8개의 출력채널을 가진 다채널 믹서 기능을 가진다. 그리고, 디지털오디오신호를 처리하기 위한 EQ, Matrix, Limiter을 가지고 있다. 또한, 개발된 컨트롤러는 믹서의 동작상태 모니터링과 전체 PA시스템을 제어하기 위한 인터넷 연결기능을 가진다.

Prediction of Closed Quotient During Vocal Phonation using GRU-type Neural Network with Audio Signals

  • Hyeonbin Han;Keun Young Lee;Seong-Yoon Shin;Yoseup Kim;Gwanghyun Jo;Jihoon Park;Young-Min Kim
    • Journal of information and communication convergence engineering
    • /
    • 제22권2호
    • /
    • pp.145-152
    • /
    • 2024
  • Closed quotient (CQ) represents the time ratio for which the vocal folds remain in contact during voice production. Because analyzing CQ values serves as an important reference point in vocal training for professional singers, these values have been measured mechanically or electrically by either inverse filtering of airflows captured by a circumferentially vented mask or post-processing of electroglottography waveforms. In this study, we introduced a novel algorithm to predict the CQ values only from audio signals. This has eliminated the need for mechanical or electrical measurement techniques. Our algorithm is based on a gated recurrent unit (GRU)-type neural network. To enhance the efficiency, we pre-processed an audio signal using the pitch feature extraction algorithm. Then, GRU-type neural networks were employed to extract the features. This was followed by a dense layer for the final prediction. The Results section reports the mean square error between the predicted and real CQ. It shows the capability of the proposed algorithm to predict CQ values.

2차원적 음원추적에 관한 연구 (A Study on Acoustic Sound Tracking System on 2-Dimensional Plain)

  • 문성배;전승환
    • 한국항해항만학회:학술대회논문집
    • /
    • 한국항해항만학회 1996년도 The Korean Institute of Navigation 1996년도 한·중 국제학술 심포지움 및 추계학술발표회 논문집
    • /
    • pp.117-124
    • /
    • 1996
  • When navigating in or near an area of restricted visibility it is necessary to be heard the whistle bell and/or the siren of lighthouses or ships at times. Even though we can get the brief informations about the property of sound the direction and range of a sound radiator it is not easy to get the accurate informations for decision making. generally the audio frequency is known as 16-20,000Hz but the earshot is shorten and discrimination of sound is more difficult when there is some noise. The sound pressure is 60dB at the moment when human speaks 1 meter away. Usually the noise pressure in a silent room is 40dB and 60dB on the quiet street. In this study we suggest the basic algorithm to trace the direction and range of the source radiator using the signal received through not a physical sense but the microphone sensors and a series of signal of signal processing.

  • PDF

Area-wise relational knowledge distillation

  • Sungchul Cho;Sangje Park;Changwon Lim
    • Communications for Statistical Applications and Methods
    • /
    • 제30권5호
    • /
    • pp.501-516
    • /
    • 2023
  • Knowledge distillation (KD) refers to extracting knowledge from a large and complex model (teacher) and transferring it to a relatively small model (student). This can be done by training the teacher model to obtain the activation function values of the hidden or the output layers and then retraining the student model using the same training data with the obtained values. Recently, relational KD (RKD) has been proposed to extract knowledge about relative differences in training data. This method improved the performance of the student model compared to conventional KDs. In this paper, we propose a new method for RKD by introducing a new loss function for RKD. The proposed loss function is defined using the area difference between the teacher model and the student model in a specific hidden layer, and it is shown that the model can be successfully compressed, and the generalization performance of the model can be improved. We demonstrate that the accuracy of the model applying the method proposed in the study of model compression of audio data is up to 1.8% higher than that of the existing method. For the study of model generalization, we demonstrate that the model has up to 0.5% better performance in accuracy when introducing the RKD method to self-KD using image data.

청각을 이용한 시각 재현 시스템의 개발 (Development of Processing System for Audio-vision System Based on Auditory Input)

  • 김정훈;김덕규;원철호;이종민;이희중;이나희;윤수영
    • 대한의용생체공학회:의공학회지
    • /
    • 제33권1호
    • /
    • pp.25-31
    • /
    • 2012
  • The audio vision system was developed for visually impaired people and usability was verified. In this study ten normal volunteers were included in the subject group and their mean age was 28.8 years old. Male and female ratio was 7:3. The usability of audio vision system was verified by as follows. First, volunteers learned distance of obstacles and up-down discrimination. After learning of audio vision system, indoor and outdoor walking examination was performed. The test was scored by ability of up-down and lateral discrimination, distance recognition and walking without collision. Each parameter was scored by 1 to 5. The results were 93.5 +- SD(ranges, 86 to 100) of 100. In this study, we could convert visual information to auditory information by audio-vision system and verified possibility of applying to daily life for visually impaired people.

오디오의 Peak 특징을 이용한 동일 영화 콘텐츠 검색 (Similar Movie Contents Retrieval Using Peak Features from Audio)

  • 정명범;성보경;고일주
    • 한국멀티미디어학회논문지
    • /
    • 제12권11호
    • /
    • pp.1572-1580
    • /
    • 2009
  • 검색을 위해 동영상 데이터 전체를 이용하면 많은 시간과 저장 공간이 필요하다. 이를 보완하고자 기존의 동일 영화 검색은 영상 정보의 일부를 이용하여 동일한 영상 검색에 사용해 왔다. 그러나 이 방법은 같은 영상임에도 비디오 부호화기이나 해상도가 다른 경우 전혀 다른 영상으로 인식한다. 따라서 본 논문에서는 동영상의 오디오 정보를 이용하여 동일한 동영상을 찾는 알고리즘을 제안한다. 제안 방법은 부호화율, 부호화기, 샘플링 수의 변화에도 유사한 파형을 형성하는 Peak 정보를 바탕으로 데이터베이스에 색인하고, 검색한다. 논문에서는 제안 방법의 성능을 확인하기 위해 1,000개의 동영상 데이터를 검색 실험하였으며, 92.1%의 성공률을 나타내었다.

  • PDF

Implementation of Public Address System Using Anchor Technology

  • Seungwon Lee;Soonchul Kwon;Seunghyun Lee
    • International journal of advanced smart convergence
    • /
    • 제12권3호
    • /
    • pp.1-12
    • /
    • 2023
  • A public address (PA) system installed in a building is a system that delivers alerts, announcements, instructions, etc. in an emergency or disaster situation. As for the products used in PA systems, with the development of information and communication technology, PA products with various functions have been introduced to the market. PA systems recently launched in the market may be connected through a single network to enable efficient management and operation, or use voice recognition technology to deliver quick information in case of an emergency. In addition, a system capable of locating a user inside a building using a location-based service and guiding or responding to a safe area in the event of an emergency is being launched on the market. However, the new PA systems currently on the market add some functions to the existing PA system configuration to make system operation more convenient, but they do not change the complex PA system configuration to reduce facility costs, maintenance, and management costs. In this paper, we propose a novel PA system configuration for buildings using audio networks and control hierarchy over peer-to-peer (Anchor) technology based on audio over IP (AoIP), which simplifies the complex PA system configuration and enables convenient operation and management. As a result of the study, through the emergency signal processing algorithm, fire broadcasting was made possible according to the detection of the existence of a fire signal in the Anchor system. In addition, the control device of the PA system was replaced with software to reduce the equipment installation cost, and the PA system configuration was simplified. In the future, it is expected that the PA system using Anchor technology will become the standard for PA facilities.

서브밴드 적응신호처리를 이용한 음향 에코제거기의 모델링 (Modeling of Acoustic Echo Canceller Using Subband Adaptive Signal Processing)

  • 김천덕;심동연;정호문;이준구;차경환
    • 한국음향학회지
    • /
    • 제16권5호
    • /
    • pp.43-49
    • /
    • 1997
  • TV 회의 시스템 또는 확성회의 시스템에 응용되는 반향제거기에 있어서, 긴 잔향시간을 갖는 실내 공간에서는 환경변화에 따른 필터계수의 갱신에 많은 시간이 필요하며 실시간 처리에 장애요인이 되고 있다. 따라서 본 논문에서는 MPEG 오디오 시스템에서 이용하고 있는 폴리페이즈 필터 뱅크를 사용한 서브밴드 적응 신호처리법을 제안한다. 이 방법은 입력과 출력의 스펙트럼을 몇 개의 주파수 밴드로 분할하여, 각 밴드를 ES-NLMS 알고리즘을 이용하여 적응처리하는 것이다. 계산기상의 시뮬레이션을 통하여 최적의 서브밴드 수를 구하였으며, 기존의 풀밴드 방식에 대하여 수렴속도 및 제특성이 약 2dB 정도 작을때 서브밴드로 분할하는 방법이 연산량에 있어서 약 88% 정도 감소하여 풀밴드보다 우수한 것으로 나타났다.

  • PDF

An Implementation of Highly Integrated Signal Processing IC for HDTV

  • Hahm Cheul-Hee;Park Kon-Kyu;Kim Hyoung-Gil;Jung Choon-Sik;Lee Sang-keun;Jang Jae-Young;Park Sung-Uk;Chon Byung-Hoan;Chun Kang-Wook;Jo Jae-Moon;Song Dong-il
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2003년도 정기총회 및 학술대회
    • /
    • pp.69-72
    • /
    • 2003
  • This paper presents a signal processing IC for digital HDTV, which is designed to operate in bunt-in HDW or in HD-set-top Box. The chip supports de-multiplexing an ISO/IEC 13818-1 MPEG-2 TS stream. It decodes MPEG-2 MP@HL video bitstream, and provides high-quality scaled video for display on HDTV monitor. The chip consists of ARM7TDMI for TS-Demux, PCI interface, Audio interface, MPEG2 MP@HL video decoder Display processor, Graphic processor, Memory controller, Audio int3face, Smart Card interface and UART. It is fabricated using Sam sung's 0.18-um and the package of 492-pin BGA is used.

  • PDF