• Title/Summary/Keyword: 청취 모델

Search Result 51, Processing Time 0.027 seconds

Energy-Efficient Routing Protocol for Hybrid Ad Hoc Networks (하이브리드 애드 혹 네트워크에서의 에너지 효율성을 고려한 라우팅 알고리즘)

  • Park, Hye-Mee;Park, Kwang-Jin;Choo, Hyun-Seung
    • Journal of Internet Computing and Services
    • /
    • v.8 no.5
    • /
    • pp.133-140
    • /
    • 2007
  • Currently, as the requirement for high quality Internet access from anywhere at anytime is consistently increasing, the interconnection of pure ad hoc networks to fixed IP networks becomes increasingly important. Such integrated network, referred to as hybrid ad hoc networks, can be extended to many applications, including Sensor Networks, Home Networks, Telematics, and so on. We focus on some data communication problems of hybrid ad hoc networks, such as broadcasting and routing. In particular. power failure of mobile terminals is the most important factor since it affects the overall network lifetime. We propose an energy-efficient routing protocol based on clustering for hybrid ad hoc networks. By applying the index-based data broadcasting and selective tuning methods, the infra system performs the major operations related to clustering and routing on behalf of ad hoc nodes. The proposed scheme reduces power consumption as well as the cost of path discovery and maintenance, and the delay required to configure the route.

  • PDF

Simulation Software for Instrument Placement on Stage Based on the Acoustic Properties of Concert Halls (연주홀 특성을 적용한 악기 무대 배치 시뮬레이션 소프트웨어 제작)

  • Kim, Wan-Jung;Yoo, Won-Dae;Kim, Keun-Hyung;Lee, Ki-Beom;Yeo, Woon-Seung
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.7
    • /
    • pp.960-972
    • /
    • 2010
  • In this paper, we present a software for placing instruments on stage based on the acoustic properties of the concert hall. In order to simulate the changes in sound depending on the positions of the instruments, we incorporated the idea of location-based reverberation effect which can be realized through the convolution of instrument sounds with the impulse responses from the respective instrument positions. And we developed a software with a real-time convolution engine which enables the user to conveniently simulate the resulting sound of various instrument placements. The software was tested with the impulse response data measured at two concert halls of the National Center for Korean Traditional Performing Arts and Korean traditional instrument sounds. Results of these experiments show that simulated reverberation effects properly represent the spatial placement of instruments on stage.

Voice-to-voice conversion using transformer network (Transformer 네트워크를 이용한 음성신호 변환)

  • Kim, June-Woo;Jung, Ho-Young
    • Phonetics and Speech Sciences
    • /
    • v.12 no.3
    • /
    • pp.55-63
    • /
    • 2020
  • Voice conversion can be applied to various voice processing applications. It can also play an important role in data augmentation for speech recognition. The conventional method uses the architecture of voice conversion with speech synthesis, with Mel filter bank as the main parameter. Mel filter bank is well-suited for quick computation of neural networks but cannot be converted into a high-quality waveform without the aid of a vocoder. Further, it is not effective in terms of obtaining data for speech recognition. In this paper, we focus on performing voice-to-voice conversion using only the raw spectrum. We propose a deep learning model based on the transformer network, which quickly learns the voice conversion properties using an attention mechanism between source and target spectral components. The experiments were performed on TIDIGITS data, a series of numbers spoken by an English speaker. The conversion voices were evaluated for naturalness and similarity using mean opinion score (MOS) obtained from 30 participants. Our final results yielded 3.52±0.22 for naturalness and 3.89±0.19 for similarity.

An Application of the Kalman Filter for Attenuation of Colored Noise Superimposed on Speech Signal (칼만필터를 이용한 음성신호에 중첩된 유색잡음의 감쇠)

  • Gu, Bon-Eung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.2
    • /
    • pp.76-85
    • /
    • 1994
  • A speech enhancement algorithm which attenuates nonstationary colored noise is presented In this paper. The algorithm consists of a stationary Kalman filter and the simple speech/nonspeech detector. While the conventional enhancement systems are focused on a stationary and/or white background noise, this study Is focused on the mort realistic nonstationary and nonwhite noise. An AR model-based vector Kalman filter is used as a noise suppression system and a short-time energy threshold logic is used as a speech/nonspeech classifier. For Kalman filtering. noise coefficients are estimated in the nonspeech frame, and speech coefficients are estimated by applying the EM iteration algorithm. Simulation results using the car noise are presented based on the signal-to-noise ratio and informal listening tests. According to the experimental results, background noises in the nonspeech frames are eliminated almost completely, while some distortions are noticed in the speech frames. The distortion becomes severer as the SNR is reduced to 0dB and -5dB. Intelligibility, however, is not degraded significantly.

  • PDF

A Basic Study on Development of VTS Control Guideline based on Ship's Operator's Consciousness (선박운항자 의식 기반 적정 관제시기 분석에 관한 기초 연구)

  • Park, Sang-Won;Park, Young-Soo
    • Journal of Navigation and Port Research
    • /
    • v.40 no.3
    • /
    • pp.105-111
    • /
    • 2016
  • In ports of Korea, the marine traffic flow is congested due to a large number of vessels coming in and going out. In order to improve the safety and efficiency of these vesse's movement, South Korea is operating with a Vessel Traffic System, which is monitoring its flow 24-7. However despite these efforts of the VTS (Vessel Traffic System) officers, marine accidents are occurring continuously in their control area. VTS Officers are controlling subjectively based on their experience due to no VTS control guideline of dangerous situation among vessels. On this paper, we listened to Busan VHF channel for 3days and analyzed the message. With collision risk model, We analyzed a moment of risk which officers advise or recommend to vessel in encounter situation, VTSO's career, and day&night.

Digital Filter Model for Analog Helical Coil Spring Reverberator (헬리컬 코일 스프링 잔향기의 디지털 필터 모델)

  • Park Joon;Chon Sang-Bae;Lee Jong-Hoon;Sung Koeng-Mo;Song Sang-Seob
    • The Journal of the Acoustical Society of Korea
    • /
    • v.25 no.6
    • /
    • pp.291-297
    • /
    • 2006
  • This paper proposes a new Digital Reverberator that models Analog Helical Coil Spring Reverberator for guitar amplifiers. While the conventional digital reverberators are proposed to provide better sound field mainly based on room acoustics, no algorithm or analysis of digital reverberators those model Helical Coil Spring Reverberator was proposed. Considering the fact that approximately $70{\sim}80$ percent of guitar amplifiers are still with Helical Coil Spring Reverberator, research was performed based not on Room Acoustics but on Helical Coil Spring Reverberator itself as an effector. After performing simulations with proposed algorithm, it was confirmed that the Digital Reverberator by proposed algorithm provides perceptually equivalent response to the conventional Analog Helical Coil Spring Reverberators.

Evaluation of a signal segregation by FDBM (FDBM의 음원분리 성능평가)

  • Lee, Chai-Bong
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.8 no.12
    • /
    • pp.1793-1802
    • /
    • 2013
  • Various approaches for sound source segregation have been proposed. Among these approaches, frequency domain binaural model(FDBM) has the advantages of low computational load and effective howling cancellation. A binaural hearing assistance system based on FDBM has been proposed. This system can enhance desired signal based on the directivity information. Although FDBM has been evaluated in terms of signal-to-noise ratio (SNR) and coherence function, the evaluation results do not always agree with the human impressions. These evaluation methods provide physical measures, and do not take account of perceptual aspect of human being. Considering a binaural hearing assistance system as a one of major applications, the quality of segregated sound should keep level enough. In the paper, signal segregation performance by means of FDBM is evaluated by three objective methods, i.e., SNR, coherence and Perceptual Evaluation of Speech Quality(PESQ), to discuss the characteristic of FDBM on the sound source segregation performance. The simulation's evaluation results show that FDBM improves the quality of the left and right channel signals to an equivalent level. And the results suggest the possibility that PESQ provides a more useful measure than SNR and coherence in terms of the segregation performance of FDBM. The evaluation results by PESQ show the effects from segregation parameters and indicate appropriate parameters under the conditions. In the paper, signal segregation performance by means of FDBM is evaluated by three objective methods, i.e., SNR, coherence and PESQ, to discuss the characteristic of FDBM on the sound source segregation performance. The simulation's evaluation results show that FDBM improves the quality of the left and right channel signals to an equivalent level. And the results suggest the possibility that PESQ provides a more useful measure than SNR and coherence in terms of the segregation performance of FDBM. The evaluation results by PESQ show the effects from segregation parameters and indicate appropriate parameters under the conditions.

A Study on the Effective VTS Communications Analysis by the Method of VCDF in Busan Port (VCDF 방식을 통한 효율적인 VTS 통신 데이터 분석에 관한 연구 - 부산항을 대상으로 -)

  • Kim, Bong-Hyun;Park, Young-Soo
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.22 no.4
    • /
    • pp.311-318
    • /
    • 2016
  • The VTS concept was located as a principal methods of maritime safety administration in world's major harbors and expected to become the pivotal role for the future of the maritime and harbor society with e-Navigation epoch. If recent limelight concept of big-data has been included in aspect of information gathering and analysis with various studies, it's required advanced studies to improve the information analysis capability and application range of the data that can be mining by the VTS. In this study, contrast to other studies that aimed quantitative analysis as communication number, it can be mining the time information and each of the communication VTS for the target vessel, including qualitative analysis, such as the purpose or the type of communication. This comparison across multiple items of the collected information, and presenting the VTS data mining model (VCDF) that can be analyzed for the purpose of analyzing way, type and number of communication by ship's type, also number of violations through VTS communication. First, In Busan port case, it shows frequently information service and shows frequently communicating with particular types of vessels. Second, Passive VTS carried out notwithstanding many kinds of traffic violations due to communication congestion. This arranged information can be used as data for the analysis, as possible the level of traffic for VTSO situational awareness, which pointed to the 'workloads' in 'IALA Guideline' and could be used as a database for future research of e-Navigation.

Design and Implementation of a Real-Time Lipreading System Using PCA & HMM (PCA와 HMM을 이용한 실시간 립리딩 시스템의 설계 및 구현)

  • Lee chi-geun;Lee eun-suk;Jung sung-tae;Lee sang-seol
    • Journal of Korea Multimedia Society
    • /
    • v.7 no.11
    • /
    • pp.1597-1609
    • /
    • 2004
  • A lot of lipreading system has been proposed to compensate the rate of speech recognition dropped in a noisy environment. Previous lipreading systems work on some specific conditions such as artificial lighting and predefined background color. In this paper, we propose a real-time lipreading system which allows the motion of a speaker and relaxes the restriction on the condition for color and lighting. The proposed system extracts face and lip region from input video sequence captured with a common PC camera and essential visual information in real-time. It recognizes utterance words by using the visual information in real-time. It uses the hue histogram model to extract face and lip region. It uses mean shift algorithm to track the face of a moving speaker. It uses PCA(Principal Component Analysis) to extract the visual information for learning and testing. Also, it uses HMM(Hidden Markov Model) as a recognition algorithm. The experimental results show that our system could get the recognition rate of 90% in case of speaker dependent lipreading and increase the rate of speech recognition up to 40~85% according to the noise level when it is combined with audio speech recognition.

  • PDF

A METHOD OF CAPABILITY EVALUATION FOR KOREAN PADDY SOILS -Part 2. The rice yield prediction by soil fertility constituents and other characters (한국(韓國) 답토양(畓土壤)의 생산력(生産力) 평가방법에 관한 연구 -2 보(報)·비옥도(肥沃度) 구성인자(構成因子) 및 기타(其他) 특성(特性)에 의(依)한 쌀수확량(收穫量)의 추정(推定))

  • Hong, Ki-Chang;Maeng, Do-Won;Kazutake, Kyuma;Hisao, Furukawa;Suh, Yoon-Soo
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.12 no.1
    • /
    • pp.15-23
    • /
    • 1979
  • In the first paper of the series the five soil fertility factors were evaluated by means of principal component analysis and varimax method. They are interpreted as representing, 1) skeletal available phosporus status, 2) organnic matter status, 3) salt status 4) base status, and 5) free oxide status. In order to resynthesize such fragmented information for the overall soil fertility evaluation, the method of multiple regression analysis was adopted, using the five factor scores and yield data for Korean paddy soils as independent and dependent variables respectively. As test of linear models with different combinations of independent variables the results of t-test of regression coefficient were revealed that the organic matter status (FII) has no relevance to the yield of paddy and that the free oxides and salt supply has by it self only an insignificant contribution to the yield. The multiple correlation coefficient (R) revealed its multiple regression analysis was as low as 0.43. Introduction of quadratic terms to the linear model bettered the result. Thus multiple correlation coefficient (R) was increased as 0.59. Therefore, a coefficient of determination 0.35 was obtained by a quadratic model with interaction terms among the five fertility constituents. Generally we think that the fertility factor has more contribution to raise the rice yield in paddy and that the failure of yield prediction by fertility factor scores was caused by one of follows; 1) the roughness of the yield inspection, and 2) missextraction of fertility constituents. The second step in this study, assuming that the residuals by multiple regression analysis were due to factors other than soil fertility, we can now proceed to predicting the yield from the field characters with the classified fertility groups by means of Hayashi's theory of quantification No. 1. Such variables as fertility groups (FTYG), water availability (WATER), soil drainage (DRNG), climatic zone (CLIZ), surface soil's stickiness (STCKT), surface soil's dry consistence (DCNST), and surface soil's texture (FTEXT) are taken up as the explanatory variables. The quantification appears reasonable; the well to extremely well in soil drainage, very sticky of surface soil, inefficiency in water availability, coarse texture, and very hard to extremely hard dry consistence in soil are detrimental to the rice yield. The R was as high as 0.90 for the set of variables. But the given explanatory variables in this study were not quite effective in explaining rice yield. The method developed seems to be promising only if properly collected data are available. Conditions that should be satisfied in the yield inspection obtained from common cultivator for the purpose of deriving a prediction equation were put forward.

  • PDF