• Title/Summary/Keyword: Voice enhancement

Search Result 82, Processing Time 0.023 seconds

Active Control of Isolation Table Using $H_\infty$ Control ($H_\infty$ 제어를 이용한 방진대의 능동제어)

  • Kim, Kyu-Young;Yang, Hyun-seok;Park, Young-Pil
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.20 no.10
    • /
    • pp.3079-3094
    • /
    • 1996
  • Recently, the high-precision vibration attenuation technology becomes the essence fo the seccessful development of high-integrated and ultra-precision industries, and is expected to continue playing a key role in the enhancement of manufacturing technology. Vibration isolation system using an air-spring is widely employed owing to its excellent isolation characteristics in a wide frequency range. It has, however, some drawbacks such as low-stiffness and low-damping features and can be easily excited by exogenous disturbances, and then vibration of table is remained for a long time. Consequently, the need for active vibration control for an air-spring vibration isolation system becomes inevitable. Furthermore, for an air-spring isolation table to be successfully employed in a variety of manufacturing sites, it should have a guaranteed robust performance not only to exogenous disturbances but also to uncertainties due to various equipments which might be put on the table. In this study, an active vibration suppression control system using H.inf. theory is designed and experiments are performed to verify its robust performance. An air-spring vibration isolation table with voice-coil-motors as its actuators is designed and built. The table is modeled as 3 degree-of-freedom system. An active control system is designed based on $H_\infty$control theory using frequency-shaped weighting functions. Analysis on its performance and frequency responce properties are done through numerical simulations. Robust characteristics of$H_\infty$ control on disturbances and model uncertainties are experimentally verified through (i) the transient response to the impact excitation of the table, (ii) the steady-state response to the harmonic excitation, and (iii) the response to the mass change of the table itself. An LQG controller is also designed and its performance is compared with the $H_\infty$ controller.

Design of a Three Dimensional Audio System for Multicast Conferencing (멀티캐스트 화상회의를 위한 3-D 음향시스템 설계)

  • 김영오;고대식
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.25 no.1B
    • /
    • pp.71-76
    • /
    • 2000
  • On multimedia teleconferencing system existing a number of participants, face of the participants can beperceived by visual image. However, differentiation of each participant's voice and spaciousness sense are very hard since voice of all participants is processed with one dimensional data. In this paper, we implemented three dimensional audio rendering system using the HRTF(Head Related Transfer Function) and distance sense reproduction method and determined the optimal location of the participants for teleconferencing system. In the results of the listening test using elevation and azimuth angle, we showed that directional perception of the azimuth angles were better than that of the elevation angles. Specially, we showed that participant location using the HRTFS of the azimuth angle 10" , 90" , 270" and350" was efficient in teleconferencing system existing four participants. We also proposed that distance cue was used for enhancement of the reality and location of many participants more than five.ipants more than five.

  • PDF

Enhancement of Authentication Performance based on Multimodal Biometrics for Android Platform (안드로이드 환경의 다중생체인식 기술을 응용한 인증 성능 개선 연구)

  • Choi, Sungpil;Jeong, Kanghun;Moon, Hyeonjoon
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.3
    • /
    • pp.302-308
    • /
    • 2013
  • In this research, we have explored personal authentication system through multimodal biometrics for mobile computing environment. We have selected face and speaker recognition for the implementation of multimodal biometrics system. For face recognition part, we detect the face with Modified Census Transform (MCT). Detected face is pre-processed through eye detection module based on k-means algorithm. Then we recognize the face with Principal Component Analysis (PCA) algorithm. For speaker recognition part, we extract features using the end-point of voice and the Mel Frequency Cepstral Coefficient (MFCC). Then we verify the speaker through Dynamic Time Warping (DTW) algorithm. Our proposed multimodal biometrics system shows improved verification rate through combining two different biometrics described above. We implement our proposed system based on Android environment using Galaxy S hoppin. Proposed system presents reduced false acceptance ratio (FAR) of 1.8% which shows improvement from single biometrics system using the face and the voice (presents 4.6% and 6.7% respectively).

Speech Enhancement Based on Modified IMCRA Using Spectral Minima Tracking with Weighted Subband Selection (서브밴드 가중치를 적용한 스펙트럼 최소값 추적을 이용하는 수정된 IMCRA 기반의 음성 향상 기법)

  • Park, Yun-Sik;Park, Gyu-Seok;Lee, Sang-Min
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.49 no.3
    • /
    • pp.89-97
    • /
    • 2012
  • In this paper, we propose a novel approach to noise power estimation for speech enhancement in noisy environments. The method based on IMCRA (improved minima controlled recursive averaging) which is widely used in speech enhancement utilizes a rough VAD (voice activity detection) algorithm which excludes speech components during speech periods in order to improves the performance of the noise power estimation by reducing the speech distortion caused by the conventional algorithm based on the minimum power spectrum derived from the noisy speech. However, since the VAD algorithm is not sufficient to distinguish speech from noise at non-stationary noise and low SNRs (signal-to-noise ratios), the speech distortion resulted from the minimum tracking during speech periods still remained. In the proposed method, minimum power estimate obtained by IMCRA is modified by SMT (spectral minima tracking) to reduce the speech distortion derived from the bias of the estimated minimum power. In addition, in order to effectively estimate minimum power by considering the distribution characteristic of the speech and noise spectrum, the presented method combines the minimum estimates provided by IMCRA and SMT depending on the weighting factor based on the subband. Performance of the proposed algorithm is evaluated by subjective and objective quality tests under various environments and better results compared with the conventional method are obtained.

New Speech Enhancement Method using Psychoacoustic Criteria (심리 음향 기준을 이용한 새로운 음질 개선 방법)

  • 김대경;박장식;손경식
    • Journal of Korea Multimedia Society
    • /
    • v.4 no.1
    • /
    • pp.56-66
    • /
    • 2001
  • The spectral subtraction algorithm using a criterion based on the human perception has been recently developed. The speech processed with Virag's algorithm sounds more pleasant to a human listener than those obtained by the classical methods. However, Virag's algorithm requires a robust voice activity detector (VAD). In the ESS (extended spectral subtraction) algorithm without VAD, the residual noise becomes more noticeable as the SNR decrease. In this paper we propose a new speech enhancement method, the combination of Wiener filter and spectral subtraction based on noise masking characteristics in the human auditory system. There is no need of VAD because the noise can be successively updated even during speech activity using Wiener filter. The adjustment of the subtraction parameter based on the masking threshold makes the residual noise inaudible. The proposed method has been compared with conventional spectral subtraction algorithms. Objective and subjective evaluation of the proposed system is performed with several noise types having different time-frequency distributions. The application of objective measures, the study of the speech spectrograms, as well as subjective listening tests, confirm that the enhanced speech with proposed algorithm is more pleasant to a human listener.

  • PDF

Collaborative Governance in Philippine Science and Technology Parks: A closer look at the UP - Ayala Land Technohub

  • Sale, Jonathan P.
    • World Technopolis Review
    • /
    • v.4 no.1
    • /
    • pp.23-32
    • /
    • 2015
  • Public-private partnerships (PPPs) are very popular governance practices, as they enable the private partner to engage in business and have profits while the public partner improves the provision of public services. PPPs are organizational arrangements with a sector-crossing or sector-blurring nature, and are modes of governance - governance by partnerships or collaborative governance (Schuppert 2011). New models and applications of PPPs have been developed over time. Collaborative governance entails information exchange, action or movement harmonization, resource sharing, and capacity enhancement among the partners (Sale 2011; 2012a). As the national university, the University of the Philippines (UP) serves as a research university in various fields of expertise and specialization by conducting basic and applied research and development, and promoting research in various colleges and universities, and contributing to the dissemination and application of knowledge, among other purposes. (Republic Act 9500) It is the site of two (2) science and technology parks (Sale 2012b), one of which is the UP - Ayala Land Technohub. A collaboration between industry and the academe, the Technohub is envisioned as an integrated community of science and technology companies building a dynamic learning and entrepreneurial laboratory (UP-AyalaLand Technohub). This paper takes a closer look at the UP - Ayala Land Technohub as an example of a PPP or collaborative governance in science and technology parks. Have information exchange, action or movement harmonization, resource sharing, and capacity enhancement taken place in the Technohub? What are some significant outcomes of, and issues arising from, the PPP? What assessment indicators may be used? Is a governance instrument needed?

Speech Enhancement based on Minima Controlled Recursive Averaging Technique Incorporating Second-order Conditional Maximum a posteriori Criterion (2차 조건 사후 최대 확률 기반 최소값 제어 재귀평균기법을 이용한 음성향상)

  • Kum, Jong-Mo;Chang, Joon-Hyuk
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.46 no.4
    • /
    • pp.132-138
    • /
    • 2009
  • In this paper, we propose a novel approach to improve the performance of minima controlled recursive averaging (MCRA) which is based on the second-order conditional maximum a posteriori (CMAP). From an investigation of the MCRA scheme, it is discovered that the MCRA method cannot take full consideration of the inter-frame correlation of voice activity since the noise power estimate is adjusted by the speech presence probability depending on an observation of the current frame. To avoid this phenomenon, the proposed MCRA approach incorporates the second-order CMAP criterion in which the noise power estimate is obtained using the speech presence probability conditioned on both the current observation and the speech activity decisions in the previous two frames. Experimental results show that the proposed MCRA technique based on second-order conditional MAP yields better results compared to the conventional MCRA method.

A Preprocessing Approach to Improving the Quality of the Music Produced by the EVRC (EVRC 코덱으로 재생하는 음악의 품질을 개선하기 위한 전처리 기법)

  • 남영한;하태균;전윤호;김재수;박섭형
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.28 no.5C
    • /
    • pp.476-485
    • /
    • 2003
  • This paper proposers a preprocessing approach to improving the quality of the music produced by the EVRC(enhanced variable rate codec) which is one of the CDMA(Code Division Multiple Access) voice codecs. Since the EVRC is optimized only for speech signals, it can deteriorate the quality of the music passed through it. One of the problems with the EVRC-coded music is time-clipping, which usually occurs when subsequent frames are encoded at Rate l/8. Since the EVRC determines the bit rate for an input frame based on the long-term prediction gain, we increase the long-term prediction gain in order for the most of the frames to be encoded at Rate 1 or Rate 1/2. Experimental results show that the approach works well on music signals and the number of time-clipped frames is considerably reduced.

A Study on Intelligent Mobility Enhancement System for the Mobility Handicapped (첨단 교통약자 보호시스템에 대한 연구)

  • Han, Woong-Gu;Shin, Kang-Won;Choi, Kee-Choo;Kim, Nam-Sun;Sohn, Sang-Hyun
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.9 no.5
    • /
    • pp.25-37
    • /
    • 2010
  • This study is aimed at enhancing mobility rights for the transportation underprivileged that has been made light of relatively compared to normal people. In order to do this, we've suggested having ITS (Intelligent Traffic System) built and improving satisfaction through the test operation of its main system. The existing sound signal device for the visually handicapped has one problem with managing it. Because, the people in charge of it had to visit each problematic site directly to maintain and fix some problems every time it was out of order. Moreover, it couldn't provide sustainable services about voice guidance and the visually handicapped had to control it by either confirming the location of buttons that were installed on the pillar of traffic light and then pressing one of them or using a remote controller on their own. In order to improve such inconveniences, we have created a new typed sound signal device for the visually handicapped by applying the cutting-edge wireless technology based on ergonomics considering actual road situations. Such technology enables it report the status of signal device and light to them by using its voice guidance system automatically every time they have access to it. Additionally, we've already introduced it to a couple of test areas and then known the fact that they recognized traffic situation more conveniently and safely compared to the existing sound signal device. That is above average in terms of satisfaction. In addition to that, we've provided LTS (Location Tracking System - Location-based service intended for elementary students) by utilizing the existing wireless infrastructure and founded the fact that about 87% of their parents were satisfied with the service based on LTS.

Privacy Protection and RFID(Radio Frequency IDentification) (RFID와 프라이버시 보호)

  • Lee, Cheol-Ho
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2006.11a
    • /
    • pp.443-446
    • /
    • 2006
  • RFID is the core of realizing ubiquitous environment. This is expected to improve economical effect through related industry revitalization, make-work, and so on, in the future, and to be linked to social see-through enhancement via national life change. However unchecked RFID use lets retailers collect unprecedented huge information and they link it to customer information database, so the voice of worry to bring about a result of trampling down consumer privacy doesn't make a negligible situation. Although RFID system is spreaded out socially, the servicing of law and system is not accomplished to protect individuals from personal information violation threat. At the same time, in ubiquitous computing environment, to protect individual information efficiently, from the step of planning and deciding this technology system, constitutional law, norm, the basic legal rights of the people, and so forth is to be considered. The objective of the research is to persent the privacy protection from the viewpoints of law on RFID.

  • PDF