• Title/Summary/Keyword: 합성 알고리즘

Search Result 1,057, Processing Time 0.035 seconds

A DB Pruning Method in a Large Corpus-Based TTS with Multiple Candidate Speech Segments (대용량 복수후보 TTS 방식에서 합성용 DB의 감량 방법)

  • Lee, Jung-Chul;Kang, Tae-Ho
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.6
    • /
    • pp.572-577
    • /
    • 2009
  • Large corpus-based concatenating Text-to-Speech (TTS) systems can generate natural synthetic speech without additional signal processing. To prune the redundant speech segments in a large speech segment DB, we can utilize a decision-tree based triphone clustering algorithm widely used in speech recognition area. But, the conventional methods have problems in representing the acoustic transitional characteristics of the phones and in applying context questions with hierarchic priority. In this paper, we propose a new clustering algorithm to downsize the speech DB. Firstly, three 13th order MFCC vectors from first, medial, and final frame of a phone are combined into a 39 dimensional vector to represent the transitional characteristics of a phone. And then the hierarchically grouped three question sets are used to construct the triphone trees. For the performance test, we used DTW algorithm to calculate the acoustic similarity between the target triphone and the triphone from the tree search result. Experimental results show that the proposed method can reduce the size of speech DB by 23% and select better phones with higher acoustic similarity. Therefore the proposed method can be applied to make a small sized TTS.

Derivation of Inherent Optical Properties Based on Deep Neural Network (심층신경망 기반의 해수 고유광특성 도출)

  • Hyeong-Tak Lee;Hey-Min Choi;Min-Kyu Kim;Suk Yoon;Kwang-Seok Kim;Jeong-Eon Moon;Hee-Jeong Han;Young-Je Park
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_1
    • /
    • pp.695-713
    • /
    • 2023
  • In coastal waters, phytoplankton,suspended particulate matter, and dissolved organic matter intricately and nonlinearly alter the reflectivity of seawater. Neural network technology, which has been rapidly advancing recently, offers the advantage of effectively representing complex nonlinear relationships. In previous studies, a three-stage neural network was constructed to extract the inherent optical properties of each component. However, this study proposes an algorithm that directly employs a deep neural network. The dataset used in this study consists of synthetic data provided by the International Ocean Color Coordination Group, with the input data comprising above-surface remote-sensing reflectance at nine different wavelengths. We derived inherent optical properties using this dataset based on a deep neural network. To evaluate performance, we compared it with a quasi-analytical algorithm and analyzed the impact of log transformation on the performance of the deep neural network algorithm in relation to data distribution. As a result, we found that the deep neural network algorithm accurately estimated the inherent optical properties except for the absorption coefficient of suspended particulate matter (R2 greater than or equal to 0.9) and successfully separated the sum of the absorption coefficient of suspended particulate matter and dissolved organic matter into the absorption coefficient of suspended particulate matter and dissolved organic matter, respectively. We also observed that the algorithm, when directly applied without log transformation of the data, showed little difference in performance. To effectively apply the findings of this study to ocean color data processing, further research is needed to perform learning using field data and additional datasets from various marine regions, compare and analyze empirical and semi-analytical methods, and appropriately assess the strengths and weaknesses of each algorithm.

A VLSI Efficient Design and Implementation of EBCOT for JPEG2000 (JPEG2000을 위한 효율적인 EBCOT의 VLSI 설계 및 구현)

  • Yang, Sang-Hoon;Yoo, Hyuck-Min;Park, Dong-Sun;Yoon, Sook
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.46 no.3
    • /
    • pp.37-43
    • /
    • 2009
  • The new still image compression standard JPEG2000 is consisted of DWT and EBCOT. In this paper, proposed and designed new algorithm in efficient EBCOT. BPC based on the contort. Proposed BPC Algorithm is forecasted coding pass using Sigstage, column, mpass value. BAC design apply 4-pipeline stage. EBCOT designed using Verilog HDL. Verification and Synthesis using Xillinx FPGA technology.

Hardware Implementation of Integer Transform and Quantization for H.264 (하드웨어 기반의 H.264 정수 변환 및 양자화 구현)

  • 임영훈;정용진
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.28 no.12C
    • /
    • pp.1182-1191
    • /
    • 2003
  • In this paper, we propose a new hardware architecture for integer transform, quantizer, inverse quantizer, and inverse integer transform of a new video coding standard H.264/JVT. We describe the algorithm and derive hardware architecture emphasizing the importance of area for low cost and low power consumption. The proposed architecture has been verified by PCI-interfaced emulation board using APEX-II Alters FPGA and also by ASIC synthesis using Samsung 0.18 um CMOS cell library. The ASIC synthesis result shows that the proposed hardware can operate at 100 MHz, processing more than 1,300 QCIF video frames per second. The hardware is going to be used as a core module when implementing a complete H.264 video encoder/decoder ASIC for real-time multimedia application.

A Mixed Reality Based Interface for Planing Layouts (공간 배치를 위한 혼합현실 기반의 인터페이스)

  • Kang, Hyun;Lee, Gun A.;Son, Wook-Ho
    • Journal of the HCI Society of Korea
    • /
    • v.2 no.2
    • /
    • pp.45-51
    • /
    • 2007
  • Space planning is one of the popular applications of VR technology including interior design, architecture design, and factory layout. In order to provide easier methods to accommodate physical objects into virtual space planning task, we suggest applying mixed reality (MR) interface. We describe our hardware and software of our MR system designed according to requirements of the application domain. In brief, our system hardware consists of a video see-through display with a touch screen interface, mounted on a mobile platform, and we use screen space 3D manipulations to arrange virtual objects within the MR scene. Investigating the interface with our prototype implementation, we are convinced that our system will help users to design spaces in more easy and effective way.

  • PDF

Improvement of front-back sound localization characteristics in headphone-based 3D sound generation (헤드폰 기반의 입체음향 생성에서 앞/뒤 음상정위 특성 개선)

  • 김경훈;김시호;배건성;최송인;박만호
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.8C
    • /
    • pp.1142-1148
    • /
    • 2004
  • A binaural filtering method using HRTF DB is generally used to make the headphone-based 3D sound. But it can make some confusion between front and back directions or between up and down directions due to the non-individual HRTF depending on each listener. To reduce the confusion of sound image localization, we propose a new method to boost the spectral cue by modifying HRTF spectra with spectrum difference between front and back directions. Informal listening tests show that the proposed method improves the front-back sound localization characteristics much better than the conventional methods

Development of Design Program for CFFT Structure (CFFT구조의 설계프로그램 개발)

  • Choi, Young-Min;Hwang, Yoon-Koog;Lee, Young-Ho;Lee, Jung-Howan;Kim, Dong-Chan
    • Proceedings of the Computational Structural Engineering Institute Conference
    • /
    • 2009.04a
    • /
    • pp.549-552
    • /
    • 2009
  • 본 논문에서는 재료적 성능이 우수하며 경량재료로서 최근 건설구조물에 활용하고자 하는 연구가 활발히 진행되고 있는 섬유보강재료(FRP : Fiber Reinforced Polymer)를 이용한 합성구조인 CFFT(Concrete Filled FRP Tube)의 설계프로그램을 개발하여 제안하고자 하였다. 먼저, CFFT구조는 FRP관에 의해 철근콘크리트가 구속되는 구조로서 기둥과 같이 축력이 도입되는 경우 포아송효과에 의한 변형을 FRP관이 구속효과를 줌으로써 콘크리트의 역학적 거동을 개선하게 되는데 본 연구에서는 실험에 의해 검증된 식을 제시하였으며 이를 바탕으로 CFFT구조를 설계하는 알고리즘을 제안하였다. 또한 CFFT구조는 FRP관의 구속으로 인해 고강도콘크리트와 긴장재의 도입이 가능한 구조로서 이에 대한 설계도 포함하였다. 그러나 이방성재료인 FRP의 설계와 동시에 FRP관에 의한 구속효과를 고려하는 CFFT구조의 설계는 일반 실무설계자들에게는 다소 난해한 작업으로써 전산화 설계프로그램의 필요성이 대두되어 본 연구에서 CFFT구조의 설계프로그램을 개발하였다. 개발된 설계프로그램의 검증을 위해 일반 철근콘크리트기둥, CFFT기둥, 고강도콘크리트와 PS긴장재를 도입한 CFFT기둥을 설계한 결과, 매우 실용적이며 타당한 설계가 수행될 수 있음을 확인하였다.

  • PDF

An Implementation of HBC System for Capsule Endoscope (캡슐내시경을 위한 HBC시스템 구현)

  • Kim, Ki-Yun
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.18 no.3
    • /
    • pp.215-221
    • /
    • 2018
  • In this paper, a comprehensive design of HBC(Human Body Communication) system for capsule endoscope is presented. First, we propose a method of combining the signals received from multiple patches attached to the body of patient through differential operation and derive the signal SNR mathematically. To synchronize HBC transmission signal sent from capsule, we analyzed coarse timing synchronization method using PN code and fine timing synchronization performance among Manchester, NRZ and RZ modulation method using ZCD(Zero Crossing Detector). In addition, we evaluated the equalization performance of HBC signal frame in Rician and Rayleigh channel environments by applying LMS and RLS algorithm.

Design of Bit Selectable and Bi-directional Interface Device using Interrupt Generator (인터럽트 발생기를 사용한 접속 비트 전환식 양방향 접속장치의 설계)

  • Lim, Tae-Young;Yi, Cheon-Hee
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.36C no.7
    • /
    • pp.17-26
    • /
    • 1999
  • In this paper, Bit selectable and Bi-directional Interface Device is described, which can communicate data with the peripheral devices. Specially, an algorithm of truth-table comparison that synthesizes the pulse-type sequential circuit pulse has been proposed to design the Interrupt Generator, and implemented in designing the Interrupt Register. Also, a description of the asynchronous design method is given to remove the clock skew phenomenon, and the output asynchronous control method which finds the optimal clock and controls all the enable signal of the output pins at the same time is presented. Using this technique interface ports have delay time of less-than 0.7ns.

  • PDF

Multi-Stage Adaptive Noise Cancellation Technique for Synthetic $Hard-{\alpha}$ Inclusion (합성 $Hard-{\alpha}$ Inclusion의 다단계 적응형 노이즈 제거기법 연구)

  • Kim, Jae-Joon
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.23 no.5
    • /
    • pp.455-463
    • /
    • 2003
  • Adaptive noise cancellation techniques are ideally suitable for reducing spatially varying noise due to the grain structure of material in ultrasonic nondestructive evaluation. Grain noises have an un-correlation property, while flaw echoes are correlated. Thus, adaptive filtering algorithms use the correlation properties of signals to enhance the signal-to-noise ratio (SNR) of the output signal. In this paper, a multi-stage adaptive noise cancellation (MANC) method using adaptive least mean square error (LMSE) filter for enhancing flaw detection in ultrasonic signals is proposed.