• 제목/요약/키워드: pitch

검색결과 4,224건 처리시간 0.032초

시간지연 회귀 신경회로망을 이용한 피치 악센트 인식 (Automatic Recognition of Pitch Accents Using Time-Delay Recurrent Neural Network)

  • Kim, Sung-Suk;Kim, Chul;Lee, Wan-Joo
    • The Journal of the Acoustical Society of Korea
    • /
    • 제23권4E호
    • /
    • pp.112-119
    • /
    • 2004
  • This paper presents a method for the automatic recognition of pitch accents with no prior knowledge about the phonetic content of the signal (no knowledge of word or phoneme boundaries or of phoneme labels). The recognition algorithm used in this paper is a time-delay recurrent neural network (TDRNN). A TDRNN is a neural network classier with two different representations of dynamic context: delayed input nodes allow the representation of an explicit trajectory F0(t), while recurrent nodes provide long-term context information that can be used to normalize the input F0 trajectory. Performance of the TDRNN is compared to the performance of a MLP (multi-layer perceptron) and an HMM (Hidden Markov Model) on the same task. The TDRNN shows the correct recognition of $91.9{\%}\;of\;pitch\;events\;and\;91.0{\%}$ of pitch non-events, for an average accuracy of $91.5{\%}$ over both pitch events and non-events. The MLP with contextual input exhibits $85.8{\%},\;85.5{\%},\;and\;85.6{\%}$ recognition accuracy respectively, while the HMM shows the correct recognition of $36.8{\%}\;of\;pitch\;events\;and\;87.3{\%}$ of pitch non-events, for an average accuracy of $62.2{\%}$ over both pitch events and non-events. These results suggest that the TDRNN architecture is useful for the automatic recognition of pitch accents.

Preparation of pitch-coated $TiO_2$ and their photocatalytic performance

  • Chen, Ming-Liang;Oh, Won-Chun
    • 한국결정성장학회지
    • /
    • 제17권1호
    • /
    • pp.23-29
    • /
    • 2007
  • Pitch-coated anatase $TiO_2$ typed was prepared by $CCl_4$ solvent mixing method with different mixing ratios. Since the carbon layers derived from pitch on the $TiO_2$ particles were porous, the pitch-coated $TiO_2$ sample series showed a good adsorptivity and photo decomposition activity. The BET surface area for the carbon layer in the sample increases to increasing with pitch contents. The SEM results present to the characterization of porous texture on the pitch-coated $TiO_2$ sample and pitch distributions on the surfaces for all the materials used. From XRD data a weak and broad carbon peak of graphene with pristine anatase peaks were observed in the X-ray diffraction patterns for the pitch-coated $TiO_2$. The EDX spectra show the presence of C, O and S with strong Ti peaks. Most of these samples are richer in carbon and major Ti metal than any other elements. Finally, the excellent photocatalytic activity of pitch-coated $TiO_2$ with slope relationship between relative concentration of MB ($c/c_o$) and t could be attributed to the homogeneous coated pitch on the external surface by $CCl_4$ solvent method.

음성 하모닉스 스펙트럼의 피크-피팅을 이용한 피치검출에 관한 연구 (A Study on the Pitch Detection of Speech Harmonics by the Peak-Fitting)

  • 김종국;조왕래;배명진
    • 음성과학
    • /
    • 제10권2호
    • /
    • pp.85-95
    • /
    • 2003
  • In speech signal processing, it is very important to detect the pitch exactly in speech recognition, synthesis and analysis. If we exactly pitch detect in speech signal, in the analysis, we can use the pitch to obtain properly the vocal tract parameter. It can be used to easily change or to maintain the naturalness and intelligibility of quality in speech synthesis and to eliminate the personality for speaker-independence in speech recognition. In this paper, we proposed a new pitch detection algorithm. First, positive center clipping is process by using the incline of speech in order to emphasize pitch period with a glottal component of removed vocal tract characteristic in time domain. And rough formant envelope is computed through peak-fitting spectrum of original speech signal infrequence domain. Using the roughed formant envelope, obtain the smoothed formant envelope through calculate the linear interpolation. As well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. Inverse fast fourier transform (IFFT) compute this flattened harmonics. After all, we obtain Residual signal which is removed vocal tract element. The performance was compared with LPC and Cepstrum, ACF. Owing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.

  • PDF

관군 배열에서의 종간 간격이 열전달에 미치는 영향에 대한 수치 해석적 연구 (NUMERICAL ANALYSIS FOR LONGITUDINAL PITCH EFFECT ON TUBE BANK HEAT TRANSFER)

  • 이동균;안준;신승원
    • 한국전산유체공학회지
    • /
    • 제17권3호
    • /
    • pp.39-44
    • /
    • 2012
  • In this study, a longitudinal pitch effect on in-line tube bank heat transfer has been analyzed numerically. To verify the accuracy of the solver model and boundary conditions, global Nusselt number(Nu) and pressure drop across the 2 row tube bank are compared with the existing experimental correlations under 500 ~ 2,000 Reynolds number(Re) range. By changing transverse pitch($S_T$) or longitudinal pitch($S_L$) separately in tube bank, we're trying to identify the each effect on heat transfer. We found that the effect of transverse pitch can be accounted for Reynolds number evaluated with maximum velocity($V_{max}$) at the smallest flow area similar to most existing correlations. Variation of the longitudinal pitch($S_L$) has a greater impact on the heat transfer compared to the transverse pitch($S_T$). Overall Nusselt number increases with larger longitudinal pitch($S_L$), however individual Nusselt number of the tube row has significant difference after the first row.

피치각 조정형 송풍-역풍 겸용 축류팬에서 배연용 피치각 선정을 위한 실험적 연구 (An Experimental Study on Selection Pitch Angle on backward flow of an Axial Fan with Adjustable Pitch Angle Blades)

  • 장택순;허진혁;문승재;이재헌;유호선;임윤철
    • 대한설비공학회:학술대회논문집
    • /
    • 대한설비공학회 2008년도 동계학술발표대회 논문집
    • /
    • pp.145-150
    • /
    • 2008
  • In this study, the experimental study has carried out to select pitch angle on the backward flow in an axial fan that has adjustable pitch blades. With the change of pitch angle of axial fan with adjustable blade, air flow rate, pressure and air flow direction can be changed. Because of this merit, adjustable axial fan can be used in the backward flow. For the selection of the backward flow pitch angle, fan performance test method is selected by KS B 6311. Dynamic pressure, static pressure, electric current and voltage are measured in each pitch angles of axial fan that are $36^{\circ}C$, $-16^{\circ}C$, $-21^{\circ}C$, $-26^{\circ}C$, $-31^{\circ}C$ and $-36^{\circ}C$. In the result of test, fan performance curves at several pitch angle has been investigated. Finally, pitch angle of $-26^{\circ}C$ has been selected to get largest flow rate at backward flow situation.

  • PDF

음성신호의기본주파수 검출 (On a Detection for the Fundamental Frequency of Speech Signals)

  • 배명진
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 제11회 음성통신 및 신호처리 워크샵 논문집 (SCAS 11권 1호)
    • /
    • pp.42-47
    • /
    • 1994
  • A pitch detector is an essential component in a variety of speech processing systems. Besides providing valuable insights into the nature of the exciation source for speech production, the pitch contour of an utterance is useful for recognizing speakers, aids-to-the handicapped, and is required in almost all speech analysis-synthesis system. Because of the importance of the pitch detection, a wide variety algorithms for pitch detection have been proposed in speech procesing literature. Thus, in this paper we discuss th evarious type of pitch detection algorithms which have been proposed until now. Then we provide th eperformance measurements for seven pitch detection algorithms.

  • PDF

피치 알고리즘 수정 및 소음에의 적용 (Modification of Pitch Algorithm and Its Application to Noise)

  • Shin, Sung-Hwan;Ih, Jeong-Guon
    • 한국소음진동공학회:학술대회논문집
    • /
    • 한국소음진동공학회 2002년도 추계학술대회논문초록집
    • /
    • pp.354.1-354
    • /
    • 2002
  • Pitch is a perception related to frequency, one of the psychological aspects or attributes of tones, and an important factor to determine sound quality of sound together with loudness and timber. while a study on pitch has been actively achieved In the part of speech recognition and speech separation, that for analysis and improvement of product sound quality is not yet enough. (omitted)

  • PDF

CELP 보코더에서 델타 피치 검색 방법 개선에 대한 연구 (An Algorithm to Reduce the Pitch Computational amount using Modified Delta Searching in CELP Vocoders)

  • 주상규
    • 한국산학기술학회:학술대회논문집
    • /
    • 한국산학기술학회 2010년도 춘계학술발표논문집 1부
    • /
    • pp.269-272
    • /
    • 2010
  • In this paper, we propose the computation reduction methods of delta pitch search that is used in G.723.1 vocoder. In order to decrease the computational amount in delta pitch search the characteristic of proposed algorithms is as the following. First, scheme to reduce the computation amount in delta pitch search uses NAMDF. Developed the second scheme is the skipping technique of lags in pitch searching by using the threshold value. By doing so, we can reduce the computational amount of pitch searching more than 64% with negligible quality degradation.

  • PDF

상관관계 특성을 이용한 CELP 보코더의 피치검색시간 단축법의 비교 (On a Performance Comparison of Pitch Search Algorithms with the Correlation Properties for the CELP Vocoder)

  • 김대식
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 제11회 음성통신 및 신호처리 워크샵 논문집 (SCAS 11권 1호)
    • /
    • pp.188-194
    • /
    • 1994
  • Code excited linear prediction speech coders exhibit good performance at data rates as low as 4800bps. But the major drawback to CELP type coders is their large computational requirements. Therefore, in this paper a comparative performance study of three pitch searching algorithms for the CELP vocoder was conducted. For each of the algorithms, a standard pitch searching algorithm was used by the full pitch searching algorithm that was implimented in the QCELP vocoder. The algorithms used in this study is to reduce the pitch searching time 1) using the skip table, 2) using the symmetrical property of the autocorrelation , and 3) using the preprocessing autocorrelation, 4) using the positive autocorrelation, 5) using the preliminary pitch. Performance scores are presented for each of the five pitch searching algorithms based on computation speed and on pitch prediction error.

  • PDF

한국어 운율구 기반의 피치궤적 변환의 통계적 접근 (Statistical Approaches to Convert Pitch Contour Based on Korean Prosodic Phrases)

  • Lee, Ki-Young
    • The Journal of the Acoustical Society of Korea
    • /
    • 제23권1E호
    • /
    • pp.10-15
    • /
    • 2004
  • In performing speech conversion from a source speaker to a target speaker, it is important that the pitch contour of the source speakers utterance be converted into that of the target speaker, because pitch contour of a speech utterance plays an important role in expressing speaker's individuality and meaning of the utterance. This paper describes statistical algorithms of pitch contour conversion for Korean language. Pitch contour conversions are investigated at two 1 evels of prosodic phrases: intonational phrase and accentual phrase. The basic algorithm is a Gaussian normalization [7] in intonational phrase. The first presented algorithm is combined with a declination-line of pitch contour in an intonational phrase. The second one is Gaussian normalization within accentual phrases to compensate for local pitch variations. Experimental results show that the algorithm of Gaussian normalization within accentual phrases is significantly more accurate than the other two algorithms in intonational phrase.