• Title/Summary/Keyword: pitch-matching

Search Result 55, Processing Time 0.022 seconds

An Experimental Study of Comfortable Pitch and Loudness with Target Matching: Effects on Electroglottographic and Acoustic Measures

  • Choi, Seong Hee
    • Phonetics and Speech Sciences
    • /
    • v.4 no.4
    • /
    • pp.139-146
    • /
    • 2012
  • This study was designed to examine comfort levels of pitch and loudness with target matching and their effects on electroglottographic (EGG) and acoustic measures. Twelve speakers, six males and six females, were instructed to produce /a/ sustained vowel for three seconds at a comfortable pitch and loudness level without any instruction and with a target matching procedure of either a certain f0 or SPL separately with visual and auditory feedback. The range of pitch for females and males were presented by progressing up and down randomly at intervals of 5Hz from 150 Hz to 310 Hz (total 33 frequency targets) and from 85 Hz to 190 Hz (total 22 frequency targets), respectively. The loudness levels were 65, 75, 85, 95 dB (total of four intensity targets) for both males and females. Subjective estimations of comfortable levels were obtained using a 10-point equal-appearing interval rating scale following each phonation. The results showed that males and females demonstrated similar trends in loudness levels with greatest comfort at 75 dB, whereas pitch comfort ratings showed a greater variability with females having a wider range with target matching. In the comfort levels of individuals, most male and female speakers rated higher comfort at soft, rather than loud phonations. On the other hand, most male speakers perceived highest comfort levels below the comfort pitch levels they phonated under natural conditions. Higher frequency ranges, however, were perceived to be more comfortable than those of natural condition in most female speakers, although the comfortable pitch levels in spontaneous phonations were within the comfort level ranges determined by targeted phonations. When comparing acoustic (%jitter, %shimmer, SNR) and EGG measures (CQ%) between spontaneous comfortable phonations and targeted phonations produced by the same subject at similar f0 and intensity, no significant differences were observed (p>0.05). Thus, target matching procedures may be considered a compatible and alternative method to reduce the variability of comfortable pitch and loudness levels by eliciting consistent comfortable phonations.

Thai Classical Music Matching Using t-Distribution on Instantaneous Robust Algorithm for Pitch Tracking Framework

  • Boonmatham, Pheerasut;Pongpinigpinyo, Sunee;Soonklang, Tasanawan
    • Journal of Information Processing Systems
    • /
    • v.13 no.5
    • /
    • pp.1213-1228
    • /
    • 2017
  • The pitch tracking of music has been researched for several decades. Several possible improvements are available for creating a good t-distribution, using the instantaneous robust algorithm for pitch tracking framework to perfectly detect pitch. This article shows how to detect the pitch of music utilizing an improved detection method which applies a statistical method; this approach uses a pitch track, or a sequence of frequency bin numbers. This sequence is used to create an index that offers useful features for comparing similar songs. The pitch frequency spectrum is extracted using a modified instantaneous robust algorithm for pitch tracking (IRAPT) as a base combined with the statistical method. The pitch detection algorithm was implemented, and the percentage of performance matching in Thai classical music was assessed in order to test the accuracy of the algorithm. We used the longest common subsequence to compare the similarities in pitch sequence alignments in the music. The experimental results of this research show that the accuracy of retrieval of Thai classical music using the t-distribution of instantaneous robust algorithm for pitch tracking (t-IRAPT) is 99.01%, and is in the top five ranking, with the shortest query sample being five seconds long.

Tone Deafness and Implications for Music Therapy Strategies for Treatment

  • Chong, Hyun Ju
    • Journal of Music and Human Behavior
    • /
    • v.2 no.2
    • /
    • pp.69-79
    • /
    • 2005
  • This study was purported to examine the definition of tone deafness, various factors for the cause based on literature review of research findings, and to examine therapeutic application of music for treatment of tone deafness. With research, it was found that there can be three different kinds of tone deafness; amusia, agnosia, and asonia. Literature review showed that tone deafness has been frequently dealt in many research in order to verify the causal factors, such as gender, age, and environments. With time, the research trend on tone deafness has shifted towards neurological approach closely examining brain activity, presenting the statement that the brain's capacity to perceive modest pitch changes may be congenitally impaired. Also physiological factors contribute to tone deafness called diplacusis, which is a phenomenon wherein a given tone is heard as different pitches by the two ears, resulting in conflicting bilateral perception of pitch. Music can be used for treatment of various factors causing tone deafness. The most efficient intervention was singing program. Pitch-matching training can be effective training using operant conditioning procedure. Successive approximation or reinforcement of correct response alone was more efficient procedure in helping uncertain singers to sing on pitch. Also progressive breathing exercises helped the training the pitch-matching where one had to coordinate hearing and voice.

  • PDF

A Study on Number sounds Speaker recognition using the Pitch detection and the Fuzzified pattern (피치 검출과 퍼지화 패턴을 이용한 숫자음 화자 인식에 관한 연구)

  • 김연숙;김희주;김경재
    • Journal of the Korea Society of Computer and Information
    • /
    • v.8 no.3
    • /
    • pp.73-79
    • /
    • 2003
  • This paper proposes speaker recognition algorithm which includes both the pitch detection and the fuzzified pattern matching. This study utilizes pitch pattern using a pitch and speech parameter uses binary spectrum. In this paper. makes reference pattern using fuzzy membership function in order to include time variation width for non-utterance time and performs vocal track recognition of common character using fuzzified pattern matching.

  • PDF

A Design of Matching Engine for a Practical Query-by-Singing/Humming System with Polyphonic Recordings

  • Lee, Seok-Pil;Yoo, Hoon;Jang, Dalwon
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.8 no.2
    • /
    • pp.723-736
    • /
    • 2014
  • This paper proposes a matching engine for a query-by-singing/humming (QbSH) system with polyphonic music files like MP3 files. The pitch sequences extracted from polyphonic recordings may be distorted. So we use chroma-scale representation, pre-processing, compensation, and asymmetric dynamic time warping to reduce the influence of the distortions. From the experiment with 28 hour music DB, the performance of our QbSH system based on polyphonic database is very promising in comparison with the published QbSH system based on monophonic database. It shows 0.725 in MRR(Mean Reciprocal Rank). Our matching engine can be used for the QbSH system based on MIDI DB also and that performance was verified by MIREX 2011.

Effects of Coronal Thread Pitch in Scalloped Implant with 2 Different Connections on Loading Stress using 3 Dimensional Finite Element Analysis (연결부 형태가 다른 두 가지 scallop 임플란트에서 경부 나사선 피치가 응력 분포에 미치는 영향 : 삼차원적유한요소분석)

  • Choi, Kyung-Soo;Park, Seong-Hun;Lee, Jae-Hoon;Huh, Jung-Bo;Yun, Mi-Jung;Jeon, Young-Chan;Jeong, Chang-Mo
    • Journal of Dental Rehabilitation and Applied Science
    • /
    • v.29 no.2
    • /
    • pp.111-118
    • /
    • 2013
  • Purpose of present study is to investigate the effects of thread pitch in coronal portion in scalloped implant with 2 different connections on loading stress using 3 dimensional finite element analysis. Scalloped implant with 4 different thread pitches (0.4mm, 0.5mm, 0.6, and 0.7mm) in the coronal part was modeled with 2 different implant-abutment connections. Platform matching connection had the same implant and abutment diameter so that they were in flush contact at the periphery while platform mismatching connection had smaller abutment diameter than implant so that their connection was made away from periphery of implant-bone interface. Occlusal loading of 100N force was applied vertically and 30 degree obliquely to all 8 models and the maximum von Mises bone stress was identified. Loading stress as highly concentrated in cortical bone. Platform mismatching scalloped implant with small thread pitch (0.4mm) model had consistently lowest maximum von Mises bone stress in vertical and oblique loads. Platform matching model had lowest maximum von Mises bone stress with 0.6mm thread pitch in vertical load and with 0.4mm thread pitch in oblique load. Platform mismatching connection had important roles in reducing maximum von Mises bone stress. Scalloped implant with smaller coronal thread pitch showed trend of reducing maximum von Mises bone stress under load.

Matching Pursuit Estimation and Quantizer Design for Sinusoidal Model-based Coder (정현파 모델 부호화기를 위한 MP(Matching Pursuit) 알고리즘과 파라미터 양자화기)

  • Ahn Yeong-Uk;Jeong Gyu-Hyeok;Kim Jong-Hak;Yang Yong-Ho;Lee In-Sung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.7
    • /
    • pp.402-409
    • /
    • 2005
  • In this paper. we propose a coding method using a matching pursuit algorithm in a strongly periodic highband signal. Also. we propose an efficient quantizer for the estimated parameters : spectral magnitude and phase. Based on the error concealment principle and sinusoidal model. the MP algorithm requires the high-precision pitch period estimation. To estimate more accurate pitch period. the refined pitch obtained from lowband speech is used. which increases the efficiency of bit allocation. The spectral magnitude parameters are quantized by the method which is combined with MDCT (Modified Discrete Cosine Transform) and multi-stage structure. The spectral phase quantizer uses the $2{\pi}$ modular characteristic of phases and the weighted function by spectral magnitudes. To evaluate the efficiency of the proposed method. we applied it to analysis-by-synthesis system. Furthermore we suggest the possibillity of scalable wideband speech codecs based on band-split structure.

RECOGNIZING SIX EMOTIONAL STATES USING SPEECH SIGNALS

  • Kang, Bong-Seok;Han, Chul-Hee;Youn, Dae-Hee;Lee, Chungyong
    • Proceedings of the Korean Society for Emotion and Sensibility Conference
    • /
    • 2000.04a
    • /
    • pp.366-369
    • /
    • 2000
  • This paper examines three algorithms to recognize speaker's emotion using the speech signals. Target emotions are happiness, sadness, anger, fear, boredom and neutral state. MLB(Maximum-Likeligood Bayes), NN(Nearest Neighbor) and HMM (Hidden Markov Model) algorithms are used as the pattern matching techniques. In all cases, pitch and energy are used as the features. The feature vectors for MLB and NN are composed of pitch mean, pitch standard deviation, energy mean, energy standard deviation, etc. For HMM, vectors of delta pitch with delta-delta pitch and delta energy with delta-delta energy are used. We recorded a corpus of emotional speech data and performed the subjective evaluation for the data. The subjective recognition result was 56% and was compared with the classifiers' recognition rates. MLB, NN, and HMM classifiers achieved recognition rates of 68.9%, 69.3% and 89.1% respectively, for the speaker dependent, and context-independent classification.

  • PDF

A Study on Speaker Recognition using the Peak and valley pitch detection and the Fuzzy (국부 봉우리와 골에 의한 피치 검출과 퍼지를 이용한 화자 인식에 관한 연구)

  • 김연숙;김희주;김경재
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.8 no.1
    • /
    • pp.213-219
    • /
    • 2004
  • This paper proposes speaker recognition algorithm which includes the pitch parameter for the peak and valley. The time-frequency hybrid method for pitch extraction is valuable in that it can improve resolution in the time domain and accuracy in the frequency domain at the same time. It makes reference pattern using membership function and performs vocal track recognition of common character using fuzzy pattern matching in order to include time variation width for non-linear utterance for proposed method, speaker recognition experiments are carried out using vowels and number sounds.

A Study on Korean, English and Japanese Speaker Recognitions Using the Peak and Valley Pitch Detection and the Fuzzy Theory (PVPF방법과 퍼지 이론을 이용한 한국어, 영어 및 일본어 화자 인식에 관한 연구)

  • Kim, Yeon-Suk
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.2
    • /
    • pp.522-533
    • /
    • 1999
  • This paper proposes speaker recognition algorithm which includes both the pitch parameter and the fuzzy inference. This study proposes a pitch detection method PVPF(peak and valley pitch detection fuction) by means of comparing spectra which utilizes the transform characteristics between time and frequency. In this paper, makes reference pattern using membership function and performs vocal tract recognition of common character using fuzzy pattern matching in order to include time variation width for non-linear utterance time.

  • PDF