• Title/Summary/Keyword: spectrogram

Search Result 239, Processing Time 0.028 seconds

Coding History Detection of Speech Signal using Deep Neural Network (심층 신경망을 이용한 음성 신호의 부호화 이력 검출)

  • Cho, Hyo-Jin;Jang, Won;Shin, Seong-Hyeon;Park, Hochong
    • Journal of Broadcast Engineering
    • /
    • v.23 no.1
    • /
    • pp.86-92
    • /
    • 2018
  • In this paper, we propose a method for coding history detection of digital speech signal. In digital speech communication and storage, the signal is encoded to reduce the number of bits. Therefore, when a speech signal waveform is given, we need to detect its coding history so that we can determine whether the signal is an original or an coded one, and if coded, determine the number of times of coding. In this paper, we propose a coding history detection method for 12.2kbps AMR codec in terms of original, single coding, and double coding. The proposed method extracts a speech-specific feature vector from the given speech, and models the feature vector using a deep neural network. We confirm that the proposed feature vector provides better performance in coding history detection than the feature vector computed from the general spectrogram.

A Study on the Effects of Speech Training for Adults Focusing on the Analysis of Voices Before and After Speech Training (성인 스피치교육 전후 효과에 관한 목소리변화스펙트로그램 비교 연구)

  • Chung, Eun-Ee;Lee, Sang-Ho
    • Journal of Digital Contents Society
    • /
    • v.18 no.6
    • /
    • pp.1049-1056
    • /
    • 2017
  • This study focused on the changes in the voices in determining the effects of speech training. This study aimed to make more visible and scientific evaluation of the changes in the voices among the substantial effects obtained from speech training. As a result, some objective differences from before the speech training could be found in the voice of every learner. Each learner showed gradual technical improvement in a variety of vocal elements, including resonance and timbre, accuracy of pronunciation, pause; that is, the voice became more powerful, more accurate pronounced, more pausing and more stable than before the speech training. This study determined if speech training could change a voice and the results are expected to help speech learners participate actively in speech training and see their speech ability improved.

Noise-Robust Anomaly Detection of Railway Point Machine using Modulation Technique (모듈레이션 기법을 이용한 잡음에 강인한 선로 전환기의 이상 상황 탐지)

  • Lee, Jonguk;Kim, A-Yong;Park, Daihee;Chung, Yongwha
    • Smart Media Journal
    • /
    • v.6 no.4
    • /
    • pp.9-16
    • /
    • 2017
  • The railway point machine is an especially important component that changes the traveling direction of a train. Failure of the point machine may cause a serious railway accident. Therefore, early detection of failures is important for the management of railway condition monitoring systems. In this paper, we propose a noise-robust anomaly detection method in railway condition monitoring systems using sound data. First, we extract feature vectors from the spectrogram image of sound signals and convert it into modulation feature to ensure robust performance, and lastly, use the support vector machine (SVM) as an early anomaly detector of railway point machines. By the experimental results, we confirmed that the proposed method could detect the anomaly conditions of railway point machines with acceptable accuracy even under noisy conditions.

Characteristics of Dairy Cow's Vocalization in Postpartum Related with Calf Isolation (출산 후 새끼와의 분리에 따른 유우의 발성음 특성)

  • Kim, Min-Jin;Son, Seung-Hun;Rhim, Shin-Jae;Chang, Moon-Baek
    • Journal of Animal Science and Technology
    • /
    • v.52 no.1
    • /
    • pp.51-56
    • /
    • 2010
  • This study was conducted to clarify the characteristics of Holstein dairy cow's vocalization in postpartum related with calf isolation. Vocalizations of 16 individuals of cows were recorded 6 hours per day (1:00am~4:00am and 1:00pm~4:00pm) using digital recorder and microphone during October 2008 and May 2009. Vocalizations were divided into 4 types. Characteristics of frequency, intensity and duration were analyzed by GLM (general linear model) and Duncan's multi-test. There were significant differences in frequency and intensity based on analyses of spectrogram and spectrum among 4 types of vocalizations. Frequencies of vocalizations were dramatically decreased on 2nd and 3rd day. Vocalization would be important factor affecting the motheryoung bond in Holstein dairy cattle.

Korean isolated word recognizer using new time alignment method of speech signal (새로운 시간축 정규화 방법을 이용한 한국어 고립단어 인식기)

  • Nam, Myeong-U;Park, Gyu-Hong;No, Seung-Yong
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.38 no.5
    • /
    • pp.567-575
    • /
    • 2001
  • This paper suggests new method to get fixed size parameter from different length of voice signals. The efficiency of speech recognizer is determined by how to compare the similarity(distance of each pattern) of the parameter from voice signal. But the variation of voice signal and the difference of speech speed make it difficult to extract the fixed size parameter from the voice signal. The method suggested in this paper is to normalize the parameter at fixed size by using the 2 dimension DCT(Discrete Cosine Transform) after representing the parameter by spectrogram. To prove validity of the suggested method, parameter extracted from 32 auditory filter-bank(it estimates auditory nerve firing probabilities) is used for the input of neural network after being processed by 2 dimension DCT. And to compare with conventional methods, we used one of conventional methods which solve time alignment problem. The result shows more efficient performance and faster recognition speed in the speaker dependent and independent isolated word recognition than conventional method.

  • PDF

Data Analysis of Inertial Sensors for Train Positioning Detection System (열차위치검지 시스템을 위한 관성센서 데이터 분석 연구)

  • Kim, Seong Jin;Park, Sungsoo;Lee, Jae-Ho;Kang, Donghoon
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.35 no.1
    • /
    • pp.18-24
    • /
    • 2015
  • Train positioning detection information is fundamental for high-speed railroad inspection, making it possible to simultaneously determine the status and evaluate the integrity of railroad equipment. This paper presents the results of measurements and an analysis of an inertial measurement unit (IMU) used as a positioning detection sensors. Acceleration and angular rate measurements from the IMU were analyzed in the amplitude and frequency domains, with a discussion on vibration and train motions. Using these results and GPS information, the positioning detection of a Korean tilting train express was performed from Naju station to Illo station on the Honam-line. The results of a synchronized analysis of sensor measurements and train motion can help in the design of a train location detection system and improve the positioning detection performance.

Design of the Noise Suppressor Using Wavelet Transform (웨이블릿 변환을 이용한 잡음제거기 설계)

  • 원호진;김종학;이인성
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.7
    • /
    • pp.37-46
    • /
    • 2001
  • This paper proposes a new noise suppression method using the Wavelet transform analysis. The noise suppressor using the Wavelet transform shows the more effective advantages in a babble noise than one using the short-time Fourier transform. We designed a new channel structure based on spectral subtraction of Wavelet transform coefficients and used the Wavelet mask pattern with more higher time resolution in high frequency. It showed a good adaptation capability for babble noise with a non-stationary property. To evaluate the performance of proposed noise canceller, the informal subjective listening tests (Mos tests) were performed in background noise environments (car noise, street noise, babble noise) of mobile communication. The proposed noise suppression algorithm showed about MOS 0.2 performance improvements than the suppression algorithm of EVRC in informal listening tests. The noise reduction by the proposed method was shown in spectrogram of speech signal.

  • PDF

Acoustic Characteristics of Korean Spoken by the Women Immigrants from Japan and Philippine (여성 결혼이민자들의 한국어 조음에 나타나는 음향음성학 특성 연구 - 일본과 필리핀 출신 여성 결혼이민자들을 대상으로)

  • Jo, Seon-Hui;Kim, Hyun-Gi;Kim, Sun-Jun
    • Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.203-217
    • /
    • 2008
  • The number of Asian women immigrants in Korea is getting bigger and it's important to note that their communication problem in Korean causes not only the difficulty of adapting to Korean society but their children's speech-language disorder. To date there is little research on their acoustics characters and articulatory errors. Therefore, this study focuses on acoustic characters and articulatory error patterns of the women immigrants from Japan and Philippine based on the theory of "contrastive analysis". The subjects were 16 Japanese women immigrants(age: 42.5$\pm$4.4) and 14 Philippine women immigrants(age: 31.64$\pm$6.7) and control group consisted of 10 Korean women(age: 28.3$\pm$1.2). Speech and hearing of all subjects and control group were within normal limits. Speech samples were analyzed in a computer using CSL and data analysis was done on FFT widow for F1, F2, F3 of vowels and on wideband spectrogram for VOT of plosives and africatives. The results of this study were like this; For Japanese women immigrants, they had different articulatory patterns of /e/, /a/, /u/, /o/, /$\varepsilon$/, /m/ from those of Koreans and showed articulatory errors on the fortis and aspirated sounds. The reason is Japanese has only two distinctive characters for plosives and affricates; voicing and voiceless. The Philippine women immigrants also showed the same error patterns as the Japanese women immigrants. Especially the errors on aspirated sounds were prominent because their mother tongue has no distinctive characters about aspirated sounds. For vowels, they showed errors of /a/, /o/, /c/.

  • PDF

Acoustic Evaluation of acupuncture therapy effects on post-stroke dysarthria (중풍으로 인한 마비성 조음장애 환자의 침술 후 말소리의 음향학적 평가 연구)

  • Moon, B.S.;Yun, J.M.;Shin, Y.I.;Kim, H.G.
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.211-212
    • /
    • 2007
  • Stroke makes several physical deficits. Dysarthria is one of the most difficult problems in conventional medicine because of the weakness of neuromotor control. The purpose of this study is to find the acoustic characteristics of acupuncture therapy effects on post-stroke dysarthria. Seven patients with stroke(infarction or hemorrhage) were selected by CT or MR imaging. The authors applied acupuncture therapy by inserting needles into 8 acupuncture points, ipsilateral ST4, ST6 and contralateral LI4, ST36 on facial palsy side, and CV23, CV24, bilateral "Sheyu" for 4 weeks. Speech sample were composed of five simple vowels /a,e,i,o,u/ and meaningless polysyllabic words CVCVC(C: stops, affricated, fricative sounds, v: /e/). .VOT, total duration of each speech samples and vowel formant (F1&F2) were analyzed on Spectrogram. The results are as follows: 1. VOT of bilabial and velar stops was decreased post treatment. The VOT of bilabial glottalized pre and post treatment were statistically significant (p < 0.05). 2. Total duration of polysyllabic words was decreased post treatment. Decrement of total duration containing the bilabial was statistically significant (p<0.05). 3. First formant of round vowel /o/ pre and post treatment was statistically significant (p<0.05).

  • PDF

The comparative Study of the Acoustic Representation between Pansori singer's and Spasmodic dysphonia patient's Voice (병적인 소리 떨림증과 소리꾼 떨림증의 음향학적인 비교연구)

  • Hong, K.H.;Kim, H.G.;Lee, J.K.;Choi, J.S.
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.143-145
    • /
    • 2007
  • Muscle groups that are located in and around the vocal tract can produce audible changes in frequency and/or intensity of the voice. Vocal vibrato is a characteristic feature in the singing of performers trained in the western classical tradition and vibrato is generally considered to result from modulation in frequency amplitude and timbre. Vocal tremor is also characterized by periodic fluctuations in the voice frequency or intensity and vocal tremor is symptom of a neurological disease as Spasmodic dysphonia , Parkinson's disease. Vocal vibrato and Vocal tremor may have many of the same origins and mechanisms in the voice production systems. The purpose of this study is to find acostic character of Korean traditional song Pansori singer's vibrato and Spasmodic dysphonia patient's vocal tremor. twelve Pansori singers and seven Spasmodic dysponia patients participated to this study. Power spectrum and Real time Spectrogram are used to analyze the acoustic characteristics of Pansori singing and Spasmodic dysphonia patient's voice The results are as follows; First, vowel formant differences between Pansori singing and Spasmodic dysphonia patient's voice are higher F1, F3. Second, The vibrato rate show differences between Pansori singing and Spasmodic dysphonia patients;$4^{\sim}6/sec$ and $5{\sim}6/sec$ Vibrato rate of pitch is 5.7 Hz ${\sim}$ 42.4 Hz for Pansori singing , 3.8 Hz ${\sim}$ 27.9 Hz for Spasmodic dysphonia patients ;Vibrato rate of intensity range is 0.07 dB ${\sim}$ 8.26 dB for Pansori singing and 0.07 dB ${\sim}$ 4.81 dB for Spasmodic dysphonia patients

  • PDF