• Title/Summary/Keyword: Vocal

Search Result 1,186, Processing Time 0.028 seconds

Investigation of Timbre-related Music Feature Learning using Separated Vocal Signals (분리된 보컬을 활용한 음색기반 음악 특성 탐색 연구)

  • Lee, Seungjin
    • Journal of Broadcast Engineering
    • /
    • v.24 no.6
    • /
    • pp.1024-1034
    • /
    • 2019
  • Preference for music is determined by a variety of factors, and identifying characteristics that reflect specific factors is important for music recommendations. In this paper, we propose a method to extract the singing voice related music features reflecting various musical characteristics by using a model learned for singer identification. The model can be trained using a music source containing a background accompaniment, but it may provide degraded singer identification performance. In order to mitigate this problem, this study performs a preliminary work to separate the background accompaniment, and creates a data set composed of separated vocals by using the proven model structure that appeared in SiSEC, Signal Separation and Evaluation Campaign. Finally, we use the separated vocals to discover the singing voice related music features that reflect the singer's voice. We compare the effects of source separation against existing methods that use music source without source separation.

The Flattening Algorithm of Speech Spectrum by Quadrature Mirror Filter (QMF에 의한 음성스펙트럼의 평탄화 알고리즘)

  • Min, So-Yeon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.7 no.5
    • /
    • pp.907-912
    • /
    • 2006
  • Pre-emphasizing the speech compensates for falloff at high frequencies. The most common form of pre-emphasis is y(n)=s(n)-A${\cdot}$s(n-1), where A typically lies between 0.9 and 1.0 in voiced signal. And, this value reflects the degree of pre-emphasis and equals R(1)/R(0) in conventional method. This paper proposes a new flattening method to compensate the weaked high frequency components that occur by vocal cord characteristic. We used QMF(Quardrature Mirror Filter) to minimize the output signal distortion. After using the QMF to compensate high frequency components, flattening process is followed by R(1)/R(0) at each frame. Experimental results show that the proposed method flattened the weaked high frequency components effectively than auto correlation method. Therefore, the flattening algorithm will apply in speech signal processing like speech recognition, speech analysis and synthesis.

  • PDF

Effects of Tonsillectomy on Oral and Nasal Spectral Outputs for Sustained Vowel (편도적출술이 구강 및 비강 음향스팩트럼에 미치는 영향)

  • Choi, Dong-Il;Kong, Il-Seung;Lee, Eun-Jung;So, Sang-Soo;Yang, Yoon-Soo;Hong, Ki-Hwan
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.18 no.1
    • /
    • pp.33-38
    • /
    • 2007
  • Background and Objectives: It has been suggested that tonsillectomy possibly causes changes of voice because the morphology of the vocal tract is altered. This may cause serious problems for professional voice users. Materials and Method: Subjects were 26 patients. The oral and nasal sound spectrum of oral vowel /a/, /e/ and /i/ were measured before and after tonsillectomy. The formant frequencies and intensities for oral and nasal spectra were compared. The nasality and fundamental frequencies for oral vowel were measured. Results: The first formant frequencies for oral spectra of all vowels were not changed after surgery, but the second formant frequencies were increased significantly after surgery in the vowel /e/ and /i/. The first and second formant intensities for oral spectra were increased significantly after surgery in the all vowels. The first and second formant frequencies for nasal spectra of all vowels were not changed after surgery, but their intensities for nasal spectra were increased after surgery. The nasalities for oral vowel were not changed after surgery. Conclusion : Tonsillectomy appeared to change the spectral features of oral and nasal components of oral vowel, especially spectral intensities.

  • PDF

A Case of Foreign Body Laryngeal Granuloma Mimicking Contact Granuloma (접촉성 육아종으로 오인된 후두 이물 육아종 1예)

  • Kim, Hye soo;Kim, Sun woo;Lee, Jin;Lee, Sang Hyuk
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.31 no.1
    • /
    • pp.27-30
    • /
    • 2020
  • Among lesions in the larynx, laryngeal contact granuloma due to persistent tissue irritation can typically be attributed to endotracheal intubation, vocal abuse, or gastro-esophageal reflux disease. Treatment typically includes voice therapy, lifestyle changes and use of anti-reflux medication. Microsurgical removal is only indicated in cases of severe dyspnea due to mass size. Foreign body granuloma is a response of to any foreign material in the tissue. Foreign body granulomas are sometimes misdiagnosed as soft tissue tumors when the causative foreign body is not initially found. Delayed treatment of these foreign bodies may cause complications. We present a case of larynx granuloma due to impacted foreign body, probably fish bone, in the larynx that mimicked contact granuloma. We initially used anti-reflux medication, but to no avail. The laryngeal mass, observed through laryngoscopy, showed no improvement and therefore necessitated a proper pathologic diagnosis. We were able to successfully treat it via trans-oral laser CO2 microsurgery before any complications developed.

Stevie Wonder's music has had on the K-POP (Stevie Wonder의 음악이 K-POP에 끼친 영향)

  • Yun, Byung-Jin;Cho, Tae-Seon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.17 no.10
    • /
    • pp.104-108
    • /
    • 2016
  • African-American music heavily influences a variety of musical genres, including folk music, jazz, rhythm and blues, soul, and funk music. The music derived from traditional African music with its unique syncopated rhythm using a five-note pentatonic scale, and has developed through significant influences from gospel music. African-American music has also evolved through historical social movements, such as eliminating the racial discrimination, and has been influenced by the personalities of different cities in the United States. Modern music features fusion with elements of Western music. One of the most influential and respected artists of the 20th century was Stevie Wonder, who was known as "Father of African-American music." He was an accomplished artist, winning numerous awards despite being disabled. He has become one of the most famous and respected artists worldwide. This study of Stevie Wonder's life, music, and spiritual strength, aims to highlight his significant achievements and contributions to pop music. This is a study based on an analysis of the work of Stevie Wonder and describes how elements of African-American music influences current Korean pop music and musicians.

A Training Method for Emotionally Robust Speech Recognition using Frequency Warping (주파수 와핑을 이용한 감정에 강인한 음성 인식 학습 방법)

  • Kim, Weon-Goo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.20 no.4
    • /
    • pp.528-533
    • /
    • 2010
  • This paper studied the training methods less affected by the emotional variation for the development of the robust speech recognition system. For this purpose, the effect of emotional variation on the speech signal and the speech recognition system were studied using speech database containing various emotions. The performance of the speech recognition system trained by using the speech signal containing no emotion is deteriorated if the test speech signal contains the emotions because of the emotional difference between the test and training data. In this study, it is observed that vocal tract length of the speaker is affected by the emotional variation and this effect is one of the reasons that makes the performance of the speech recognition system worse. In this paper, a training method that cover the speech variations is proposed to develop the emotionally robust speech recognition system. Experimental results from the isolated word recognition using HMM showed that propose method reduced the error rate of the conventional recognition system by 28.4% when emotional test data was used.

Speech Dereverberation using Improved Linear Prediction Residual (개선된 선형예측 잔여를 이용한 음성의 잔향음 제거)

  • Park, Chan-Sub;Kim, Ki-Man;Kang, Suk-Youb
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.11 no.10
    • /
    • pp.1845-1851
    • /
    • 2007
  • Background noise and room reverberation are two causes of degradation in speech in listening situations. Many algorithms developed to enhance reverberant speech. In this paper we propose a dereverberation method for enhancement of speech using modified the linear prediction(LP) residual in reverberant room condition. The proposed dereberberation method based on the fact that the signification excitation of the vocal tract system takes place at the instant of glottal closure in voiced speech. Our method used delay information form each sensor, and we need reverberant signals from 3 sensors. We obtain a new LP residual signal using modified IP residual combination which derived form weighting of the LP residual and the Hilbert transform of LP residual. The nature of the coherently added Hilbert envelop has several large amplitude spikes because of the effects of noise and reverberation. This residual of the clean speech is used to excite the time-varying all-pole filter to obtain the enhanced speech. We achieved simulation of proposed algorithm for performance analysis in reverberation environment. The proposed algorithm improves substantially the quality of reverberant speech.

Cho Yong-Pil's 50 years of Music and the Korean Popular Music History (조용필 음악 50년의 한국 대중음악사적 의의 연구)

  • Choi, Hyeon-Woo;Yang, Eun-Young
    • Journal of Convergence for Information Technology
    • /
    • v.8 no.4
    • /
    • pp.199-204
    • /
    • 2018
  • This study analyzes Cho, Yong- Pil's contribution on K-Pop history as he celebrates the 50th anniversary of his debut. The history of K-Pop, when Cho has been active in, is divided into three periods. From the late 1960s to the early 1980s was a period when trots and rocks were popular. Soft rock, heavy metal, and ballads came in from the mid-1980s to the early 2000s. From the late 2000s to the present, dance music, mostly hook songs, has dominated. In every inflection point of K-Pop history, he contributed to the development throughout various musical genres. In the first period, he introduced rock music with the unique emotion of Korea by combining the trot and the rock genre. In the second period, he contributed to the development of both rock and ballad genres. In the third period, his hook songs were released to the controversy, but it contributed to creating an atmosphere in which the vocal ability of the vocalist in idol groups was emphasized in the K-pop scene.

Reconstruction of Tracheal Defect by Sternocleidomastoid Muscle Flap Covered with Skin Graft: A Case Report (피부이식과 흉쇄유돌근 피판을 이용한 기관 결손의 재건 1례)

  • Jang, Soo Kyung;Seo, Gang Hyeon;Choi, Sun;Park, Seok Hyun;Kim, Jin Hwan;Lee, Dong Jin
    • Korean Journal of Head & Neck Oncology
    • /
    • v.37 no.1
    • /
    • pp.63-66
    • /
    • 2021
  • Supracricoid partial laryngectomy (SCPL) with cricohyoidoepiglottopexy (CHEP) or cricohyoidopexy (CHP) involves the removal of the whole thyroid cartilage, both true and false vocal cords, the ventricles, and the paraglottic spaces, sparing the cricoid cartilage, hyoid bone, and at least one functional and mobile cricoarytenoid unit. Reconstruction is performed by suturing of the cricoid cartilage up tightly to the hyoid bone, so trachea-releasing procedures are needed to prevent leakage at anastomosis site. In case of advanced tranglottic cancer invading tracheal tracheal wall, we need to perform additional circumferentrial circumferential tracheal wall resection. However, when we perform SCPL, circumferential resection of tracheal wall is limited because SCPL procedure itself needs releasing of tracheal length. We report a case of advanced transglottic cancer involving tracheal wall treated with induction chemotherapy and SCPL including tracheal wall resection with reconstruction of tracheal defect by sternocleidomastoid muscle flap covered with skin graft.

Acoustic analysis of wet voice among patients with swallowing disorders (삼킴장애 환자의 wet voice 관련 음향학적 분석)

  • Kang, Young Ae;Koo, Bon Seok;Kwon, In Sun;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.147-154
    • /
    • 2018
  • Wet voice quality (WVQ) is a characteristic that appears after swallowing. Although the concept is accepted by many clinicians worldwide, it is nevertheless ambiguous. In this study, we investigated WVQ in patients with swallowing disorders using acoustic analysis. A total of 106 patients diagnosed with penetration-aspiration by the videofluoroscopic swallowing study (VFSS) were recruited. A voice recording of vowel /a/ was conducted before and after the VFSS, and an acoustic analysis was then performed using PRAAT. Voice after VFSS was used for a perceptual judgment and divided into two groups: the Wet group (48 patients) and the Non-wet group (58 patients). At the post-VFSS stage, the two groups displayed significant differences in many acoustic parameters including F0_SD, Jitter, RAP, Shimmer, APQ, HNR, NHR, FUF, DVB, and CPP. The parameter affecting judging wetness resulted into Jitter and NHR by the logistic regression test. At the pre-VFSS stage, the two groups differed significantly in many acoustic parameters including Intensity, Jitter, RAP, Shimmer, NHR, FUF, DVB, and CPP. Both pre-and post-VFSS, the mean values of all significant parameters, except Intensity, HNR, and CPP, were higher in the Wet group. According to pre-and post-VFSS, the two groups displayed interactions in many parameters (Intensity, F0_SD, Jitter, RAP, Shimmer, APQ, HNR, NHR, FUF, DVB, and CPP). In particular, Intensity increased in both groups after the VFSS, although the increase in the Non-wet group was greater. Based on these results, it was conjectured that the WVQ after swallowing resulted from the secretion effect of the mucous membrane due to the dry laryngeal characteristic of elderly patients, rather than aspiration resulting in food on the vocal cords.