• Title/Summary/Keyword: vocal process

Search Result 75, Processing Time 0.021 seconds

Application of Standardized North American Marsh Bird Monitoring Protocols to Survey Inconspicuous Marsh Birds in Korea (은둔형 습지 조류의 효과적인 조사 방법 탐색을 위한 국외 프로토콜의 시범 적용)

  • Lee, Sang-Yeon;Sung, Ha-Cheol
    • Korean Journal of Ecology and Environment
    • /
    • v.52 no.2
    • /
    • pp.143-150
    • /
    • 2019
  • Although inconspicuous marsh birds are an indicator of marsh health, there is little understanding of their status and population trends due to their behavioral characteristics and lack of reliable survey methods in Korea. We applied the Standardized North American Marsh Bird Monitoring Protocols(SNAMBMP) already validated in North America for effective survey of the marsh birds. We selected 29 sites with emergent marshes, rice fields and riparian forests in Seocheon-gun, Buyeo-gun and Gunsan-si. We conducted the survey with a combination of passive 5 minute point-count and vocal survey method (30 seconds call-broadcasting+30 seconds silence) that was targeted eight species 2~7 times/site from March to July 2017. Four species, Brown-cheeked Rail(Rallus indicus), Ruddy-breasted Crake (Porzana fusca), Watercock (Gallicrex cinerea) and Greater Painted-snipe (Rostatula benghalensis), were detected at one site respectively (naïve occupancy rate=0.035). Vocal survey method with conspecific call-broadcasting provided better on Brown-cheeked Rail and Watercock than the others. We suggest a combination of passive point-count and vocal survey method like SNAMBMP to monitor inconspicuous marsh birds at nationwide scale and collection of sound files through recording of the entire process during the survey.

Personal Credit Evaluation System through Telephone Voice Analysis: By Support Vector Machine

  • Park, Hyungwoo
    • Journal of Internet Computing and Services
    • /
    • v.19 no.6
    • /
    • pp.63-72
    • /
    • 2018
  • The human voice is one of the easiest methods for the information transmission between human beings. The characteristics of voice can vary from person to person and include the speed of speech, the form and function of the vocal organ, the pitch tone, speech habits, and gender. The human voice is a key element of human communication. In the days of the Fourth Industrial Revolution, voices are also a major means of communication between humans and humans, between humans and machines, machines and machines. And for that reason, people are trying to communicate their intentions to others clearly. And in the process, it contains various additional information along with the linguistic information. The Information such as emotional status, health status, part of trust, presence of a lie, change due to drinking, etc. These linguistic and non-linguistic information can be used as a device for evaluating the individual's credit worthiness by appearing in various parameters through voice analysis. Especially, it can be obtained by analyzing the relationship between the characteristics of the fundamental frequency(basic tonality) of the vocal cords, and the characteristics of the resonance frequency of the vocal track.In the previous research, the necessity of various methods of credit evaluation and the characteristic change of the voice according to the change of credit status were studied. In this study, we propose a personal credit discriminator by machine learning through parameters extracted through voice.

The Clinical Analysis of Sulcus Vocalis (성대구증에 관한 임상적 고찰)

  • 김광문;서장수;오혜경;최홍식;김기령
    • Proceedings of the KOR-BRONCHOESO Conference
    • /
    • 1982.05a
    • /
    • pp.11.2-12
    • /
    • 1982
  • The major advancement in phonosurgery due to recent development of laryngomicrosurgery enabled more accurate diagnosis and treatment of patient with voice disorders. Among large proportion of voice disordered patients, prominent linear furrow running parallel along the free edge of vocal cord extending from the vocal process to anterior commissure can be seen as well as incomplete closure during phonation. These cases were illustrated and coined as sulcus vocalis by Salvi in 1901, since then other similar paper was reported in Europe and Japan, but has not been reported in Korea. The exact etiology and therapeutic methods of sulcus vocalis has not been elaborated. At Department of Otolaryngology of Yonsei University College of Medicine a series of voice analysis were performed among those 35 patients with sulcus vocalis visited to Vocal Dynamics Laboratory from May, 1981 to March, 1982. Following is the result of clinical statistical investgation and therapeutic modality. 1) The incidance of sulcus vocalis among 290 patients with voice disorder visited to Vocal Dynamics Laboratory was approximately 12%(35 cases). 2) Onset of this voice disorder was most frequent among patient under 10 year-old groups; 19 cases (54%) followed by second decade, third decade groups in decreasing frequency respectably. 3) The etiology of sulcus vocalis was mostly unknown. The sequelae after measle (4 cases) and severe upper respiratory infection (3 cases) and congenital deformity (2 cases) were the possible causes of sulcus vocalis. 4) These patients were involved bilaterally in 25 cases (71%), left side only in 8 cases (23%) and right side only in 2 cases (6%). 5) Almost all patients complained hoarseness and 7 patients were suffering from chronic laryngitis. 6) In aerodynamic analysis, Maximal Phonation Time was decreased in 20 cases (57%), Phonation Quotient was increased in 22 cases (63%) and Mean Air Flow Rate was increased in 23 cases (66%). 7) Among them, 33 cases were analyzed with stroboscopy. The findings were as follows; incomplete glottic closure during phonation in 31 cases (93%), regular vocal cord movement in whole cases, asymmetric cord movement in 4 cases (12%), decreased amplitude in 5 cases (21%) and small mucosal wave in 24 cases (73%). 8) Intracordal Teflon injection in 5 cases and Sulcusectomy in 1 cases were performed as therapeutic management, however, the therapeutic results were not effective except one case with Teflon injection.

  • PDF

Clinical Characeristics of Intracordal Cysts (성대낭종의 임상적 특성)

  • Hong, Ki-Hwan;Park, Jung-Hoon;Kim, Won;Kim, Chang-Hyun
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.10 no.2
    • /
    • pp.164-169
    • /
    • 1999
  • Background and Objectives : The intracordal cysts are more increasingly diagnosed and treated due to advanced laryngeal stroboscopy and laryngeal microsurgical technique. The intracordal cysts are frequently misdiagnosed as vocal polyp or nodule The purpose of this study is to evaluate clinical features of intracordal cysts. Materials and Methods : In the present series, 83 cases of the intracordal cysts treated with laryngeal microsurgery are reported. The intracordal cysts are diagnosed preoperatively with indirect laryngoscopy, laryngeal endoscopy, laryngeal stroboscopy and confirmed with laryngeal microsurgical findings and biopsies. Results : Intracordal cysts are 83 of 1900 patients treated with laryngeal microsurgery(4.4%)-ductal cysts are 56 cases and epidermoid cysts are 27 cases. Intracordal cysts are more frequent in women, forties and the frequent site is an anterior third of the true vocal cord. With the indirect laryngoscopic examination, the ductal cysts are frequently misdiagnosed as vocal polyps or nodules but the epidermoid cysts are relatively easily diagnosed. The etiologic factors of the intracordal cysts are suspected as voice abuse and upper respiratory infection. The degree of postoperative voice satisfaction is similar to that of the vocal polyps. Conclusion : Intracordal cysts are frequently misdiagnosed as polyps or nodules, therefore preoperative stroboscopic findings and laryngeal microsurgical findings is important. An ideal treatment is to enucleate the cysts avoiding rupture of cyst and injury of lamina propria of the vocal cord.

  • PDF

Stereo Vision Neural Networks with Competition and Cooperation for Phoneme Recognition

  • Kim, Sung-Ill;Chung, Hyun-Yeol
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.1E
    • /
    • pp.3-10
    • /
    • 2003
  • This paper describes two kinds of neural networks for stereoscopic vision, which have been applied to an identification of human speech. In speech recognition based on the stereoscopic vision neural networks (SVNN), the similarities are first obtained by comparing input vocal signals with standard models. They are then given to a dynamic process in which both competitive and cooperative processes are conducted among neighboring similarities. Through the dynamic processes, only one winner neuron is finally detected. In a comparative study, with, the average phoneme recognition accuracy on the two-layered SVNN was 7.7% higher than the Hidden Markov Model (HMM) recognizer with the structure of a single mixture and three states, and the three-layered was 6.6% higher. Therefore, it was noticed that SVNN outperformed the existing HMM recognizer in phoneme recognition.

Explaining Avian Vocalizations: a Review of Song Learning and Song Communication in Male-Male Interactions

  • Sung, Ha-Cheol;Park, Shi-Ryong
    • Animal cells and systems
    • /
    • v.9 no.2
    • /
    • pp.47-55
    • /
    • 2005
  • Avian vocalization has been main topics in studying animal communication. The structure and usage as well as development and function of vocalization vary enormously among species and even among populations, and thus we reviewed the general patterns of song learning and the consequences of song communication in birds at the behavioural level: first, we compared the different learning phenomena between non-songbird and songbird, and we investigated the learning process of songbird both in the field and in the lab, which are needed to fully understand vocal communication. Second, we discussed a recent trend of sexual selection hypothesis explaining the structural and functional diversity of song in songbirds with repertoire and presented how the repertoire is actually used between neighbours based on individual recognition.

Implementation of Human and Computer Interface for Detecting Human Emotion Using Neural Network (인간의 감정 인식을 위한 신경회로망 기반의 휴먼과 컴퓨터 인터페이스 구현)

  • Cho, Ki-Ho;Choi, Ho-Jin;Jung, Seul
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.13 no.9
    • /
    • pp.825-831
    • /
    • 2007
  • In this paper, an interface between a human and a computer is presented. The human and computer interface(HCI) serves as another area of human and machine interfaces. Methods for the HCI we used are voice recognition and image recognition for detecting human's emotional feelings. The idea is that the computer can recognize the present emotional state of the human operator, and amuses him/her in various ways such as turning on musics, searching webs, and talking. For the image recognition process, the human face is captured, and eye and mouth are selected from the facial image for recognition. To train images of the mouth, we use the Hopfield Net. The results show 88%$\sim$92% recognition of the emotion. For the vocal recognition, neural network shows 80%$\sim$98% recognition of voice.

On a Pitch Alteration Method Compensated with the Spectrum for High Quality Speech Synthesis (스펙트럼 보상된 고음질 합성용 피치 변경법)

  • 문효정
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1995.06a
    • /
    • pp.123-126
    • /
    • 1995
  • The waveform coding are concerned with simply preserving the wave shape of speech signal through a redundancy reduction process. In the case of speech synthesis, the wave form coding with high quality are mainly used to the synthesis by analysis. However, because the parameters of this coding are not classified as either excitation and vocal tract parameters, it is difficult to applying the waveform coding to the synthesis by rule. In this paper, we proposed a new pitch alteration method that can change the pitch period in waveform coding by using scaling the time-axis and compensating the spectrum. This is a time-frequency domain method that is preserved in the phase components of the waveform and that has a little spectrum distortion with 2.5% and less for 50% pitch change.

  • PDF

A Case of Laryngeal Pleomorphic Adenoma (후두에 발생한 다형성 선종 1례)

  • Lee, Sang Hun;Choi, Seung-Ho
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.28 no.2
    • /
    • pp.141-143
    • /
    • 2017
  • Pleomorphic adenoma is the most common salivary gland neoplasm and most of them arise in the parotid gland. Pleomorphic adenomas at other sites than salivary glands have rarely been reported. We experienced a patient with pleomorphic adenoma of larynx. A 59 year-old female patient visited outpatient clinic complaining of voice change and foreign body sensation. Round mass at right vocal process was found in laryngoscopic exam. We performed laryngoscopic microsurgery to remove the tumor. Histologically, it was diagnosed as pleomorphic adenoma. Recurrence or complication did not occur during the follow up period of 3 years.

  • PDF

Listener's Age Estimation by Prosody Manipulation (운율 변조 양상에 따른 청자의 연령 지각)

  • Kim, Jiyoun;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.6 no.2
    • /
    • pp.81-88
    • /
    • 2014
  • The normal aging process on speech production and these changes are perceived by listeners. This study examined whether age perception changed under various conditions of prosodic manipulations in normal listeners, comparing the prosodic changes according to age and sex in adulthood. The older and younger voices were resynthesized by manipulation of the speaking rate and pitch to shift the perceived age of the groups toward each other. Two-way repeated ANOVA were conducted to determine if the prosodic type of resynthesized cue resulted in a significant shift in perceived age of young and old voices. The manipulation of the speaking rate resulted in a significant shift in perceived age for the older and younger groups. A significant shift in age estimates was not observed for the younger male group when pitch was manipulated. There were significant gender-by-age group interactions for prosodic manipulation type. Age-related changes in the prosodic properties of speech may ultimately influence speech perception.