• Title/Summary/Keyword: Voice training

Search Result 179, Processing Time 0.027 seconds

NUI/NUX of the Virtual Monitor Concept using the Concentration Indicator and the User's Physical Features (사용자의 신체적 특징과 뇌파 집중 지수를 이용한 가상 모니터 개념의 NUI/NUX)

  • Jeon, Chang-hyun;Ahn, So-young;Shin, Dong-il;Shin, Dong-kyoo
    • Journal of Internet Computing and Services
    • /
    • v.16 no.6
    • /
    • pp.11-21
    • /
    • 2015
  • As growing interest in Human-Computer Interaction(HCI), research on HCI has been actively conducted. Also with that, research on Natural User Interface/Natural User eXperience(NUI/NUX) that uses user's gesture and voice has been actively conducted. In case of NUI/NUX, it needs recognition algorithm such as gesture recognition or voice recognition. However these recognition algorithms have weakness because their implementation is complex and a lot of time are needed in training because they have to go through steps including preprocessing, normalization, feature extraction. Recently, Kinect is launched by Microsoft as NUI/NUX development tool which attracts people's attention, and studies using Kinect has been conducted. The authors of this paper implemented hand-mouse interface with outstanding intuitiveness using the physical features of a user in a previous study. However, there are weaknesses such as unnatural movement of mouse and low accuracy of mouse functions. In this study, we designed and implemented a hand mouse interface which introduce a new concept called 'Virtual monitor' extracting user's physical features through Kinect in real-time. Virtual monitor means virtual space that can be controlled by hand mouse. It is possible that the coordinate on virtual monitor is accurately mapped onto the coordinate on real monitor. Hand-mouse interface based on virtual monitor concept maintains outstanding intuitiveness that is strength of the previous study and enhance accuracy of mouse functions. Further, we increased accuracy of the interface by recognizing user's unnecessary actions using his concentration indicator from his encephalogram(EEG) data. In order to evaluate intuitiveness and accuracy of the interface, we experimented it for 50 people from 10s to 50s. As the result of intuitiveness experiment, 84% of subjects learned how to use it within 1 minute. Also, as the result of accuracy experiment, accuracy of mouse functions (drag(80.4%), click(80%), double-click(76.7%)) is shown. The intuitiveness and accuracy of the proposed hand-mouse interface is checked through experiment, this is expected to be a good example of the interface for controlling the system by hand in the future.

Clinical Characeristics of Intracordal Cysts (성대낭종의 임상적 특성)

  • Hong, Ki-Hwan;Park, Jung-Hoon;Kim, Won;Kim, Chang-Hyun
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.10 no.2
    • /
    • pp.164-169
    • /
    • 1999
  • Background and Objectives : The intracordal cysts are more increasingly diagnosed and treated due to advanced laryngeal stroboscopy and laryngeal microsurgical technique. The intracordal cysts are frequently misdiagnosed as vocal polyp or nodule The purpose of this study is to evaluate clinical features of intracordal cysts. Materials and Methods : In the present series, 83 cases of the intracordal cysts treated with laryngeal microsurgery are reported. The intracordal cysts are diagnosed preoperatively with indirect laryngoscopy, laryngeal endoscopy, laryngeal stroboscopy and confirmed with laryngeal microsurgical findings and biopsies. Results : Intracordal cysts are 83 of 1900 patients treated with laryngeal microsurgery(4.4%)-ductal cysts are 56 cases and epidermoid cysts are 27 cases. Intracordal cysts are more frequent in women, forties and the frequent site is an anterior third of the true vocal cord. With the indirect laryngoscopic examination, the ductal cysts are frequently misdiagnosed as vocal polyps or nodules but the epidermoid cysts are relatively easily diagnosed. The etiologic factors of the intracordal cysts are suspected as voice abuse and upper respiratory infection. The degree of postoperative voice satisfaction is similar to that of the vocal polyps. Conclusion : Intracordal cysts are frequently misdiagnosed as polyps or nodules, therefore preoperative stroboscopic findings and laryngeal microsurgical findings is important. An ideal treatment is to enucleate the cysts avoiding rupture of cyst and injury of lamina propria of the vocal cord.

  • PDF

Investigation of aerodynamic evaluation in female patients undergoing thyroidectomy (갑상선절제술을 받은 여성 환자의 공기역학 검사변수 조사)

  • Kang, Young Ae;Kwon, In Sun;Won, Ho-Ryun;Chang, Jae Won;Koo, Bon Seok
    • Phonetics and Speech Sciences
    • /
    • v.12 no.2
    • /
    • pp.73-80
    • /
    • 2020
  • Breathing is the voice's driving force and also acts as a regulator of larynx function and efficiency. Respiratory distress is a side effect of general anesthesia in thyroid surgery. Therefore, this study's objective was to provide practical and complementary information for voice recovery after thyroid surgery, based on aerodynamic evaluation pre- and post-thyroidectomy. From May 2014 to July 2015, aerodynamic evaluations were performed on 34 female patients diagnosed with thyroid papillary cancer one week before surgery (PRE), one month after surgery (P1), and three months after surgery (P3). The Phonatory Aerodynamic System (model 6600, KayPENTAX, USA) was employed for this purpose, and a total of 29 analysis parameters were selected. The results showed statistically significant differences in peak expiratory airflow (p=0.004), mean pitch (p<0.01), expiration airflow duration (p=0.001), and expiratory volume (p=0.018), based on time factors. In the comparison of time factors, peak expiratory airflow and mean pitch parameters were different in PRE-P1 and PRE-P3. Expiration airflow duration and expiratory volume parameters were different in PRE-P3 and P1-P3. The interaction effect of time and surgical range was significant only for expiratory volume (p=0.024). Female patients who undergo thyroidectomy require post-operative breathing training, and exhalation improvement is considered to reflect a positive lifestyle after surgery.

A Study on the Educational Necessity and Activation Plan of Image Making Program for Life Care (라이프케어를 위한 이미지메이킹 프로그램 교육의 필요성과 활성화 방안)

  • Yoon, Hee
    • Journal of Korea Entertainment Industry Association
    • /
    • v.14 no.7
    • /
    • pp.429-437
    • /
    • 2020
  • This study is aimed at exploring the current state, necessity and activation of curriculum related to image making program in domestic colleges. To achieve this, an empirical survey was carried out to college students to provide basic data for the development of image making education program in the college curriculum as a measure to guide job interviews with them and improve interpersonal skills of employees-to-be. To achieve this, a survey was carried out to 400 college students in Gwangju and Jeonnam areas. The analysis was conducted to verify the collected data using SPSS v. 21.0 through the process of data coding and data cleaning. The results are as follows. First, the necessity of image making program curriculum showed that they needed the image making program in the college curriculum, the image making program curriculum to get a job and manage an image of employees-to-be after graduation, and other people's help to figure out the images objectively. Second, the educational importance of image making program showed that attitude (behavior) was the highest, followed by manners & greeting, look, speech, relationship, clothes, hairstyle, and makeup. In terms of the important educational factors of image making program, look was the highest, followed by makeup, hairstyle, attitude (behavior), relationship, speech, clothes, and manners & greeting, which look was the most important. Third, the educational influence of image making program showed that the influence on employment was the highest, followed by the influence on relationship, and the influence on life. Fourth, the educational activation of image making program showed that the appropriate educational time for image making program they want was from the second year. Education hours they want were once a week for one semester. And the curriculum they want was liberal arts or an optional course of liberal arts. In terms of image making program-related curriculum contents, manner & greeting was the highest, followed by makeup & coordination, job fair, education to acquire a skill qualification, and training for domestic companies, which their biggest wish was manner & greeting. And image making program leaders they want were major professors. In terms of image making program-related education, speech or voice was the highest, followed by education to analyze communication, education to analyze and practice matching hairstyles and makeup, Education on corporate interviews, and education on walking or posture correction, which their biggest wish was speech or voice and education to analyze communication.

RPCA-GMM for Speaker Identification (화자식별을 위한 강인한 주성분 분석 가우시안 혼합 모델)

  • 이윤정;서창우;강상기;이기용
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.7
    • /
    • pp.519-527
    • /
    • 2003
  • Speech is much influenced by the existence of outliers which are introduced by such an unexpected happenings as additive background noise, change of speaker's utterance pattern and voice detection errors. These kinds of outliers may result in severe degradation of speaker recognition performance. In this paper, we proposed the GMM based on robust principal component analysis (RPCA-GMM) using M-estimation to solve the problems of both ouliers and high dimensionality of training feature vectors in speaker identification. Firstly, a new feature vector with reduced dimension is obtained by robust PCA obtained from M-estimation. The robust PCA transforms the original dimensional feature vector onto the reduced dimensional linear subspace that is spanned by the leading eigenvectors of the covariance matrix of feature vector. Secondly, the GMM with diagonal covariance matrix is obtained from these transformed feature vectors. We peformed speaker identification experiments to show the effectiveness of the proposed method. We compared the proposed method (RPCA-GMM) with transformed feature vectors to the PCA and the conventional GMM with diagonal matrix. Whenever the portion of outliers increases by every 2%, the proposed method maintains almost same speaker identification rate with 0.03% of little degradation, while the conventional GMM and the PCA shows much degradation of that by 0.65% and 0.55%, respectively This means that our method is more robust to the existence of outlier.

Prevalence of Laryngo-pharyngeal Reflux(LPR) Related Symptoms at the Out Patient Department in Korea : One Week Survey (우리나라 이비인후과 외래환자의 인.후두 역류증상 발병빈도 조사(One Week Survey 결과))

  • Choi, Hong-Sik;Kim, Hyung-Tae;Seo, Jang-Soo;Wang, Soo-Gun;Cho, Jae-Sik;Choi, Gun;Hong, Ki-Hwan;Kim, Seok-Il;Lee, Won-Chul
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.11 no.1
    • /
    • pp.87-97
    • /
    • 2000
  • One week survey to investigate the prevalence rate and clinical characteristics of laryngopharyngeal reflux symptoms in Korea. The subject(n=7,704 patients) was newly enrolled patients at the out patient clinic in 90 ENT departments of resident training hospitals and 11 local clinics, which were voluntarily participated in the study 1) Twenty five percent of all enrolled patients has LPR-related symptoms or clinical findings from the examination by ENT specialists. 2) Among e name of LPR-related diagnosis, globus syndrome was e most common, and follows by reflux laryngitis, and chronic laryngitis. 3) Women was more prevalent than men, and it is common in 5th, 6th, and 7th decades, which seems to be related with aging process. 4) Most popular symtoms of LPR. were globus sensation, conic throat clearing, and hoarseness of unknown origin. 5) Aggravating factors of LPR-related symptoms were tiredness, mental stress, drink alcohol, cigarettes smoking, spicy food, and drinking coffee. 6) LPR-related symptoms were more common in professional voice users. 7) In past medical history, diseases of stomach and tonsillitis were most common.

  • PDF

Comparison of the Singing Pitch Characteristics in Adults with Intellectual Disabilities Based on Their Choir Experience (성인지적장애인의 노래부르기 시 음도산출 특성: 합창경험 유무에 따른 비교)

  • Kim, Eun Jin;Kim, Soo Ji
    • 재활복지
    • /
    • v.21 no.3
    • /
    • pp.165-186
    • /
    • 2017
  • The purpose of this study was to compare adults' with intellectual disabilities voice pitch between who have choir experiences and those who do not. Participants were a total of 21 male adults with intellectual disabilities (12 choir group members and 9 non-choir group). Praat test was conducted to compare the characteristics of pitch, produced by the participants while they were singing in their comfortable pitch range. The results showed that the range of melodic contour in the choir group was broader and higher than those of the non choir group. Participants in the choir group produced a lower pitch in the beginning note, and they produced a higher pitch compared to the non-choir group on the highest and lowest note of the song. An analysis on the pitch of the individual note that the participants produced revealed a gap between the expected pitch notes and the actual notes produced while singing. In all syllables of the song, participants in the choir group showed higher accuracy of the pitch production, and significantly more accurate on the perfect fifth and eighth intervals. Regarding to the relative pitch, participants in the choir group produced significantly more accurate notes on perfect fifth, perfect fourth, and perfect eighth intervals. Findings of the study suggest that constant singing experience enable them to have pitch training. It also implies for further studies regarding to singing abilities of adults with intellectual disabilities.

Comparative study of data augmentation methods for fake audio detection (음성위조 탐지에 있어서 데이터 증강 기법의 성능에 관한 비교 연구)

  • KwanYeol Park;Il-Youp Kwak
    • The Korean Journal of Applied Statistics
    • /
    • v.36 no.2
    • /
    • pp.101-114
    • /
    • 2023
  • The data augmentation technique is effectively used to solve the problem of overfitting the model by allowing the training dataset to be viewed from various perspectives. In addition to image augmentation techniques such as rotation, cropping, horizontal flip, and vertical flip, occlusion-based data augmentation methods such as Cutmix and Cutout have been proposed. For models based on speech data, it is possible to use an occlusion-based data-based augmentation technique after converting a 1D speech signal into a 2D spectrogram. In particular, SpecAugment is an occlusion-based augmentation technique for speech spectrograms. In this study, we intend to compare and study data augmentation techniques that can be used in the problem of false-voice detection. Using data from the ASVspoof2017 and ASVspoof2019 competitions held to detect fake audio, a dataset applied with Cutout, Cutmix, and SpecAugment, an occlusion-based data augmentation method, was trained through an LCNN model. All three augmentation techniques, Cutout, Cutmix, and SpecAugment, generally improved the performance of the model. In ASVspoof2017, Cutmix, in ASVspoof2019 LA, Mixup, and in ASVspoof2019 PA, SpecAugment showed the best performance. In addition, increasing the number of masks for SpecAugment helps to improve performance. In conclusion, it is understood that the appropriate augmentation technique differs depending on the situation and data.

Screen Performance and Social Attitude of Song Gang-Ho (송강호의 스크린 퍼포먼스와 사회적 태도)

  • Kim, Jong-Guk
    • Journal of Korea Entertainment Industry Association
    • /
    • v.13 no.2
    • /
    • pp.123-132
    • /
    • 2019
  • This paper analyzes the performances of actor Song Kang-Ho in the background of interdisciplinary and integrated film acting, using performance rather than acting as a general term. If the act is a concept limited to acting training or acting skills, performance is a broad concept that includes expressions, movements, and emotions. The performance on the screen can be explained in the context of film and can be extended to the social attitude of acting. In addition, I used the term screen in terms of representation rather than film referring to medium. Song Kang-Ho expressed the performances of various characters in more than 30 films. Although his facial expressions, gestures, and voices suitable for individual characters in various genres are represented in various ways, personality inherent in the actor Song Kang-Ho integrates persona with character. What drives it is the social attitude of screen performance. As a sign, acting is an ideological construct and foregrounds a character who describes a certain social and historical moment. Song Gang-Ho as actor, persona and character, who asserts the popularity, speaks to society and makes discourse. His comic performance is always confronting the tragedy of life, his face is the spirit of the times, and it expands into social meaning. The face of the close-up does not laugh at all, the gesture symbolized by the curved rear view is exaggerated disorderedly and disturbingly, and the voice using dialect accent does not follow the standard of the vocal.

A Study on The Adoption of Drama for Improving Early Childhood Teacher's Artistic Competence (유아교사의 예술적 역량 함양을 위한 교육연극 활용에 관한 고찰)

  • Kim, Ji-Youn;Kim, Su-youn
    • (The) Research of the performance art and culture
    • /
    • no.41
    • /
    • pp.69-92
    • /
    • 2020
  • This study describes the impact of early childhood teacher's artistic competence on art education pedagogy and improved curriculum design. Furthermore, the effect of drama as a way of improving early childhood teacher's artistic competence is explained. Many researchers have mentioned that early childhood is a period of sensitivity and potential. Therefore, it will be helpful if children meet a teacher who understands them and inspires their innate artistic sense at a level of their eyes. It explained which aspect of artistic competence should be focused for the teacher training education. There are many approaches to develop early childhood teachers' artistic competence. Adopting drama is one of them. The strong points of drama to improve their artistic competence are as follows. Firstly, human's movement and voice are the main artistic channel in drama. What we are doing in daily life is found are drama world. It means if early childhood teachers experience drama activity, they will feel more comfortable and intimate with it. In addition, early childhood teachers tend to be familiar with dramatic play, so they can more easily access to drama world. Secondly, drama will be helpful to understand different feelings and to broaden and deepen understandings of others' standpoints. For early childhood teachers, drama activity will be helpful to understand how dramatic art form works and to lead children's play in diversified and sincere way. In addition, drama activity will be useful to build horizontal and democratic relationships between children and the teacher. It is one of the main emphases of 2019 revised Nori national curriculum. To sum up, drama will be a excellent method to develop artistic competence for early childhood teachers. Thus, it is expected that They have more opportunities to experience drama as an art form.