• Title/Summary/Keyword: Voice problem

Search Result 339, Processing Time 0.027 seconds

Implementation of Analysis System for H.323 Traffic (H.323 트래픽 분석 시스템의 개발)

  • Lee Sun-Hun;Chung Kwang-Sue
    • The KIPS Transactions:PartC
    • /
    • v.13C no.4 s.107
    • /
    • pp.471-480
    • /
    • 2006
  • Recently, multimedia communication services, such as video conferencing and voice over IP, have been rapidly spread. H.323 is an international standard that specifies the components, protocols and procedures that provide multimedia communication services of real-time audio, video, and data communications over packet networks, including IP based networks. H.323 is applied to many commercial services because it supports various network environments and has a good performance. But communication services based on H.323 may have some problem because of current network trouble or mis-implementation of H.323. The understanding of this problem is a critical issue because it improves the quality of service and is easy to service maintenance. In this paper, we implement the analysis system for H.323 protocol wihch includes H.245, H.225.0, RTP, RTCP, and so on. Tills system is able to capture, parse, and present the H.323 protocol in real-time. Through the operation test and performance evaluation, we prove that our system is a useful to analyze and understand the problems for communication services based on H.323.

Comparative study of data augmentation methods for fake audio detection (음성위조 탐지에 있어서 데이터 증강 기법의 성능에 관한 비교 연구)

  • KwanYeol Park;Il-Youp Kwak
    • The Korean Journal of Applied Statistics
    • /
    • v.36 no.2
    • /
    • pp.101-114
    • /
    • 2023
  • The data augmentation technique is effectively used to solve the problem of overfitting the model by allowing the training dataset to be viewed from various perspectives. In addition to image augmentation techniques such as rotation, cropping, horizontal flip, and vertical flip, occlusion-based data augmentation methods such as Cutmix and Cutout have been proposed. For models based on speech data, it is possible to use an occlusion-based data-based augmentation technique after converting a 1D speech signal into a 2D spectrogram. In particular, SpecAugment is an occlusion-based augmentation technique for speech spectrograms. In this study, we intend to compare and study data augmentation techniques that can be used in the problem of false-voice detection. Using data from the ASVspoof2017 and ASVspoof2019 competitions held to detect fake audio, a dataset applied with Cutout, Cutmix, and SpecAugment, an occlusion-based data augmentation method, was trained through an LCNN model. All three augmentation techniques, Cutout, Cutmix, and SpecAugment, generally improved the performance of the model. In ASVspoof2017, Cutmix, in ASVspoof2019 LA, Mixup, and in ASVspoof2019 PA, SpecAugment showed the best performance. In addition, increasing the number of masks for SpecAugment helps to improve performance. In conclusion, it is understood that the appropriate augmentation technique differs depending on the situation and data.

Automatic detection and severity prediction of chronic kidney disease using machine learning classifiers (머신러닝 분류기를 사용한 만성콩팥병 자동 진단 및 중증도 예측 연구)

  • Jihyun Mun;Sunhee Kim;Myeong Ju Kim;Jiwon Ryu;Sejoong Kim;Minhwa Chung
    • Phonetics and Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.45-56
    • /
    • 2022
  • This paper proposes an optimal methodology for automatically diagnosing and predicting the severity of the chronic kidney disease (CKD) using patients' utterances. In patients with CKD, the voice changes due to the weakening of respiratory and laryngeal muscles and vocal fold edema. Previous studies have phonetically analyzed the voices of patients with CKD, but no studies have been conducted to classify the voices of patients. In this paper, the utterances of patients with CKD were classified using the variety of utterance types (sustained vowel, sentence, general sentence), the feature sets [handcrafted features, extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS), CNN extracted features], and the classifiers (SVM, XGBoost). Total of 1,523 utterances which are 3 hours, 26 minutes, and 25 seconds long, are used. F1-score of 0.93 for automatically diagnosing a disease, 0.89 for a 3-classes problem, and 0.84 for a 5-classes problem were achieved. The highest performance was obtained when the combination of general sentence utterances, handcrafted feature set, and XGBoost was used. The result suggests that a general sentence utterance that can reflect all speakers' speech characteristics and an appropriate feature set extracted from there are adequate for the automatic classification of CKD patients' utterances.

Literary Text and the Cultural Interpretation - A Study of the Model of 「History of Spanish Literature」 (문학텍스트와 문학적 해석 -「스페인 문학사」를 통한 모델 연구)

  • Na, Songjoo
    • Cross-Cultural Studies
    • /
    • v.26
    • /
    • pp.465-485
    • /
    • 2012
  • Instructing "History of Spanish Literature" class faces various types of limits and obstacles, just as other foreign language literature history classes do. Majority of students enter the university without having any previous spanish learning experience, which means, for them, even the interpretation of the text itself can be difficult. Moreover, the fact that "History of Spanish Literature" is traced all the way back to the Middle Age, students encounter even more difficulties and find factors that make them feel the class is not interesting. To list several, such factors include the embarrassment felt by the students, antiquated expressions, literature texts filled with deliberately broken grammars, explanations written in pretentious vocabularies, disorderly introduction of many different literary works that ignores the big picture, in which in return, reduces academic interest in students, and finally general lack of interest in literate itself due to the fact that the following generation is used to visual media. Although recognizing such problem that causes the distortion of the value of our lives and literature is a very imminent problem, there has not even been a primary discussion on such matter. Thus, the problem of what to teach in "History of Spanish Literature" class remains unsolved so far. Such problem includes wether to teach the history of authors and literature works, or the chronology of the text, the correlations, and what style of writing to teach first among many, and how to teach to read with criticism, and how to effectively utilize the limited class time to teach. However, unfortunately, there has not been any sorts of discussion among the insructors. I, as well, am not so proud of myself either when I question myself of how little and insufficiently did I contemplate about such problems. Living in the era so called the visual media era or the crisis of humanity studies, now there is a strong need to bring some change in the education of literature history. To suggest a solution to make such necessary change, I recommended to incorporate the visual media, the culture or custom that students are accustomed to, to the class. This solution is not only an attempt to introduce various fields to students, superseding the mere literature reserch area, but also the result that reflects the voice of students who come from a different cultural background and generation. Thus, what not to forget is that the bottom line of adopting a new teaching method is to increase the class participation of students and broaden the horizon of the Spanish literature. However, the ultimate goal of "History of Spanish Literature" class is the contemplation about humanity, not the progress in linguistic ability. Similarly, the ultimate goal of university education is to train students to become a successful member of the society. To achieve such goal, cultural approach to the literature text helps not only Spanish learning but also pragmatic education. Moreover, it helps to go beyond of what a mere functional person does. However, despite such optimistic expectations, foreign literature class has to face limits of eclecticism. As for the solution, as mentioned above, the method of teaching that mainly incorporates cultural text is a approach that fulfills the students with sensibility who live in the visual era. Second, it is a three-dimensional and sensible approach for the visual era, not an annotation that searches for any ambiguous vocabularies or metaphors. Third, it is the method that reduces the burdensome amount of reading. Fourth, it triggers interest in students including philosophical, sociocultural, and political ones. Such experience is expected to stimulate the intellectual curiosity in students and moreover motivates them to continues their study in graduate school, because it itself can be an interesting area of study.

An Efficient IPTV Distribution Network by Packet Transport System (Packet Transport System에 의한 효율적인 IPTV 분배망 구축 방안)

  • Jang, Jin-Hee;Park, Seung-Kwon;Roh, Jin-Young;Noh, Francis Tai
    • Journal of Broadcast Engineering
    • /
    • v.12 no.2
    • /
    • pp.80-92
    • /
    • 2007
  • IPTV Services that is representative union service of broadcasting and telecommunication need guarantee of QoS, efficiency of multicasting, and hish bandwidth on the network. Because typical TDM based metro transport network was designed by transporting fixed voice traffic with stable and recovering method, it has a defect of bottleneck and a waste of bandwidth for acceptance of data traffic with burst feature and then all of data are treated equally at the transport network because it cannot classify between advanced high end service and best effort low end service. for completely resolving this kind of problem about increasing burst traffic and QoS issues, firstly we need to new design for transport network. This paper presents transformation method from TDM based metro transport network to packet based transport network and advantage and effectiveness of packet based transport network and also indicates technical factor and characters about method of packet transport system. As a result of research, the Packet Transport System, which is a transmission network for packet delivery, take in not only a specific character of legacy TDM but QoS, Multicast and high bandwidth, then, it is able to keep an effective bandwidth and a stabilized performance of packet transmissions. Additionally, if a fault be occurred on an optical link, the system is able to guarantee a differential QoS by an each service class using an algorithm to make certain of a traffic existence and contain a protective mechanism.

DFT-spread OFDM Communication System for the Power Efficiency and Nonlinear Distortion in Underwater Communication (수중통신에서 비선형 왜곡과 전력효율을 위한 DFT-spread OFDM 통신 시스템)

  • Lee, Woo-Min;Ryn, Heung-Gyoon
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.8A
    • /
    • pp.777-784
    • /
    • 2010
  • Recently, the necessity of underwater communication and demand for transmitting and receiving various data such as voice or high resolution image data are increasing as well. The performance of underwater acoustic communication system is influenced by characteristics of the underwater communication channels. Especially, ISI(inter symbol interference) occurs because of delay spread according to multi-path and communication performance is degraded. In this paper, we study the OFDM technique to overcome the delay spread in underwater channel and by using CP, we compensate for delay spread. But PAPR which OFDM system has problem is very high. Therefore, we use DFT-spread OFDM method to avoid nonlinear distortion by high PAPR and to improve efficiency of amplifier. DFT-spread OFDM technique obtains high PAPR reduction effect because of each parallel data loads to all subcarrier by DFT spread processing before IFFT. In this paper, we show performance about delay spread through OFDM system and verify method that DFT spread OFDM is more suitable than OFDM for underwater communication. And we analyze performance according to two subcarrier mapping methods(Interleaved, Localized). Through the simulation results, performance of DFT spread OFDM is better about 5~6dB at $10^{-4}$ than OFDM. When compared to BER according to subcarrier mapping, Interleaved method is better about 3.5dB at $10^{-4}$ than Localized method.

Packet Loss Concealment Algorithm Based on Speech Characteristics (음성신호의 특성을 고려한 패킷 손실 은닉 알고리즘)

  • Yoon Sung-Wan;Kang Hong-Goo;Youn Dae-Hee
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.7C
    • /
    • pp.691-699
    • /
    • 2006
  • Despite of the in-depth effort to cantrol the variability in IP networks, quality of service (QoS) is still not guaranteed in the IP networks. Thus, it is necessary to deal with the audible artifacts caused by packet lasses. To overcame the packet loss problem, most speech coding standard have their own embedded packet loss concealment (PLC) algorithms which adapt extrapolation methods utilizing the dependency on adjacent frames. Since many low bit rate CELP coders use predictive schemes for increasing coding efficiency, however, error propagation occurs even if single packet is lost. In this paper, we propose an efficient PLC algorithm with consideration about the speech characteristics of lost frames. To design an efficient PLC algorithm, we perform several experiments on investigating the error propagation effect of lost frames of a predictive coder. And then, we summarize the impact of packet loss to the speech characteristics and analyze the importance of the encoded parameters depending on each speech classes. From the result of the experiments, we propose a new PLC algorithm that mainly focuses on reducing the error propagation time. Experimental results show that the performance is much higher than conventional extrapolation methods over various frame erasure rate (FER) conditions. Especially the difference is remarkable in high FER condition.

Patch Reconstruction with Radial Forearm Free Flap of Hypopharyngeal Cancer Using the Narrow Strip Pharynageal Wall (소폭의 잔존 하인두벽을 이용한 첩포형 전완유리 피판 인두 재건술)

  • Jeong, Hii Sun;Lee, Won Jai;Lew, Dae Hyun;Rah, Dong Kyun;Tark, Kwan Chul
    • Archives of Plastic Surgery
    • /
    • v.33 no.4
    • /
    • pp.407-412
    • /
    • 2006
  • Purpose: Various attempts of reconstruction for pharyngoesophageal defects after ablative surgery have been made to restore the function of the pharyngoesophagus. A fabricated tubed radial forearm free flap or free jejunal free flap was used when the width of remnant pharyngeal wall was less than 50% of the normal width. However there are many disadvantages such as stricture, saliva leakage and fistula formation on tubed radial forearm free flap. The jejunal free flap has the problem such as short pedicle, poor tolerance of ischemic time, wet voice and delayed transit of swallowed food due to the uncoordinated contraction. The authors studied the utility of patch-type radial forearm free flap using the remnant posterior pharyngeal wall of the hypopharynx. Methods: Retrospective reviews in Severance Hospital were made on 25 patients who underwent reconstruction surgery with patched radial forearm free flap because of the hypopharyngeal cancer between 1996 and 2005. The patients of Group I had the narrow posterior pharyngeal wall and its width was less than 3centimeters after the tumor was resected. Those of Group II had the partial pharyngectomy and the width of the remnant pharynx was larger than 3 centimeters. Results: Seven patients belonged to the group I and the flap of this group had 100% survival rate. One case of fistula and no swallowing discomfort due to stricture was reported. The Group II including 18 patients also had the 100% flap survival rate. Neither fistula nor stricture was seen but the lower diet grade was checked. Conclusion: The patch type radial forearm free flap using the remnant pharyngeal wall have the advantage of the radial forearm free flap, and furthermore this flap is the safe reconstructive method even if the width of the remnant pharyngeal wall is less than 30% of that of normal pharynx.

Manufacture of 3-Dimensional Image and Virtual Dissection Program of the Human Brain (사람 뇌의 3차원 영상과 가상해부 풀그림 만들기)

  • Chung, M.S.;Lee, J.M.;Park, S.K.;Kim, M.K.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1998 no.11
    • /
    • pp.57-59
    • /
    • 1998
  • For medical students and doctors, knowledge of the three-dimensional (3D) structure of brain is very important in diagnosis and treatment of brain diseases. Two-dimensional (2D) tools (ex: anatomy book) or traditional 3D tools (ex: plastic model) are not sufficient to understand the complex structures of the brain. However, it is not always guaranteed to dissect the brain of cadaver when it is necessary. To overcome this problem, the virtual dissection programs of the brain have been developed. However, most programs include only 2D images that do not permit free dissection and free rotation. Many programs are made of radiographs that are not as realistic as sectioned cadaver because radiographs do not reveal true color and have limited resolution. It is also necessary to make the virtual dissection programs of each race and ethnic group. We attempted to make a virtual dissection program using a 3D image of the brain from a Korean cadaver. The purpose of this study is to present an educational tool for those interested in the anatomy of the brain. The procedures to make this program were as follows. A brain extracted from a 58-years old male Korean cadaver was embedded with gelatin solution, and serially sectioned into 1.4 mm-thickness using a meat slicer. 130 sectioned specimens were inputted to the computer using a scanner ($420\times456$ resolution, true color), and the 2D images were aligned on the alignment program composed using IDL language. Outlines of the brain components (cerebrum, cerebellum, brain stem, lentiform nucleus, caudate nucleus, thalamus, optic nerve, fornix, cerebral artery, and ventricle) were manually drawn from the 2D images on the CorelDRAW program. Multimedia data, including text and voice comments, were inputted to help the user to learn about the brain components. 3D images of the brain were reconstructed through the volume-based rendering of the 2D images. Using the 3D image of the brain as the main feature, virtual dissection program was composed using IDL language. Various dissection functions, such as dissecting 3D image of the brain at free angle to show its plane, presenting multimedia data of brain components, and rotating 3D image of the whole brain or selected brain components at free angle were established. This virtual dissection program is expected to become more advanced, and to be used widely through Internet or CD-title as an educational tool for medical students and doctors.

  • PDF

A Development of Communication and Relationship Enrichment Program for Multicultural Couples (다문화가정 부부의 의사소통 및 관계 증진 프로그램 개발)

  • Kim, GumHee;Min, Ki-yeon;Lee, Youngsun
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.11
    • /
    • pp.202-214
    • /
    • 2015
  • This study aims to develop and apply communication and relationship enrichment programs for married multicultural couples and verify its effects. This preliminary program was developed based on results from reviews of existing intervention programs, participants' interview for their needs and professionals' feedback on program. The program includes 11 sessions(once a week; 2 hours per session) focusing on communication strategies and related activities using marital counseling. Four multicultural couples participated in the intervention program. Qualitative data was collected during the program, including voice recording files, activity sheets, and filed notes conducted by researchers. Based on the results from the data analysis, we could find the intervention program had an positive effect on the communication between multicultural couples. Specifically, the intervention program could (1) enhanced intimacy in the couples, (2) started conversation and improved communication patterns between the couples. And also (3) modes of communication were changed to more collaborative communication through this program. The marital communication program developed in this study is significant in that it played a catalytic role for marital conversation in multicultural couples and used professional counseling techniques and strategies in solving and managing marital conflict.