• Title/Summary/Keyword: Voice Quality

Search Result 769, Processing Time 0.029 seconds

Anura Call Monitoring Data Collection and Quality Management through Citizen Participation (시민참여형 무미목 양서류 음성신호 수집 및 품질관리 방안)

  • Kyeong-Tae Kim;Hyun-Jung Lee;Won-Kyong Song
    • Korean Journal of Environment and Ecology
    • /
    • v.38 no.3
    • /
    • pp.230-245
    • /
    • 2024
  • Amphibians, sensitive to external environmental changes, serve as bioindicator species for assessing alterations or disturbances in local ecosystems. It is known that one-third of amphibian species within the order Anura are at risk of extinction due to anthropogenic threats such as habitat destruction and fragmentation caused by urbanization. To develop effective protection and conservation strategies for anuran amphibians, species surveys that account for population characteristics are essential. This study aimed to investigate the potential for citizen participation in ecological monitoring using the mating calls of anura species. We also proposed suitable quality control measures to mitigate errors and biases, ensuring the extraction of reliable species occurrence data. The Citizen Science project was carried out nationwide from April 1 to August 31, 2022, targeting 12 species of anura amphibians in Korea. Citizens voluntarily participated in voice signal monitoring, where they listened to anura species' mating calls and recorded them using a mobile application. Additionally, we established a quality control process to extract reliable species occurrence data, categorizing errors and biases from citizen-collected data into three levels: omission, commission, and incorrect identification. A total of 6,808 observations were collected during the citizen participation in anura species vocalization monitoring. Through the quality control process, errors and biases were identified in 1,944 (28.55%) of the 6,808 data. The most common type of error was omission, accounting for 922 cases (47.43%), followed by incorrect identification with 540 cases (27.78%), and commission with 482 cases (24.79%). During the Citizen Science project, we successfully recorded the mating calls of 10 out of the 12 anuran amphibian species in Korea, excluding the Asian toads (Bufo gargarizans Cantor), Korean brown frog (Rana coreana). Difficulties in collecting mating calls were primarily attributed to challenges in observing due to population decline or discrepancies between the breeding season of non-emergent individuals and the timing of the citizen science project. This study represents the first investigation of distribution status and species emergence data collection through mating calls of anura species in Korea based on citizen participation. It can serve as a foundation for designing future bioacoustic monitoring that incorporates citizen science and quality control measures for citizen science data.

Intelligent VOC Analyzing System Using Opinion Mining (오피니언 마이닝을 이용한 지능형 VOC 분석시스템)

  • Kim, Yoosin;Jeong, Seung Ryul
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.113-125
    • /
    • 2013
  • Every company wants to know customer's requirement and makes an effort to meet them. Cause that, communication between customer and company became core competition of business and that important is increasing continuously. There are several strategies to find customer's needs, but VOC (Voice of customer) is one of most powerful communication tools and VOC gathering by several channels as telephone, post, e-mail, website and so on is so meaningful. So, almost company is gathering VOC and operating VOC system. VOC is important not only to business organization but also public organization such as government, education institute, and medical center that should drive up public service quality and customer satisfaction. Accordingly, they make a VOC gathering and analyzing System and then use for making a new product and service, and upgrade. In recent years, innovations in internet and ICT have made diverse channels such as SNS, mobile, website and call-center to collect VOC data. Although a lot of VOC data is collected through diverse channel, the proper utilization is still difficult. It is because the VOC data is made of very emotional contents by voice or text of informal style and the volume of the VOC data are so big. These unstructured big data make a difficult to store and analyze for use by human. So that, the organization need to automatic collecting, storing, classifying and analyzing system for unstructured big VOC data. This study propose an intelligent VOC analyzing system based on opinion mining to classify the unstructured VOC data automatically and determine the polarity as well as the type of VOC. And then, the basis of the VOC opinion analyzing system, called domain-oriented sentiment dictionary is created and corresponding stages are presented in detail. The experiment is conducted with 4,300 VOC data collected from a medical website to measure the effectiveness of the proposed system and utilized them to develop the sensitive data dictionary by determining the special sentiment vocabulary and their polarity value in a medical domain. Through the experiment, it comes out that positive terms such as "칭찬, 친절함, 감사, 무사히, 잘해, 감동, 미소" have high positive opinion value, and negative terms such as "퉁명, 뭡니까, 말하더군요, 무시하는" have strong negative opinion. These terms are in general use and the experiment result seems to be a high probability of opinion polarity. Furthermore, the accuracy of proposed VOC classification model has been compared and the highest classification accuracy of 77.8% is conformed at threshold with -0.50 of opinion classification of VOC. Through the proposed intelligent VOC analyzing system, the real time opinion classification and response priority of VOC can be predicted. Ultimately the positive effectiveness is expected to catch the customer complains at early stage and deal with it quickly with the lower number of staff to operate the VOC system. It can be made available human resource and time of customer service part. Above all, this study is new try to automatic analyzing the unstructured VOC data using opinion mining, and shows that the system could be used as variable to classify the positive or negative polarity of VOC opinion. It is expected to suggest practical framework of the VOC analysis to diverse use and the model can be used as real VOC analyzing system if it is implemented as system. Despite experiment results and expectation, this study has several limits. First of all, the sample data is only collected from a hospital web-site. It means that the sentimental dictionary made by sample data can be lean too much towards on that hospital and web-site. Therefore, next research has to take several channels such as call-center and SNS, and other domain like government, financial company, and education institute.

A Comparative Study on the Aesthetic Aspect of Design Preferred Between Countries Centering Around the Analysis on the Aesthetic Aspect of Mobile Phone Preferred by Korean and Chinese Consumers - (국가 간 선호 디자인의 심미성요소 비교연구 - 한.중 소비자 선호휴대폰의 심미성요소 분석을 중심으로 -)

  • Jeong Su-Kyoung;Hong Jung-Pyo
    • Science of Emotion and Sensibility
    • /
    • v.9 no.1
    • /
    • pp.49-61
    • /
    • 2006
  • The present mobile phone industry has significant effect on the domestic economy and has taken root as the core item that has the responsibility to lead the Korean economy for a considerable period of time. As the mobile phone market becomes gigantic, the mobile phone is being used by people in broader age bracket, and functions or designs preferred by people of various age are getting more diverse. Like that, as the mobile phone has greater effect on and meaning in our daily lives, consumers of mobile phone have growing expectation of the mobile phone Now, the core function of voice communication via the mobile phone is not a great concern to consumers. But the function, such as more convenient and friendly information input and output, processing and storage, and the design, which is more sophisticated and optimized for the user environment, are being demanded, not just the simple voice communication. And as the modern design is getting more similar to the objects of traditional high art consumed by consumers every day, the aesthetic aspect of design can play an important role, as the factor that differentiates the product, in creating new value which forms the spiritual and emotional value of human beings to improve the quality of living, and in addition, the willingness of consumers to buy is determined by the design that they prefer the most. Like that, a new design of mobile phone based on a new dimension and preferred by the consumers the most is urgently required to be developed by shedding light on the factors related to the preference of consumers on the basis of the analysis on the aesthetic aspect, which can be said to be the most critical factor in the design process. Therefore, this study aims to identity the common preference and different factors of aesthetic aspects through the analysis on the aesthetic aspects of the mobile phone preferred by users among countries, and figure out the formative artistic factors of aesthetic aspects that are considered to be important, in order to propose the guideline on the aesthetic aspect of mobile phone that can be applied to the design of mobile phone practically.

  • PDF

A comparison of acoustic measures among the microphone types for smartphone recordings in normal adults (정상 성인에서 스마트폰 녹음을 위한 마이크 유형 간 음향학적 측정치 비교)

  • Jeong In Park;Seung Jin Lee
    • Phonetics and Speech Sciences
    • /
    • v.16 no.2
    • /
    • pp.49-58
    • /
    • 2024
  • This study aimed to compare the acoustic measurements of speech samples recorded from individuals with normal voices using various devices: the Computerized Speech Lab (CSL), a unidirectional wired pin-microphone (WIRED) suitable for smartphones, the built-in omnidirectional microphone (SMART) of smartphones, and Bluetooth-connected wireless earphones, specifically the Galaxy Buds2 Pro (WIRELESS). This study included 40 normal adults (12 males and 28 females) who had not visited an otolaryngologist for respiratory diseases within the past three months. Participants performed sustained vowel /a/ phonation for four seconds and reading tasks with sentences ("Walk") and paragraphs ("Autumn") in a sound-treated booth. Recordings were simultaneously conducted using the four different devices and synchronized based on the CSL-recorded samples for analysis using the MDVP, ADSV, and VOXplot programs. Compared with CSL, the Cepstral Spectral Index of Dysphonia (CSIDV, CSIDS) and Acoustic Voice Quality Index (AVQI) values were lower in the WIRED and higher in the SMART. The opposite trend was observed for the L/H spectral ratios (SRV and SRS), and the WIRELESS demonstrated task-specific discrepancies. Furthermore, both the fundamental frequency (F0) and the cepstral peak prominence of the vowel samples (CPPV) had intraclass correlation coefficient (ICC) values above 0.9, indicating high reliability. These variables, F0 and CPPV were considered highly reliable for voice recordings across different microphone types. However, caution should be exercised when analyzing and interpreting variables such as the SR, CSID, and AVQI, which may be influenced by the type of microphone used.

Speech Enhancement Based on Modified IMCRA Using Spectral Minima Tracking with Weighted Subband Selection (서브밴드 가중치를 적용한 스펙트럼 최소값 추적을 이용하는 수정된 IMCRA 기반의 음성 향상 기법)

  • Park, Yun-Sik;Park, Gyu-Seok;Lee, Sang-Min
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.49 no.3
    • /
    • pp.89-97
    • /
    • 2012
  • In this paper, we propose a novel approach to noise power estimation for speech enhancement in noisy environments. The method based on IMCRA (improved minima controlled recursive averaging) which is widely used in speech enhancement utilizes a rough VAD (voice activity detection) algorithm which excludes speech components during speech periods in order to improves the performance of the noise power estimation by reducing the speech distortion caused by the conventional algorithm based on the minimum power spectrum derived from the noisy speech. However, since the VAD algorithm is not sufficient to distinguish speech from noise at non-stationary noise and low SNRs (signal-to-noise ratios), the speech distortion resulted from the minimum tracking during speech periods still remained. In the proposed method, minimum power estimate obtained by IMCRA is modified by SMT (spectral minima tracking) to reduce the speech distortion derived from the bias of the estimated minimum power. In addition, in order to effectively estimate minimum power by considering the distribution characteristic of the speech and noise spectrum, the presented method combines the minimum estimates provided by IMCRA and SMT depending on the weighting factor based on the subband. Performance of the proposed algorithm is evaluated by subjective and objective quality tests under various environments and better results compared with the conventional method are obtained.

The Study on Antenna Performancet Test for Surion Radio Installation and Optimal Positioning (수리온 통신 안테나 장착 및 최적위치 선정을 위한 안테나 성능시험에 관한 연구)

  • No, Sangwan;Lee, Soonyoung;Kim, Minsoo
    • Journal of Aerospace System Engineering
    • /
    • v.14 no.5
    • /
    • pp.122-129
    • /
    • 2020
  • Surion is required to install radios (U/VHF-AM, VHF-FM) capable of omni-directional communication. Therefore, this paper demonstrates the antenna performance test for the installation of the Surion communication antenna and the selection of optimal location. A simulation pattern analysis was performed employing the antenna, and a coupling test was performed by creating a new evaluation criterion. In addition, the results of the pattern flight test conducted at the previously suggested 1:20 turn and separation distance ratio were observed to be normal. However, the occurrence of voice cutoff was noted in the long-distance flight test. Therefore, in this paper, 1:300 (15 NM) is proposed as a new optimal ratio for predicting the long-distance flight test results in advance. Finally, the effectiveness of the proposed evaluation criteria was verified through long-distance flight tests. Consequently, it is expected to reduce the development schedule and cost by reducing the trial and error of the performance test for the Surion model. Also, the results of this study are expected to be used as standards for the installation of communication antenna and quality tests for other helicopters.

Resource Allocation Information Sorting Algorithm Variable Selection Scheme for MF-TDMA DAMA Satellite Communication System (MF-TDMA DAMA 위성통신 시스템에서의 자원할당정보 정렬 알고리즘 가변 선택기법 연구)

  • Park, Nam Hyoung;Han, Joo-Hee;Han, Ki Moon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.2
    • /
    • pp.1-7
    • /
    • 2020
  • In modern society, as technology has advanced and human life area has expanded, there has been an increasing demand for high-quality voice and video communications services without restrictions on time and place. In response to this demand, satellite communications systems that provide a wide range of communications and that offer multiple access are evolving day by day. In satellite communications systems such as Digital Video Broadcasting - Return Channel Via Satellite (DVB-RCS) and Warfighter Information Network-Tactical (WIN-T), the multi-frequency time division multiple access (MF-TDMA) demand assigned multiple access (DAMA) scheme is used for efficient resource allocation. In this scheme, since the satellite terminals periodically request resources from the network controller, and the network controller dynamically allocates resources, it is necessary to arrange resource allocation information from time to time. Shortening of the alignment time is a more important factor in a satellite communications system in which a long transmission delay occurs due to long-distance transmission and reception. In this paper, we propose a sorting algorithm variable-selection scheme that shortens the sorting time by cross-selecting the sorting algorithm based on a threshold value, while setting the number of frames in the MF-TDMA DAMA satellite communications system as the threshold value.

Real-Time Implementation of the G.729.1 Using ARM926EJ-S Processor Core (ARM926EJ-S 프로세서 코어를 이용한 G.729.1의 실시간 구현)

  • So, Woon-Seob;Kim, Dae-Young
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.8C
    • /
    • pp.575-582
    • /
    • 2008
  • In this paper we described the process and the results of real-time implementation of G.729.1 wideband speech codec which is standardized in SG15 of ITU-T. To apply the codec on ARM926EJ-S(R) processor core. we transformed some parts of the codec C program including basic operations and arithmetic functions into assembly language to operate the codec in real-time. G.729.1 is the standard wideband speech codec of ITU-T having variable bit rates of $8{\sim}32kbps$ and inputs quantized 16 bits PCM signal per sample at the rate of 8kHz or 16kHz sampling. This codec is interoperable with the G.729 and G.729A and the bandwidth extended wideband($50{\sim}7,000Hz$) version of existing narrowband($300{\sim}3,400Hz$) codec to enhance voice quality. The implemented G.729.1 wideband speech codec has the complexity of 31.2 MCPS for encoder and 22.8 MCPS for decoder and the execution time of the codec takes 11.5ms total on the target with 6.75ms and 4.76ms respectively. Also this codec was tested bit by bit exactly against all set of test vectors provided by ITU-T and passed all the test vectors. Besides the codec operated well on the Internet phone in real-time.

Uncommon Causes of Hoarseness (타질환과 동반된 애성)

  • 윤희병;김미자;정대현;박승훈;박옥경;목정민;전승하;강주원
    • Proceedings of the KOR-BRONCHOESO Conference
    • /
    • 1982.05a
    • /
    • pp.8.2-8
    • /
    • 1982
  • Hoarseness is the change of voice quality which represents the abnormal function of phonation and is the main symtom of the laryngeal diseases. The etiology of hoarseness are known more than 50 causes, among them, viral upper respiratory infection is the main cause of hoarseness and the laryngeal nodule and polyp, laryngeal paralysis, laryngeal cancer, laryngeal papilloma and the laryngeal tuberculosis are the other causes of hoarseness in that order. Recently, the authors experienced 4 cases of uncommon etiology of hoarseness, so we present the cases with the brief review of literatures. Case 1. 29 years old male Admitted in Dept. of neurosurgery due to Traffic Accident. He had a trauma on the anterior neck. Hoarseness was developed on 1 month after the accident. Laryngoscopic finding; Paramedian paralysis of left vocal cord. Displacement of left arytenoid cartilage. Case 2. 53 years old male Admitted in Dept. of General Surgery due to Clonorchis Sinensis, under the general endotracheal anesthesia, Choledochostomy was performed. Laryngoscopic finding; Median paralysis of left vocal cord. Case 3. 56 years old male Admitted in Dept. of Internal Medicine due to Aortic Aneurysm. Hoarseness was developed on 3 months prior to admission. Laryngoscopic finding; Intermediated position paralysis of left vocal cord. Displacement of left arytenoid cartilage. Case 4. 74 years old male Admitted in Dept. of Internal Medicine due to Bronchogenic carcinoma. Hoarseness was developed on 3 years prior to admission. Laryngoscopic finding; Paramedian paralysis of right vocal cord.

  • PDF

UPnP-based QoSAgent for QoS-guaranteed Streaming Service in Home Networks (서비스 품질이 보장되는 홈 네트워크 스트리밍 전송을 위한 UPnP 기반의 QoSAgent에 대한 연구)

  • Lee Hyun-Ryong;Moon Sung-Tae;Kim Jong-Won;Shin Dong-Yun
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.5B
    • /
    • pp.430-441
    • /
    • 2006
  • As the various A/V devices and home networks are delivered to users, home networks are changing to an entertainment network. It is expected that the required network bandwidth and the amount of usage of media content in home entertainment networks will be increased. Although the access networks and home networks becoming a high speed network, there remains the problems for QoS-guaranteed media content transfer in home networks. Also, in the home network, there can be network traffic caused by applications like video conferencing, video telephone, and VoIP(voice over IP) as well as inner network traffic of home network. Since media content transfer requires the real-time delivery, it is very important and basic requirement that is to transfer media content to A/V device user wants while keeping the media quality. Even though there are many middleware protocol for home networking, they provide basic device discovery and control or simple functions for QoS-guaranteed media content transfer that are not enough to provide QoS-guaranteed media transfer service that user wants. Thus, in this paper, we propose the technique based on UPnP(universal plug and play) protocol for QoS-guaranteed media content transfer in the home network. The proposed technique is compatible with UPnP and can be used with UPnP as additional functions. In this paper, we utilize VideoLAN application to verify the proposed technique. We add the additional modules that support the proposed technique's function to VideoLAN and we verify the its functions through various test scenarios.