Search | Korea Science

Comparison of Korean Speech De-identification Performance of Speech De-identification Model and Broadcast Voice Modulation (음성 비식별화 모델과 방송 음성 변조의 한국어 음성 비식별화 성능 비교)

Seung Min Kim;Dae Eol Park;Dae Seon Choi
- Smart Media Journal
- /
- v.12 no.2
- /
- pp.56-65
- /
- 2023
In broadcasts such as news and coverage programs, voice is modulated to protect the identity of the informant. Adjusting the pitch is commonly used voice modulation method, which allows easy voice restoration to the original voice by adjusting the pitch. Therefore, since broadcast voice modulation methods cannot properly protect the identity of the speaker and are vulnerable to security, a new voice modulation method is needed to replace them. In this paper, using the Lightweight speech de-identification model as the evaluation target model, we compare speech de-identification performance with broadcast voice modulation method using pitch modulation. Among the six modulation methods in the Lightweight speech de-identification model, we experimented on the de-identification performance of Korean speech as a human test and EER(Equal Error Rate) test compared with broadcast voice modulation using three modulation methods: McAdams, Resampling, and Vocal Tract Length Normalization(VTLN). Experimental results show VTLN modulation methods performed higher de-identification performance in both human tests and EER tests. As a result, the modulation methods of the Lightweight model for Korean speech has sufficient de-identification performance and will be able to replace the security-weak broadcast voice modulation.
https://doi.org/10.30693/SMJ.2023.12.2.56 인용 PDF

A Comparative Study of Western Singer's Voice and a Pansori Singer's Voice Based on Glottal Image and Acoustic Characteristics (성대형태 및 음향발현에서 성악 발성 및 판소리 발성의 비교 연구)

Kim, Sun-Sook
- Speech Sciences
- /
- v.11 no.2
- /
- pp.165-177
- /
- 2004
Western singers voice have been studied in music science since the early 20th century. However, Korean traditional singers voice have not yet been studied scientifically. This study is to find the physiological and acoustic characteristics of Pansori singers voices. Western singers participated for comparative purposes. Ten western singers and ten Pansori singers participated in this study. The subjects spoke and sung seven simple vowels /a, e, i, o, u, c, w/. An analysis of Glottal image was done by Scope View and acoustic characteristics of speech and singing voice were analyzed by CSL. The results are as follows: (1) Glottal gestures of Pansori singers showed asymmetric vocal folds. (2) Singing vowel formants of Pansori singers showed breathiness based on Spectrogram. (3) Music formant of western singers appeared in around 3kHz area, however, Pansori singers formant appeared in low frequency area. Modulation of vibrato showed 6 frequency per sec in case of western singers. Pansori singers showed no deep modulation of vibrato on spectrogram.
PDF

Effects of PSK Modulation Methods in Underwater Acoustic Communication (PSK 변조방식이 수중통신에 미치는 영향에 관한 연구)

Cho, Jin-Soo;Jung, Seung-Back;Shim, Tae-Bo
- The Journal of the Acoustical Society of Korea
- /
- v.26 no.7
- /
- pp.366-374
- /
- 2007
In underwater wireless communication, needs for long distance communication using the high frequency are surpassing ones of short range communication by ultrasonic wave, and demands for transmitting and receiving various data such as voice or high resolution image data are increasing as well. In this work, we studied the effects on the real underwater communication depending on the difference of digital modulation methods. Simulation shows that only the performance of GMSK among many other PSK based modulation schemes(BPSK, QPSK, MSK, GMSK) is significant. Test condition simulates the oceanographic conditions along the 207-survey line, 15Km south of Busan and SNR is maintained 35dB or below. Simulated tests are composed of both transmitting image data($3{\times}10^5$ pixel, 4 bit per pixel) and voice communication($10^{-2}$BER, channel capacity of 1Kbps). Test results show that there are gain of about 7 seconds in transmission time in image transmission case, where channel capacity for BPSK, QPSK, and MSK and for GMSK were 65 Kbps and 45 Kbps, respectively and gain of about 8Km in distances in voice communication case.
https://doi.org/10.7776/ASK.2007.26.7.366 인용 PDF KSCI

Duty Ratio-Displacement Model in PWM Control of Voice Coil Actuator (보이스 코일 액츄에이터의 PWM 제어에서 듀티비-변위 모델 연구)

Hwang, Jin-Dong;Kwak, Yong-Kil;Kim, Ju-Hyun;Kim, Sun-Ho;Ahn, Jung-Hwan
- Journal of the Korean Society of Manufacturing Process Engineers
- /
- v.6 no.2
- /
- pp.59-66
- /
- 2007
Voice coil actuator is used linear motion system that requires precision positioning control. In order to control precision positioning of voice coil actuator, relation model between duty ratio and moving displacement of voice coil actuator is needed. This paper present a duty ratio - displacement model in PWM control of voice coil actuator. Transfer function of voice coil actuator is obtained by combining voice coil motor's equation of motion with the equation of circuit and characteristic of voice coil motor. Consider to initial condition of velocity and current, transfer function is transformed mathematical model. The induced model can predict output displacement, velocity and current according to duty ratio and amplitude. The model is verified by experimental tests such as velocity and displacement response of voice coil motor. Simulated results have tracking errors of less than 10 percent of experimental results.
PDF

A Study on Formants of Vowels for Speaker Recognition (화자 인식을 위한 모음의 포만트 연구)

Ahn Byoung-seob;Shin Jiyoung;Kang Sunmee
- MALSORI
- /
- no.51
- /
- pp.1-16
- /
- 2004
The aim of this paper is to analyze vowels in voice imitation and disguised voice, and to find the invariable phonetic features of the speaker. In this paper we examined the formants of monophthongs /a, u, i, o, {$\omega},{\;}{\varepsilon},{\;}{\Lambda}$/. The results of the present are as follows : $\circled1$ Speakers change their vocal tract features. $\circled2$ Vowels /a, ${\varepsilon}$, i/ appear to be proper for speaker recognition since they show invariable acoustic feature during voice modulation. $\circled3$ F1 does not change easily compared to higher formants. $\circled4$ F3-F2 appears to be constituent for a speaker identification in vowel /a/ and /$\varepsilon$/, and F4-F2 in vowel /i/. $\circled5$ Resulting of F-ratio, differences of each formants were more useful than individual formant of a vowel to speaker recognition.
PDF

Simulation of tracking errors for non-circular cutting using voice coil motor (VCM을 이용한 비원형 형상 가공의 궤적 오차 시뮬레이션)

Hwang J.D.;Kwak Y.K.;Kim S.H.;Ahan J.H.
- Proceedings of the Korean Society of Precision Engineering Conference
- /
- 2006.05a
- /
- pp.57-58
- /
- 2006
A Simulation model is developed to minimize the path tracking errors when the non-circular cutting is done by a VCM(voice coil motor) driven tool. The relationship between PWM(Pulse Width Modulation) duty ratio and velocity of voice coil motor is theoretically derived from combining the circuit equation for the coils and the motion equation for the magnetic rod of the voice coil motor. The path tracking errors are showed differently according to the rotational speed, the number of segments and the control period in digital control. Given a required accuracy in the non-circular cutting, the optimal values for those parameters are determined based on the developed simulation model.
PDF

A Study on the underwater communication system of ultrasonic transducer (압전 초음파 센서를 이용한 수중통신에 관한 연구)

Kim, Dong-Hyun;Woo, Hyoung-Gwan;Hwang, Hyun-Suk;Jin, Hong-Bum;Song, Joon-Tae
- Proceedings of the KIEE Conference
- /
- 2000.07c
- /
- pp.1658-1660
- /
- 2000
Simple signs were usually exchanged as the means of underwater communications. As people recently, need more informations for underwater activities, necessities of underwater communication systems exchanging hunman voice are increased. The purpose of this paper is understanding the ordinary characteristics of underwater communication and investigating the necessary conditions for a good underwater communication system by making a basic communication module. The experiment is achieved by applying AM (Amplitude Modulation) which is mainly used for the underwater communication systems and using common ultrasonic transducers. Ultrasonic transducers usually have narrow bandwidth for transducing electrical energy to mechanical energy. For improvement of sound reconstruction, transducers need more bandwidth which covers voice's frequency range, and goof linearity characteristics in this frequency range. As underwater transmissions have many factors to distort signals. Amplitude Modulation is not a proper way for underwater communications. Using digital signal by sampling human voice should be a good way for this systems, because digital communication simplify transmitting signals.
PDF

The Design and Implementation of S/W Packet Modem based on Frequency Hopping Legacy Radio System (재래식 주파수도약 통신장비용 S/W 패킷모뎀 개발 및 적용에 관한 연구)

Koo, Jung;Pyo, Sang-Ho;Kang, Kyeong-Sung;Kim, Ki-Hyung
- Journal of the Korea Institute of Military Science and Technology
- /
- v.14 no.2
- /
- pp.222-231
- /
- 2011
In this paper, we have proposed a method which can make it possible to stably transmit and receive data like the ARC-164 radio frequency hopping environment as a S/W packet modem with PSK modulation. This is a method that the S/W packet modem with PSK digital modulation and the use of PC sound cards change over from data to voice signals and then transmit/receive data. We confirmed not only that it is possible to solve the slow speed communication with the use of sending data through multi-channels and PSK modulation that has the ability to methodically improve transmission rates, but also that it is possible to send the state of frequency hopping stably. In conclusion, we've confirmed both tactical values that though the transmission rate may be a tad slow, a state of frequency hopping of more than 94% confidence plus voice and data can be sent via radio at the same time. In this paper, the proposed S/W packet modem is only an implemented S/W component, so when we apply it to aircraft that we don't consider EMC problems with, then we have the advantage of a wider use of conventional UHF/VHF/HF radio that is possible to voice communication. If we recognize these operational requirements, we can apply for a lot of field equipment efficiently.
https://doi.org/10.9766/KIMST.2011.14.2.222 인용 PDF KSCI

A Study on the Digital Design for Voice Modem Using the Multicarrier DS-CDMA in Powerline Channels (전력선 채널에서 멀티캐리어 DS-CDMA를 이용한 전력선 음성모뎀의 디지털부 구현에 관한 연구)

이상준;김민걸;이종성;구시경;박광철;오정현;김기두
- Proceedings of the IEEK Conference
- /
- 2000.06a
- /
- pp.77-80
- /
- 2000
In this paper, we implemented the voice modem using the multicarrier DS-CDMA in powerline channels. Both TMS320C5402 of Texas Instrument and FPGA FLEX 10K EPF10K100ARC240 of ALTERA are used to realize the proposed system. For robustness in the powerline channel, we used multicarrier DS-CDMA modulation, convolutional encoding/Viterbi decoding, and interleaving. Finally, we showed satisfactory performance in the laboratory experiment.
PDF

Voice Tremor in Parkinsonism : A Preliminary Study for Differential Diagnosis (파킨슨증의 음성진전 : 감별진단을 위한 예비연구)

Choi, Seong-Hee;Kim, Hyang-Hee;Lee, Won-Yong;Choi, Hong-Shik
- Speech Sciences
- /
- v.12 no.3
- /
- pp.19-33
- /
- 2005
Tremor is a main factor of parkinsonism. Voice tremor may be the first, later or the only symptom of a neurological disease and its frequency, amplitude, and regularity may differ among the diseases of different neural subsystems. Differential diagnosis between idiopathic Parkinson's disease (IPD) and multiple system atrophy (MSA) has been difficult. This study included three groups: (1) 6 IPD patients; (2) 6 MSA patients; and (3) 20 ageand sex-matched normal controls. The MDVP (Multidimensional Voice Program) was used to analyze the sustained /a/phonation. The results were as follows: (1) frequency perturbation parameters (jitter, sPPQ, Vf0) and FTRI of tremor parameter of two patient groups were statistically different from those of the controls (p < .01); (2) measures were higher in short-term and long-term f0 and amplitude perturbation in MSA than IPD; (3) however, any acoustic parameters between IPD and MSA were not statistically different; except for the rate of frequency tremor, 4$\sim$5 Hz in IPD, 5$\sim$11 Hz in MSA and (4) the pattern of regularity for voice tremor through histogram indicated that amplitude of IPD was irregular while both f0 and amplitude of MSA were irregular. In conclusion, F0, rate of frequency tremor, and pattern of f0 regularity may be predictors for differential diagnosis. These findings might signify that voice tremor of parkinsonism was resulted from modulation of f0.
PDF

Search Result 47, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)