• Title/Summary/Keyword: 음성 변조

Search Result 91, Processing Time 0.021 seconds

Comparison of Korean Speech De-identification Performance of Speech De-identification Model and Broadcast Voice Modulation (음성 비식별화 모델과 방송 음성 변조의 한국어 음성 비식별화 성능 비교)

  • Seung Min Kim;Dae Eol Park;Dae Seon Choi
    • Smart Media Journal
    • /
    • v.12 no.2
    • /
    • pp.56-65
    • /
    • 2023
  • In broadcasts such as news and coverage programs, voice is modulated to protect the identity of the informant. Adjusting the pitch is commonly used voice modulation method, which allows easy voice restoration to the original voice by adjusting the pitch. Therefore, since broadcast voice modulation methods cannot properly protect the identity of the speaker and are vulnerable to security, a new voice modulation method is needed to replace them. In this paper, using the Lightweight speech de-identification model as the evaluation target model, we compare speech de-identification performance with broadcast voice modulation method using pitch modulation. Among the six modulation methods in the Lightweight speech de-identification model, we experimented on the de-identification performance of Korean speech as a human test and EER(Equal Error Rate) test compared with broadcast voice modulation using three modulation methods: McAdams, Resampling, and Vocal Tract Length Normalization(VTLN). Experimental results show VTLN modulation methods performed higher de-identification performance in both human tests and EER tests. As a result, the modulation methods of the Lightweight model for Korean speech has sufficient de-identification performance and will be able to replace the security-weak broadcast voice modulation.

Limitations of Spectrogram Analysis for Smartphone Voice Recording File Forgery Detection (스마트폰 음성 녹음 파일 위변조 검출을 위한 스펙트로그램 분석의 한계점)

  • Sangmin Han;Yeongmin Son;Jae Wan Park
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.2
    • /
    • pp.545-551
    • /
    • 2023
  • As digital information is readily available to everyone today, the adoption of digital evidence is increasing. However, it is virtually impossible to determine the authenticity of forgery in the case of a voice recording file that has gone through a sophisticated editing process along with the spread of various voice file editing tools. This study aims to prove that forgery, which is difficult to distinguish from the original file, is possible by using insertion, deletion, linking, and synthetic editing technologies in voice recording files. This study presents the difficulty of detecting forgery by encoding a forged voice file with the same extension as the original. In addition, it was shown that forgery detection is impossible if additional transition band deletion and secondary encoding are performed only for experiments in which features occurred. Through this, this study is expected to contribute to the establishment of more stringent evidence admissibility criteria for adopting voice recording files as digital evidence.

A Study on Forgery Techniques of Smartphone Voice Recording File Structure and Metadata (스마트폰 음성녹음 파일 구조 및 메타데이터의 위변조 기법에 관한 연구)

  • Park, Jae Wan;Kwak, Won Jun;Lee, John Sanghyun
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.6
    • /
    • pp.807-812
    • /
    • 2022
  • Recently, as the number of voice recording files submitted as court evidence increases, the number of cases claiming forgery is also increasing. If the audio recording file structure and metadata, which are objective grounds, are completely forged, it is actually impossible to detect forgery of the sophisticated audio recording file. It is extremely rare for the court to reject the file structure and metadata analysis performed with the forged audio recording file. The purpose of this study is to prove that forgery of voice recording file structure and metadata is easily possible. To this end, in this study, it was introduced that forgery detection is impossible when the 'mixed paste' function, which enables sophisticated editing based on the typification of the editing method of voice recording files, is applied. Moreover, it has been proven through experiments that forgery of file structure and metadata is possible. Therefore, a stricter standard for judging the admissibility of evidence is required when the audio recording file is adopted as digital evidence. This study will not only contribute to the standard of integrity in the adoption of digital evidence by judges, but will also contribute to the method of constructing a dataset for artificial intelligence in detecting forgery of recorded files that is expected to be developed in the future.

Speech Secure Communication Control System Using Chaos Generation Circuit (카오스 발생회로를 이용한 음성비화통신 제어시스템)

  • 여지환;이익수
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.6 no.2
    • /
    • pp.72-80
    • /
    • 1996
  • 본 논문은 카오스 발생회로(chaos generation circuit)를 설계 및 구현하고, 카오스 회로들간의 카오스 동기화(chaos synchronization) 알고리즘을 기초로 하여 카오스 변조통신을 구성하여 음성비화시스템의 구현에 관하여 실험적으로 검증했다. Pecorra와 Carroll 은 카오스 신호로서 카오스 시스템을 구동하면 카오스 동기화가 가능하다고 발표했다. 이러한 제어기법은 카오스 신호의 마스킹과 복원 알고리즘의 등장을 초래했다. 본 연구는 카오스 신호를 발생하기 위하여 상태변수 기법을 이용하여 로렌쯔(Lorenz) 카오스 발생회로를 하드웨어로 구현했다. 수치 실험 및 보드상의 실험에서 카오스 회로는 카오스의 동적특성을 나타냈으며, 카오스 발생회로들간의 카오스 동기제어를 아루었다. 음성비화를 위한 카오스 신호의 변조는 카오스 신호에 음성신호를 가산하여 송신하며, 광대역)spread spectrum)의 카오스 변조통신 (chaotic modulation communication)에서 음성정보는 수신시스템의 카오스 부시스템에서 카오스 신호를 빼내어 신호를 복원한다. 보드상에서 하드웨어로 구현한 카오스 변.복조 통신시스템을 구성하여 음성신호와 비화통신에 카오스 지능제어기법을 적용하였다.

  • PDF

Intelligibility Analysis on the Eavesdropping Sound of Glass Windows Using MTF-STI (MTF-STI를 이용한 유리창 도청음의 명료도 분석)

  • Kim, Hee-Dong;Kim, Yoon-Ho;Kim, Seock-Hyun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.1
    • /
    • pp.8-15
    • /
    • 2007
  • Speech intelligibility of the eavesdropping sound is investigated on a acoustic cavity - glass window coupled system. Using MLS (Maximum Length Sequency) signal as a sound source, acceleration and velocity responses of the glass window are measured by accelerometer and laser doppler vibrometer. MTF (Modulation Transfer Function) is used to identify tile speech transmission characteristics of the cavity and window system. STI (Speech Transmission Index) based upon MTF is calculated and speech intelligibility of the vibration sound of the glass window is estimated. Speech intelligibilities by the acceleration signal and the velocity signal are compared. Finally, intelligibility of the conversation sound is confirmed by the subjective test.

Chaotic Speech Secure Communication Using Feedback Masking Techniques (피드백 마스킹 기법을 사용한 카오스 음성비화통신)

  • 이익수;여지환
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2002.12a
    • /
    • pp.353-356
    • /
    • 2002
  • 본 논문은 카오스 신호를 이용하여 안전한 음성신호의 전송을 위한 아날로그 비화통신 시스템의 성능분석에 관한 연구이다. 기존의 카오스 동기화 및 카오스 변조통신 알고리즘을 개선하여 실제 통신환경에서 발생하는 다양한 조건들을 적용하여 음성신호의 복원능력을 모의실험으로 분석하였다. 일반적인 PC 제어기법과 제안한 피드백 마스킹 기법을 사용하여 송신단에서 음성신호를 카오스 신호로 마스킹하여 변조하고, 통신채널에 잡음신호를 추가하여 전송하였다. 수신단에서는 카오스 응답시스템을 이용하여 음성신호를 복조하고, 복원성능을 계산하기 위하여 아날로그 복원 에러신호의 평균전력을 제안하여 계산하였다. 실험결과 마스킹 정도, 파라미터들의 민감성, 채널잡음 등에 대하여 PC 제어기법보다 피드백 제어기법의 복원성능이 우수함을 확인할 수 있었다. 또한 로렌쯔 카오스 시스템을 비화통신시스템에 사용할 경우 파라미터들의 조합으로 암호키를 구성해야 하므로 키값들의 선정에 기준이 되는 파라미터 변화율에 대응하는 복원에러율의 관계를 실험 값으로 구하였다.

Speech Intelligibility Analysis on the Laser Detected Sound of the Glass Windows (유리창의 레이저 탐지음에 대한 음성명료도 분석)

  • Kim, Seock-Hyun;Lee, Hyun-Woo;Kim, Hee-Dong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.2
    • /
    • pp.127-134
    • /
    • 2009
  • In this study, possibility of the laser eavesdropping is investigated on the window glasses with various thicknesses, Glass windows are excited by maximum length sequency (MLS) signal and the vibration sound is detected by a laser doppler vibrometer. From the detected sound, speech intelligibility is objectively estimated. Speech transmission index (STI), which is based on the modulation transfer function (MTF). is calculated for the estimation. Finally, disturbing wave effect on the speech intelligibility is analysed by using an outside speaker and a window shaker attached on the glass window. The purpose of the study is to estimate the possibility of remote eavesdropping by the laser sensor and to evaluate the performance of the homemade window shaker to protect from the remote eavesdropping.

Experimental Results of SSB Modem in Shallow Sea (천해에서 SSB 모뎀의 실험결과 분석)

  • Ju, Hyng-Jun;Han, Jung-Woo;Kim, Ki-Man
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.12 no.6
    • /
    • pp.990-998
    • /
    • 2008
  • In this paper we achieve experimental data evaluation using SSB(Single-side band) modulation in the ocean. Present research in underwater communication is applying digital modulation, OFDM and MIMO system. However, Commercial modems using analog modulation techniques in oceans. So, we achieved experimental for modem appliance development of correct high quality in South Korea sea characteristics. This experimets achievd useing SSB analog modulation in Jin-hae shore of shallow water condition. Used data are tonal and LFM signal for getting underwater channel characterisitcs and female Korean speech for speech communications.

Effects of PSK Modulation Methods in Underwater Acoustic Communication (PSK 변조방식이 수중통신에 미치는 영향에 관한 연구)

  • Cho, Jin-Soo;Jung, Seung-Back;Shim, Tae-Bo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.7
    • /
    • pp.366-374
    • /
    • 2007
  • In underwater wireless communication, needs for long distance communication using the high frequency are surpassing ones of short range communication by ultrasonic wave, and demands for transmitting and receiving various data such as voice or high resolution image data are increasing as well. In this work, we studied the effects on the real underwater communication depending on the difference of digital modulation methods. Simulation shows that only the performance of GMSK among many other PSK based modulation schemes(BPSK, QPSK, MSK, GMSK) is significant. Test condition simulates the oceanographic conditions along the 207-survey line, 15Km south of Busan and SNR is maintained 35dB or below. Simulated tests are composed of both transmitting image data($3{\times}10^5$ pixel, 4 bit per pixel) and voice communication($10^{-2}$BER, channel capacity of 1Kbps). Test results show that there are gain of about 7 seconds in transmission time in image transmission case, where channel capacity for BPSK, QPSK, and MSK and for GMSK were 65 Kbps and 45 Kbps, respectively and gain of about 8Km in distances in voice communication case.