• Title/Summary/Keyword: 음성 녹음 파일

Search Result 20, Processing Time 0.04 seconds

A Study on Forgery Techniques of Smartphone Voice Recording File Structure and Metadata (스마트폰 음성녹음 파일 구조 및 메타데이터의 위변조 기법에 관한 연구)

  • Park, Jae Wan;Kwak, Won Jun;Lee, John Sanghyun
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.6
    • /
    • pp.807-812
    • /
    • 2022
  • Recently, as the number of voice recording files submitted as court evidence increases, the number of cases claiming forgery is also increasing. If the audio recording file structure and metadata, which are objective grounds, are completely forged, it is actually impossible to detect forgery of the sophisticated audio recording file. It is extremely rare for the court to reject the file structure and metadata analysis performed with the forged audio recording file. The purpose of this study is to prove that forgery of voice recording file structure and metadata is easily possible. To this end, in this study, it was introduced that forgery detection is impossible when the 'mixed paste' function, which enables sophisticated editing based on the typification of the editing method of voice recording files, is applied. Moreover, it has been proven through experiments that forgery of file structure and metadata is possible. Therefore, a stricter standard for judging the admissibility of evidence is required when the audio recording file is adopted as digital evidence. This study will not only contribute to the standard of integrity in the adoption of digital evidence by judges, but will also contribute to the method of constructing a dataset for artificial intelligence in detecting forgery of recorded files that is expected to be developed in the future.

Limitations of Spectrogram Analysis for Smartphone Voice Recording File Forgery Detection (스마트폰 음성 녹음 파일 위변조 검출을 위한 스펙트로그램 분석의 한계점)

  • Sangmin Han;Yeongmin Son;Jae Wan Park
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.2
    • /
    • pp.545-551
    • /
    • 2023
  • As digital information is readily available to everyone today, the adoption of digital evidence is increasing. However, it is virtually impossible to determine the authenticity of forgery in the case of a voice recording file that has gone through a sophisticated editing process along with the spread of various voice file editing tools. This study aims to prove that forgery, which is difficult to distinguish from the original file, is possible by using insertion, deletion, linking, and synthetic editing technologies in voice recording files. This study presents the difficulty of detecting forgery by encoding a forged voice file with the same extension as the original. In addition, it was shown that forgery detection is impossible if additional transition band deletion and secondary encoding are performed only for experiments in which features occurred. Through this, this study is expected to contribute to the establishment of more stringent evidence admissibility criteria for adopting voice recording files as digital evidence.

A comparative analysis of metadata structures and attributes of Samsung smartphone voice recording files for forensic use (법과학적 활용을 위한 삼성 스마트폰 음성 녹음 파일의 메타데이터 구조 및 속성 비교 분석 연구)

  • Ahn, Seo-Yeong;Ryu, Se-Hui;Kim, Kyung-Wha;Hong, Ki-Hyung
    • Phonetics and Speech Sciences
    • /
    • v.14 no.3
    • /
    • pp.103-112
    • /
    • 2022
  • Due to the popularization of smartphones, most of the recorded speech files submitted as evidence of recent crimes are produced by smartphones, and the integrity (forgery) of the submitted speech files based on smartphones is emerging as a major issue in the investigation and trial process. Samsung smartphones with the highest domestic market share are distributed with built-in speech recording applications that can record calls and voice, and can edit recorded speech. Unlike editing through third-party speech (audio) applications, editing by their own builtin speech applications has a high similarity to the original file in metadata structures and attributes, so more precise analysis techniques need to prove integrity. In this study, we constructed a speech file metadata database for speech files (original files) recorded by 34 Samsung smartphones and edited speech files edited by their built-in speech recording applications. We analyzed by comparing the metadata structures and attributes of the original files to their edited ones. As a result, we found significant metadata differences between the original speech files and the edited ones.

Intelligent Classification and Context Analysis System of Voice Data (음성 데이터의 지능적 분류 및 컨텍스트 분석 시스템 구현)

  • Choi, HyeonSeok;Joo, SungHwan;Kim, DaeCheon;Park, YeChan;Yeom, Sanggil;Choo, HyeonSeung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2016.10a
    • /
    • pp.162-163
    • /
    • 2016
  • 사람은 의사소통을 위해 음성, 글자, 몸짓 등 다양한 매개체를 활용한다. 오늘날 스마트폰의 발달로 문자의 비중이 높아지고 있지만 음성 대화는 여전히 사람들 사이에서 가장 많이 사용되어지는 의사소통 수단이다. 음성 대화는 녹음해서 음성 데이터로 남길 수 있다. 음성을 녹음하는 과정은 간편하지만 녹음파일에서 원하는 데이터를 찾는 것은 많은 시간이 소모된다. 본 논문에서는 음성 데이터를 인식하여 텍스트화 시키고 문자화 된 데이터를 분석하여 사용자에게 효율적으로 분류하는 시스템을 제안한다. 이 시스템으로 사용자는 음성 데이터의 내용을 들어보지 않고 파악할 수 있으며 원하는 내용을 찾을 수도 있다.

Limitations of Analyzing Metadata and File Structure of Audio Files for Legal Evidence: Focusing on Samsung Smartphones (법적 증거 능력을 위한 오디오 파일의 메타데이터 및 파일 구조 분석의 한계: 삼성 스마트폰을 중심으로)

  • Sungwon Baek;Homin Son;Jae Wan Park
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.6
    • /
    • pp.1103-1109
    • /
    • 2023
  • Today, as the number of audio files submitted as legal evidence increases with the proliferation of smartphones, the integrity of audio files has become an important issue. Accordingly, the purpose of this study is to explore whether the metadata and file structure of audio files recorded on Samsung smartphones can be manipulated to be identical to the original. This study was based on Samsung smartphones, the most widely used in Korea, and conducted experiments on the built-in voice recording app and the 'Easy Voice Recorder' app, which is the most popular recording app. Through the experiments of this study, it was proven that the metadata and file structure of audio files can be manipulated. Therefore, this study reveals that metadata and file structure analysis have limitations in proving the integrity when audio files are analyzed for adoption as legal evidence. They also argue for the need to develop new voice file forgery technology that does not rely on metadata and file structure analysis.

The Development of Program for Remote Hearing Diagnosis and Generate Data (원격 난청진단 및 진단용 데이터 생성 프로그램 개발)

  • Kim, Ho-Jin;Yi, Se-Ung;Kim, Kyoung-Hoon;Nam, Ji-Seung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2001.10b
    • /
    • pp.1099-1102
    • /
    • 2001
  • 난청은 주위에서 흔히 볼 수 있는 병리적 현상이지만 현재로서는 난청도를 진단하기 위해서 병의원을 찾아 진단을 해야만 한다. 또한 실제로 병의원에서 진단하는 과정은 매우 간단하며 이는 PC기반의 프로그램을 개발함으로써도 진단이 가능할 정도이다. 이에 본 논문에서는 원격지에서도 인터넷을 통해 난청자가 자신의 난청도를 진단할 수 있는 프로그램을 개발하였다. 기본적으로 일정한 데이터 파일외 주파수 변조를 통해 각 주파수 영역별 난청정도를 스스로 진단 한 수 있으며, 필요에 따라서는 음성 녹음한 후 주파수 변조를 통해 진단용 파일을 생성하여 진단 및 교육을 할 수 있다. 진단후 결과는 데이터 베이스로 저장이 되며, 진단용 음성 파일 또한 데이터 베이스로 보관된다. 또한 본 프로그램은 대화형식의 진행으로 구성되어 일반 사용자에게 편리한 인터페이스를 제공한다.

  • PDF

Development of Device Prototypes for Toddler Language Learning using Sensors and TTS API (센서와 tts api를 이용한 유아용 언어 학습용 디바이스 프로토타입 개발)

  • Choi, Hyo Hyun;Yu, Kwang Sik
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2021.07a
    • /
    • pp.509-510
    • /
    • 2021
  • 본 논문에서는 라즈베리파이, 마이크, 스피커, 버튼센서, 진동센서, TTS(Text-To-Speech) api를 활용하여 유아용 언어 학습용 디바이스를 개발한다. 학습시키고 싶은 단어가 쓰여져 있는 상자를 유아가 건드리면 그 단어의 소리가 나는 것을 가정하였다. 사용자가 버튼을 통해 직접 단어를 녹음을 할 수 있으며 웹페이지를 통해 텍스트(영어)를 입력하면 text-to-speech api를 통해 텍스트(영어)에 맞는 음성파일을 제공받을 수 있다. 저장된 음성파일은 진동센서를 통해 진동이 감지되면 스피커를 통해서 출력이 되는 시스템으로 구성하였다.

  • PDF

Universal Personal Telecommunications using Specialized Resource Functions in the Intelligent Peripheral (Intelligent Peripheral의 특수 음성 자원을 이용한 Universal Personal Telecommunications 서비스)

  • Kim, Gi-Ryeong;Kim, Tae-Il;Choe, Go-Bong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.6
    • /
    • pp.1506-1514
    • /
    • 1996
  • This paper proposes enhanced features for the Universal Telecommunications (UPT), voice authentication and voice synthesis, using the specialized resources functions in the Intelligent peripheral(IP). The proposed voice authentication is able ti provide simple and user-friendly security mechanism and to prevent unauthorized users from fraudulently using the UPT number. Also, traditional UPT service deliveries only fixed message to the UPT user, but the proposed UPT service can support flexible message transfer by use of the voice synthesis.

  • PDF

Real Time Monitoring of Smart Baby Bed using Sound Sensor (사운드 센서 이용한 Smart 아기 침대의 실시간 모니터링)

  • Kwon, Mi-Rae;Park, Hwa-Jung;Kim, Nam-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.230-232
    • /
    • 2021
  • As the ratio of double-income households and parental leave use increase, there is an increasing demand for products that help when raising children alone. In particular, there is a lot of demand for baby beds that help raise children without difficulty even by themselves. Therefore, in this paper, we propose a real-time monitoring of a smart crib using a sound sensor. The proposed bed uses a sound sensor to detect the child's crying and condition, and the measured sensor output value can be checked with a mobile application. When the sound sensor output value is more than a certain value, a voice file such as a lullaby recorded with the voice of the parents is played, and if the sensor output value is less than a certain value, the playing voice file is stopped. If the sensor output value continues to exist after a certain period of time, a pop-up notification is sent to the mobile application. This allows the child to quickly calm down with a sense of stability and comfort through the recorded voices of the parents, and the parents can remotely monitor the child's condition in real time.

  • PDF

Development of the Remote-Educating Communication Tool using DCOM Voice Module (DCOM 음성 모듈을 이용한 원격 대화식 학습 도구의 개발)

  • Jang, Seung-Ju
    • The KIPS Transactions:PartA
    • /
    • v.10A no.2
    • /
    • pp.173-180
    • /
    • 2003
  • This paper proposes Remote Educating Communication Tool (RECT) that allows students and teachers to communicate using Web-based Bulletin Board System. The distance teaching using DCOM (Distributed Component Object Model) voice module is used to enhance academic accomplishments for students in computer class. The DCOM voice module to be used in distance learning is designed, implemented and applied to teachers and students in the computer class in order to measure and analyze academic results. The RECT server provides Q&A sessions between students and teachers in the BBS using recording and playback functions. The client RECT includes recording and playback functions. The client module of RECT receives and uses DCOM module. When recording, the client transmits voice files with the recorded content to the server.