• Title/Summary/Keyword: Audio Technology

Search Result 642, Processing Time 0.028 seconds

The Development of Virtual Reality Telemedicine System for Treatment of Acrophobia (고소공포증 치료를 위한 가상현실 원격진료 시스템의 개발)

  • Ryu Jong Hyun;Beack Seung Hwa;Paek Seung Eun;Hong Sung Chan
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.52 no.4
    • /
    • pp.252-257
    • /
    • 2003
  • Acrophobia is an abnormal fear of heights. Medications or cognitive-behavior methods have been mainly used as a treatment. Lately the virtual reality technology has been applied to that kind of anxiety disorders. A virtual environment provides patient with stimuli which arouses phobia, and exposing to that environment makes him having ability to over come the fear. Recently, the patient can take diagnose from a medical doctor in distance with the telemedicine system. The hospital and doctors can get the medical data, audio, video, signals in the actual examination room or operating room via a live interactive system. Audio visual and multimedia conference service, online questionary, ECG signal transfer system, update system are needed in this system. Virtual reality simulation system that composed with a position sensor, head mount display, and audio system, is also included in this telemedicine system. In this study, we tried this system to the acrophobia patient in distance.

Classification of Phornographic Videos Based on the Audio Information (오디오 신호에 기반한 음란 동영상 판별)

  • Kim, Bong-Wan;Choi, Dae-Lim;Lee, Yong-Ju
    • MALSORI
    • /
    • no.63
    • /
    • pp.139-151
    • /
    • 2007
  • As the Internet becomes prevalent in our lives, harmful contents, such as phornographic videos, have been increasing on the Internet, which has become a very serious problem. To prevent such an event, there are many filtering systems mainly based on the keyword-or image-based methods. The main purpose of this paper is to devise a system that classifies pornographic videos based on the audio information. We use the mel-cepstrum modulation energy (MCME) which is a modulation energy calculated on the time trajectory of the mel-frequency cepstral coefficients (MFCC) as well as the MFCC as the feature vector. For the classifier, we use the well-known Gaussian mixture model (GMM). The experimental results showed that the proposed system effectively classified 98.3% of pornographic data and 99.8% of non-pornographic data. We expect the proposed method can be applied to the more accurate classification system which uses both video and audio information.

  • PDF

Korean and Japanese EFL Learners' AV Benefit for the Perception of the Liquid Contrast in English (한국인 및 일본인 영어학습자의 유음 차이 지각에 미치는 시각/청각 효과)

  • Chung, Hyun-Song
    • MALSORI
    • /
    • no.60
    • /
    • pp.1-11
    • /
    • 2006
  • This paper investigated AV benefit of Korean and Japanese EFL learners' perception of the liquid contrast in English. In a perception experiment, the two English consonants /l/ and /r/ were embedded in initial and medial position in nonsense words in the context of the vowels /i, a, u/. Singletons and clusters were included in the speech material. Audio and video recordings were made using a total of 108 items. The items were presented to Korean and Japanese learners of English in three conditions: audio-alone (A), visual-alone (V) and audio-visual presentation (AV). The results showed that there was no evidence of AV benefit for the perception of the /l/-/r/ contrast for either Korean or Japanese learners of English. The results suggest that increasing auditory proficiency in identifying a non-native contrast is linked with an increasing proficiency in using visual cues to the contrast.

  • PDF

Automatic melody extraction algorithm using a convolutional neural network

  • Lee, Jongseol;Jang, Dalwon;Yoon, Kyoungro
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.12
    • /
    • pp.6038-6053
    • /
    • 2017
  • In this study, we propose an automatic melody extraction algorithm using deep learning. In this algorithm, feature images, generated using the energy of frequency band, are extracted from polyphonic audio files and a deep learning technique, a convolutional neural network (CNN), is applied on the feature images. In the training data, a short frame of polyphonic music is labeled as a musical note and a classifier based on CNN is learned in order to determine a pitch value of a short frame of audio signal. We want to build a novel structure of melody extraction, thus the proposed algorithm has a simple structure and instead of using various signal processing techniques for melody extraction, we use only a CNN to find a melody from a polyphonic audio. Despite of simple structure, the promising results are obtained in the experiments. Compared with state-of-the-art algorithms, the proposed algorithm did not give the best result, but comparable results were obtained and we believe they could be improved with the appropriate training data. In this paper, melody extraction and the proposed algorithm are introduced first, and the proposed algorithm is then further explained in detail. Finally, we present our experiment and the comparison of results follows.

Development of the central control system using IP PBX convergence with broadcasting function (방송기능이 있는 IP PBX 융합 중앙 관제 시스템 개발)

  • Kim, Sam-Taek
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.7
    • /
    • pp.1-6
    • /
    • 2021
  • Currently, virus infection such as Corona 19 has become commonplace, and interest in unmanned systems is increasing in the field for non-face-to-face ICT services. In this paper, the function and performance of remotely successfully controlling a store through video and audio using an IP PBX with a broadcasting function was verified through a test. And the fully unmanned system is not gaining credibility due to various technical problems, however the central control system is a very efficient and reliable system because the controller can directly control the customer while monitoring the access and the inside of the store through the video and audio. In the future, we plan to study a completely unmanned remote control system using A.I technology.

A DNN-Based Personalized HRTF Estimation Method for 3D Immersive Audio

  • Son, Ji Su;Choi, Seung Ho
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.13 no.1
    • /
    • pp.161-167
    • /
    • 2021
  • This paper proposes a new personalized HRTF estimation method which is based on a deep neural network (DNN) model and improved elevation reproduction using a notch filter. In the previous study, a DNN model was proposed that estimates the magnitude of HRTF by using anthropometric measurements [1]. However, since this method uses zero-phase without estimating the phase, it causes the internalization (i.e., the inside-the-head localization) of sound when listening the spatial sound. We devise a method to estimate both the magnitude and phase of HRTF based on the DNN model. Personalized HRIR was estimated using the anthropometric measurements including detailed data of the head, torso, shoulders and ears as inputs for the DNN model. After that, the estimated HRIR was filtered with an appropriate notch filter to improve elevation reproduction. In order to evaluate the performance, both of the objective and subjective evaluations are conducted. For the objective evaluation, the root mean square error (RMSE) and the log spectral distance (LSD) between the reference HRTF and the estimated HRTF are measured. For subjective evaluation, the MUSHRA test and preference test are conducted. As a result, the proposed method can make listeners experience more immersive audio than the previous methods.

Unleashing the Power of Digitization: National Mission for Manuscript's Analysis and Special Efforts in Enhancing Manuscript Usability and Preserving Cultural Heritage in Uttar Pradesh

  • Priyanka Jaiswal;Abhay Chaurasia;Ajay Pratap Singh
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.14 no.3
    • /
    • pp. 7-18
    • /
    • 2024
  • The present study focuses on the activities and efforts of the National Mission for Manuscripts (NMM) in the Uttar Pradesh region, which is known for its vast area, population, and rich cultural heritage. The aim is to examine the digitization work carried out by the NMM in this area, as digitization plays a crucial role in preserving our country's rich ancient heritage. The importance of safeguarding cultural heritage is universally acknowledged, and digitization serves as a vital tool in this endeavour. Through digitization, we can protect and preserve our heritage for future generations. The government has implemented several commendable initiatives for manuscript digitization, and the NMM stands as a prominent organization dedicated to the conservation of cultural heritage. The NMM possesses a diverse range of cultural heritage resources, including photographic slides, photographs, digital images, photo-negatives, motion pictures, audio spools, microfiche, LP records, endangered manuscripts, audio and videotapes, digital images, microfilms, digital audio and video files, and more. The mission has undertaken extensive digitization efforts to conserve and provide access to a significant portion of its collection. This study is unique as it explores the digital conservation and digitization practices of a premier institute working in the field of art and cultural heritage in Uttar Pradesh. With its extensive network of institutions, the mission aims to cover all manuscripts, digitize them, and consolidate them on a common platform for easy access and utilization.

A Study on Elemental Technology Identification of Sound Data for Audio Forensics (오디오 포렌식을 위한 소리 데이터의 요소 기술 식별 연구)

  • Hyejin Ryu;Ah-hyun Park;Sungkyun Jung;Doowon Jeong
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.34 no.1
    • /
    • pp.115-127
    • /
    • 2024
  • The recent increase in digital audio media has greatly expanded the size and diversity of sound data, which has increased the importance of sound data analysis in the digital forensics process. However, the lack of standardized procedures and guidelines for sound data analysis has caused problems with the consistency and reliability of analysis results. The digital environment includes a wide variety of audio formats and recording conditions, but current audio forensic methodologies do not adequately reflect this diversity. Therefore, this study identifies Life-Cycle-based sound data elemental technologies and provides overall guidelines for sound data analysis so that effective analysis can be performed in all situations. Furthermore, the identified elemental technologies were analyzed for use in the development of digital forensic techniques for sound data. To demonstrate the effectiveness of the life-cycle-based sound data elemental technology identification system presented in this study, a case study on the process of developing an emergency retrieval technology based on sound data is presented. Through this case study, we confirmed that the elemental technologies identified based on the Life-Cycle in the process of developing digital forensic technology for sound data ensure the quality and consistency of data analysis and enable efficient sound data analysis.

Study of DRM Application for the Portable Digital Audio Device (휴대용 디지털 오디오 기기에서의 DRM 적용에 관한 연구)

  • Cho, Nam-Kyu;Lee, Dong-Hwi;Lee, Dong-Chun;J. Kim, Kui-Nam;Park, Sang-Min
    • Convergence Security Journal
    • /
    • v.6 no.4
    • /
    • pp.21-27
    • /
    • 2006
  • With the introduction of sound source sharing over the high speed internet and portable digital audio, the digitalization of sound source has been rapidly expanded and the sales and distribution of sound sources of the former offline markets are stagnant. Also, the problem of infringement of copyright is being issued seriously through illegal reproduction and distribution of digitalized sound sources. To solve these problems, the DRM technology for protecting contents and copyrights in portable digital audio device began to be introduced. However, since the existing DRM was designed based on the fast processing CPU and network environment, there were many problems in directly applying to the devices with small screen resolution, low processing speed and network function such as digital portable audio devices which the contents are downloadable through the PC. In this study, the DRM structural model which maintains similar security level as PC environment in the limited hardware conditions such as portable digital audio devices is proposed and analyzed. The proposed model chose portable digital audio exclusive device as a target platform which showed much better result in the aspect of security and usability compared to the DRM structure of exiting portable digital audio device.

  • PDF

전력선통신 기반 A/V 신호 송수신기 설계 및 특성분석

  • Kim, Ji-Hyeong;Kim, Yong-Gap
    • Proceedings of the Korean Institute of Electrical and Electronic Material Engineers Conference
    • /
    • 2009.11a
    • /
    • pp.285-285
    • /
    • 2009
  • Due to a development of a modem technology as Power Line Communication(PLC) over 200 Mbps, the high-speed multi-media data trasmission could be currently possible. In This paper we develop a high quality media transmitter-receiver based on merging the HomePlug AV, which is 200 Mbps class PLC technology and HDMI Interface technology. Smart Live 6 software were used for the assessment of audio property. As the result of measurement of the HD class images by capturing from the receiver of the PLC, the quality of images couldn't be confirm any deterioration, which has compared with original reflections. In case of audio part as the result of confirmation of the Phase, Magnitude, it has been confirmed that over 90% of nomal transmition and receiving of acoustic signal.

  • PDF