• Title/Summary/Keyword: Audio device

Search Result 171, Processing Time 0.032 seconds

Multimedia Messaging Service Adaptation for the Mobile Learning System Based on CC/PP

  • Kim, Su-Do;Park, Man-Gon
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.6
    • /
    • pp.883-890
    • /
    • 2008
  • It becomes enabled to provide variety of multimedia contents through mobile service with the development of high-speed 3rd generation mobile communication and handsets. MMS (Multimedia Messaging Service) can be displayed in the presentation format which is unified the various multimedia contents such as text, audio, image, video, etc. It is applicable as a new type of ubiquitous learning. In this study we propose to design a mobile learning system by providing profiles which meets the standard of CC/PP and by generating multimedia messages based on SMIL language through the adaptation steps according to the learning environment, the content type, and the device property of learners.

  • PDF

A Real-Time Sound Recognition System with a Decision Logic of Random Forest for Robots (Random Forest를 결정로직으로 활용한 로봇의 실시간 음향인식 시스템 개발)

  • Song, Ju-man;Kim, Changmin;Kim, Minook;Park, Yongjin;Lee, Seoyoung;Son, Jungkwan
    • The Journal of Korea Robotics Society
    • /
    • v.17 no.3
    • /
    • pp.273-281
    • /
    • 2022
  • In this paper, we propose a robot sound recognition system that detects various sound events. The proposed system is designed to detect various sound events in real-time by using a microphone on a robot. To get real-time performance, we use a VGG11 model which includes several convolutional neural networks with real-time normalization scheme. The VGG11 model is trained on augmented DB through 24 kinds of various environments (12 reverberation times and 2 signal to noise ratios). Additionally, based on random forest algorithm, a decision logic is also designed to generate event signals for robot applications. This logic can be used for specific classes of acoustic events with better performance than just using outputs of network model. With some experimental results, the performance of proposed sound recognition system is shown on real-time device for robots.

Design and Implementation of Scent-Supported Educational Content using Arduino

  • Hye-kyung Kwon;Heesun Kim
    • International journal of advanced smart convergence
    • /
    • v.12 no.4
    • /
    • pp.260-267
    • /
    • 2023
  • Due to the development of science and technology in the 4th Industrial Revolution, a variety of content is being developed and utilized through educational courses linked to digital textbooks. Students use smart devices to engage in realistic virtual learning experiences, interacting with the content in digital textbooks. However, while many realistic contents offer visual and auditory effects like 3D VR, AR, and holograms, olfactory content that evokes actual sensations has not yet been introduced. Therefore, in this paper, we designed and implemented 4D educational content by adding the sense of smell to existing content. This implemented content was tested in classrooms through a curriculum-based evaluation. Classes taught with olfactory-enhanced content showed a higher percentage of correct answers compared to those using traditional audio-visual materials, indicating improved understanding.

Design of Music Player Application System based on Android (안드로이드 기반의 뮤직 플레이어 어플리케이션 시스템 설계)

  • Ryu, Chang-Su;Lee, Myung-Hwan;Hur, Chang-Wu
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.18 no.5
    • /
    • pp.1237-1242
    • /
    • 2014
  • The functions of playing musics and multimedia have become essential in one device as a smart phone since the smart phone appeared. It is very convenient, but it contains controversial arguments about sound quality, so many smart phone users use the music player application. By using these music applications, people start to think about the relationship between music playing and sound quality. However, those applications are not perfect, so it is hard to choose a good application. This thesis is about the advantages of the sound quality of music player applications that are currently sold in Android Market through RightMark Audio Analyzer program, and plans to suggest android music player application system design by analyzing applications by covering disadvantages of these applications.

A Speaker Detection System based on Stereo Vision and Audio (스테레오 시청각 기반의 화자 검출 시스템)

  • An, Jun-Ho;Hong, Kwang-Seok
    • Journal of Internet Computing and Services
    • /
    • v.11 no.6
    • /
    • pp.21-29
    • /
    • 2010
  • In this paper, we propose the system which detects the speaker, who is speaking currently, among a number of users. A proposed speaker detection system based on stereo vision and audio is mainly composed of the followings: a position estimation of speaker candidates using stereo camara and microphone, a current speaker detection, and a speaker information acquisition based on a mobile device. We use the haar-like features and the adaboost algorithm to detect the faces of speaker candidates with stereo camera, and the position of speaker candidates is estimated by a triangulation method. Next, the Time Delay Of Arrival (TDOA) is estimated by the Cross Power Spectrum Phase (CPSP) analysis to find the direction of source with two microphone. Finally we acquire the information of the speaker including his position, voice, and face by comparing the information of the stereo camera with that of two microphone. Furthermore, the proposed system includes a TCP client/server connection method for mobile service.

The Design of Remote Control System using Bluetooth Wireless Technology (블루투스 무선기술을 응용한 원격제어 시스템의 설계)

  • 전형준;이창희
    • Journal of the Korea Computer Industry Society
    • /
    • v.4 no.4
    • /
    • pp.547-552
    • /
    • 2003
  • In this thesis, interference phenomena of bluetooth networks requiring Security were minimized; strengthened security of piconet by assigning an identical PIN code to bluetooth devices, which was establishing a specific piconet during authentication stage. To establish a bluetooth piconet system. an unique ID was assigned to each bluetooth device, communication algorithms having different data formats between devices was designed, and an embedded hardware module using ARM processor and uCOS-II RTOS was implemented. About 30% of CPU efficiency in the module was increased by modifying functions including block parameters to work as nonblocking; by the increased efficiency of total piconet, the module could be used as an access point. The module could transmit maximum 10 frames of image and also audio signal by switching the packet effectively according to channel condition. By above-mentioned process, video, audio, and data could be well transmitted by the bluetooth managing program and the possibility of a commercial remote control system using bluetooth technology was suggested.

  • PDF

Deep Learning based Raw Audio Signal Bandwidth Extension System (딥러닝 기반 음향 신호 대역 확장 시스템)

  • Kim, Yun-Su;Seok, Jong-Won
    • Journal of IKEEE
    • /
    • v.24 no.4
    • /
    • pp.1122-1128
    • /
    • 2020
  • Bandwidth Extension refers to restoring and expanding a narrow band signal(NB) that is damaged or damaged in the encoding and decoding process due to the lack of channel capacity or the characteristics of the codec installed in the mobile communication device. It means converting to a wideband signal(WB). Bandwidth extension research mainly focuses on voice signals and converts high bands into frequency domains, such as SBR (Spectral Band Replication) and IGF (Intelligent Gap Filling), and restores disappeared or damaged high bands based on complex feature extraction processes. In this paper, we propose a model that outputs an bandwidth extended signal based on an autoencoder among deep learning models, using the residual connection of one-dimensional convolutional neural networks (CNN), the bandwidth is extended by inputting a time domain signal of a certain length without complicated pre-processing. In addition, it was confirmed that the damaged high band can be restored even by training on a dataset containing various types of sound sources including music that is not limited to the speech.

MPEG-D USAC: Unified Speech and Audio Coding Technology (MPEG-D USAC: 통합 음성 오디오 부호화 기술)

  • Lee, Tae-Jin;Kang, Kyeong-Ok;Kim, Whan-Woo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.7
    • /
    • pp.589-598
    • /
    • 2009
  • As mobile devices become multi-functional, and converge into a single platform, there is a strong need for a codec that is able to provide consistent quality for speech and music content MPEG-D USAC standardization activities started at the 82nd MPEG meeting with a CfP and approved WD3 at the 88th MPEG meeting. MPEG-D USAC is converged technology of AMR-WB+ and HE-AAC V2. Specifically, USAC utilizes three core codecs (AAC ACELP and TCX) for low frequency regions, SBR for high frequency regions and the MPEG Surround tool for stereo information. USAC can provide consistent sound quality for both speech and music content and can be applied to various applications such as multi-media download to mobile device Digital radio Mobile TV and audio books.

High Efficiency Switch-Mode LED driver for Visible Light Communication System (가시광 통신 시스템을 위한 고효율 스위치모드 LED 구동회로)

  • Kang, Jung-Min;Cho, Sang-Ho;Hong, Sung-Soo;Han, Sang-Kyoo;SaKong, Suk-Chin
    • The Transactions of the Korean Institute of Power Electronics
    • /
    • v.16 no.4
    • /
    • pp.358-365
    • /
    • 2011
  • Recently, the LED(Light Emitting Diode) replacing incandescent light bulbs and fluorescent light has great attentions as a most promising candidate for the next generation lighting source due to its environment-friendly characteristics, long life and excellent efficiency. Moreover, since it is a semiconductor device which can convert the electric energy to visible light at a very high speed, it can also used as a communication device. Therefore, the VLC(Visible Light Communication) using the LED can perform the near field communication and lighting function at the same time without additional expenses. However, since the switching device of the conventional LED driver for VLC is operated in the linear region, there exist several drawbacks such as a poor power conversion efficiency and serious heat generation. On the other hand, since the proposed driver is operated in the on/off switching region, it features a higher efficiency and more improved heat generation. To verify the validity of the proposed LED driver, experimental results from a prototype of 20W rated LED driver applied to 3MHz bps broadcasting audio system are given.

Proposal of Hostile Command Attack Method Using Audible Frequency Band for Smart Speaker (스마트 스피커 대상 가청 주파수 대역을 활용한 적대적 명령어 공격 방법 제안)

  • Park, Tae-jun;Moon, Jongsub
    • Journal of Internet Computing and Services
    • /
    • v.23 no.4
    • /
    • pp.1-9
    • /
    • 2022
  • Recently, the functions of smart speakers have diversified, and the penetration rate of smart speakers is increasing. As it becomes more widespread, various techniques have been proposed to cause anomalous behavior against smart speakers. Dolphin Attack, which causes anomalous behavior against the Voice Controllable System (VCS) during various attacks, is a representative method. With this method, a third party controls VCS using ultrasonic band (f>20kHz) without the user's recognition. However, since the method uses the ultrasonic band, it is necessary to install an ultrasonic speaker or an ultrasonic dedicated device which is capable of outputting an ultrasonic signal. In this paper, a smart speaker is controlled by generating an audio signal modulated at a frequency (18 to 20) which is difficult for a person to hear although it is in the human audible frequency band without installing an additional device, that is, an ultrasonic device. As a result with the method proposed in this paper, while humans could not recognize voice commands even in the audible band, it was possible to control the smart speaker with a probability of 82 to 96%.