• Title/Summary/Keyword: voice image

Search Result 293, Processing Time 0.025 seconds

Design of a Three Dimensional Audio System for Multicast Conferencing (멀티캐스트 화상회의를 위한 3-D 음향시스템 설계)

  • 김영오;고대식
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.25 no.1B
    • /
    • pp.71-76
    • /
    • 2000
  • On multimedia teleconferencing system existing a number of participants, face of the participants can beperceived by visual image. However, differentiation of each participant's voice and spaciousness sense are very hard since voice of all participants is processed with one dimensional data. In this paper, we implemented three dimensional audio rendering system using the HRTF(Head Related Transfer Function) and distance sense reproduction method and determined the optimal location of the participants for teleconferencing system. In the results of the listening test using elevation and azimuth angle, we showed that directional perception of the azimuth angles were better than that of the elevation angles. Specially, we showed that participant location using the HRTFS of the azimuth angle 10" , 90" , 270" and350" was efficient in teleconferencing system existing four participants. We also proposed that distance cue was used for enhancement of the reality and location of many participants more than five.ipants more than five.

  • PDF

Voice-based messenger using NXT touch-sensor input unit and the Bluetooth wireless communication for the blind (터치 센서 입력기와 블루투스 무선 통신을 이용한 시각 장애인용 음성 기반 메신저)

  • Lee, Jung-Il;Kim, Soon-Cheol;Won, Hui-Chul
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.13 no.5
    • /
    • pp.78-86
    • /
    • 2008
  • Many people have conveniently used various messengers to talk with remote friends or to send urgent files to remote co-workers. Recently, it is also possible to use messenger with user's image. However, these messenger technologies are of no use for the blind. In order to cope with this problem, we propose voice-based messenger with a Braille system for the blind. The proposed messenger enables the blind to listen to the received sentences from remote user. It also enables them to listen to the written sentences before sending to remote user for the purpose of checking that the sentences are correctly written. The Braille system for writing sentences can be implemented by using the programmable NXT system, which contains a 32-bit ARM-7 micro-controller, with 4 touch-sensors. Finally, we apply the Bluetooth technology for wireless communication between the Braille system and the proposed messenger.

  • PDF

Establishment for Efficiency Air-To-Ground Air Operation Model in Link-16 (Link-16 기반의 효율적인 공대지 항공작전 모델 설계)

  • Lee, Hyeong-Heon;Jang, Hyeong-Jun;Kim, Yeong-Gu;Lim, Jae-Sung
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.13 no.5
    • /
    • pp.861-868
    • /
    • 2010
  • As CAS, X-ATK, and INT models considered as the most typical Air-to-Ground operation models in ROKAF are mainly designed as the voice-centered system between aircraft and ground control facilities, it is critical to newly develop the Link-16 based model for the ROK-US combined operation between F-15K, AWACS, M-SAM, and KDX-III equipped with Link-16. Former studies had been limited to the CAS operation, and they had mainly focused on reducing the voice transmission time to exchange the information between each mission step with maintaining existing operation steps. Therefore, this paper makes up the weak point in former studies, thereby designing new Air-to-Ground operation model for CAS, X-ATK, INT mission using Enterprise Architecture OV6c, which enables both aircraft and ground control facilities or between aircraft to obtain the real-time information on the location, identification, armament and the real-time image data through the broadcasting function. Based on the analysis of new operation model, we come to a conclusion that by simultaneously exchanging the information on mission between nodes concerned through the broadcasting function of Link-16. It is possible to cut down superfluous steps among the mission steps, and to reduce the mission time. It is clear that it gives rise to improve the battle efficiency and the decision-making tempo as well as the battlefield situational awareness.

Lens Position Error Compensated Fast Auto-focus Algorithm in Mobile Phone Camera Using VCM (VCM을 이용한 휴대폰 카메라에서의 렌즈 위치 오차 보상 고속 자동 초점 알고리즘)

  • Han Chan-Ho;Kim Tae-Kyu;Kwon Seong-Geun
    • Journal of Korea Multimedia Society
    • /
    • v.9 no.5
    • /
    • pp.585-594
    • /
    • 2006
  • Due to the size limit, the voice coil motor (VCM) is adopted in most of the mobile phone camera to control auto-focus instead of step motor. The optical system using the VCM has the property that the focus values are varying even though the same current is induced. It means that an error of the lens position was taken placed due to the characteristics of the VCM. In this paper, a algorithm was proposed to compensate the lens position error using the step size and the search count of each stage. In the proposed algorithm -7 step middle searching stage is inserted the conventional searching algorithm for the fast auto-focus searching and the final searing step size was set to +1 for the precise focus control, respectively. In the experimental results, the focus values was found more fast in the proposed algorithm than the conventional. And more the image quality by the proposed algorithm was superior to that of the conventional.

  • PDF

A Study on the Filter Modeling of Fading Channel for Digital Transmission (디지털 전송을 위한 페이딩 채널의 필터 모델링에 관한 연구)

  • 임승각;김노환
    • KSCI Review
    • /
    • v.2 no.1
    • /
    • pp.55-67
    • /
    • 1995
  • Recently, it is possible to high speed transmission of the non-voiced data, including voice, data, moving image instead of voice only in the past by changing the communication method to digital form from analog owing to the development of semiconductor and computer technology which for information transmission of the remote point. By doing so, we can get the improvement of the noise effect and low cost but the loss of transmission bandwidth. It is necessary to take some method in oreder to reducing the fading which is propotional to transmission bandwidth during the transmission of radio communication channel, especially. When we design the digital communication system, we must considered to the fading effect in order to determination of the transmitting power, modulation /demodulation method, transmission speed, bit error rate. This paper mainly concerns to the method to the channel simulator which descrives the fading effect during the transmission by computer model and digital filter modeling of the radio fading channel by unsing the transmitting and received signal. By taking the inverse of the characteristic of the modeled filter, it is possible to improvement of the communication system by reducing the distortion and inter-symbol interference which occurs in the channel.

  • PDF

Voice Assistant for Visually Impaired People (시각장애인을 위한 음성 도우미 장치)

  • Chae, Jun-Gy;Jang, Ji-Woo;Kim, Dong-Wan;Jung, Su-Jin;Lee, Ik Hyun
    • The Journal of Korean Institute of Information Technology
    • /
    • v.17 no.4
    • /
    • pp.131-136
    • /
    • 2019
  • People with compromised visual ability suffer from many inconveniences in daily life, such as distinguishing colors, identifying currency notes and realizing the atmospheric temperature. Therefore, to assist the visually impaired people, we propose a system by utilizing optical and infrared cameras. In the proposed system, an optical camera is used to collect features related to colors and currency notes while an infrared camera is utilized to get temperature information. The user is enabled to select the desired service by pushing the button and the appreciate voice information are provided through the speaker. The device can distinguish 16 kinds of colors, four different currency notes, and temperature information in four steps and the current accuracy is around 90%. It can be improved further through block-wise input image, machine learning, and a higher version of the infrared camera. In addition, it will be attached to the stick for easy carrying and to use it more conveniently.

Big Data Analysis Method for Recommendations of Educational Video Contents (사용자 추천을 위한 교육용 동영상의 빅데이터 분석 기법 비교)

  • Lee, Hyoun-Sup;Kim, JinDeog
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.12
    • /
    • pp.1716-1722
    • /
    • 2021
  • Recently, the capacity of video content delivery services has been increasing significantly. Therefore, the importance of user recommendation is increasing. In addition, these contents contain a variety of characteristics, making it difficult to express the characteristics of the content properly only with a few keywords(Elements used in the search, such as titles, tags, topics, words, etc.) specified by the user. Consequently, existing recommendation systems that use user-defined keywords have limitations that do not properly reflect the characteristics of objects. In this paper, we compare the efficiency of between a method using voice data-based subtitles and an image comparison method using keyframes of images in recommendation module of educational video service systems. Furthermore, we propose the types and environments of video content in which each analysis technique can be efficiently utilized through experimental results.

A Study on Interactive Talking Companion Doll Robot System Using Big Data for the Elderly Living Alone (빅데이터를 이용한 독거노인 돌봄 AI 대화형 말동무 아가야(AGAYA) 로봇 시스템에 관한 연구)

  • Song, Moon-Sun
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.5
    • /
    • pp.305-318
    • /
    • 2022
  • We focused on the care effectiveness of the interactive AI robots. developed an AI toy robot called 'Agaya' to contribute to personalization with more human-centered care. First, by applying P-TTS technology, you can maximize intimacy by autonomously selecting the voice of the person you want to hear. Second, it is possible to heal in your own way with good memory storage and bring back memory function. Third, by having five senses of the role of eyes, nose, mouth, ears, and hands, seeking better personalised services. Fourth, it attempted to develop technologies such as warm temperature maintenance, aroma, sterilization and fine dust removal, convenient charging method. These skills will expand the effective use of interactive robots by elderly people and contribute to building a positive image of the elderly who can plan the remaining old age productively and independently

An Efficient 2-dimensional Addressing Mode for Image Processor (영상처리용 프로세서를 위한 효율적인 이차원 어드레스 지정 기법)

  • Go, Yun-Ho;Yun, Byeong-Ju;Kim, Seong-Dae
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.38 no.5
    • /
    • pp.486-497
    • /
    • 2001
  • In this paper, we propose a new addressing mode, which can be used for programmable image processor to perform image-processing algorithms effectively. Conventional addressing modes are suitable for one-dimensional data processing such as voice, but the proposed addressing mode consider two-dimensional characteristics of image data. The proposed instruction for two-dimensional addressing requires two operands to specify a pixel and doesn't require any change of memory architecture. The proposed two-dimensional addressing mode for image processor has the following advantages. The proposed instruction combines several instructions to load a pixel data from an external memory to a register. Hence, the proposed instruction reduces required code size so that it satisfies high performance and low power requirements of image processor. In addition, it uses inherent two-dimensional characteristics of image data and offers user-friendly instruction to assembler programmer The proposed two-dimensional addressing mode is applicable to DSP, media processor, graphic device, and so on. In this paper, we propose a new concept of two-dimensional addressing mode and an efficient hardware implementation method of it.

  • PDF

Hand Biometric Information Recognition System of Mobile Phone Image for Mobile Security (모바일 보안을 위한 모바일 폰 영상의 손 생체 정보 인식 시스템)

  • Hong, Kyungho;Jung, Eunhwa
    • Journal of Digital Convergence
    • /
    • v.12 no.4
    • /
    • pp.319-326
    • /
    • 2014
  • According to the increasing mobile security users who have experienced authentication failure by forgetting passwords, user names, or a response to a knowledge-based question have preference for biological information such as hand geometry, fingerprints, voice in personal identification and authentication. Therefore biometric verification of personal identification and authentication for mobile security provides assurance to both the customer and the seller in the internet. Our study focuses on human hand biometric information recognition system for personal identification and personal Authentication, including its shape, palm features and the lengths and widths of the fingers taken from mobile phone photographs such as iPhone4 and galaxy s2. Our hand biometric information recognition system consists of six steps processing: image acquisition, preprocessing, removing noises, extracting standard hand feature extraction, individual feature pattern extraction, hand biometric information recognition for personal identification and authentication from input images. The validity of the proposed system from mobile phone image is demonstrated through 93.5% of the sucessful recognition rate for 250 experimental data of hand shape images and palm information images from 50 subjects.