• Title/Summary/Keyword: voice image

Search Result 293, Processing Time 0.03 seconds

Implementation of Digital Photo Frame using Embedded Linux System (임베디드 리눅스 시스템을 이용한 디지털 사진 액자 구현)

  • Hyun, Kyung-Seok;Lee, Myung-Eui
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.7 no.5
    • /
    • pp.901-906
    • /
    • 2006
  • In this paper, we describe the implementation of the digital photo frame system that displays the images coming through the memory card of a digital camera. Each image can be recorded with voice in this system, and a function of the mp3 player is implemented as well. We use Intel PXA255 to control the system and modify the bootloader and linux kernel. Also we adapt device driver for this system. For the realization of image display, voice recording and mp3 playing in the basis of the linux system, we program some of the Microwindows system configuration files and program applications here. This study will be a good example to access the development of the digital photo frame based on the linux system using less-power and high performed embedded processor.

  • PDF

Comparison of big data image analysis techniques for user curation (사용자 큐레이션을 위한 빅데이터 영상 분석 기법 비교)

  • Lee, Hyoun-Sup;Kim, Jin-Deog
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.563-565
    • /
    • 2021
  • The most important feature of the recently increasing content providing service is that the amount of content increase over time is very large. Accordingly, the importance of user curation is increasing, and various techniques are used to implement it. In this paper, among the techniques for video recommendation, the analysis technique using voice data and subtitles and the video comparison technique based on keyframe extraction are compared with the results of implementing and applying the video content of real big data. In addition, through the comparison result, a video content environment to which each analysis technique can be applied is proposed.

  • PDF

Investigating the Relationship Between Vehicle Front Images and Voice Assistants (자동차 전면부와 음성 어시스턴트의 스타일 관계 분석)

  • Min-Jung Park;So-Yeong Min;Tae-Su Kim;Hyeon-Jeong Suk
    • Science of Emotion and Sensibility
    • /
    • v.25 no.4
    • /
    • pp.129-138
    • /
    • 2022
  • In the context of the increasing applications of voice assistants in vehicles, we focused on the association between the visual appeal of the cars and the acoustic characteristics of the voice assistants. This study aimed to investigate the relationship between the visual appeal of the vehicle and the voice assistant based on their emotional characteristics. A total of 15 adjectives were used to assess the emotional characteristics of 12 types of cars and six types of voices. An online interview was carried out, instructing participants to match three adjectives with the presented car images or voices. This was followed with a brief interview to allow the participants to reflect on the adjective matches. Based on the assessments, we performed principal component analysis (PCA) to determine factors. We aimed to deploy the cars and voices and analyze the patterns of clustering. The PCA analysis revealed two factors profiled as "Light-Heavy" and "Comfortable-Radical." Both car and voice stimuli were deployed in a two-dimensional space showing the internal relationship within and between the two substances. Based on the coordination data, a hierarchical cluster grouped the 18 stimuli into four groups labeled as challenge, elegance, majesty, and vigor. This study identified two latent factors describing the emotional characteristics of both car images and voice types clustered into four groups based on their emotional characteristics. The coherent matches between car style and voice type are expected to address the design concept more successfully.

Transmission of Image Information in Wireless System (무선 환경에서의 영상 정보의 전송)

  • Jeonh, Sang-Hoon;Lim, Joon-Hong
    • Proceedings of the KIEE Conference
    • /
    • 2004.11c
    • /
    • pp.268-270
    • /
    • 2004
  • There are various researches on MPEG techniques. MPEG technique is used to digital TV(DTV) and internet image communication. Connection method between server and client is usually using wire. Applications may be expanded, if wireless technology is used between server and client system. In this paper, Bluetooth is used for connection method between server and client. Bluetooth offers fast and reliable transmissions of both voice and data over the globally available 2.4GHz ISM (Industrial, Scientific and Medical) band. One of the major application purposes of Bluetooth is the cable replacement for mobile and peripheral devices. Bluetooth has the advantage of small size, low power and low cost. It has the disadvantage of limited bandwidth and limited range. In order to transfer effectively image Information between server and client using Bluetooth, we apply MPEG-2 and MPEG-4 image compression techniques and the results are compared with each other.

  • PDF

Design and Implementation of the Image Creation System based on User-Media Interaction (사용자와 미디어 사이의 상호작용 기능 제공 기반 영상 창작 시스템 설계 및 구현)

  • Song, Bok Deuk;Kim, Sang Yun;Kim, Chae Kyu
    • Journal of Korea Multimedia Society
    • /
    • v.19 no.5
    • /
    • pp.932-938
    • /
    • 2016
  • Recently, interactive media which maximizes audience engagement by making the audience appeal on a stage in digital media environment has been distributed more widely. In fact, there has been active movement to develop and promote a new participatory media genre with higher immersion by applying this kind of interactive media concept to advertisement, film, game and e-learning. In the conventional interactive media, digital media had to be enjoyed in particular environment where diverse sensors were installed or through a certain device to recognize a user's motion and voice. This study attempted to design and implement an image creation system which ensures interactions between a user and media in popular distribution-enabled web environment and through PC and smart devices to minimize the image producer-user constraints.

Design of Medical Conferencing System using DICOM 3.0 (DICOM 3.0 표준안을 이용한 의료 화상회의 시스템의 설계)

  • Yoo, S.K.;Kang, Y.T.;Kim, K.M.;Bae, S.H.;Kim, N.H.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1997 no.05
    • /
    • pp.104-107
    • /
    • 1997
  • A medical teleconferencing and medical image transmision system has been developed for diagnosis of the medical images between the medical doctors who are far away. The medical teleconferencing system transmits the voice and image of the doctors using the video and audio capture boards. The medical image transmission system software uses the medical image standard DICOM 3.0 for the future expansibility and the open system interconectivity. The medical images usually use CR images.

  • PDF

Reversible Data Hiding Based on Block Median Preservation and image local characteristic

  • Qu, Xiao-Chao;Kim, Hyoung-Joong
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2011.04a
    • /
    • pp.986-989
    • /
    • 2011
  • Reversible data hiding is a technique that can embed information into cover media (image, video, voice signal) and can recover the original cover media after extracting the embedded information. In this papa, we propose a new reversible data hiding methods that based on block median preservation and the image local characteristic. By using the median value of a block, a high payload can be got and by considering the image local characteristic, a lot of distortion can be avoided and a high PSNR can be got. In the experiment, our methods can generate better result than the previous reversible data hiding methods.

3-D Nano Topology Measurement using VCM (VCM(voice coil motor)를 이용한 3차원 나노 형상 측정 시스템)

  • Jung, Jong-Kyu;Youm, Woo-Sub;Park, Kiy-Hwan
    • Proceedings of the KSME Conference
    • /
    • 2007.05a
    • /
    • pp.1439-1443
    • /
    • 2007
  • In this paper, vibration reduction techniques of a voice coil motor (VCM) actuator are presented for AFM imaging system. The damping coefficient of the actuator driven by VCM with a flexure hinge is quite low and it cause the about 30dB peak amplitude response at the resonance frequency. To decrease this peak response, we design and apply elliptical band-stop filters to xy and z axis VCM actuator. Frequency response of each actuator with filter is measured to verify the effect of the filters. As a sensor, capacitive sensor is used. Vibration reduction rate of the xy actuator with the filter is also measured while real AFM scanning condition. As another method, closed loop control with the capacitive sensor is applied to the xy axis actuator to add an electrical damping effect and vibration reduction rate measured. These vibration reduction rates with each method are compared. In the case of z axis actuator, the frequency response of force (gap) control loop is measured. For comparison, the frequency response using a conventional PID controller is also obtained. Finally, the AFM image of a standard grid sample is measured with the designed controller to analyze the effect in the AFM imaging.

  • PDF

Fault Prediction of a Telecommunications Network using Association Rules Mining based on Voice of the Customer (VOC 기반 연관규칙 마이닝을 이용한 통신선로설비의 장애 예측)

  • Na, Gijoo;Han, Insup;Cho, Namwook
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.11 no.4
    • /
    • pp.13-24
    • /
    • 2015
  • Customer complaints handling helps organizations to retain existing customers and attract new customers, as well. As Voice of the Customer (VOC) is one of the main sources of customer complaints, many organizations utilize VOC to enhance customer satisfaction. Effective management of VOC has been proved as one of the best ways to maintain organization's brand image and reputation. In spite of its importance, little has been reported on the utilization of VOC to detect faults in a telecommunication industry. In this paper, association rule mining based on VOC is used to identify root fault causes of a telecommunications network. To do that, VOC of a Communication Service Provider has been collected first. Then, association rule mining has also been conducted with various support and confidence levels. As a result, root fault causes of the telecommunications network can be identified. It is expected that this study can be used as a basis for decisions about customer satisfaction management such as preventive maintenances or reduction of the customer maintenance cost.

HearCAM Embedded Platform Design (히어 캠 임베디드 플랫폼 설계)

  • Hong, Seon Hack;Cho, Kyung Soon
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.10 no.4
    • /
    • pp.79-87
    • /
    • 2014
  • In this paper, we implemented the HearCAM platform with Raspberry PI B+ model which is an open source platform. Raspberry PI B+ model consists of dual step-down (buck) power supply with polarity protection circuit and hot-swap protection, Broadcom SoC BCM2835 running at 700MHz, 512MB RAM solered on top of the Broadcom chip, and PI camera serial connector. In this paper, we used the Google speech recognition engine for recognizing the voice characteristics, and implemented the pattern matching with OpenCV software, and extended the functionality of speech ability with SVOX TTS(Text-to-speech) as the matching result talking to the microphone of users. And therefore we implemented the functions of the HearCAM for identifying the voice and pattern characteristics of target image scanning with PI camera with gathering the temperature sensor data under IoT environment. we implemented the speech recognition, pattern matching, and temperature sensor data logging with Wi-Fi wireless communication. And then we directly designed and made the shape of HearCAM with 3D printing technology.