• Title/Summary/Keyword: voice image

Search Result 293, Processing Time 0.031 seconds

Synchronization of the Train PIS using the reference clock and development of a subtitle authoring tool (레퍼런스 클럭을 이용한 객차 PI 시스템 동기화 및 자막 편집기 개발)

  • Kim, Jung-Hoon;Jang, Dong-Wook;Han, Kwang-Rok
    • Journal of the Korea Society of Computer and Information
    • /
    • v.12 no.4
    • /
    • pp.1-10
    • /
    • 2007
  • This paper describes the development of a network-based passenger information system(PIS) which provides the convenience of the passenger of the train and heightens the effect of the subtitle service, the advertising and the shelter guidance broadcasting against the urgent event. The existing system uses VGA signal distributor in order to broadcast information with image and subtitle and voice guidance. In this paper we improve the existing system by applying the UDP and TCP/IP protocol and use a reference clock to solve a data loss and synchronization problem which occurs in this case. We also developed an XML-based subtitle authoring tool which can edit and play the subtitles with various 3D to improve the automatic guidance broadcasting and advertisement effect according to the operation schedule of the train. The system performance was evaluated through a simulation.

  • PDF

A Study on the Removal of Impulse Noiseusing Wavelet Transform Pair and Adaptive-Length Median filter (웨이브렛 변환쌍과 적응-길이 메디안 필터를 이용한 임펄스 노이즈 제거에 관한 연구)

  • 배상범;김남호
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.7 no.7
    • /
    • pp.1575-1581
    • /
    • 2003
  • As a society has progressed rapidly toward a highly advanced digital information age, a multimedia communication service for acquisition, transmission and storage of image data as well as voice has being commercialized externally and internally. However, in the process of digitalization or transmission of data, noise is generated by several causes, and researches for eliminating those noises have been continued until now. There were the existing FFT(fast fourier transform) and STFT(short time fourier transform) for removing noise but it's impossible to know information about time and time-frequency localization capabilities has conflictive relationship. Therefore, for overcoming these limits, wavelet transform which is presented as a new technique of signal processing field is being applied in many fields recently. Because it has time-frequency localization capabilities it's Possible for multiresolution analysis as well as easy to analyze various signal. And when two wavelet base were designed to form Hilbert transform pair, wavelet pair provide superior performance than the existing DWT(discrete wavelet transform) in data characteristic detection. Therefore in this parer, we removed impulse noise by using adaptive-length median filter and two dyadic wavelet base which is designed by truncated coefficient vector.

Digital Mirror System with Machine Learning and Microservices (머신 러닝과 Microservice 기반 디지털 미러 시스템)

  • Song, Myeong Ho;Kim, Soo Dong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.9 no.9
    • /
    • pp.267-280
    • /
    • 2020
  • Mirror is a physical reflective surface, typically of glass coated with a metal amalgam, and it is to reflect an image clearly. They are available everywhere anytime and become an essential tool for us to observe our faces and appearances. With the advent of modern software technology, we are motivated to enhance the reflection capability of mirrors with the convenience and intelligence of realtime processing, microservices, and machine learning. In this paper, we present a development of Digital Mirror System that provides the realtime reflection functionality as mirror while providing additional convenience and intelligence including personal information retrieval, public information retrieval, appearance age detection, and emotion detection. Moreover, it provides a multi-model user interface of touch-based, voice-based, and gesture-based. We present our design and discuss how it can be implemented with current technology to deliver the realtime mirror reflection while providing useful information and machine learning intelligence.

A Study on Free Indirect Discourse Emerged in the (영화 <여자, 정혜>에 연출된 자유간접화법의 의미 분석)

  • Kim, Jong-Wan
    • The Journal of the Korea Contents Association
    • /
    • v.17 no.9
    • /
    • pp.60-68
    • /
    • 2017
  • Through this thesis, I wanted to understand the form of free indirect discourse of modern films. To this end, I first explored the notion of the polyphonie as a mixture of the speaker and the character' voice in order to establish a concept related to free indirect discourse. However, I could not overlook the differences in the form of novels and movies to apply the following theory to films. Based on the concept of narrative distance, I sought to explore the possibility of free indirect discourse from the dual position of the camera. Next, I introduced the concept of free indirect discourse in the film by introducing the concept of Time in G. Deleuze' CinemaII. In other words, the time from Deleuze is the past and the present cycle, and he sees the Time circulating like the Non-Euclidean space. I wanted to understand the form of free indirect discourse in films by analyzing the concept of Time as an analysis of the movie .

Commercially Available High-Speed Cameras Connected with a Laryngoscope for Capturing the Laryngeal Images (상용화 된 고속카메라와 후두내시경을 이용한 성대촬영 방법의 소개)

  • Nam, Do-Hyun;Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.21 no.2
    • /
    • pp.133-138
    • /
    • 2010
  • Background and Objectives : High-speed imaging can be useful in studies of linguistic and artistic singing styles, and laryngeal examination of patients with voice disorders, particularly in irregular vocal fold vibrations. In this study, we introduce new laryngeal imaging systems which are commercially available high speed cameras connected with a laryngoscope. Materials and Method : The laryngeal images were captured from three different types of cameras. First, the adapter was made to connect with laryngoscope and Casio EX-F1 to capture the images using $2{\times}150$ Watt Halogen light source (EndoSTROB) at speeds of 1,200 tps (frame per second)($336{\times}96$). Second, Phantom Miro ex4 was used to capture the digital laryngeal images using Xenon Nova light source 175 Watt (STORZ) at speeds of 1,920 fps ($512{\times}384$). Finally, laryngeal images were captured using MotionXtra N-4 with 250 Watt halogen lamp (Olympus CLH-250) light source at speeds of 2,000tps ($384{\times}400$) by connecting with laryngoscope. All images were transformed into the Kymograph using KIPS (Kay's image processing Software) of Kay Pentex Inc. Results: Casio EX-F1 was too small to adjust the focus and screen size was diminished once the images were captured despite of high resolution images. High quality of color images could be obtained with Phantom Miro ex4 whereas good black and white images from Motion Xtra N-4 Despite of some limitations of illumination problems, limited recording time capacity, and time consuming procedures in Phantom Miro ex4 and Motion Xtra N-4, those portable devices provided high resolution images. Conclusion : All those high speed cameras could capture the laryngeal images by connecting with laryngoscope. High resolution images were able to be captured at the fixed position under the good lightness. Accordingly, these techniques could be applicable to observe the vocal fold vibration properties in the clinical practice.

  • PDF

Procrustes in Disguise: The Speakers in Robert Frost's Early Poems (프로크루스테스의 초상 : 로버트 프로스트 초기 시의 화자들)

  • Lee, Sam Chool
    • Cross-Cultural Studies
    • /
    • v.31
    • /
    • pp.95-118
    • /
    • 2013
  • Robert Frost's poetry has generally been considered fairly readable partly because of the simplicity or down-to-earth-ness of the messages that go along with the poet's projected public image and the 'traditional' forms he used. Against the grain of such general perception, this study reads some of the early poems of Robert Frost to re-characterize the beginning of the poet's career as a modernist attempt to challenge the dominant poetic conventions of the time: the genteel conventions. In reading the poems, this study focuses on frost's strategic method of using the speaker or persona regarding the delivery of meanings. Those readers who would like to find the immediate presence of Frost's voice in the poems, fail to distinguish the speaker and the poet, readily accepting the face value of what the speaker tries to convey: those messages which are in line with liberal individualism, like self-reliance, autonomous self, work ethics, etc. Frost's speakers, however, are rarely the mouthpiece of the poet himself. Rather, they are fictional characters who, while on the surface of the text appear to be hammering out a stable theme out of their everyday experience, under a heuristic scrutiny of the textual structure, turn out to be undermining the logic or the rationality of the theme, which can be identified as a modernist textual strategy that challenges the traditional conventions regarding the stability of meaning in a poetic text.

Design and Implementation of Vehicle Control Network Using WiFi Network System (WiFi 네트워크 시스템을 활용한 차량 관제용 네트워크의 설계 및 구현)

  • Yu, Hwan-Shin
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.3
    • /
    • pp.632-637
    • /
    • 2019
  • Recent researches on autonomous driving of vehicles are becoming very active, and it is a trend to assist safe driving and improve driver's convenience. Autonomous vehicles are required to combine artificial intelligence, image recognition capability, and Internet communication between objects. Because mobile telecommunication networks have limitations in their processing, they can be easily implemented and scale using an easily expandable Wi-Fi network. We propose a wireless design method to construct such a vehicle control network. We propose the arrangement of AP and the software configuration method to minimize loss of data transmission / reception of mobile terminal. Through the design of the proposed network system, the communication performance of the moving vehicle can be dramatically increased. We also verify the packet structure of GPS, video, voice, and data communication that can be used for the vehicle through experiments on the movement of various terminal devices. This wireless design technology can be extended to various general purpose wireless networks such as 2.4GHz, 5GHz and 10GHz Wi-Fi. It is also possible to link wireless intelligent road network with autonomous driving.

Multi-view learning review: understanding methods and their application (멀티 뷰 기법 리뷰: 이해와 응용)

  • Bae, Kang Il;Lee, Yung Seop;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.1
    • /
    • pp.41-68
    • /
    • 2019
  • Multi-view learning considers data from various viewpoints as well as attempts to integrate various information from data. Multi-view learning has been studied recently and has showed superior performance to a model learned from only a single view. With the introduction of deep learning techniques to a multi-view learning approach, it has showed good results in various fields such as image, text, voice, and video. In this study, we introduce how multi-view learning methods solve various problems faced in human behavior recognition, medical areas, information retrieval and facial expression recognition. In addition, we review data integration principles of multi-view learning methods by classifying traditional multi-view learning methods into data integration, classifiers integration, and representation integration. Finally, we examine how CNN, RNN, RBM, Autoencoder, and GAN, which are commonly used among various deep learning methods, are applied to multi-view learning algorithms. We categorize CNN and RNN-based learning methods as supervised learning, and RBM, Autoencoder, and GAN-based learning methods as unsupervised learning.

A Study on Space Utilization according to Changes in Non-face-to-Face Consumer Use : Focused on bank offices

  • Hwang, Sungi;Ryu, Gihwan;Yun, Daiyeol;Kim, Heeyoung
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.4
    • /
    • pp.271-278
    • /
    • 2020
  • Modern financial services go beyond the stage of internet banking, and new concepts of financial transactions such as Internet of Things, mobile banking, electronic payments, and fintech have emerged. As a result, banks are less influential in financial transactions, and changes are being demanded. In the present era, the basic business of banks has decreased, and it is transforming into a space where both consumer finance work and reside. The bank office stands for the brand image of the bank, and it is represented by trust with customers in the basic business of financial transactions, and the rise in real estate value is a natural social phenomenon due to the nature of the location and location of real estate owned by the bank. The business method and space of the bank office that meets the new paradigm of the modern society is an inefficient space only for the convenience and rest of consumers, but it must be used as a variety of spaces suitable for the region to increase the functional value of the bank office. Through this study, as a convenience space for consumers, various service facilities should be introduced to understand the characteristics of the region as a convenience space for consumers, and various service facilities should be introduced to meet the needs of consumers, and the bank office should be improved as a complex service space for local residents.

A Design Scheme for Multimedia Contents Considering Memory Constraints in IoT Devices (IoT 장치에서 메모리 용량 제한을 고려한 멀티미디어 콘텐츠 설계 기법)

  • Son, Kyung A
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.11
    • /
    • pp.1463-1469
    • /
    • 2020
  • Multimedia information, including video and voice, is highly utilized in that it is easily understood by people. For this reason, applications have been studied which store multimedia information in IoT devices and transmit information in conjunction with smartphones. The problem is that the size of information can be larger than the capacity of IoT devices due to video and image. In this paper, the multimedia content design technique, which takes into account the limitations of storage capacity, was studied when there is a limit of storage capacity. Considering that the video has a higher understanding of information than text, while the capacity is larger, the solution between information comprehension and capacity is sought. The size of static and dynamic media is a variable and the harm is solved in accordance with the linear planning method. Case studies have shown that the design techniques of this paper are useful.