• 제목/요약/키워드: Visual and Audio System

검색결과 148건 처리시간 0.027초

열차 내 승무원과의 원격대화 시스템 설계에 관한 연구 (Design of A/V Communication System for Passenger and Attendant in Train)

  • 장덕진;강송희;박현휴;강대호;허재석;송달호
    • 한국철도학회:학술대회논문집
    • /
    • 한국철도학회 2010년도 춘계학술대회 논문집
    • /
    • pp.448-454
    • /
    • 2010
  • Currently a KTX train of 20-car formation is 388m long and carries 931 passengers including one captain and three crews which is quite a few to cover the lengthy service area and many customers. On the other hand, if a passenger wants to talk to an attendant, he has to wait for an attendant passing by his/her seat or walk to an intercom which is placed at every other car. Any of these choices is inconvenient. So, in this paper, we presented a system design for developing an audio/visual communication system for a passenger and an attendant. The system was analyzed and designed according to the Object-Oriented methodology with UML (Unified Modeling Language). Based on a problem statement, a Use-case Diagram, Sequence Diagrams, Class Diagram, State Charts, collaboration Diagram were generated. The design will be used in system implementation to a HEMU-400X test train and to be tested.

  • PDF

영화배우 김혜수의 스크린 퍼포먼스 (Screen Performance of the Korean Actress Kim Hye-Soo)

  • 김종국
    • Journal of Information Technology Applications and Management
    • /
    • 제28권1호
    • /
    • pp.43-51
    • /
    • 2021
  • This article explores Kim Hye-soo's film acting from the perspective of performance, which means a socio-cultural action planned and intended for a certain purpose. Through the aspect of screen performance which the identity of the era that the performance study aims for is expressed through acting and reappeared in a system of verbal and non-verbal symbols, it was intended to enhance the academic value of Korean film acting. First, Kim Hye-soo's acting performance transforms by repeating genre acting. The sensuality and sexual attractiveness that evaluates Kim Hye-soo are repeated by the typical vision required by genre films, but the acting performance is not consumed or subordinated as a tool for visual pleasure. Second, Kim Hye-soo's body, face, emotion and audio are engraved with memories of the times, and the sociocultural identity of the performance is expressed through dynamic interaction between actions and reactions. Third, Kim Hye-soo's restored and recreated performance is sensitive to the changes of the times and is still in the process.

Beginning of a New Standard: Internet of Media Things

  • Kim, Sang-Kyun;Sahu, Nevadita;Preda, Marius
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권11호
    • /
    • pp.5182-5199
    • /
    • 2017
  • Recently, Internet of Things (IoT) drives a large variety of research, development, and new type of markets. All type of devices and sensors will be part of the Internet of Things and will be able to communicate not only plain data, but also audio-visual, olfactory, and haptic media data. In addition, as the devices and sensors getting smarter, it is highly probable that they can process acquired media and metadata to extract higher level of information (e.g., semantics). To support such enhanced functionalities, ISO/IEC SC29 WG11 (MPEG) starts a new standard project, ISO/IEC 23093, called Internet of Media Things (IoMT) to provide standard data formats and APIs for media things. This paper presents the standardization activities of IoMT focusing on explaining terms, standard scopes, and major media things with their use cases. One of the use cases, an IoT system for a blind pedestrian navigation assistance, is evaluated to prove its effectiveness.

학교 성교육의 현황 및 개선방향에 관한 연구 -대구시와 경북지역을 중심으로- (A Study on the State of Sex Education in High Schools and for Improvement in It -in case of Taegu city & Kyungbuk province-)

  • 김정옥
    • 한국가정과교육학회지
    • /
    • 제5권1호
    • /
    • pp.121-132
    • /
    • 1993
  • The purpose of this study is to evaluate the state and problems on sex education of junior & senior high schools in Taegu city & Kyungbuk province and then to try to find the solutions. This article is consisted of four parts: the present state of sex education, how to improve sex education, the problems of teacher training and the actual condition of educational materials for that. The item of questionnaires was analyzed by SAS-PC program. the results are marked with frequency, percentage, mean and $\chi$$^2$(chi-square). For effective and better sex education in high schools, development & distribution of various audio-visual teaching material, and systematic & correlative arrangement of educational contents are to be first requisite. Concerning about the teacher training it is needed that to establish the standard o general management and equipment of instruction system.

  • PDF

Speech Emotion Recognition with SVM, KNN and DSVM

  • Hadhami Aouani ;Yassine Ben Ayed
    • International Journal of Computer Science & Network Security
    • /
    • 제23권8호
    • /
    • pp.40-48
    • /
    • 2023
  • Speech Emotions recognition has become the active research theme in speech processing and in applications based on human-machine interaction. In this work, our system is a two-stage approach, namely feature extraction and classification engine. Firstly, two sets of feature are investigated which are: the first one is extracting only 13 Mel-frequency Cepstral Coefficient (MFCC) from emotional speech samples and the second one is applying features fusions between the three features: Zero Crossing Rate (ZCR), Teager Energy Operator (TEO), and Harmonic to Noise Rate (HNR) and MFCC features. Secondly, we use two types of classification techniques which are: the Support Vector Machines (SVM) and the k-Nearest Neighbor (k-NN) to show the performance between them. Besides that, we investigate the importance of the recent advances in machine learning including the deep kernel learning. A large set of experiments are conducted on Surrey Audio-Visual Expressed Emotion (SAVEE) dataset for seven emotions. The results of our experiments showed given good accuracy compared with the previous studies.

휘도 마스킹과 DC Modulus 알고리즘을 이용한 비디오 워터마킹 (A Blind Video Watermarking Technique Using Luminance Masking and DC Modulus Algorithm)

  • 장용원;김인택;한승수
    • 대한전기학회논문지:시스템및제어부문D
    • /
    • 제51권7호
    • /
    • pp.302-307
    • /
    • 2002
  • Digital watermarking is the technique, which embeds an invisible signal including signal including owner identification and copy control information into multimedia data such as audio, video, and images for copyright protection. A new MPEG watermark embedding algorithm using complex block effect based on the Human Visual System(HVS) is introduced in this paper. In this algorithm, $8{\times}8$ dark blocks are selected, and the watermark is embedded in the DC component of the discrete cosine transform(DCT) by using quantization and modulus calculation. This algorithm uses a blind watermark retrieval technique, which detects the embedded watermark without using the original image. The experimental results show that the proposed watermark technique is robust against MPEG coding, bitrate changes, and various GOP(Group of Picture) changes.

BER DEGRADATION DUE TO THE PHASE NOISE SPECTRAL SHAPE IN LMDS SYSTEMS

  • Kim, Youngsun;Song, Jong-In;Kim, Kiseon
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 ITC-CSCC -1
    • /
    • pp.113-116
    • /
    • 2000
  • Phase noise of oscillator gives the performance degradation significantly when a high carrier frequency and low transmission rate are used. The BER(Bit Error Rates) degradation of QPSK(Quadrature Phase Shift Keying) transmission is analyzed with the oscillator phase noise level specified in downstream physical interface of LMDS(Local Multipoint Distribution Services) which is described in DAVIC(Digital Audio Visual Council). The model used for the phase noise is a power-law model. We also investigated the effects of the various transmission rates on system performance. For the transmission rate below 0.5 Mbps, the BER performance is severely degraded and we verified that the transmission rate, 20 Mbps, is adequate for the downstream of LMDS systems.

  • PDF

키프레임 얼굴영상을 이용한 시청각음성합성 시스템 구현 (Implementation of Text-to-Audio Visual Speech Synthesis Using Key Frames of Face Images)

  • 김명곤;김진영;백성준
    • 대한음성학회지:말소리
    • /
    • 제43호
    • /
    • pp.73-88
    • /
    • 2002
  • In this paper, for natural facial synthesis, lip-synch algorithm based on key-frame method using RBF(radial bases function) is presented. For lips synthesizing, we make viseme range parameters from phoneme and its duration information that come out from the text-to-speech(TTS) system. And we extract viseme information from Av DB that coincides in each phoneme. We apply dominance function to reflect coarticulation phenomenon, and apply bilinear interpolation to reduce calculation time. At the next time lip-synch is performed by playing the synthesized images obtained by interpolation between each phonemes and the speech sound of TTS.

  • PDF

해양레저에 관한 기초적인 연구 - 해변휴양의 정서심리를 중심으로 - (A Fundamental Study on the Marine Leisure - focus on the Psychology of Emotion for Seashore Relaxation -)

  • 윤순동
    • 해양환경안전학회:학술대회논문집
    • /
    • 해양환경안전학회 2008년도 춘계학술발표회
    • /
    • pp.75-80
    • /
    • 2008
  • 해양레저의 실용분야에 대한 관심과 연구는 많으나 기초분야에 대한 연구는 드물다. 즉, 해양레저의 장점에 대한 연구가 필요한 실정이다. 필자는 해변환경의 시각적, 청각적 정의를 정서심리학을 바탕으로 미학적, 음악적으로 분석하였다. 결과적으로, 해변휴양을 통하여 긍정적인 정서를 얻을 수 있으며, 긍정적인 정서로 변화시킬 수 있음을 알았다.

  • PDF

오디오-비디오 정보 융합을 통한 멀티 모달 음성 인식 시스템 (Audio-Visual Integration based Multi-modal Speech Recognition System)

  • 이상운;이연철;홍훈섭;윤보현;한문성
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2002년도 추계학술발표논문집 (상)
    • /
    • pp.707-710
    • /
    • 2002
  • 본 논문은 오디오와 비디오 정보의 융합을 통한 멀티 모달 음성 인식 시스템을 제안한다. 음성 특징 정보와 영상 정보 특징의 융합을 통하여 잡음이 많은 환경에서 효율적으로 사람의 음성을 인식하는 시스템을 제안한다. 음성 특징 정보는 멜 필터 캡스트럼 계수(Mel Frequency Cepstrum Coefficients: MFCC)를 사용하며, 영상 특징 정보는 주성분 분석을 통해 얻어진 특징 벡터를 사용한다. 또한, 영상 정보 자체의 인식률 향상을 위해 피부 색깔 모델과 얼굴의 형태 정보를 이용하여 얼굴 영역을 찾은 후 강력한 입술 영역 추출 방법을 통해 입술 영역을 검출한다. 음성-영상 융합은 변형된 시간 지연 신경 회로망을 사용하여 초기 융합을 통해 이루어진다. 실험을 통해 음성과 영상의 정보 융합이 음성 정보만을 사용한 것 보다 대략 5%-20%의 성능 향상을 보여주고 있다.

  • PDF