• Title/Summary/Keyword: voice image

Search Result 297, Processing Time 0.026 seconds

Multidimensional Affective model-based Multimodal Complex Emotion Recognition System using Image, Voice and Brainwave (다차원 정서모델 기반 영상, 음성, 뇌파를 이용한 멀티모달 복합 감정인식 시스템)

  • Oh, Byung-Hun;Hong, Kwang-Seok
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2016.04a
    • /
    • pp.821-823
    • /
    • 2016
  • 본 논문은 다차원 정서모델 기반 영상, 음성, 뇌파를 이용한 멀티모달 복합 감정인식 시스템을 제안한다. 사용자의 얼굴 영상, 목소리 및 뇌파를 기반으로 각각 추출된 특징을 심리학 및 인지과학 분야에서 인간의 감정을 구성하는 정서적 감응요소로 알려진 다차원 정서모델(Arousal, Valence, Dominance)에 대한 명시적 감응 정도 데이터로 대응하여 스코어링(Scoring)을 수행한다. 이후, 스코어링을 통해 나온 결과 값을 이용하여 다차원으로 구성되는 3차원 감정 모델에 매핑하여 인간의 감정(단일감정, 복합감정)뿐만 아니라 감정의 세기까지 인식한다.

A priority scheme for IEEE 802.11 with guaranteeing QoS

  • Kim, Yong-Joong;Park, Hyo-Dal
    • Proceedings of the IEEK Conference
    • /
    • 2002.07c
    • /
    • pp.1594-1597
    • /
    • 2002
  • In this paper, we proposed the IEEE 802.11 CSMA/CA protocol with the priority scheme. The IEEE 802.11 CSMA/CA protocol is the standard in wireless LAN We applied the proposed method to the aeronautical mobile telecommunication environment. The CSMA/CA protocol has two frames : one is PCF frame for real time service like voice and image and the other DCF frame for contention services like data transmission. Now we proposed the priority scheme that has the different CW region according to the transmitted data. The simmulation results shows the proposed method's performance is improved, Because the collision probability is reduced by allowing the different CW between stations. And the time dalay results show the priority scheme is very appropriated.

  • PDF

Realization of remote medical (원격 진료의 구현)

  • 조의주;김천석;한경희;권락범
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2001.10a
    • /
    • pp.158-161
    • /
    • 2001
  • with the splendid development of internet environment Korea is a deverse proving ground and supplies the highest ADSL in the world. This thesis examines the remote medical treatment which connects the doctor's computer with the in each house and transmits blood pressure, pulsation, temperature, blood sugar, image picture stethoscope, voice.

  • PDF

Subword-based Lip Reading Using State-tied HMM (상태공유 HMM을 이용한 서브워드 단위 기반 립리딩)

  • Kim, Jin-Young;Shin, Do-Sung
    • Speech Sciences
    • /
    • v.8 no.3
    • /
    • pp.123-132
    • /
    • 2001
  • In recent years research on HCI technology has been very active and speech recognition is being used as its typical method. Its recognition, however, is deteriorated with the increase of surrounding noise. To solve this problem, studies concerning the multimodal HCI are being briskly made. This paper describes automated lipreading for bimodal speech recognition on the basis of image- and speech information. It employs audio-visual DB containing 1,074 words from 70 voice and tri-viseme as a recognition unit, and state tied HMM as a recognition model. Performance of automated recognition of 22 to 1,000 words are evaluated to achieve word recognition of 60.5% in terms of 22word recognizer.

  • PDF

Lipreading using The Fuzzy Degree of Simuliarity

  • Kurosu, Kenji;Furuya, Tadayoshi;Takeuchi, Shigeru;Soeda, Mitsuru
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1993.06a
    • /
    • pp.903-906
    • /
    • 1993
  • Lipreading through visual processing techniques help provide some useful systems for the hearing impaired to learn communication assistance. This paper proposes a method to understand spoken words by using visual images taken by a camera with a video-digitizer. The image is processed to obtain the contours of lip, which is approximated into a hexagon. The pattern lists, consisting of lengths and angles of hexagon, are compared and computed to get the fuzzy similarity between two lists. By similarity matching, the mouth shape is recognized as the one which has the pronounced voice. Some experiments, exemplified by recognition of the Japanese vowels, are given to show feasibilities of this method.

  • PDF

Recognition of the Korean Alphabet using Phase Synchronization of Neural Oscillator

  • Lee, Joon-Tark;Bum, Kwon-Yong
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.14 no.1
    • /
    • pp.93-99
    • /
    • 2004
  • Neural oscillator can be applied to oscillatory systems such as analyses of image information, voice recognition and etc. Conventional EBPA (Error back Propagation Algorithm) is not proper for oscillatory systems with the complicate input`s patterns because of its tedious training procedures and sluggish convergence problems. However, these problems can be easily solved by using a synchrony characteristic of neural oscillator with PLL(Phase Locked Loop) function and by using a simple Hebbian learning rule. Therefore, in this paper, a technique for Recognition of the Korean Alphabet using Phase Synchronized Neural Oscillator was introduced.

Realtime/Non-realtime Multimedia Traffic Transmission in cdma 2000 (cdma2000에서 실시간/비실시간 멀티미디어 트래픽 전송)

  • 이종찬
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.25 no.9A
    • /
    • pp.1340-1347
    • /
    • 2000
  • The international mobile telecommunication-2000(IMT-2000) system can support not only the non-realtime multimedia traffic such as data such as data image but also the realtime multimedia traffic such as voice, video. In the paper we propose multicode allocation and handoff schemes for efficient transmission of realtime and non-realtime data in cdma2000. In those schemes the bandwidth of target cell is reserved based on moving direction of mobiles to support QoS of realtime multimedia data and the reserved bandwidths is used by the non-realtime mobiles of the target cell until the mobiles want to perform hadoff. Our framework is able to guarantee QoS continuity of realtime multimedia data and carries the maximum number of subscriber. System performance is evaluated and compared with conventional scheme considering transmission delay channel utilization and blocking probability by computer simulation.

  • PDF

Clinical Application of the Laryngostroboscopy in the Laryngeal Disorders (후두 스트로보스코피의 임상적 응용)

  • 김광문;김기령;최홍식;전영명;박한규
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.3 no.1
    • /
    • pp.22-28
    • /
    • 1989
  • Laryngostroboscopy is one of the most practical techniques for clinical examination of the larynx. The videostroboscopy provides valuable information concerning the nature of vocal folds' vibration, an immediate image of the presence or absence of pathology, and a permanent record. Additionally, when used by trained observers in conjunction with other instrumentation, it can provide both qualitative and quantitative data on vocal function of both the normal and disordered larynx. The authors examined the 388 patients with voice disorders by videostroboscope. This paper describes the clinical procedure of laryngostroboscopy based on some introductory remarks on laryngeal anatomy and function. And the findings of parameters observed by the stroboscopy is noted for the laryngeal disorders.

  • PDF

A Study Video using Image and Voice Search (음성과 이미지를 이용한 동영상 검색에 관한 연구)

  • Sin, In-Gyeong;Park, Sung-Hyun;Ahn, Hyo-Chang;Rhee, Sang-Burm
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2012.11a
    • /
    • pp.568-571
    • /
    • 2012
  • 정보화 사회의 정보 기반 구조로서, 고속 정보망의 구축, 개인용 컴퓨터의 급속한 보급, 멀티미디어 기술의 발전 등으로 인하여 정보 서비스의 새로운 장이 열리고 있다. 동영상 데이터는 텍스트만이 아니라 영상정보, 음성정보등 각종 의미있는 다양한 멀티미디어 정보를 포함하고 있다. 본 논문에서는 동영상에서 음성과 영상을 분리하여 음성을 이용하여 음성열을 분할 및 복원하여 음성을 텍스트로 변환하여 텍스트색인파일을 만들고 영상은 이미지를 분할 및 히스토그램을 사용하여 이미지 샷을 검출하여 두 색인파일을 이용하여 인덱싱을 하여 동영상 검색에 활용한다.

Technology Trends in Wireless Communication for Railway Systems (철도전용 무선통신 기술 동향)

  • Lee, S.J.;Oh, S.C.;Yoon, B.S.;Jeong, H.S.
    • Electronics and Telecommunications Trends
    • /
    • v.36 no.4
    • /
    • pp.23-33
    • /
    • 2021
  • Wireless communication for train control is an active research field. The World Railway Federation in Europe developed GSM-R, which integrates the GSM-based voice call standard and train control signals. To provide advanced railway services, the LTE-R wireless communication system was developed in Korea for passenger services and wireless image information required by the railroad industry. Recently, direct communication technology for autonomous train driving has been studied to decrease the driving interval, and research is being conducted on a hyperloop train control system that runs at a maximum speed of 1,220km/h in a subvacuum environment of 0.001 atmosphere. In this paper, we summarize the trends in wireless communication technologies used for GSM-R/LTE-R railway systems. For future wireless communication in railway systems, we discuss autonomous train driving and the hyperloop railway control system, define wireless communication technology, and discuss trends in domestic and foreign technologies.