• Title/Summary/Keyword: MPEG System

Search Result 746, Processing Time 0.021 seconds

A 3D Audio-Visual Animated Agent for Expressive Conversational Question Answering

  • Martin, J.C.;Jacquemin, C.;Pointal, L.;Katz, B.
    • 한국정보컨버전스학회:학술대회논문집
    • /
    • 2008.06a
    • /
    • pp.53-56
    • /
    • 2008
  • This paper reports on the ACQA(Animated agent for Conversational Question Answering) project conducted at LIMSI. The aim is to design an expressive animated conversational agent(ACA) for conducting research along two main lines: 1/ perceptual experiments(eg perception of expressivity and 3D movements in both audio and visual channels): 2/ design of human-computer interfaces requiring head models at different resolutions and the integration of the talking head in virtual scenes. The target application of this expressive ACA is a real-time question and answer speech based system developed at LIMSI(RITEL). The architecture of the system is based on distributed modules exchanging messages through a network protocol. The main components of the system are: RITEL a question and answer system searching raw text, which is able to produce a text(the answer) and attitudinal information; this attitudinal information is then processed for delivering expressive tags; the text is converted into phoneme, viseme, and prosodic descriptions. Audio speech is generated by the LIMSI selection-concatenation text-to-speech engine. Visual speech is using MPEG4 keypoint-based animation, and is rendered in real-time by Virtual Choreographer (VirChor), a GPU-based 3D engine. Finally, visual and audio speech is played in a 3D audio and visual scene. The project also puts a lot of effort for realistic visual and audio 3D rendering. A new model of phoneme-dependant human radiation patterns is included in the speech synthesis system, so that the ACA can move in the virtual scene with realistic 3D visual and audio rendering.

  • PDF

A neural network model for recognizing facial expressions based on perceptual hierarchy of facial feature points (얼굴 특징점의 지각적 위계구조에 기초한 표정인식 신경망 모형)

  • 반세범;정찬섭
    • Korean Journal of Cognitive Science
    • /
    • v.12 no.1_2
    • /
    • pp.77-89
    • /
    • 2001
  • Applying perceptual hierarchy of facial feature points, a neural network model for recognizing facial expressions was designed. Input data were convolution values of 150 facial expression pictures by Gabor-filters of 5 different sizes and 8 different orientations for each of 39 mesh points defined by MPEG-4 SNHC (Synthetic/Natural Hybrid Coding). A set of multiple regression analyses was performed with the rating value of the affective states for each facial expression and the Gabor-filtered values of 39 feature points. The results show that the pleasure-displeasure dimension of affective states is mainly related to the feature points around the mouth and the eyebrows, while a arousal-sleep dimension is closely related to the feature points around eyes. For the filter sizes. the affective states were found to be mostly related to the low spatial frequency. and for the filter orientations. the oblique orientations. An optimized neural network model was designed on the basis of these results by reducing original 1560(39x5x8) input elements to 400(25x2x8) The optimized model could predict human affective rating values. up to the correlation value of 0.886 for the pleasure-displeasure, and 0.631 for the arousal-sleep. Mapping the results of the optimized model to the six basic emotional categories (happy, sad, fear, angry, surprised, disgusted) fit 74% of human responses. Results of this study imply that, using human principles of recognizing facial expressions, a system for recognizing facial expressions can be optimized even with a a relatively little amount of information.

  • PDF

Osteogenic Differentiation of Bone Marrow Stem Cells Using Thermo-Sensitive Hydrogels (온도감응성 수화젤을 이용한 골수간엽줄기세포의 골분화 유도)

  • Kim, Sun-Kyung;Hyun, Hoon;Kim, Soon-Hee;Yoon, Sun-Jung;Kim, Moon-Suk;Rhee, John-M.;Khang, Gil-Son;Lee, Hai-Bang
    • Polymer(Korea)
    • /
    • v.30 no.3
    • /
    • pp.196-201
    • /
    • 2006
  • Poly (ethylene glycol)-based diblock and triblock thermo- sensitive polyester copolymers were investigated for application on tissue engineering and injectable biomaterials in drug delivery system due to their nontoxicity, biocompatibility and biodegradability. We synthesized the diblock copolymers consisting of methoxy poly (ethylene glycol) (MPEG) (Mn=750 g/mole) and poly $(\varepsilon-caprolactone)$ (PCL) by ring opening polymerization of $\varepsilon-CL$ with MPEG as an initiator in the presence of HCl $Et_2O$. The effect of diblock copolymers on in vivo osteogenic differentiation of rat bone marrow stromal cells (BMSCS) with and without the presence of osteogenic supplements (dexamethasone) was investigated. Thin sections were cut from paraffin embedded tissues and histological sections were stained by H&E, von Kossa, and immunohistochemical staining for osteocalcin. In conclusion, dexamethasone containing thermo- sensitive hydrogel might be improved osteogenic differentiation of BMSCs. We expect the osteoinduction effect to be excellent when it uses stem cell or other osteogenic materials.

Encryption Method Based on Chaos Map for Protection of Digital Video (디지털 비디오 보호를 위한 카오스 사상 기반의 암호화 방법)

  • Yun, Byung-Choon;Kim, Deok-Hwan
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.49 no.1
    • /
    • pp.29-38
    • /
    • 2012
  • Due to the rapid development of network environment and wireless communication technology, the distribution of digital video has made easily and the importance of the protection for digital video has been increased. This paper proposes the digital video encryption system based on multiple chaos maps for MPEG-2 video encoding process. The proposed method generates secret hash key of having 128-bit characteristics from hash chain using Tent map as a basic block and generates $8{\times}8$ lattice cipher by applying this hash key to Logistic map and Henon map. The method can reduce the encryption overhead by doing selective XOR operations between $8{\times}8$ lattice cipher and some coefficient of low frequency in DCT block and it provides simple and randomness characteristic because it uses the architecture of combining chaos maps. Experimental results show that PSNR of the proposed method is less than or equal to 12 dB with respect to encrypted video, the time change ratio, compression ratio of the proposed method are 2%, 0.4%, respectively so that it provides good performance in visual security and can be applied in real time.

T-DMB Hybrid Data Service Part 1: Hybrid BIFS Technology (T-DMB 하이브리드 데이터 서비스 Part 1: 하이브리드 BIFS 기술)

  • Lim, Young-Kwon;Kim, Kyu-Heon;Jeong, Je-Chang
    • Journal of Broadcast Engineering
    • /
    • v.16 no.2
    • /
    • pp.350-359
    • /
    • 2011
  • Fast developments of broadcasting technologies since 1990s enabled not only High Definition Television service providing high quality audiovisual contents at home but also mobile broadcasting service providing audiovisual contents to high speed moving vehicle. Terrestrial Digital Multimedia Broadcasting (T-DMB) is one of the technologies developed for mobile broadcasting service, which has been successfully commercialized. One of the major technical breakthroughs achieved by T-DMB in addition to robust vehicular reception is an adoption of framework based on MPEG-4 System. It naturally enables integrated interactive data services by using Binary Format for Scene (BIFS) technology for scene description and representation of graphics object and Object Descriptor Framework representing multimedia service components as objects. T-DMB interactive data service has two fundamental limitations. Firstly, graphic data for interactive service should be always overlaid on top of a video not to be rendered out of it. Secondly, data for interactive service is only received by broadcasting channel. These limitations were considered as general in broadcasting systems. However, they are being considered as hard limitations for personalized data services using location information and user characteristics which are becoming widely used for data services of smart devices in these days. In this paper, the architecture of T-DMB hybrid data service is proposed which is utilizing broadcasting network, wireless internet and local storage for delivering BIFS data to overcome these limitations. This paper also presents hybrid BIFS technology to implement T-DMB hybrid data service while maintaining backward compatibility with legacy T-DMB players.

The Implementation of Multi-Channel Audio Codec for Real-Time operation (실시간 처리를 위한 멀티채널 오디오 코덱의 구현)

  • Hong, Jin-Woo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.14 no.2E
    • /
    • pp.91-97
    • /
    • 1995
  • This paper describes the implementation of a multi-channel audio codec for HETV. This codec has the features of the 3/2-stereo plus low frequency enhancement, downward compatibility with the smaller number of channels, backward compatibility with the existing 2/0-stereo system(MPEG-1 audio), and multilingual capability. The encoder of this codec consists of 6-channel analog audio input part with the sampling rate of 48 kHz, 4-channel digital audio input part and three TMS320C40 /DSPs. The encoder implements multi-channel audio compression using a human perceptual psychoacoustic model, and has the bit rate reduction to 384 kbit/s without impairment of subjective quality. The decoder consists of 6-channel analog audio output part, 4-channel digital audio output part, and two TMS320C40 DSPs for a decoding procedure. The decoder analyzes the bit stream received with bit rate of 384 kbit/s from the encoder and reproduces the multi-channel audio signals for analog and digital outputs. The multi-processing of this audio codec using multiple DSPs is ensured by high speed transfer of date between DSPs through coordinating communication port activities with DMA coprocessors. Finally, some technical considerations are suggested to realize the problem of real-time operation, which are found out through the implementation of this codec using the MPEG-2 layer II sudio coding algorithm and the use of the hardware architecture with commercial multiple DSPs.

  • PDF

Personalized EPG Application using Automatic User Preference Learning Method (사용자 선호도 자동 학습 방법을 이용한 개인용 전자 프로그램 가이드 어플리케이션 개발)

  • Lim Jeongyeon;Jeong Hyun;Kim Munchurl;Kang Sanggil;Kang Kyeongok
    • Journal of Broadcast Engineering
    • /
    • v.9 no.4 s.25
    • /
    • pp.305-321
    • /
    • 2004
  • With the advent of the digital broadcasting, the audiences can access a large number of TV programs and their information through the multiple channels on various media devices. The access to a large number of TV programs can support a user for many chances with which he/she can sort and select the best one of them. However, the information overload on the user inevitably requires much effort with a lot of patience for finding his/her favorite programs. Therefore, it is useful to provide the persona1ized broadcasting service which assists the user to automatically find his/her favorite programs. As the growing requirements of the TV personalization, we introduce our automatic user preference learning algorithm which 1) analyzes a user's usage history on TV program contents: 2) extracts the user's watching pattern depending on a specific time and day and shows our automatic TV program recommendation system using MPEG-7 MDS (Multimedia Description Scheme: ISO/IEC 15938-5) and 3) automatically calculates the user's preference. For our experimental results, we have used TV audiences' watching history with the ages, genders and viewing times obtained from AC Nielson Korea. From our experimental results, we observed that our proposed algorithm of the automatic user preference learning algorithm based on the Bayesian network can effectively learn the user's preferences accordingly during the course of TV watching periods.

Synthesis and Characterization of Poly(ethylene glycol) Grafted Polysuccinimide (폴리(에틸렌 글리콜)이 결합된 Polysuccinimide의 합성과 특성)

  • Lim, Nak-Hyun;Lee, Ha-Young;Kim, Moon-Suk;Khang, Gil-Son;Lee, Hai-Bang;Cho, Sun-Hang
    • Polymer(Korea)
    • /
    • v.29 no.1
    • /
    • pp.36-40
    • /
    • 2005
  • Poly(amino acid) derivatives have been widely investigated as a drug carrier in drug delivery system. Particularly,polysuccinimide (PSI) is one of the most promising drug carriers since it possesses suitable physicochemical characteristics for development of macromolecular prodrugs, due to biocompatibility and biodegradability. In this study, we deal with the synthesis of polyaspartamide having various functional groups such as methoxy-poly(ethylene glycol) (MPEG) via ring closing of PSI. PSI was synthesized by polyonensation polymerization of spartic acid. The variety of average molecular weight was confirmed with reacion time and catalyst content to observe the optimum condition of synthesis. MPEG, hydrophilic chain, was bonded to fabricate polymeric micell composed of hydrophilic and hydrophobic polymer. All materials were characterized by 1H-NMR, FT-IR and GPC. In addition, the formation of nanoparticle micelle as drug carrier were also examined. Micelle size was measured by ELS and AFM. The functionalized polysparamide formed nanoparticle micelle whose size ranged from 90 to 130 nm. In conclusion, we prepared polyaspartamide functionalized with PEG examined the possibility as drug carriers.

Adaptive QoS Study for Video Streaming Service In MMT Protocol (비디오 스트리밍 서비스를 위한 MMT 기반 적응적 QoS 연구)

  • Jo, Bokyun;Lee, Doohyun;Suh, Doug Young
    • Journal of Broadcast Engineering
    • /
    • v.20 no.1
    • /
    • pp.40-47
    • /
    • 2015
  • This paper discusses QoS enhancement in the Best-effort services of the service plan provided by MPEG Media Transport (MMT) systems for video streaming applications. Among MMT services, i.e. per-flow, per-class, and best-effort services, the server does not provide guaranteed bandwidth for the best-effort service only. Therefore, in the best-effort services, a bandwidth access priority is defined for various services, where the lowest priority is assigned to the low-level video services. To alleviate the issue of bandwidth limitation in the best-effort services, this paper investigates transmission of low-resolution video with low bitrate and up-sampling. Our experimental results prove the superiority of the proposed method in terms of delivered video quality.

PSIP Converter based on PMCP for Terrestrial/Cable Data Broadcasting Retransmission Service (지상파/케이블 데이터방송 재전송 서비스를 위한 PMCP 기반 PSIP 변환기)

  • Choi Ji Hoon;Kim Yong Ho;Choi Jin Soo;Hong Jin Woo
    • The KIPS Transactions:PartB
    • /
    • v.12B no.6 s.102
    • /
    • pp.647-654
    • /
    • 2005
  • In this paper, we implemented a terrestrial/cable PSIP converting system, so-called a PSIP converter, which is converting a terrestrial PSIP into a cable PSIP for a data broadcasting service in the interoperable network of terrestrial and cable, and define an interface between the PSIP converter and the OOB SI generator by using PMCP messages compliant to ATSC T3/Sl. The exiting PSIP converter just converts a terrestrial PSIP into a cable PSIP compliant to ATSC and OCAP standard and transmits by a MPEG-2 TS format. That is to say, it is not for the digital data broadcasting but for the digital broadcasting. In addition, the PSIP converter can support various types of PSIP information to the OOB SI generator by using PMCP messages defined by a hierarchical structure as per each channel, audio/video event, data event and so on.