• Title/Summary/Keyword: Encoder Model

Search Result 354, Processing Time 0.026 seconds

An Optimal Selection of Frame Skip and Spatial Quantization for Low Bit Rate Video Coding (저속 영상부호화를 위한 최적 프레임 율과 공간 양자화 결정)

  • Bu, So-Young;Lee, Byung-Uk
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.6C
    • /
    • pp.842-847
    • /
    • 2004
  • We present a new video coding technique to tradeoff frame rate and picture quality for low bit rate video coding. We show a model equation for selecting the optimal frame rate from the motion content of the source video. We can determine DCT quantization parameter (QP) using the frame rate and bit rate. For objective video quality measurement we propose a simple and effective error measure for skipped frames. The proposed method enhances the video quality up to 2 ㏈ over the H.263 TMN5 encoder.

An end-to-end synthesis method for Korean text-to-speech systems (한국어 text-to-speech(TTS) 시스템을 위한 엔드투엔드 합성 방식 연구)

  • Choi, Yeunju;Jung, Youngmoon;Kim, Younggwan;Suh, Youngjoo;Kim, Hoirin
    • Phonetics and Speech Sciences
    • /
    • v.10 no.1
    • /
    • pp.39-48
    • /
    • 2018
  • A typical statistical parametric speech synthesis (text-to-speech, TTS) system consists of separate modules, such as a text analysis module, an acoustic modeling module, and a speech synthesis module. This causes two problems: 1) expert knowledge of each module is required, and 2) errors generated in each module accumulate passing through each module. An end-to-end TTS system could avoid such problems by synthesizing voice signals directly from an input string. In this study, we implemented an end-to-end Korean TTS system using Google's Tacotron, which is an end-to-end TTS system based on a sequence-to-sequence model with attention mechanism. We used 4392 utterances spoken by a Korean female speaker, an amount that corresponds to 37% of the dataset Google used for training Tacotron. Our system obtained mean opinion score (MOS) 2.98 and degradation mean opinion score (DMOS) 3.25. We will discuss the factors which affected training of the system. Experiments demonstrate that the post-processing network needs to be designed considering output language and input characters and that according to the amount of training data, the maximum value of n for n-grams modeled by the encoder should be small enough.

Coordinate Estimation of Mobile Robot Using Optical Mouse Sensors (광 마우스 센서를 이용한 이동로봇 좌표추정)

  • Park, Sang-Hyung;Yi, Soo-Yeong
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.22 no.9
    • /
    • pp.716-722
    • /
    • 2016
  • Coordinate estimation is an essential function for autonomous navigation of a mobile robot. The optical mouse sensor is convenient and cost-effective for the coordinate estimation problem. It is possible to overcome the position estimation error caused by the slip and the model mismatch of robot's motion equation using the optical mouse sensor. One of the simple methods for the position estimation using the optical mouse sensor is integration of the velocity data from the sensor with time. However, the unavoidable noise in the sensor data may deteriorate the position estimation in case of the simple integration method. In general, a mobile robot has ready-to-use motion information from the encoder sensors of driving motors. By combining the velocity data from the optical mouse sensor and the motion information of a mobile robot, it is possible to improve the coordinate estimation performance. In this paper, a coordinate estimation algorithm for an autonomous mobile robot is presented based on the well-known Kalman filter that is useful to combine the different types of sensors. Computer simulation results show the performance of the proposed localization algorithm for several types of trajectories in comparison with the simple integration method.

Balancing and Driving Control of a Mecanum Wheel Ball Robot (메카넘 바퀴 볼 로봇의 자세제어 및 주행)

  • Hwang, Seung-Ik;Ha, Hwi-Myung;Lee, Jang-Myung
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.21 no.4
    • /
    • pp.336-341
    • /
    • 2015
  • This paper proposes a balancing and driving control system for a Mecanum wheel ball robot which has a two axis structure and four motors. The inverted pendulum control method is adopted to maintain the balance of the ball robot while it is driving. For the balancing control, an anon-model-based controller has been designed to control the device simply without the need of a complex formula. All the gains of the controller are heuristically adjusted during the experiments. The tilt angle is measured by IMU sensors, which is used to generate the control input of the roll and pitch controller to make the tilt angle zero. For the driving control, the PID control algorithm has been adopted with angles of the wheels and the encoder data. The performance of the designed control system has been verified through the real experiments with the suggested ball robot.

Design and Implementation of Web Based Multimedia Courseware for Visual Basie Learning (Visual Basic 학습을 위한 웹기반 멀티미디어 코스웨어의 설계 및 구현)

  • Park, Sun-Young;Bang, Kee-Chun;Cha, Jae-Hyuk
    • Journal of Digital Contents Society
    • /
    • v.1 no.1
    • /
    • pp.111-124
    • /
    • 2000
  • The coursewares available on the web are on active development by each country all over the world in accordance with web prevailing. Many Visual Basic language courseware is also into web-based education it is not improving the effect of learning. Hence through this study, in orer to be out of the said kind of approach, 1)by adding multimedia elements with making the best of web characteristics, 2)escalating the studying effect of learners' with experimental environment, and 3)providing proper feedbacks in assessment of learners' reactions 4)we invented web-based multimedia courseware with the technology of Real Encoder, Real Player, Active X etc. and tried adapting it to school education field on the basis of design model of professor-learning system.

  • PDF

Measuring Sentence Similarity using Morpheme Embedding Model and GRU Encoder for Question and Answering System (질의응답 시스템에서 형태소임베딩 모델과 GRU 인코더를 이용한 문장유사도 측정)

  • Lee, DongKeon;Oh, KyoJoong;Choi, Ho-Jin;Heo, Jeong
    • 한국어정보학회:학술대회논문집
    • /
    • 2016.10a
    • /
    • pp.128-133
    • /
    • 2016
  • 문장유사도 분석은 문서 평가 자동화에 활용될 수 있는 중요한 기술이다. 최근 순환신경망을 이용한 인코더-디코더 언어 모델이 기계학습 분야에서 괄목할만한 성과를 거두고 있다. 본 논문에서는 한국어 형태소임베딩 모델과 GRU(Gated Recurrent Unit)기반의 인코더를 제시하고, 이를 이용하여 언어모델을 한국어 위키피디아 말뭉치로부터 학습하고, 한국어 질의응답 시스템에서 질문에 대한 정답을 유추 할 수 있는 증거문장을 찾을 수 있도록 문장유사도를 측정하는 방법을 제시한다. 본 논문에 제시된 형태소임베딩 모델과 GRU 기반의 인코딩 모델을 이용하여 문장유사도 측정에 있어서, 기존 글자임베딩 방법에 비해 개선된 결과를 얻을 수 있었으며, 질의응답 시스템에서도 유용하게 활용될 수 있음을 알 수 있었다.

  • PDF

Steady State and Transient Analysis of Switched Reluctance Motor Drive Fed from a Controlled AC-DC Rectifier

  • Moussa, Mona Fouad
    • Journal of Electrical Engineering and Technology
    • /
    • v.12 no.4
    • /
    • pp.1495-1502
    • /
    • 2017
  • The Theory of operation of switched reluctance motors (SRM) depends on the reluctance torque, where energy is transferred to stator winding only. Although its construction is simple, the electrical design is complex, due to the switching configuration needed to deliver power to stator coils. However, because of the nonlinearly of magnetic circuit, SRM has torque ripple. This paper proposes a new strategy to drive SRM from a single-phase AC supply. Each stator winding is connected to AC-DC or AC-AC converters, which is called branch. All branches are connected in parallel to a single-phase AC supply. A shaft encoder allows current production in stator winding during the positive torque production region and terminates it during the negative torque production region. A magnetic flux is produced between stator poles when current is supplied from AC supply to stator coil and repeats many cycles as long as the rate of change of stator inductance is positive. Different possibilities for the configurations of AC-AC or AC-DC converters are introduced to drive SRM from the single-phase AC supply. A case study is presented for a SRM fed from AC supply through semi-controlled AC-DC converter is presented. A simulation model is introduced and verified by experimental rig for two-phase SRM.

Study of parallelization methods for real-time HEVC encoder implementation (실시간 HEVC 인코더 구현을 위한 병렬화 기법에 관한 연구)

  • Ahn, Yongjo;Hwang, Taejin;Lee, Dongkyu;Kim, Sangmin;Oh, Seoung-Jun;Sim, Dong-Gyu
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2013.06a
    • /
    • pp.119-122
    • /
    • 2013
  • ITU-T VCEG 과 ISO/IEC MPEG 이 공동으로 구성한 JCT-VC (Joint Collaborative Team on Video Coding)이 표준화를 진행 중인 HEVC (High Efficiency Video Coding)은 H.264/AVC 대비 약 2 배의 압축효율을 갖는다. 하지만, 계층적 구조를 갖는 가변크기 블록의 사용과 재귀적 부호화 구조에 따른 인코더의 복잡도 증가는 개선해야 할 문제점으로 지적되고 있다. 본 논문에서는 현재 표준화가 진행 중인 HEVC 인코더의 실시간 구현을 위한 SIMD 명령어를 이용한 data-level 병렬화 기법, CPU 및 GPU 를 이용한 multi-threading 기법과 같은 다양한 병렬화 기법을 소개한다. 또한, 이러한 병렬화 기법들을 HEVC 인코더에 적용하기 위해 적합한 연산 및 기능 모듈에 대하여 소개한다. 본 연구를 통하여 HM (HEVC reference model)에 적용한 결과 $832{\times}480$ 영상의 경우 20-30fps 의 부호화 속도를 나타냈으며, $1920{\times}1080$ 영상의 경우 5-10fps 의 부호화 속도를 나타내었다.

  • PDF

Fast Decoding Method of Distributed Video Based on Modeling of Parity Bit Requests (패리티 비트 요구량 모델링에 의한 분산 비디오의 고속 복호화 기법)

  • Kim, Man-Jae;Kim, Jin-Soo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.11
    • /
    • pp.2465-2473
    • /
    • 2012
  • Recently, as one of low complexity video encoding methods, DVC (Distributed Video Coding) scheme has been actively studied. Most of DVC schemes exploit feedback channel to achieve better coding performances, however, this causes these schemes to have high decoding delay. In order to overcome these, this paper proposes a new fast DVC decoding method using parity-bit request model, which can be obtained by using bit-error rate, sent by encoder with motion vector, which is transmitted through feedback channel by decoder after generating side information. Through several simulations, it is shown that the proposed method improves greatly the decoding speed, compared to the conventional schemes.

Fast Coding Mode Decision for H.264 Video Coding (H.264 동영상 압축을 위한 고속 부호화 모드 결정 방법)

  • 이제윤;전병우
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.6
    • /
    • pp.165-173
    • /
    • 2004
  • H.264 is the newest international video coding standard that provides high coding efficiency. A macroblock in H.264 has 7 different motion-compensation block sizes in the Inter mode, and several different prediction directions in the Intra mode. In order to achieve as highest coding efficiency as possible, H.264 reference model employs complex mode decision technique based on rate-distortion (RD) optimization which requires high computational complexity. In this paper, we propose two techniques -'early SKIP mode decision' and 'selective intra mode decision' - which can further reduce the computational complexity. Simulation results show that without considerable performance degradation, the proposed methods reduce encoding time by 30% on average and save the number of computing rate-distortion cost by 72%.