• Title/Summary/Keyword: I-벡터

Search Result 380, Processing Time 0.036 seconds

Motion Flow Analysis using Bi-directional Prediction-Independent Framework in MPEG Compressed Domain (압축 영역에서의 양방향 예측 구조를 이용한 움직임 흐름 분석)

  • 김낙우;김태용;최종수
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.5
    • /
    • pp.13-22
    • /
    • 2004
  • Because video sequence consists of dynamic objects in nature, the object motion in video is an effective feature in describing the contents of video sequence and motion feature plays an important role in video retrieval. In this paper, we propose a method that converts motion vectors (MVs) to a uniform set on MPEG coded domain, independent of the frame type and the direction of prediction, and utilizes these normalized MVs (N-MVs) as motion descriptor to understand video contents. We describe a frame-type independent representation of the various types of frames presented in an MPEG video in which all frames can be considered equivalently, without full-decoding. In the experiments, we show that the proposed method is better than the conventional one in terms of performance.

Fast Decision Method of Adaptive Motion Vector Resolution (적응적 움직임 벡터 해상도 고속 결정 기법)

  • Park, Sang-hyo
    • Journal of Broadcast Engineering
    • /
    • v.25 no.3
    • /
    • pp.305-312
    • /
    • 2020
  • As a demand for a new video coding standard having higher coding efficiency than the existing standards is growing, recently, MPEG and VCEG has been developing and standardizing the next-generation video coding project, named Versatile Video Coding (VVC). Many inter prediction techniques have been introduced to increase the coding efficiency, and among them, an adaptive motion vector resolution (AMVR) technique has contributed on increasing the efficiency of VVC. However, the best motion vector can only be determined by computing many rate-distortion costs, thereby increasing encoding complexity. It is necessary to reduce the complexity for real-time video broadcasting and streaming services, but it is yet an open research topic to reduce the complexity of AMVR. Therefore, in this paper, an efficient technique is proposed, which reduces the encoding complexity of AMVR. For that, the proposed method exploits a special VVC tree structure (i.e., multi-type tree structure) to accelerate the decision process of AMVR. Experiment results show that the proposed decision method reduces the encoding complexity of VVC test model by 10% with a negligible loss of coding efficiency.

Vector Quantizer Based Speaker Normalization for Continuos Speech Recognition (연속음성 인식기를 위한 벡터양자화기 기반의 화자정규화)

  • Shin Ok-keun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.8
    • /
    • pp.583-589
    • /
    • 2004
  • Proposed is a speaker normalization method based on vector quantizer for continuous speech recognition (CSR) system in which no acoustic information is made use of. The proposed method, which is an improvement of the previously reported speaker normalization scheme for a simple digit recognizer, builds up a canonical codebook by iteratively training the codebook while the size of codebook is increased after each iteration from a relatively small initial size. Once the codebook established, the warp factors of speakers are estimated by comparing exhaustively the warped versions of each speaker's utterance with the codebook. Two sets of phones are used to estimate the warp factors: one, a set of vowels only. and the other, a set composed of all the Phonemes. A Piecewise linear warping function which corresponds to the estimated warp factor is adopted to warp the power spectrum of the utterance. Then the warped feature vectors are extracted to be used to train and to test the speech recognizer. The effectiveness of the proposed method is investigated by a set of recognition experiments using the TIMIT corpus and HTK speech recognition tool kit. The experimental results showed comparable recognition rate improvement with the formant based warping method.

Motion vector-tracing algorithms of video sequence (비디오 시퀀스의 움직임 추적 알고리즘)

  • 이재현
    • Journal of the Korea Computer Industry Society
    • /
    • v.3 no.7
    • /
    • pp.927-936
    • /
    • 2002
  • This paper presents the extraction of a feature by motion vector for efficient content-based retrieval for digital video. in this paper, divided by general size block for the current frame by video, using BMA(block matching algorithm) for an estimate by block move based on a time frame. but in case BMA appeared on a different pattern fact of motion in the vector obtain for the BMA. solve in this a problem to application for full search method this method is detected by of on many calculations. I propose an alternative plan in this paper Limit the search region to $\pm$15 and search is a limit integer pixel. a result, in this paper is make an estimate motion vector in more accurately using motion vector in adjoin in blocks. however, refer to the block vector because occurrence synchronism. Such addition information is get hold burden receive to transmit therefore, forecasted that motion feature each block and consider for problems for establish search region. in this paper Algorithm based to an examination Motion Estimation method by for motion Compensation is proposed.

  • PDF

Peak-to-Average Power Ratio Reduction Technique Superimposing the Rotation Phases over Pilot and Data Symbols (회전 위상을 파일롯과 데이터 심볼에 덧붙인 첨두대 평균 전력비 저감 기법)

  • Han, Tae-Young;Choi, Jung-Hun;Kim, Nam
    • The Journal of Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.18 no.1 s.116
    • /
    • pp.53-61
    • /
    • 2007
  • This paper researches on the scheme superimposing the rotation phases over the pilot and data symbols in order to reduce the peak-to-average power ratio(PAPR) of the orthogonal frequency division multiplexing(OFDM) communication. The bandwidth and power efficiency are the main consideration. The phases of rotation vector are added to those of both pilot symbols and data symbols interlaying between any two pilot symbols in an OFDM block. Owing to this scheme the transmitter reduces the PAPR using the partial transmit sequences(PTS) and the receiver restores the data symbol utilizing the channel estimation of pilot symbols. Therefore, the bandwidth efficiency is accomplished by not using the further subcarriers for the reduction of PAPR and the enormous increase of bit error rate according to the receiving error of the side information, i.e. the phases of rotation vector, is prevented. In other words, both bandwidth-and power-efficiency and quality of communication performance can be improved.

A Study on the balancing of stereo image pairs (스테레오 영상간의 휘도 불균형 보정에 관한 연구)

  • 최명환;오세범;임정은;김용태;손광훈
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2000.11b
    • /
    • pp.131-134
    • /
    • 2000
  • 본 논문에서는 스테레오 좌.우 영상간의 휘도 불균형 보정에 관한 연구를 수행하였다. 좌영상과 우영상의 관계를 선형관계 $(I_L = aI_R +b)$로서 가정하고 휘도 불균형을 보정한 기존 알고리듬(global balancing)의 문제점을 분석하였다. 또한 이러한 문제점을 해결하기 위한 방법으로 두 가지 방법을 제안하는데 히스토그램 균일화를 통한 방법과 영상의 국부적 특징을 이용하여 선형변환을 적용하는 방법(local balancing)이 보다 정확한 변이벡터를 찾는 전처리 과정임을 모의실험을 통해 검증하였다.

  • PDF

Characterization of gp64 Gene of Bombyx mori Nucleopolyhedrovirus and Development of a Transient Expression Vector (누에 핵다각체병 바이러스 헤 gp64 유전자의 특성조사 및 transient 발현 벡터 개발)

  • 김미향;최재영;우수동;이해광;제연호
    • Microbiology and Biotechnology Letters
    • /
    • v.29 no.1
    • /
    • pp.18-24
    • /
    • 2001
  • Expression of the baculovirus major envelope glycoprotein gene(gp64) is regulated by transcription from botha early and late promoters. To develop a transient expression vector under the control of gp64 gene promoter, the gp64 gene of Bombyx mori nucleopolyhedrovirus-K1(BmNPV-K1) was characterized. The gp64 gene was local-ized at EcoR I-Pst I 7.38-kb fragment of the BmNPV-K1 genome. The EcorR 1-Pst I 7.38-kb fragment was cloned and the nucleotide sequence of 2,277 bases including the coding region of gp64 gene was determined. Based on these results, transient expression vector using gp64 gene promoter was constructed and named as pBm64. E.coli lacZ gene was introduced onto pBm64 as a reporter gene and expressed transiently in B. mori 5(Bm 5) cells. The expression vector transfected into the cells was maintained stably for 1 to 5 days. In order to confirm the expression of the reporter gene by gp64 promoter, recombinant virus was constructed. The recombinant virus has two independent transcription units in opposite orientations with two promoters; gp64 and polyhedrin gene promoters each initiating transcription of $\beta$-galactosidase and polyhedrin, respectively. Polyhedra formation and expression of $\beta$-galactosidase in Bm5 cells infected with the recombinant virus were observed with phase contrast microscope and in situ staining.

  • PDF

Speech Recognition Using Linear Discriminant Analysis and Common Vector Extraction (선형 판별분석과 공통벡터 추출방법을 이용한 음성인식)

  • 남명우;노승용
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.4
    • /
    • pp.35-41
    • /
    • 2001
  • This paper describes Linear Discriminant Analysis and common vector extraction for speech recognition. Voice signal contains psychological and physiological properties of the speaker as well as dialect differences, acoustical environment effects, and phase differences. For these reasons, the same word spelled out by different speakers can be very different heard. This property of speech signal make it very difficult to extract common properties in the same speech class (word or phoneme). Linear algebra method like BT (Karhunen-Loeve Transformation) is generally used for common properties extraction In the speech signals, but common vector extraction which is suggested by M. Bilginer et at. is used in this paper. The method of M. Bilginer et al. extracts the optimized common vector from the speech signals used for training. And it has 100% recognition accuracy in the trained data which is used for common vector extraction. In spite of these characteristics, the method has some drawback-we cannot use numbers of speech signal for training and the discriminant information among common vectors is not defined. This paper suggests advanced method which can reduce error rate by maximizing the discriminant information among common vectors. And novel method to normalize the size of common vector also added. The result shows improved performance of algorithm and better recognition accuracy of 2% than conventional method.

  • PDF

Detection of Character Emotional Type Based on Classification of Emotional Words at Story (스토리기반 저작물에서 감정어 분류에 기반한 등장인물의 감정 성향 판단)

  • Baek, Yeong Tae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.18 no.9
    • /
    • pp.131-138
    • /
    • 2013
  • In this paper, I propose and evaluate the method that classifies emotional type of characters with their emotional words. Emotional types are classified as three types such as positive, negative and neutral. They are selected by classification of emotional words that characters speak. I propose the method to extract emotional words based on WordNet, and to represent as emotional vector. WordNet is thesaurus of network structure connected by hypernym, hyponym, synonym, antonym, and so on. Emotion word is extracted by calculating its emotional distance to each emotional category. The number of emotional category is 30. Therefore, emotional vector has 30 levels. When all emotional vectors of some character are accumulated, her/his emotion of a movie can be represented as a emotional vector. Also, thirty emotional categories can be classified as three elements of positive, negative, and neutral. As a result, emotion of some character can be represented by values of three elements. The proposed method was evaluated for 12 characters of four movies. Result of evaluation showed the accuracy of 75%.