• 제목/요약/키워드: feature coding

검색결과 204건 처리시간 0.029초

무선 패킷 네트워크에서의 채널 적응형 양방향 움직임 벡터 추적 기술 (Channel-Adaptive Bidirectional Motion Vector Tracking over Wireless Packet Network)

  • 변재영
    • 전자공학회논문지CI
    • /
    • 제44권1호
    • /
    • pp.94-101
    • /
    • 2007
  • 스트리밍 비디오 서비스는 최근 이종망으로 구성되는 무선망에서 중요한 어플리케이션으로 자리잡을 것으로 예상된다. 그러나 군집성있는 패킷 손실에 의해서 서비스의 충분한 품질이 보장되지 않는다. 무선망에서 패킷 손실에 대한 효율적인 해결책은 수신단에서 적절한 에러 은닉 기술을 사용하는 방법일 것이다. 그러나 대부분의 에러 은닉 기술은 손실 블록에 인접한 이웃 블록들이 군집성 패킷 손실에 의해 이미 손실되었기 때문에 효율적으로 손실된 블록열들을 복원하기 어렵다. 이를 해결하기 위해 손실된 MB에서의 움직임 선형 특성을 이용하는 bidirectional motion vector tracking (BMVT)가 이전에 제안되었었다. 본 논문에서는 BMVT 에러 은닉 기술을 향상시킨 채널 적응형 잉여 코딩 방식이 소개되어진다.

다면기법 SPFACS 영상객체를 이용한 AAM 알고리즘 적용 미소검출 설계 분석 (Using a Multi-Faced Technique SPFACS Video Object Design Analysis of The AAM Algorithm Applies Smile Detection)

  • 최병관
    • 디지털산업정보학회논문지
    • /
    • 제11권3호
    • /
    • pp.99-112
    • /
    • 2015
  • Digital imaging technology has advanced beyond the limits of the multimedia industry IT convergence, and to develop a complex industry, particularly in the field of object recognition, face smart-phones associated with various Application technology are being actively researched. Recently, face recognition technology is evolving into an intelligent object recognition through image recognition technology, detection technology, the detection object recognition through image recognition processing techniques applied technology is applied to the IP camera through the 3D image object recognition technology Face Recognition been actively studied. In this paper, we first look at the essential human factor, technical factors and trends about the technology of the human object recognition based SPFACS(Smile Progress Facial Action Coding System)study measures the smile detection technology recognizes multi-faceted object recognition. Study Method: 1)Human cognitive skills necessary to analyze the 3D object imaging system was designed. 2)3D object recognition, face detection parameter identification and optimal measurement method using the AAM algorithm inside the proposals and 3)Face recognition objects (Face recognition Technology) to apply the result to the recognition of the person's teeth area detecting expression recognition demonstrated by the effect of extracting the feature points.

A Construction Method of Expert Systems in an Integrated Environment

  • Chen, Hui
    • 한국지능정보시스템학회:학술대회논문집
    • /
    • 한국지능정보시스템학회 2001년도 The Pacific Aisan Confrence On Intelligent Systems 2001
    • /
    • pp.211-218
    • /
    • 2001
  • This paper introduces a method of constructing expert systems in an integrated environment for automatic software design. This integrated environment may be applicable from top-level system architecture design, data flow diagram design down to flow chart and coding. The system is integrated with three CASE tools, FSD (Functional Structure Diagram), DFD (Data Flow Diagram) and structured chart PAD (Problem Analysis Diagram), and respective expert systems with automatic design capability by reusing past design. The construction way of these expert systems is based on systematic acquisition of design knowledge stemmed from a systematic design work process of well-matured developers. The design knowledge is automatically acquired from respective documents and stored in the respective knowledge bases. By reusing it, a similar software system may be designed automatically. In order to develop these expert systems in a short period, these design knowledge is expressed by the unified frame structure, functions of th expert system units are partitioned mono-functions and then standardized components. As a result, the design cost of an expert system can be reduced to standard work procedures. Another feature of this paper is to introduce the integrated environment for automatic software design. This system features an essentially zero start-up cost for automatic design resulting in substantial saving of design man-hours in the resulting in substantial saving of design man-hours in the design life cycle, and the expected increase in software productivity after enough design experiences are accumulated.

  • PDF

선형적 특징을 추출하기 위한 퍼지 후프 방법 (Fuzzy Scheme for Extracting Linear Features)

  • 주문원;최영미
    • 한국멀티미디어학회논문지
    • /
    • 제2권2호
    • /
    • pp.129-136
    • /
    • 1999
  • 특정 이미지에서의 선형적 특정은 이미지를 분석하고 이해하는데 충분한 정보를 제공하기도 한다. 본고에 서는 이미지에서 선형적 특징을 추출하기 위한 신뢰성 있는 방법을 제시한다. 일반적으로 후프 변형 방법은 이러한 선형적 특정을 추출하는 최적의 방법 중의 하나로 인식되어 왔다. 대부분의 후프 기반 방법들은 특정 edge 모델올 선택하고, 인식된 edge 픽셀의 속성을 반영하는 변형식을 활용하여 파라미터 공간에 그 발생빈도 를 기록하는 과정을 거치게 된다. 주로 edge 픽셀의 gradient 크기와 방향이 선형적 특정을 결정하는데 사용되 지만, 본고에서는 그 값틀이 퍼지변수로 활용될 수 있음을 보이고 파라미터 공간에 누적값을 계산하는데 활용한다- 이 방법을 기존의 방법과 비교하기 위하여 에러 측정 방식을 제안하고, 실험을 한 결과, 기존의 방법과 비교하여 우수한 성능을 보인다.

  • PDF

Automatic Visual Feature Extraction And Measurement of Mushroom (Lentinus Edodes L.)

  • Heon-Hwang;Lee, C.H.;Lee, Y.K.
    • 한국농업기계학회:학술대회논문집
    • /
    • 한국농업기계학회 1993년도 Proceedings of International Conference for Agricultural Machinery and Process Engineering
    • /
    • pp.1230-1242
    • /
    • 1993
  • In a case of mushroom (Lentinus Edodes L.) , visual features are crucial for grading and the quantitative evaluation of the growth state. The extracted quantitative visual features can be used as a performance index for the drying process control or used for the automatic sorting and grading task. First, primary external features of the front and back sides of mushroom were analyzed. And computer vision based algorithm were developed for the extraction and measurement of those features. An automatic thresholding algorithm , which is the combined type of the window extension and maximum depth finding was developed. Freeman's chain coding was modified by gradually expanding the mask size from 3X3 to 9X9 to preserve the boundary connectivity. According to the side of mushroom determined from the automatic recognition algorithm size thickness, overall shape, and skin texture such as pattern, color (lightness) ,membrane state, and crack were quantified and measured. A portion of t e stalk was also identified and automatically removed , while reconstructing a new boundary using the Overhauser curve formulation . Algorithms applied and developed were coded using MS_C language Ver, 6.0, PC VISION Plus library functions, and VGA graphic function as a menu driven way.

  • PDF

A Preprocessing Algorithm for Efficient Lossless Compression of Gray Scale Images

  • Kim, Sun-Ja;Hwang, Doh-Yeun;Yoo, Gi-Hyoung;You, Kang-Soo;Kwak, Hoon-Sung
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2005년도 ICCAS
    • /
    • pp.2485-2489
    • /
    • 2005
  • This paper introduces a new preprocessing scheme to replace original data of gray scale images with particular ordered data so that performance of lossless compression can be improved more efficiently. As a kind of preprocessing technique to maximize performance of entropy encoder, the proposed method converts the input image data into more compressible form. Before encoding a stream of the input image, the proposed preprocessor counts co-occurrence frequencies for neighboring pixel pairs. Then, it replaces each pair of adjacent gray values with particular ordered numbers based on the investigated co-occurrence frequencies. When compressing ordered image using entropy encoder, we can expect to raise compression rate more highly because of enhanced statistical feature of the input image. In this paper, we show that lossless compression rate increased by up to 37.85% when comparing results from compressing preprocessed and non-preprocessed image data using entropy encoder such as Huffman, Arithmetic encoder.

  • PDF

MFCC를 이용한 GMM 기반의 음성/혼합 신호 분류 (Speech/Mixed Content Signal Classification Based on GMM Using MFCC)

  • 김지은;이인성
    • 전자공학회논문지
    • /
    • 제50권2호
    • /
    • pp.185-192
    • /
    • 2013
  • 본 논문에서는 MFCC를 이용한 GMM 기반의 음성과 혼합 신호 분류 알고리즘을 MPEG의 표준 코덱인 USAC에 적용하였다. 효과적인 패턴 인식을 위해 GMM을 이용하였고, EM알고리즘을 사용하여 최적의 GMM 파라미터를 추출하였다. 제안하는 분류 알고리즘은 두 가지 중요한 부분으로 나뉜다. 첫째는 GMM을 통해 최적의 파라미터를 추출하는 것 이고, 두 번째는 MFCC 값을 이용한 패턴인식을 통해 음성/혼합 신호를 분류하였다. 제안된 알고리즘의 성능을 평가한 결과 MFCC를 이용한 GMM 기반의 제안된 방법이 기존 USAC의 방법보다 우수한 음성/혼합 신호 분류 성능을 보였다.

Post-Processing for JPEG-Coded Image Deblocking via Sparse Representation and Adaptive Residual Threshold

  • Wang, Liping;Zhou, Xiao;Wang, Chengyou;Jiang, Baochen
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권3호
    • /
    • pp.1700-1721
    • /
    • 2017
  • The problem of blocking artifacts is very common in block-based image and video compression, especially at very low bit rates. In this paper, we propose a post-processing method for JPEG-coded image deblocking via sparse representation and adaptive residual threshold. This method includes three steps. First, we obtain the dictionary by online dictionary learning and the compressed images. The dictionary is then modified by the histogram of oriented gradient (HOG) feature descriptor and K-means cluster. Second, an adaptive residual threshold for orthogonal matching pursuit (OMP) is proposed and used for sparse coding by combining blind image blocking assessment. At last, to take advantage of human visual system (HVS), the edge regions of the obtained deblocked image can be further modified by the edge regions of the compressed image. The experimental results show that our proposed method can keep the image more texture and edge information while reducing the image blocking artifacts.

W-CDMA 시스템을 위한 가변율 음성코덱 설계 (Design of a variable rate speech codec for the W-CDMA system)

  • 정우성
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1998년도 제15회 음성통신 및 신호처리 워크샵(KSCSP 98 15권1호)
    • /
    • pp.142-147
    • /
    • 1998
  • Recently, 8 kb/s CS-ACELP coder of G.729 is atandardized by ITU-T SG15 and it has been reported that the speech quality of G729 is better than or equal to that of 32kb/s ADPCM. However G.729 is the fixed rate speech coder, and it does not consider the property of voice activity in mutual conversation. If we use the voice activity, we can reduce the average bit rate in half without any degradations of the speech quality. In this paper, we propose an efficient variable rate algorithm for G.729. The variable rate algorithm consists of two main subjects, the rate determination algorithm and algorithm, we combine the energy-thresholding method, the phonetic segmentation method by integration of various feature parameters obtained through the analysis procedure, and the variable hangover period method. Through the analysis of noise features, the 1 kb/s sub rate coder is designed for coding the background noise signal. So, we design the 4 kb/s sub rate coder for the unvoiced parts. The performance of the variable rate algorithm is evaluated by the comparison of speed quality and average bit rate with G.729. Subjective quality test is also done by MOS test. Conclusively, it is verified that the proposed variable rate CS-ACELP coder produced the same speech quality as G.729, at the average bit rate of 4.4 kb/s.

  • PDF

JPEG-2000 Gradient-Based Coding: An Application To Object Detection

  • Lee, Dae Yeol;Pinto, Guilherme O.;Hemami, Sheila S.
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2013년도 추계학술대회
    • /
    • pp.165-168
    • /
    • 2013
  • Image distortions, such as quantization errors, can have a severe negative impact on the performance of computer vision algorithms, and, more specifically, on object detection algorithms. State-of-the-art implementations of the JPEG-2000 image coder commonly allocate the available bits to minimize the Mean-Squared-Error (MSE) distortion between the original image and the resulting compressed image. However, considering that some state-of-the-art object detection methods use the gradient information as the main image feature, an improved object detection performance is expected for JPEG-2000 image coders that allocate the available bits to minimize the distortions on the gradient content. Accordingly, in this work, the Gradient Mean-Squared-Error (GMSE) based JPEG-2000 coder presents an improved object detection performance over the MSE based JPEG-2000 image coder when the object of interest is located at the same spatial location of the image regions with the strongest gradients and also for high bit-rates. For low bit-rates (e.g. 0.07bpp), the GMSE based JPEG-2000 image coder becomes overly selective in choosing the gradients to preserve, and, as a result, there is a greater chance of mismatch between the spatial locations of the gradients that the coder is trying to preserve and the spatial locations of the objects of interest.

  • PDF