• Title/Summary/Keyword: 블록매칭

Search Result 160, Processing Time 0.025 seconds

Design & Implementation of Lipreading System using Robust Lip Area Extraction (견고한 입술 영역 추출을 이용한 립리딩 시스템 설계 및 구현)

  • 이은숙;이호근;이지근;김봉완;이상설;이용주;정성태
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2003.05b
    • /
    • pp.524-527
    • /
    • 2003
  • 최근 들어 립리딩은 멀티모달 인터페이스 기술의 응용분야에서 많은 관심을 모으고 있다. 동적 영상을 이용한 립리딩 시스템에서 해결해야 할 주된 문제점은 상황 변화에 독립적인 얼굴 영역과 입술 영역을 추출하는 것이다. 본 논문에서는 움직임이 있는 영상에서 화자의 얼굴영역과 입술영역을 컬러, 조명등의 변화에 독립적으로 추출하기 위해 HSI 모델과 블록 매칭을 이용하였고 특징 점 추출에는 이미지 기반 방법인 PCA 기법을 이용하였다. 추출된 입술 파라미터와 음성 데이터에 각각 HMM 기반 패턴 인식 방법을 개별적으로 적용하여 단어를 인식하였고 각각의 인식 결과를 가중치를 주어 합병하였다. 실험 결과에 의하면 잡음으로 음성 인식률이 낮아지는 경우에 음성인식과 립리딩을 함께 사용함으로써 전체적인 인식 결과를 향상시킬 수 있었다.

  • PDF

A Study on Effective Moving Object Segmentation and Fast Tracking Algorithm (효율적인 이동물체 분할과 고속 추적 알고리즘에 관한 연구)

  • Jo, Yeong-Seok;Lee, Ju-Sin
    • The KIPS Transactions:PartB
    • /
    • v.9B no.3
    • /
    • pp.359-368
    • /
    • 2002
  • In this paper, we propose effective boundary line extraction algorithm for moving objects by matching error image and moving vectors, and fast tracking algorithm for moving object by partial boundary lines. We extracted boundary line for moving object by generating seeds with probability distribution function based on Watershed algorithm, and by extracting boundary line for moving objects through extending seeds, and then by using moving vectors. We processed tracking algorithm for moving object by using a part of boundary lines as features. We set up a part of every-direction boundary line for moving object as the initial feature vectors for moving objects. Then, we tracked moving object within current frames by using feature vector for the previous frames. As the result of the simulation for tracking moving object on the real images, we found that tracking processing of the proposed algorithm was simple due to tracking boundary line only for moving object as a feature, in contrast to the traditional tracking algorithm for active contour line that have varying processing cost with the length of boundary line. The operations was reduced about 39% as contrasted with the full search BMA. Tracking error was less than 4 pixel when the feature vector was $(15\times{5)}$ through the information of every-direction boundary line. The proposed algorithm just needed 200 times of search operation.

Sparse Signal Recovery with Parallel Orthogonal Matching Pursuit for Multiple Measurement Vectors (병렬OMP 기법을 통한 복수 측정 벡터기반 성긴 신호의 복원)

  • Park, Jeonghong;Ban, Tae Won;Jung, Bang Chul
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.17 no.10
    • /
    • pp.2252-2258
    • /
    • 2013
  • In this paper, parallel orthogonal matching pursuit (POMP) is proposed to supplement the simultaneous orthogonal matching pursuit (S-OMP) which has been widely used as a greedy algorithm for sparse signal recovery for multiple measurement vector (MMV) problem. The process of POMP is simple but effective: (1) multiple indexes maximally correlated with the observation vector are chosen at the first iteration, (2) the conventional S-OMP process is carried out in parallel for each selected index, (3) the index set which yields the minimum residual is selected for reconstructing the original sparse signal. Empirical simulations show that POMP for MMV outperforms than the conventional S-OMP both in terms of exact recovery ratio (ERR) and mean-squared error (MSE).

Automatic Detecting of Joint of Human Body and Mapping of Human Body using Humanoid Modeling (인체 모델링을 이용한 인체의 조인트 자동 검출 및 인체 매핑)

  • Kwak, Nae-Joung;Song, Teuk-Seob
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.4
    • /
    • pp.851-859
    • /
    • 2011
  • In this paper, we propose the method that automatically extracts the silhouette and the joints of consecutive input image, and track joints to trace object for interaction between human and computer. Also the proposed method presents the action of human being to map human body using joints. To implement the algorithm, we model human body using 14 joints to refer to body size. The proposed method converts RGB color image acquired through a single camera to hue, saturation, value images and extracts body's silhouette using the difference between the background and input. Then we automatically extracts joints using the corner points of the extracted silhouette and the data of body's model. The motion of object is tracted by applying block-matching method to areas around joints among all image and the human's motion is mapped using positions of joints. The proposed method is applied to the test videos and the result shows that the proposed method automatically extracts joints and effectively maps human body by the detected joints. Also the human's action is aptly expressed to reflect locations of the joints

Fast Disparity Estimation Method Considering Temporal and Spatial Redundancy Based on a Dynamic Programming (시.공간 중복성을 고려한 다이내믹 프로그래밍 기반의 고속 변이 추정 기법)

  • Yun, Jung-Hwan;Bae, Byung-Kyu;Park, Se-Hwan;Song, Hyok;Kim, Dong-Wook;Yoo, Ji-Sang
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.10C
    • /
    • pp.787-797
    • /
    • 2008
  • In this paper, we propose a fast disparity estimation method considering temporal and spatial redundancy based on a dynamic programming for stereo matching. For the first step, the dynamic programming is performed to estimate disparity vectors with correlation between neighboring pixels in an image. Next, we efficiently compensate regions, which disparity vectors are not allocated, with neighboring disparity vectors assuming that disparity vectors in same object are quite similar. Moreover, in case of video sequence, we can decrease a complexity with temporal redundancy between neighboring frames. For performance comparison, we generate an intermediate-view image using the estimated disparity vector. Test results show that the proposed algorithm gives $0.8{\sim}2.4dB$-increased PSNR(peak signal to noise ratio) compared to a conventional block matching algorithm, and the proposed algorithm also gives approximately 0.1dB-increased PSNR and $48{\sim}68%$-lower complexity compared to the disparity estimation method based on general dynamic programming.

Automatic Detecting and Tracking Algorithm of Joint of Human Body using Human Ratio (인체 비율을 이용한 인체의 조인트 자동 검출 및 객체 추적 알고리즘)

  • Kwak, Nae-Joung;Song, Teuk-Seob
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.4
    • /
    • pp.215-224
    • /
    • 2011
  • There have been studying many researches to detect human body and to track one with increasing interest on human and computer interaction. In this paper, we propose the algorithm that automatically extracts joints, linked points of human body, using the ratio of human body under single camera and tracks object. The proposed method gets the difference images of the grayscale images and ones of the hue images between input image and background image. Then the proposed method composes the results, splits background and foreground, and extracts objects. Also we standardize the ratio of human body using face' length and the measurement of human body and automatically extract joints of the object using the ratio and the corner points of the silhouette of object. After then, we tract the joints' movement using block-matching algorithm. The proposed method is applied to test video to be acquired through a camera and the result shows that the proposed method automatically extracts joints and effectively tracks the detected joints.

Fast Motion Estimation Algorithm using Selection of Candidates and Stability of Optimal Candidates (후보 선별과 최적후보 안정성을 이용한 고속 움직임 예측 알고리즘)

  • Kim, Jong Nam
    • Journal of Broadcast Engineering
    • /
    • v.23 no.5
    • /
    • pp.628-635
    • /
    • 2018
  • In this paper, we propose a fast motion estimation algorithm which is important in video encoding. So many fast motion estimation algorithms have been published for improving prediction quality and computational reduction. In the paper, we propose an algorithm that reduces unnecessary computation, while almost keeping prediction quality compared with the full search algorithm. The proposed algorithm calculates the sum of partial block matching error for each candidate, selects the candidates for the next step, compares the stability of optimal candidates with minimum error, and finds optimal motion vectors by determining the progress of the next step. By doing that, we can find the minimum error point as soon as possible and obtain fast computational speed by reducing unnecessary computations. Additionally, the proposed algorithm can be used with conventional fast motion estimation algorithms and prove it in the experimental results.

A High Speed Road Lane Detection based on Optimal Extraction of ROI-LB (관심영역(ROI-LB)의 최적 추출에 의한 차선검출의 고속화)

  • Cheong, Cha-Keon
    • Journal of Broadcast Engineering
    • /
    • v.14 no.2
    • /
    • pp.253-264
    • /
    • 2009
  • This paper presents an algorithm, aims at practical applications, for the high speed processing and performance enhancement of lane detection base on vision processing system. As a preprocessing for high speed lane detection, the vanishing line estimation and the optimal extraction of region of interest for lane boundary (ROI-LB) can be processed to reduction of detection region in which high speed processing is enabled. Image feature information is extracted only in the ROI-LB. Road lane is extracted using a non-parametric model fitting and Hough transform within the ROI-LB. With simultaneous processing of noise reduction and edge enhancement using the Laplacian filter, the reliability of feature extraction can be increased for various road lane patterns. Since outliers of edge at each block can be removed with clustering of edge orientation for each block within the ROI-LB, the performance of lane detection can be greatly improved. The various real road experimental results are presented to evaluate the effectiveness of the proposed method.

Intensity Correction of 3D Stereoscopic Images Using Binarization-Based Region Segmentation (이진화기반 영역분할을 이용한 3D입체영상의 밝기보정)

  • Kim, Sang-Hyun;Kim, Jeong-Yeop
    • The KIPS Transactions:PartB
    • /
    • v.18B no.5
    • /
    • pp.265-270
    • /
    • 2011
  • In this paper, we propose a method for intensity correction using binarization-based region segmentation in 3D stereoscopic images. In the proposed method, 3D stereoscopic right image is segmented using binarizarion. Small regions in the segmented image are eliminated. For each region in right image, a corresponding region in left image is decided through region matching using correlation coefficient. When region-based matching, in order to prevent overlap between regions, we remove a portion of the area closed to the region boundary using morphological filter. The intensity correction in left and right image can be performed through histogram specification between the corresponding regions. Simulation results show the proposed method has the smallest matching error than the conventional method when we generate the right image from the left image using block based motion compensation.

Gradual Block-based Efficient Lossy Location Coding for Image Retrieval (영상 검색을 위한 점진적 블록 크기 기반의 효율적인 손실 좌표 압축 기술)

  • Choi, Gyeongmin;Jung, Hyunil;Kim, Haekwang
    • Journal of Broadcast Engineering
    • /
    • v.18 no.2
    • /
    • pp.319-322
    • /
    • 2013
  • Image retrieval research activity has moved its focus from global descriptors to local descriptors of feature point such as SIFT. MPEG is Currently working on standardization of effective coding of location and local descriptors of feature point in the context mobile based image search driven application in the name of MPEG-7 CDVS (Compact Descriptor for Visual Search). The extracted feature points consist of two parts, location information and Descriptor. For efficient image retrieval, we proposed a novel method that is gradual block-based efficient lossy location coding to compress location information according to distribution in images. From experimental result, the number of average bits per feature point reduce 5~6% and the accuracy rate keep compared to state of the art TM 3.0.