• Title/Summary/Keyword: 영상 군집화

Search Result 195, Processing Time 0.031 seconds

Font Classification using NMF and EMD (NMF와 EMD를 이용한 영문자 활자체 폰트분류)

  • Lee, Chang-Woo;Kang, Hyun;Jung, Kee-Chul;Kim, Hang-Joon
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.04b
    • /
    • pp.688-690
    • /
    • 2004
  • 최근 전자화된 문서 영상을 효율적으로 관리하고 검색하기 위한 문서구조분석 방법과 문서의 자동 분류에 관한 많은 연구가 발표되고 있다. 본 논문에서는 NMF(non-negative matrix factorization) 알고리즘을 사용하여 폰트를 자동으로 분류하는 방법을 제안한다. 제안된 방법은 폰트의 구분 특징들이 공간적으로 국부성을 가지는 부분으로 표현될 수 있다는 가정을 바탕으로, 전체의 폰트 이미지들로부터 각 폰트들의 구분 특징인 부분을 학습하고, 학습된 부분들을 특징으로 사용하여 폰트를 분류하는 방법이다. 학습된 폰트의 특징들은 계층적 군집화 알고리즘을 이용하여 템플릿을 생성하고, 테스트 패턴을 분류하기 위하여 템플릿 패턴과의 EMD(earth mover's distance)를 사용한다. 실험결과에서 폰트 이미지들의 공간적으로 국부적인 특징들이 조사되고, 그 특징들의 폰트 식별을 위한 적절성을 보였다. 제안된 방법이 기존의 문자인식. 문서 검색 시스템들의 전처리기로 사용되면. 그 시스템들의 성능을 향상시킬 것으로 기대된다.

  • PDF

Two-Dimensional Shape Description of Objects using The Contour Fluctuation Ratio (윤곽선 변동율을 이용한 물체의 2차원 형태 기술)

  • 김민기
    • Journal of Korea Multimedia Society
    • /
    • v.5 no.2
    • /
    • pp.158-166
    • /
    • 2002
  • In this paper, we proposed a contour shape description method which use the CFR(contour fluctuation ratio) feature. The CFR is the ratio of the line length to the curve length of a contour segment. The line length means the distance of two end points on a contour segment, and the curve length means the sum of distance of all adjacent two points on a contour segment. We should acquire rotation and scale invariant contour segments because each CFR is computed from contour segments. By using the interleaved contour segment of which length is proportion to the entire contour length and which is generated from all the points on contour, we could acquire rotation and scale invariant contour segments. The CFR can describes the local or global feature of contour shape according to the unit length of contour segment. Therefore we describe the shape of objects with the feature vector which represents the distribution of CFRs, and calculate the similarity by comparing the feature vector of corresponding unit length segments. We implemented the proposed method and experimented with rotated and scaled 165 fish images of fifteen types. The experimental result shows that the proposed method is not only invariant to rotation and scale but also superior to NCCH and TRP method in the clustering power.

  • PDF

Depth Map Pre-processing using Gaussian Mixture Model and Mean Shift Filter (혼합 가우시안 모델과 민쉬프트 필터를 이용한 깊이 맵 부호화 전처리 기법)

  • Park, Sung-Hee;Yoo, Ji-Sang
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.5
    • /
    • pp.1155-1163
    • /
    • 2011
  • In this paper, we propose a new pre-processing algorithm applied to depth map to improve the coding efficiency. Now, 3DV/FTV group in the MPEG is working for standard of 3DVC(3D video coding), but compression method for depth map images are not confirmed yet. In the proposed algorithm, after dividing the histogram distribution of a given depth map by EM clustering method based on GMM, we classify the depth map into several layered images. Then, we apply different mean shift filter to each classified image according to the existence of background or foreground in it. In other words, we try to maximize the coding efficiency while keeping the boundary of each object and taking average operation toward inner field of the boundary. The experiments are performed with many test images and the results show that the proposed algorithm achieves bits reduction of 19% ~ 20% and computation time is also reduced.

Super-Pixel-Based Segmentation and Classification for UAV Image (슈퍼 픽셀기반 무인항공 영상 영역분할 및 분류)

  • Kim, In-Kyu;Hwang, Seung-Jun;Na, Jong-Pil;Park, Seung-Je;Baek, Joong-Hwan
    • Journal of Advanced Navigation Technology
    • /
    • v.18 no.2
    • /
    • pp.151-157
    • /
    • 2014
  • Recently UAV(unmanned aerial vehicle) is frequently used not only for military purpose but also for civil purpose. UAV automatically navigates following the coordinates input in advance using GPS information. However it is impossible when GPS cannot be received because of jamming or external interference. In order to solve this problem, we propose a real-time segmentation and classification algorithm for the specific regions from UAV image in this paper. We use the super-pixels algorithm using graph-based image segmentation as a pre-processing stage for the feature extraction. We choose the most ideal model by analyzing various color models and mixture color models. Also, we use support vector machine for classification, which is one of the machine learning algorithms and can use small quantity of training data. 18 color and texture feature vectors are extracted from the UAV image, then 3 classes of regions; river, vinyl house, rice filed are classified in real-time through training and prediction processes.

A Study on Gesture Recognition Using Principal Factor Analysis (주 인자 분석을 이용한 제스처 인식에 관한 연구)

  • Lee, Yong-Jae;Lee, Chil-Woo
    • Journal of Korea Multimedia Society
    • /
    • v.10 no.8
    • /
    • pp.981-996
    • /
    • 2007
  • In this paper, we describe a method that can recognize gestures by obtaining motion features information with principal factor analysis from sequential gesture images. In the algorithm, firstly, a two dimensional silhouette region including human gesture is segmented and then geometric features are extracted from it. Here, global features information which is selected as some meaningful key feature effectively expressing gestures with principal factor analysis is used. Obtained motion history information representing time variation of gestures from extracted feature construct one gesture subspace. Finally, projected model feature value into the gesture space is transformed as specific state symbols by grouping algorithm to be use as input symbols of HMM and input gesture is recognized as one of the model gesture with high probability. Proposed method has achieved higher recognition rate than others using only shape information of human body as in an appearance-based method or extracting features intuitively from complicated gestures, because this algorithm constructs gesture models with feature factors that have high contribution rate using principal factor analysis.

  • PDF

Traffic Sign Recognition by the Variant-Compensation and Circular Tracing (변형 보정과 원형 추적법에 의한 교통 표지판 인식)

  • Lee, Woo-Beom
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.9 no.3
    • /
    • pp.188-194
    • /
    • 2008
  • We propose the new method for the traffic signs recognition that is one of the DAS(Driving assistance system) in the intelligent vehicle. Our approach estimates a varied degree by using a geometric method from the varied traffic signs in noise, rotation and size, and extracts the recognition symbol from the compensated traffic sign for a recognition by using the sequential color-based clustering. This proposed clustering method classify the traffic sign into the attention, regulation, indication, and auxiliary class. Also, The circular tracing method is used for the final traffic sign recognition. To evaluate the effectiveness of the proposed method, varied traffic signs were built. As a result, The proposed method show that the 95 % recognition rate for a single variation, and 93 % recognition rate for a mixed variation.

  • PDF

Extraction of Water Body Area using Micro Satellite SAR: A Case Study of the Daecheng Dam of South korea (초소형 SAR 위성을 활용한 수체면적 추출: 대청댐 유역 대상)

  • PARK, Jongsoo;KANG, Ki-Mook;HWANG, Eui-Ho
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.24 no.4
    • /
    • pp.41-54
    • /
    • 2021
  • It is very essential to estimate the water body area using remote exploration for water resource management, analysis and prediction of water disaster damage. Hydrophysical detection using satellites has been mainly performed on large satellites equipped with optical and SAR sensors. However, due to the long repeat cycle, there is a limitation that timely utilization is impossible in the event of a disaster/disaster. With the recent active development of Micro satellites, it has served as an opportunity to overcome the limitations of time resolution centered on existing large satellites. The Micro satellites currently in active operation are ICEYE in Finland and Capella satellites in the United States, and are operated in the form of clusters for earth observation purposes. Due to clustering operation, it has a short revisit cycle and high resolution and has the advantage of being able to observe regardless of weather or day and night with the SAR sensor mounted. In this study, the operation status and characteristics of micro satellites were described, and the water area estimation technology optimized for micro SAR satellite images was applied to the Daecheong Dam basin on the Korean Peninsula. In addition, accuracy verification was performed based on the reference value of the water generated from the optical satellite Sentinel-2 satellite as a reference. In the case of the Capella satellite, the smallest difference in area was shown, and it was confirmed that all three images showed high correlation. Through the results of this study, it was confirmed that despite the low NESZ of Micro satellites, it is possible to estimate the water area, and it is believed that the limitations of water resource/water disaster monitoring using existing large SAR satellites can be overcome.

Error Resilient Performance Evaluation of MPEG-4 and H.264/AVC (MPEG-4 와 H.264/AVC의 에러 강인 기술 성능 평가)

  • 정봉수;황영휘;전병우;김명돈;최송인
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.5
    • /
    • pp.203-216
    • /
    • 2004
  • Recent advances in video coding technology have resulted in rapid growth of application in mobile communication, With this explosive growth reliable transmission and error resilient technique become increasingly necessary to offer high quality multimedia service. In this paper, we present the result of our investigation on the error resilient performance evaluation of the MPEG-4 simple profile under the H.324/M and the H.264/AVC baseline under the IP packet networks. Especially, we have tested error resilient tools of MPEG-4 simple profile such as resynchronization marker insertion, data partitioning, and of H.264/AVC baseline such as the flexible macroblock ordering (FMO) scheme. The objective quality of decoded video is measured in terms of rate and PSNR under various random bit and burst error conditions.

Graph Cut-based Automatic Color Image Segmentation using Mean Shift Analysis (Mean Shift 분석을 이용한 그래프 컷 기반의 자동 칼라 영상 분할)

  • Park, An-Jin;Kim, Jung-Whan;Jung, Kee-Chul
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.11
    • /
    • pp.936-946
    • /
    • 2009
  • A graph cuts method has recently attracted a lot of attentions for image segmentation, as it can globally minimize energy functions composed of data term that reflects how each pixel fits into prior information for each class and smoothness term that penalizes discontinuities between neighboring pixels. In previous approaches to graph cuts-based automatic image segmentation, GMM(Gaussian mixture models) is generally used, and means and covariance matrixes calculated by EM algorithm were used as prior information for each cluster. However, it is practicable only for clusters with a hyper-spherical or hyper-ellipsoidal shape, as the cluster was represented based on the covariance matrix centered on the mean. For arbitrary-shaped clusters, this paper proposes graph cuts-based image segmentation using mean shift analysis. As a prior information to estimate the data term, we use the set of mean trajectories toward each mode from initial means randomly selected in $L^*u^*{\upsilon}^*$ color space. Since the mean shift procedure requires many computational times, we transform features in continuous feature space into 3D discrete grid, and use 3D kernel based on the first moment in the grid, which are needed to move the means to modes. In the experiments, we investigate the problems of mean shift-based and normalized cuts-based image segmentation methods that are recently popular methods, and the proposed method showed better performance than previous two methods and graph cuts-based automatic image segmentation using GMM on Berkeley segmentation dataset.

Comparisons of MPEG-7 Texture Descriptors for Iris recognition (MPEG-7 텍스쳐 서술자의 홍채 인식에 대한 성능 비교)

  • Choo, Hyon-Gon;Kim, Whoi-Yul
    • The KIPS Transactions:PartB
    • /
    • v.11B no.4
    • /
    • pp.421-428
    • /
    • 2004
  • There are three texture descriptors in MPEG-7 : Homogeneous Texture, Edge Histogram and Texture Browsing. In this paper, a comparative analysis is presented on the capability of MPEG-7 texture descriptors for iris recognition as part of an MPEG-7 application using descriptors. Through the experiments of comparing the clustering efficiency and error distribution of the descriptors using 560 iris images, their discriminating capabilities for different iris groups are analyzed. The results show that Homogenous Texture descriptor is the best discriminator among three descriptors to recognize the iris pattern. However, compared with the conventional iris recognition methods, it needs more efforts to enhance the results.