• Title/Summary/Keyword: Data Set Comparing

Search Result 409, Processing Time 0.027 seconds

Moving Object Tracking Method in Video Data Using Color Segmentation (칼라 분할 방식을 이용한 비디오 영상에서의 움직이는 물체의 검출과 추적)

  • 이재호;조수현;김회율
    • Proceedings of the IEEK Conference
    • /
    • 2001.06d
    • /
    • pp.219-222
    • /
    • 2001
  • Moving objects in video data are main elements for video analysis and retrieval. In this paper, we propose a new algorithm for tracking and segmenting moving objects in color image sequences that include complex camera motion such as zoom, pan and rotating. The Proposed algorithm is based on the Mean-shift color segmentation and stochastic region matching method. For segmenting moving objects, each sequence is divided into a set of similar color regions using Mean-shift color segmentation algorithm. Each segmented region is matched to the corresponding region in the subsequent frame. The motion vector of each matched region is then estimated and these motion vectors are summed to estimate global motion. Once motion vectors are estimated for all frame of video sequences, independently moving regions can be segmented by comparing their trajectories with that of global motion. Finally, segmented regions are merged into the independently moving object by comparing the similarities of trajectories, positions and emerging period. The experimental results show that the proposed algorithm is capable of segmenting independently moving objects in the video sequences including complex camera motion.

  • PDF

Rule Weight-Based Fuzzy Classification Model for Analyzing Admission-Discharge of Dyspnea Patients (호흡곤란환자의 입-퇴원 분석을 위한 규칙가중치 기반 퍼지 분류모델)

  • Son, Chang-Sik;Shin, A-Mi;Lee, Young-Dong;Park, Hyoung-Seob;Park, Hee-Joon;Kim, Yoon-Nyun
    • Journal of Biomedical Engineering Research
    • /
    • v.31 no.1
    • /
    • pp.40-49
    • /
    • 2010
  • A rule weight -based fuzzy classification model is proposed to analyze the patterns of admission-discharge of patients as a previous research for differential diagnosis of dyspnea. The proposed model is automatically generated from a labeled data set, supervised learning strategy, using three procedure methodology: i) select fuzzy partition regions from spatial distribution of data; ii) generate fuzzy membership functions from the selected partition regions; and iii) extract a set of candidate rules and resolve a conflict problem among the candidate rules. The effectiveness of the proposed fuzzy classification model was demonstrated by comparing the experimental results for the dyspnea patients' data set with 11 features selected from 55 features by clinicians with those obtained using the conventional classification methods, such as standard fuzzy classifier without rule weights, C4.5, QDA, kNN, and SVMs.

Development of a CAD-based Utility for Topological Identification and Rasterized Mapping from Polygonal Vector Data (CAD 수단을 이용한 벡터형 공간자료의 위상 검출과 격자도면화를 위한 유틸리티 개발)

  • 조동범;임재현
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.27 no.4
    • /
    • pp.137-142
    • /
    • 1999
  • The purpose of this study is to develope a CAD-based tool for rasterization of polygonal vector map in AutoCAD. To identity the layer property of polygonal entity with user-defined coordinates as topology, algorithm in processing entity data of selection set that intersected with scan line was used, and the layers were extracted sequentially by sorted intersecting points in data-list. In addition to the functions for querying and modifying topology, two options for mapping were set up to construct plan projection type and to change meshes' properties in existing DTM data. In case of plan projection type, user-defined cell size of 3DFACE mesh is available for more detailed edge, and topological draping on landform can be executed in case of referring DTM data as an AutoCAD's drawing. The concept of algorithm was simple and clear, but some unexpectable errors were found in detecting intersected coordinates that were AutoCAD's error, not the utility's. Also, the routines to check these errors were included in algorithmic processing. Developed utility named MESHMAP was written in entity data control functions of AutoLISP language and dialog control language(DCL) for the purpose of user-oriented interactive usage. MESHMAP was proved to be more effective in data handling and time comparing with GRIDMAP module in LANDCADD which has similar function.

  • PDF

Effect of Heterogeneous Variance by Sex and Genotypes by Sex Interaction on EBVs of Postweaning Daily Gain of Angus Calves

  • Oikawa, T.;Hammond, K.;Tier, B.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.12 no.6
    • /
    • pp.850-853
    • /
    • 1999
  • Angus postweaning daily gain (PWDG) was analyzed to investigate effects of the heterogeneous variance and the genotypes by sex interaction on prediction of EBVs with data sets of various environmental levels. A whole data (16,239 records) was divided into six data sets according to averages of the best linear unbiased estimator (BLUE) of herd environment. The results comparing prediction models showed that single-trait model is adequate for most of the data sets except for the data set of poor environment for both of the bulls and the heifers where the heterogeneity of variance and the genotypes by sex interaction exists. In the prediction with the data set of the low environment level, the bull's EBVs by single-trait models had high product moment correlations with male EBVs of the bulls by the multitrait model. Whereas the heifer's EBVs had moderate correlations with female EBVs by the multitrait model. This moderate correlation seems to be resulted by the heterogeneity of variance and low heritability of the heifer's PWDG. The prediction models with heterogeneity of variance had little effect on the prediction of EBVs for the data sets with moderate to high genetic correlations.

Development of e-Mail Classifiers for e-Mail Response Management Systems (전자메일 자동관리 시스템을 위한 전자메일 분류기의 개발)

  • Kim, Kuk-Pyo;Kwon, Young-S.
    • Journal of Information Technology Services
    • /
    • v.2 no.2
    • /
    • pp.87-95
    • /
    • 2003
  • With the increasing proliferation of World Wide Web, electronic mail systems have become very widely used communication tools. Researches on e-mail classification have been very important in that e-mail classification system is a major engine for e-mail response management systems which mine unstructured e-mail messages and automatically categorize them. in this research we develop e-mail classifiers for e-mail Response Management Systems (ERMS) using naive bayesian learning and centroid-based classification. We analyze which method performs better under which conditions, comparing classification accuracies which may depend on the structure, the size of training data set and number of classes, using the different data set of an on-line shopping mall and a credit card company. The developed e-mail classifiers have been successfully implemented in practice. The experimental results show that naive bayesian learning performs better, while centroid-based classification is more robust in terms of classification accuracy.

Data Mining for Knowledge Management in a Health Insurance Domain

  • Chae, Young-Moon;Ho, Seung-Hee;Cho, Kyoung-Won;Lee, Dong-Ha;Ji, Sun-Ha
    • Journal of Intelligence and Information Systems
    • /
    • v.6 no.1
    • /
    • pp.73-82
    • /
    • 2000
  • This study examined the characteristicso f the knowledge discovery and data mining algorithms to demonstrate how they can be used to predict health outcomes and provide policy information for hypertension management using the Korea Medical Insurance Corporation database. Specifically this study validated the predictive power of data mining algorithms by comparing the performance of logistic regression and two decision tree algorithms CHAID (Chi-squared Automatic Interaction Detection) and C5.0 (a variant of C4.5) since logistic regression has assumed a major position in the healthcare field as a method for predicting or classifying health outcomes based on the specific characteristics of each individual case. This comparison was performed using the test set of 4,588 beneficiaries and the training set of 13,689 beneficiaries that were used to develop the models. On the contrary to the previous study CHAID algorithm performed better than logistic regression in predicting hypertension but C5.0 had the lowest predictive power. In addition CHAID algorithm and association rule also provided the segment characteristics for the risk factors that may be used in developing hypertension management programs. This showed that data mining approach can be a useful analytic tool for predicting and classifying health outcomes data.

  • PDF

A Stigmergy-and-Neighborhood Based Ant Algorithm for Clustering Data

  • Lee, Hee-Sang;Shim, Gyu-Seok
    • Management Science and Financial Engineering
    • /
    • v.15 no.1
    • /
    • pp.81-96
    • /
    • 2009
  • Data mining, specially clustering is one of exciting research areas for ant based algorithms. Ant clustering algorithm, however, has many difficulties for resolving practical situations in clustering. We propose a new grid-based ant colony algorithm for clustering of data. The previous ant based clustering algorithms usually tried to find the clusters during picking up or dropping down process of the items of ants using some stigmergy information. In our ant clustering algorithm we try to make the ants reflect neighborhood information within the storage nests. We use two ant classes, search ants and labor ants. In the initial step of the proposed algorithm, the search ants try to guide the characteristics of the storage nests. Then the labor ants try to classify the items using the guide in-formation that has set by the search ants and the stigmergy information that has set by other labor ants. In this procedure the clustering decision of ants is quickly guided and keeping out of from the stagnated process. We experimented and compared our algorithm with other known algorithms for the known and statistically-made data. From these experiments we prove that the suggested ant mining algorithm found the clusters quickly and effectively comparing with a known ant clustering algorithm.

Analysis of the Energy Consumption in Underfloor Air Distribution System depending on Outdoor Air Intake Rates (외기 도입에 따른 바닥급기 시스템의 에너지 사용량 분석)

  • Kim, Dong-Hee;Huh, Jung-Ho;Cho, Dong-Woo;Yu, Ki-Hyung;Yu, Ji-Yong
    • Proceedings of the SAREK Conference
    • /
    • 2006.06a
    • /
    • pp.826-831
    • /
    • 2006
  • In this paper, we discussed the energy performance of underfloor air distribution(UFAD) and overhead air distribution system according to outdoor air intake rates in a office building. For this, the laboratory(S lab.) is selected for measuring the thermal environments of UFAD system and overhead system. Based on the measured data, the TRNSYS simulation is used to evaluate the energy performance of UFAD system and the overhead system according to outdoor air intake rates. By increasing outdoor air intake rates from required outdoor air intake rates(100CMH) to maximum air intake rates, the energy savings of UFAD system comparing with overhead system are varied $15%{\sim}25.6%$ in summer, $12.8%{\sim}19%$ in fall/spring and not varied in winter(8%). As results of simulations on stratification height and cooling set temperature, the lower the stratification height and the higher cooling set temperature, the larger cooling energy savings of UFAD comparing with overhead system according to outdoor air intake rates.

  • PDF

A Classifier Capable of Handling Incomplete Data Set (불완전한 데이터를 처리할수 있는 분류기)

  • Lee, Jong-Chan;Lee, Won-Don
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.14 no.1
    • /
    • pp.53-62
    • /
    • 2010
  • This paper introduces a classification algorithm which can be applied to a learning problem with incomplete data sets, missing variable values or a class value. This algorithm uses a data expansion method which utilizes weighted values and probability techniques. It operates by extending a classifier which are considered to be in the optimal projection plane based on Fisher's formula. To do this, some equations are derived from the procedure to be applied to the data expansion. To evaluate the performance of the proposed algorithm, results of different measurements are iteratively compared by choosing one variable in the data set and then modifying the rate of missing and non-missing values in this selected variable. And objective evaluation of data sets can be achieved by comparing, the result of a data set with non-missing variable with that of C4.5 which is a known knowledge acquisition tool in machine learning.

Online Learning of Bayesian Network Parameters for Incomplete Data of Real World (현실 세계의 불완전한 데이타를 위한 베이지안 네트워크 파라메터의 온라인 학습)

  • Lim, Sung-Soo;Cho, Sung-Bae
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.33 no.12
    • /
    • pp.885-893
    • /
    • 2006
  • The Bayesian network(BN) has emerged in recent years as a powerful technique for handling uncertainty iii complex domains. Parameter learning of BN to find the most proper network from given data set has been investigated to decrease the time and effort for designing BN. Off-line learning needs much time and effort to gather the enough data and since there are uncertainties in real world, it is hard to get the complete data. In this paper, we propose an online learning method of Bayesian network parameters from incomplete data. It provides higher flexibility through learning from incomplete data and higher adaptability on environments through online learning. The results of comparison with Voting EM algorithm proposed by Cohen at el. confirm that the proposed method has the same performance in complete data set and higher performance in incomplete data set, comparing with Voting EM algorithm.