• Title/Summary/Keyword: Data classification

Search Result 7,984, Processing Time 0.042 seconds

The New Criterion of Classification System for Data Linkage (자료 연계성을 고려한 차종 분류 기준의 제시)

  • Kim, Yun-Seob;Oh, Ju-Sam;Kim, Hyun-Seok
    • International Journal of Highway Engineering
    • /
    • v.7 no.4 s.26
    • /
    • pp.57-68
    • /
    • 2005
  • Vehicle classification system in Korea is operated by two different types depending on operating purpose and place. 8-category classification system operates in Expressway and Provincial road, and 11-category classification system operates in National highway. These different operations decrease the efficiency of practical use of gathering data. Therefore, this study proposes new-modified vehicle classification system for solving this problem. For classification, this study not only focuses on mechanic survey system which is based on vehicle specs, it's also focuses on the applicability of roadside survey. This proposed classification system considers the tendency to vary of vehicle types, and the compatibility with the other classification systems. This system might be the most suitable system for our present situation.

  • PDF

Contribution to Improve Database Classification Algorithms for Multi-Database Mining

  • Miloudi, Salim;Rahal, Sid Ahmed;Khiat, Salim
    • Journal of Information Processing Systems
    • /
    • v.14 no.3
    • /
    • pp.709-726
    • /
    • 2018
  • Database classification is an important preprocessing step for the multi-database mining (MDM). In fact, when a multi-branch company needs to explore its distributed data for decision making, it is imperative to classify these multiple databases into similar clusters before analyzing the data. To search for the best classification of a set of n databases, existing algorithms generate from 1 to ($n^2-n$)/2 candidate classifications. Although each candidate classification is included in the next one (i.e., clusters in the current classification are subsets of clusters in the next classification), existing algorithms generate each classification independently, that is, without taking into account the use of clusters from the previous classification. Consequently, existing algorithms are time consuming, especially when the number of candidate classifications increases. To overcome the latter problem, we propose in this paper an efficient approach that represents the problem of classifying the multiple databases as a problem of identifying the connected components of an undirected weighted graph. Theoretical analysis and experiments on public databases confirm the efficiency of our algorithm against existing works and that it overcomes the problem of increase in the execution time.

Human activity classification using Neural Network

  • Sharma, Annapurna;Lee, Young-Dong;Chung, Wan-Young
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2008.05a
    • /
    • pp.229-232
    • /
    • 2008
  • A Neural network classification of human activity data is presented. The data acquisition system involves a tri-axial accelerometer in wireless sensor network environment. The wireless ad-hoc system has the advantage of small size, convenience for wearability and cost effectiveness. The system can further improve the range of user mobility with the inclusion of ad-hoc environment. The classification is based on the frequencies of the involved activities. The most significant Fast Fourier coefficients, of the acceleration of the body movement, are used for classification of the daily activities like, Rest walk and Run. A supervised learning approach is used. The work presents classification accuracy with the available fast batch training algorithms i.e. Levenberg-Marquardt and Resilient back propagation scheme is used for training and calculation of accuracy.

  • PDF

Discovering classification knowledge using Rough Set and Granular Computing (러프집합과 Granular Computing을 이용한 분류지식 발견)

  • Choi, Sang-Chul;Lee, Chul-Heui
    • Proceedings of the KIEE Conference
    • /
    • 2000.11d
    • /
    • pp.672-674
    • /
    • 2000
  • There are various ways in classification methodologies of data mining such as neural networks but the result should be explicit and understandable and the classification rules be short and clear. Rough set theory is a effective technique in extracting knowledge from incomplete and inconsistent information and makes an offer classification and approximation by various attributes with effect. This paper discusses granularity of knowledge for reasoning of uncertain concepts by using generalized rough set approximations based on hierarchical granulation structure and uses hierarchical classification methodology that is more effective technique for classification by applying core to upper level. The consistency rules with minimal attributes is discovered and applied to classifying real data.

  • PDF

An Automatic Document Classification with Bayesian Learning (베이지안 학습을 이용한 문서의 자동분류)

  • Kim, Jin-Sang;Shin, Yang-Kyu
    • Journal of the Korean Data and Information Science Society
    • /
    • v.11 no.1
    • /
    • pp.19-30
    • /
    • 2000
  • As the number of online documents increases enormously with the expansion of information technology, the importance of automatic document classification is greatly enlarged. In this paper, an automatic document classification method is investigated and applied to UseNet 20 newsgroup articles to test its efficacy. The classification system uses Naive Bayes classification algorithm and the experimental result shows that a randomly selected newsgroup arcicle can be classified into its own category over 77% accuracy.

  • PDF

Design and Implementation of Intelligent Agent System for Pattern Classification

  • Kim, Dae-su;Park, Ji-hoon;Chang, Jae-khun;Na, Guen-sik
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.11 no.7
    • /
    • pp.598-602
    • /
    • 2001
  • Recently, due to the widely use of personal computers and internet, many computer users requested intelligent system that can cope with various types of requirements and user-friendly interfaces. Based on this background, researches on the intelligent agent are now activating in various fields. In this paper, we modeled, designed and implemented an intelligent agent system for pattern classification by adopting intelligent agent concepts. We also investigated the pattern classification method by utilizing some pattern classification algorithms for the common data. As a result, we identified that 300 3-dimensional data are applied to three pattern classification algorithms and returned correct results. Our system showed a distinguished user-friendly interface feature by adopting various agents including graphic agent.

  • PDF

Development of e-Mail Classifiers for e-Mail Response Management Systems (전자메일 자동관리 시스템을 위한 전자메일 분류기의 개발)

  • Kim, Kuk-Pyo;Kwon, Young-S.
    • Journal of Information Technology Services
    • /
    • v.2 no.2
    • /
    • pp.87-95
    • /
    • 2003
  • With the increasing proliferation of World Wide Web, electronic mail systems have become very widely used communication tools. Researches on e-mail classification have been very important in that e-mail classification system is a major engine for e-mail response management systems which mine unstructured e-mail messages and automatically categorize them. in this research we develop e-mail classifiers for e-mail Response Management Systems (ERMS) using naive bayesian learning and centroid-based classification. We analyze which method performs better under which conditions, comparing classification accuracies which may depend on the structure, the size of training data set and number of classes, using the different data set of an on-line shopping mall and a credit card company. The developed e-mail classifiers have been successfully implemented in practice. The experimental results show that naive bayesian learning performs better, while centroid-based classification is more robust in terms of classification accuracy.

Fuzzy Classification Rule Learning by Decision Tree Induction

  • Lee, Keon-Myung;Kim, Hak-Joon
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.3 no.1
    • /
    • pp.44-51
    • /
    • 2003
  • Knowledge acquisition is a bottleneck in knowledge-based system implementation. Decision tree induction is a useful machine learning approach for extracting classification knowledge from a set of training examples. Many real-world data contain fuzziness due to observation error, uncertainty, subjective judgement, and so on. To cope with this problem of real-world data, there have been some works on fuzzy classification rule learning. This paper makes a survey for the kinds of fuzzy classification rules. In addition, it presents a fuzzy classification rule learning method based on decision tree induction, and shows some experiment results for the method.

Design of One-Class Classifier Using Hyper-Rectangles (Hyper-Rectangles를 이용한 단일 분류기 설계)

  • Jeong, In Kyo;Choi, Jin Young
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.41 no.5
    • /
    • pp.439-446
    • /
    • 2015
  • Recently, the importance of one-class classification problem is more increasing. However, most of existing algorithms have the limitation on providing the information that effects on the prediction of the target value. Motivated by this remark, in this paper, we suggest an efficient one-class classifier using hyper-rectangles (H-RTGLs) that can be produced from intervals including observations. Specifically, we generate intervals for each feature and integrate them. For generating intervals, we consider two approaches : (i) interval merging and (ii) clustering. We evaluate the performance of the suggested methods by computing classification accuracy using area under the roc curve and compare them with other one-class classification algorithms using four datasets from UCI repository. Since H-RTGLs constructed for a given data set enable classification factors to be visible, we can discern which features effect on the classification result and extract patterns that a data set originally has.

Image classification methods applicable multiple satellite imagery

  • Jeong, Jae-Jun;Kim, Kyung-Ok;Lee, Jong-Hun
    • Proceedings of the KSRS Conference
    • /
    • 2002.10a
    • /
    • pp.81-81
    • /
    • 2002
  • Classification is considered as one of the processes of extracting attributes from satellite imagery and is one of the usual functions in the commercial satellite image processing software. Accuracy of classification plays a key role in deciding the usage of its results. Many tremendous efforts far the higher accuracy have been done in such fields; training area selection, classification algorithm. Our research is one of these effort in different manners. In this research, we conduct classification using multiple satellite image data and evidential approach. We statistically consider the posterior probabilities and certainty in maximum likelihood classification and methodologically Dempster's orthogonal sums. Unfortunately, accuracy for the whole data sets has not assessed yet, but accuracy assessments in training fields and check fields shows accuracy improvement over 10% in overall accuracy and over 0.1 in kappa index.

  • PDF