• 제목/요약/키워드: Classification Problem

검색결과 1,728건 처리시간 0.033초

Blackboard Scheduler Control Knowledge for Recursive Heuristic Classification

  • Park, Young-Tack
    • 지능정보연구
    • /
    • 제1권1호
    • /
    • pp.61-72
    • /
    • 1995
  • Dynamic and explicit ordering of strategies is a key process in modeling knowledge-level problem-solving behavior. This paper addressed the important problem of howl to make the scheduler more knowledge-intensive in a way that facilitates the acquisition, integration, and maintenance of the scheduler control knowledge. The solution a, pp.oach described in this paper involved formulating the scheduler task as a heuristic classification problem, and then implementing it as a classification expert system. By doing this, the wide spectrum of known methods of acquiring, refining, and maintaining the knowledge of a classification expert system are a, pp.icable to the scheduler control knowledge. One important innovation of this research is that of recursive heuristic classification : this paper demonstrates that it is possible to formulate and solve a key subcomponent of heuristic classification as heuristic classification problem. Another key innovation is the creation of a method of dynamic heuristic classification : the classification alternatives that are selected among are dynamically generated in real-time and then evidence is gathered for and aginst these alternatives. In contrast, the normal model of heuristic classification is that of structured selection between a set of preenumerated fixed alternatives.

  • PDF

Supervised Learning-Based Collaborative Filtering Using Market Basket Data for the Cold-Start Problem

  • Hwang, Wook-Yeon;Jun, Chi-Hyuck
    • Industrial Engineering and Management Systems
    • /
    • 제13권4호
    • /
    • pp.421-431
    • /
    • 2014
  • The market basket data in the form of a binary user-item matrix or a binary item-user matrix can be modelled as a binary classification problem. The binary logistic regression approach tackles the binary classification problem, where principal components are predictor variables. If users or items are sparse in the training data, the binary classification problem can be considered as a cold-start problem. The binary logistic regression approach may not function appropriately if the principal components are inefficient for the cold-start problem. Assuming that the market basket data can also be considered as a special regression problem whose response is either 0 or 1, we propose three supervised learning approaches: random forest regression, random forest classification, and elastic net to tackle the cold-start problem, comparing the performance in a variety of experimental settings. The experimental results show that the proposed supervised learning approaches outperform the conventional approaches.

Contribution to Improve Database Classification Algorithms for Multi-Database Mining

  • Miloudi, Salim;Rahal, Sid Ahmed;Khiat, Salim
    • Journal of Information Processing Systems
    • /
    • 제14권3호
    • /
    • pp.709-726
    • /
    • 2018
  • Database classification is an important preprocessing step for the multi-database mining (MDM). In fact, when a multi-branch company needs to explore its distributed data for decision making, it is imperative to classify these multiple databases into similar clusters before analyzing the data. To search for the best classification of a set of n databases, existing algorithms generate from 1 to ($n^2-n$)/2 candidate classifications. Although each candidate classification is included in the next one (i.e., clusters in the current classification are subsets of clusters in the next classification), existing algorithms generate each classification independently, that is, without taking into account the use of clusters from the previous classification. Consequently, existing algorithms are time consuming, especially when the number of candidate classifications increases. To overcome the latter problem, we propose in this paper an efficient approach that represents the problem of classifying the multiple databases as a problem of identifying the connected components of an undirected weighted graph. Theoretical analysis and experiments on public databases confirm the efficiency of our algorithm against existing works and that it overcomes the problem of increase in the execution time.

Fuzzy SVM for Multi-Class Classification

  • 나은영;홍덕헌;황창하
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 한국데이터정보과학회 2003년도 추계학술대회
    • /
    • pp.123-123
    • /
    • 2003
  • More elaborated methods allowing the usage of binary classifiers for the resolution of multi-class classification problems are briefly presented. This way of using FSVC to learn a K-class classification problem consists in choosing the maximum applied to the outputs of K FSVC solving a one-per-class decomposition of the general problem.

  • PDF

Robust Face Recognition under Limited Training Sample Scenario using Linear Representation

  • Iqbal, Omer;Jadoon, Waqas;ur Rehman, Zia;Khan, Fiaz Gul;Nazir, Babar;Khan, Iftikhar Ahmed
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제12권7호
    • /
    • pp.3172-3193
    • /
    • 2018
  • Recently, several studies have shown that linear representation based approaches are very effective and efficient for image classification. One of these linear-representation-based approaches is the Collaborative representation (CR) method. The existing algorithms based on CR have two major problems that degrade their classification performance. First problem arises due to the limited number of available training samples. The large variations, caused by illumintion and expression changes, among query and training samples leads to poor classification performance. Second problem occurs when an image is partially noised (contiguous occlusion), as some part of the given image become corrupt the classification performance also degrades. We aim to extend the collaborative representation framework under limited training samples face recognition problem. Our proposed solution will generate virtual samples and intra-class variations from training data to model the variations effectively between query and training samples. For robust classification, the image patches have been utilized to compute representation to address partial occlusion as it leads to more accurate classification results. The proposed method computes representation based on local regions in the images as opposed to CR, which computes representation based on global solution involving entire images. Furthermore, the proposed solution also integrates the locality structure into CR, using Euclidian distance between the query and training samples. Intuitively, if the query sample can be represented by selecting its nearest neighbours, lie on a same linear subspace then the resulting representation will be more discriminate and accurately classify the query sample. Hence our proposed framework model the limited sample face recognition problem into sufficient training samples problem using virtual samples and intra-class variations, generated from training samples that will result in improved classification accuracy as evident from experimental results. Moreover, it compute representation based on local image patches for robust classification and is expected to greatly increase the classification performance for face recognition task.

아이다부스트(Adaboost)와 원형기반함수를 이용한 다중표적 분류 기법 (Multi-target Classification Method Based on Adaboost and Radial Basis Function)

  • 김재협;장경현;이준행;문영식
    • 전자공학회논문지CI
    • /
    • 제47권3호
    • /
    • pp.22-28
    • /
    • 2010
  • 최근 기계학습 분야에서 커널머신을 이용한 대표적 분류기로 Adaboost가 주목받고 있다. Adaboost는 통계적 학습이론에 기반하여 뛰어난 일반화 성능을 보여주며, 다양한 패턴인식 문제에 적용되고 있다. 그러나, Adaboost는 이진 분류기이므로 다중표적 분류 문제에 곧바로 적용할 수 없다. 일반적으로 다중 분류 문제를 해결하는 기법으로 One-Vs-All 기법과 Pair-Wise 기법이 대표적이다. 이러한 두 기법은 다중 분류 문제를 여러 개의 이진 분류 문제로 분할하고, 이들을 다시 종합하여 최종 결정을 내리는 출력코딩이라는 일반적인 기법으로 실제 시스템 구성에 적합할만한 분류 성능을 보여주지 못하는 경우가 대부분이다. 본 논문에서는 이진 분류기인 Adaboost의 다중 분류 확장 방안으로 원형 기반 함수를 약한 분류기로 이용하는 Adaboost 기반 다중표적 분류 기법을 제안한다.

Thinking Science 프로그램 중 분류활동이 초등학교 5학년 학생의 분류문제해결능력에 미치는 영향 (The Effect of the classification problem solving of Thinking Science Program on the Classified Activities on Elementary School 5th grade category)

  • 이성현;한신
    • 대한지구과학교육학회지
    • /
    • 제4권2호
    • /
    • pp.102-107
    • /
    • 2011
  • In this study, elementary school science program, this category did not affect any troubleshooting analyzed. Thinking Science Program to buy for them in group activities by using one of the elements of a program of treatment and cognitive level effects were two kinds of research questions. 102, 5th grade four classes were involved, these two classes of the experimental group and the remaining two classes were divided into a control group. Pre-test between the two groups is compared to the level and classification problem-solving skills but the skills did not show a statistically significant difference. Thinking Science activity after application of classification and posttest the experimental group than in the control group problem solving abilities of students classified at the level of statistical significance was higher. Thinking Science program is a treatment effect for each level of analysis, tests, regardless of cognitive level was more effective. Through theses findings, Thinking Science activities 5th grade category classification problem-solving skills of students found to be effective in improving and these types of programs actively introduced in the field suggests that we need to see.

Optimal bandwidth in nonparametric classification between two univariate densities

  • ;강기훈
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2002년도 춘계 학술발표회 논문집
    • /
    • pp.1-5
    • /
    • 2002
  • We consider the problem of optimal bandwidth choice for nonparametric classification, based on kernel density estimators, where the problem of interest is distinguishing between two univariate distributions. When the densities intersect at a single point, optimal bandwidth choice depends on curvatures of the densities at that point. The problem of empirical bandwidth selection and classifying data in the tails of a distribution are also addressed.

  • PDF

분석과 종합문제의 분류 기준에 대한 연구 -러시아 구세프의 수학교과서를 중심으로- (A study on the classification standards of the problem of analysis and synthesis)

  • 권영인;서보억
    • 한국수학교육학회지시리즈E:수학교육논문집
    • /
    • 제20권2호
    • /
    • pp.231-248
    • /
    • 2006
  • 수학적인 사고에는 여러가지 유형이 있는데 그 중에서 가장 기본이 되는 사고유형이 분석과 종합이다. 이러한 사고유형을 담고 있는 분석과 종합문제는 수학문제의 가장 중요한 근간이라고 할 수 있다. 이러한 분석과 종합문제에 대한 체계적인 분석과 구체적인 논의를 위해 러시아의 구세프 교수가 편찬한 수학교과서를 중심으로 이러한 두 가지 유형의 문제에 대한 분석을 실시한다. 또한, 역사적으로 제시되어진 분석과 종합에 대한 다양한 문헌연구를 통해 두 가지 사고유형의 분류에 대한 기본적인 기준을 설정하고, 이를 바탕으로 분석문제와 종합문제에 대한 구체적인 분류기준을 제시하고자 한다.

  • PDF

Negative Selection Algorithm for DNA Pattern Classification

  • Lee, Dong-Wook;Sim, Kwee-Bo
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2004년도 ICCAS
    • /
    • pp.190-195
    • /
    • 2004
  • We propose a pattern classification algorithm using self-nonself discrimination principle of immune cells and apply it to DNA pattern classification problem. Pattern classification problem in bioinformatics is very important and frequent one. In this paper, we propose a classification algorithm based on the negative selection of the immune system to classify DNA patterns. The negative selection is the process to determine an antigenic receptor that recognize antigens, nonself cells. The immune cells use this antigen receptor to judge whether a self or not. If one composes ${\eta}$ groups of antigenic receptor for ${\eta}$ different patterns, these receptor groups can classify into ${\eta}$ patterns. We propose a pattern classification algorithm based on the negative selection in nucleotide base level and amino acid level. Also to show the validity of our algorithm, experimental results of RNA group classification are presented.

  • PDF