• 제목/요약/키워드: Classification Problem

검색결과 1,732건 처리시간 0.033초

Multi-Label Classification Approach to Location Prediction

  • Lee, Min Sung
    • 한국컴퓨터정보학회논문지
    • /
    • 제22권10호
    • /
    • pp.121-128
    • /
    • 2017
  • In this paper, we propose a multi-label classification method in which multi-label classification estimation techniques are applied to resolving location prediction problem. Most of previous studies related to location prediction have focused on the use of single-label classification by using contextual information such as user's movement paths, demographic information, etc. However, in this paper, we focused on the case where users are free to visit multiple locations, forcing decision-makers to use multi-labeled dataset. By using 2373 contextual dataset which was compiled from college students, we have obtained the best results with classifiers such as bagging, random subspace, and decision tree with the multi-label classification estimation methods like binary relevance(BR), binary pairwise classification (PW).

Optimizing artificial neural network architectures for enhanced soil type classification

  • Yaren Aydin;Gebrail Bekdas;Umit Isikdag;Sinan Melih Nigdeli;Zong Woo Geem
    • Geomechanics and Engineering
    • /
    • 제37권3호
    • /
    • pp.263-277
    • /
    • 2024
  • Artificial Neural Networks (ANNs) are artificial learning algorithms that provide successful results in solving many machine learning problems such as classification, prediction, object detection, object segmentation, image and video classification. There is an increasing number of studies that use ANNs as a prediction tool in soil classification. The aim of this research was to understand the role of hyperparameter optimization in enhancing the accuracy of ANNs for soil type classification. The research results has shown that the hyperparameter optimization and hyperparamter optimized ANNs can be utilized as an efficient mechanism for increasing the estimation accuracy for this problem. It is observed that the developed hyperparameter tool (HyperNetExplorer) that is utilizing the Covariance Matrix Adaptation Evolution Strategy (CMAES), Genetic Algorithm (GA) and Jaya Algorithm (JA) optimization techniques can be successfully used for the discovery of hyperparameter optimized ANNs, which can accomplish soil classification with 100% accuracy.

Use of Word Clustering to Improve Emotion Recognition from Short Text

  • Yuan, Shuai;Huang, Huan;Wu, Linjing
    • Journal of Computing Science and Engineering
    • /
    • 제10권4호
    • /
    • pp.103-110
    • /
    • 2016
  • Emotion recognition is an important component of affective computing, and is significant in the implementation of natural and friendly human-computer interaction. An effective approach to recognizing emotion from text is based on a machine learning technique, which deals with emotion recognition as a classification problem. However, in emotion recognition, the texts involved are usually very short, leaving a very large, sparse feature space, which decreases the performance of emotion classification. This paper proposes to resolve the problem of feature sparseness, and largely improve the emotion recognition performance from short texts by doing the following: representing short texts with word cluster features, offering a novel word clustering algorithm, and using a new feature weighting scheme. Emotion classification experiments were performed with different features and weighting schemes on a publicly available dataset. The experimental results suggest that the word cluster features and the proposed weighting scheme can partly resolve problems with feature sparseness and emotion recognition performance.

하이웨이 네트워크 기반 CNN 모델링 및 사전 외 어휘 처리 기술을 활용한 악성 댓글 분류 연구 (A Study on the Toxic Comments Classification Using CNN Modeling with Highway Network and OOV Process)

  • 이현상;이희준;오세환
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제29권3호
    • /
    • pp.103-117
    • /
    • 2020
  • Purpose Recently, various issues related to toxic comments on web portal sites and SNS are becoming a major social problem. Toxic comments can threaten Internet users in the type of defamation, personal attacks, and invasion of privacy. Over past few years, academia and industry have been conducting research in various ways to solve this problem. The purpose of this study is to develop the deep learning modeling for toxic comments classification. Design/methodology/approach This study analyzed 7,878 internet news comments through CNN classification modeling based on Highway Network and OOV process. Findings The bias and hate expressions of toxic comments were classified into three classes, and achieved 67.49% of the weighted f1 score. In terms of weighted f1 score performance level, this was superior to approximate 50~60% of the previous studies.

Genetic Algorithm Application to Machine Learning

  • Han, Myung-mook;Lee, Yill-byung
    • 한국지능시스템학회논문지
    • /
    • 제11권7호
    • /
    • pp.633-640
    • /
    • 2001
  • In this paper we examine the machine learning issues raised by the domain of the Intrusion Detection Systems(IDS), which have difficulty successfully classifying intruders. There systems also require a significant amount of computational overhead making it difficult to create robust real-time IDS. Machine learning techniques can reduce the human effort required to build these systems and can improve their performance. Genetic algorithms are used to improve the performance of search problems, while data mining has been used for data analysis. Data Mining is the exploration and analysis of large quantities of data to discover meaningful patterns and rules. Among the tasks for data mining, we concentrate the classification task. Since classification is the basic element of human way of thinking, it is a well-studied problem in a wide variety of application. In this paper, we propose a classifier system based on genetic algorithm, and the proposed system is evaluated by applying it to IDS problem related to classification task in data mining. We report our experiments in using these method on KDD audit data.

  • PDF

Using Artificial Neural Networks to detect Variance Change Point for Data Separation

  • 한영철;오경주;김태윤
    • 한국경영과학회:학술대회논문집
    • /
    • 대한산업공학회/한국경영과학회 2006년도 춘계공동학술대회 논문집
    • /
    • pp.1214-1220
    • /
    • 2006
  • In this article, it will be shown that a nonparametric and data-adaptive approach to the variance change point (VCP) detection problem is possible by formulating it as a pattern classification problem. Technical aspects of the VCP detector are discussed, which include its training strategy and selection of proper classification tool.

  • PDF

연관분석을 이용한 효과적인 표절검사 및 문서분류에 관한 연구 (A Study on Plagiarism Detection and Document Classification Using Association Analysis)

  • 황인수
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제23권3호
    • /
    • pp.127-142
    • /
    • 2014
  • Plagiarism occurs when the content is copied without permission or citation, and the problem of plagiarism has rapidly increased because of the digital era of resources available on the World Wide Web. An important task in plagiarism detection is measuring and determining similar text portions between a given pair of documents. One of the main difficulties of this task is that not all similar text fragments are examples of plagiarism, since thematic coincidences also tend to produce portions of similar text. In order to handle this problem, this paper proposed association analysis in data mining to detect plagiarism. This method is able to detect common actions performed by plagiarists such as word deletion, insertion and transposition, allowing to obtain plausible portions of plagiarized text. Experimental results employing an unsupervised document classification strategy showed that the proposed method outperformed traditionally used approaches.

A Recommendation System using Dynamic Profiles and Relative Quantification

  • Lee, Se-Il;Lee, Sang-Yong
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제7권3호
    • /
    • pp.165-170
    • /
    • 2007
  • Recommendation systems provide users with proper services using context information being input from many sensors occasionally under ubiquitous computing environment. But in case there isn't sufficient context information for service recommendation in spite of much context information, there can be problems of resulting in inexact result. In addition, in the quantification step to use context information, there are problems of classifying context information inexactly because of using an absolute classification course. In this paper, we solved the problem of lack of necessary context information for service recommendation by using dynamic profile information. We also improved the problem of absolute classification by using a relative classification of context information in quantification step. As the result of experiments, expectation preference degree was improved by 7.5% as compared with collaborative filtering methods using an absolute quantification method where context information of P2P mobile agent is used.

신경망의 결정론적 이완에 의한 자기공명영상 분류 (Classification of Magnetic Resonance Imagery Using Deterministic Relaxation of Neural Network)

  • 전준철;민경필;권수일
    • Investigative Magnetic Resonance Imaging
    • /
    • 제6권2호
    • /
    • pp.137-146
    • /
    • 2002
  • 목적: 본 논문에서는 신경망을 이용한 자기공명영상의 분류에 있어 결정론적 이완 방법(deterministic relaxation)과 응집 군집화(agglomerative clustering) 방법에 의한 개선된 영상 분류방법을 제시한다. 제안된 방법은 신경망을 이용한 영상의 분류시 지역적 최소치로의 수렴문제와 입력 패턴의 증대로 인하여 수렴 속가 늦어지는 문제를 해결한다. 대상 및 방법: 신경망을 이용한 영상의 분류는 지역적 계산과 병렬 계산이 가능한 특성을 갖고 있어 기존의 통계적 방법을 대신하는 방법으로 주목을 받고 있다. 그러나 일반적으로 신경망에 의한 분류알고리즘이 지닌 문제점의 하나는 에너지함수가 항상 전역적 최소치로 수렴하지 않고 지역적 최소치로도 수렴할 수 있다는 점이고, 또 다른 문제점은 반복수렴을 수행하는 에너지함수의 수렴속도가 너무 늦다는 점이다. 따라서 지역적 최소치로의 수렴을 방지하고 전역적 최소치로의 수렴속도를 가속화시키기 위하여 본 논문에서는 결정적 이완 알고리즘의 하나인 MFA(Mean Field Annealing) 방법을 적용하여 지역적 최소치로의 수렴문제를 해결하는 방법을 제시한다. MFA는 모의 애닐링의 통계적 성질을 변수의 평균값에 적용하는 결정론적인 수정 법칙들로 대신하고, 이러한 평균값을 최소화함으로서 수렴속도를 개선한 방법이다 아울러 신경망이 갖고 있는 문제점인 과다한 클래스 패턴의 생성에 따른 처리속도 지연의 문제점을 해결하기 위하여 응집 군집화 알고리즘을 이용하여 영상을 구성하는 군집을 결정하여 신경망에 입력되는 값을 초기화하여 영상패턴이 증가되는 것을 제한하였다. 결과: 본 논문에서 제시된 응집 군집화 방법 및 결정론적 이완 방법은 신경망에 의한 자기공명영상의 분류 시 발생할 수 있는 지역적 최적 치로의 수렴 문제를 해결하여 전역적 최적화로 신속히 수렴함을 알 수 있었다. 결론: 본 논문에서는 클러스터의 분석과 결정론적 이완 방법에 의하여 신경망에 의한 자기공명영상의 분류결과를 향상시키기 위한 새로운 방법을 소개하였으며 실험결과를 통하여 그러한 사실을 확인할 수 있었다.

  • PDF

지지벡터기계를 이용한 단어 의미 분류 (Word Sense Classification Using Support Vector Machines)

  • 박준혁;이성욱
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제5권11호
    • /
    • pp.563-568
    • /
    • 2016
  • 단어 의미 분별 문제는 문장에서 어떤 단어가 사전에 가지고 있는 여러 가지 의미 중 정확한 의미를 파악하는 문제이다. 우리는 이 문제를 다중 클래스 분류 문제로 간주하고 지지벡터기계를 이용하여 분류한다. 세종 의미 부착 말뭉치에서 추출한 의미 중의성 단어의 문맥 단어를 두 가지 벡터 공간에 표현한다. 첫 번째는 문맥 단어들로 이뤄진 벡터 공간이고 이진 가중치를 사용한다. 두 번째는 문맥 단어의 윈도우 크기에 따라 문맥 단어를 단어 임베딩 모델로 사상한 벡터 공간이다. 실험결과, 문맥 단어 벡터를 사용하였을 때 약 87.0%, 단어 임베딩을 사용하였을 때 약 86.0%의 정확도를 얻었다.