• 제목/요약/키워드: Supervised learning methods

검색결과 202건 처리시간 0.026초

Active Learning과 군집화를 이용한 고정키어구 추출 (Keyphrase Extraction Using Active Learning and Clustering)

  • 이현우;차정원
    • 대한음성학회지:말소리
    • /
    • 제66호
    • /
    • pp.87-103
    • /
    • 2008
  • We describe a new active learning method in conditional random fields (CRFs) framework for keyphrase extraction. To save elaboration in annotation, we use diversity and representative measure. We select high diversity training candidates by sentence confidence value. We also select high representative candidates by clustering the part-of-speech patterns of contexts. In the experiments using dialog corpus, our method achieves 86.80% and saves 88% training corpus compared with those of supervised method. From the results of experiment, we can see that the proposed method shows improved performance over the previous methods. Additionally, the proposed method can be applied to other applications easily since its implementation is independent on applications.

  • PDF

Classifications of Hadiths based on Supervised Learning Techniques

  • AbdElaal, Hammam M.;Bouallegue, Belgacem;Elshourbagy, Motasem;Matter, Safaa S.;AbdElghfar, Hany A.;Khattab, Mahmoud M.;Ahmed, Abdelmoty M.
    • International Journal of Computer Science & Network Security
    • /
    • 제22권11호
    • /
    • pp.1-10
    • /
    • 2022
  • This study aims to build a model is capable of classifying the categories of hadith, according to the reliability of hadith' narrators (sahih, hassan, da'if, maudu) and according to what was attributed to the Prophet Muhammad (saying, doing, describing, reporting ) using the supervised learning algorithms, with a view to discover a relationship between these classifications, based on the outputs of this model, which might be useful to avoid the controversy and useless debate on automatic classifications of hadith, using some of the statistical methods such as chi-square, information gain and association rules. The experimental results showed that there is a relation between these classifications, most of Sahih hadiths are belong to saying class, and most of maudu hadiths are belong to reporting class. Also the best classifier had given high accuracy was MultinomialNB, it achieved higher accuracy reached up to 0.9708 %, for his ability to process high dimensional problems and identifying the most important features that are relevant to target data in training stage. Followed by LinearSVC classifier, reached up to 0.9655, and finally, KNeighborsClassifier reached up to 0.9644.

신경회로망을 이용한 직류전동기의 센서리스 속도제어 (Sensorless Speed Control of Direct Current Motor by Neural Network)

  • 김종수;강성주
    • 한국정보통신학회논문지
    • /
    • 제7권8호
    • /
    • pp.1743-1750
    • /
    • 2003
  • 근래에는 정확성과 신뢰성이 강한 센서리스 속도추정방법으로 전동기를 구동하기 위한 노력이 전개되고 있으며, 본 논문은 외란에 대한 강인성이 뛰어난 신경회로망을 이용하여 직류전동기의 센서리스 속도제어를 실현한 연구 결과이다. 〔6­8〕 신경회로망은 사람의 뇌가 경험을 통해 학습하듯이 주어진 입력에 대해 학습을 통하여 최적의 출력을 발생한다. 학습은 직류전동기의 수식모델을 통해 얻어진 전압$.$전류 및 회전자 속도를 입$.$출력 데이터로 사용하여 역전파 학습 알고리즘〔8〕을 통해 행하여지며, 학습 완료 후 얻은 최적의 연결강도를 이용하여 속도를 추정한다. 신경회로망에 의한 방식은 복잡한 알고리즘을 사용하지 않고도 정확한 속도 추정이 가능하며, 직류전동기의 문제점인 회전자 권선의 열에 의한 전동기의 성능 악화 및 속도 제어의 어려움을 해소하여 운전 조건에 따른 외란 등에도 강인한 제어 특성을 가질 뿐만 아니라 전 속도 영역에서 속도 응답 특성이 우수한 결과를 얻을 수 있었다.

SVM을 이용한 LVQ3 학습의 성능개선 (An Improvement of LVQ3 Learning Using SVM)

  • 김상운
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2001년도 하계종합학술대회 논문집(3)
    • /
    • pp.9-12
    • /
    • 2001
  • Learning vector quantization (LVQ) is a supervised learning technique that uses class information to move the vector quantizer slightly, so as to improve the quality of the classifier decision regions. In this paper we propose a selection method of initial codebook vectors for a teaming vector quantization (LVQ3) using support vector machines (SVM). The method is experimented with artificial and real design data sets and compared with conventional methods of the condensed nearest neighbor (CNN) and its modifications (mCNN). From the experiments, it is discovered that the proposed method produces higher performance than the conventional ones and then it could be used efficiently for designing nonparametric classifiers.

  • PDF

개념 학습에 의한 신경 회로망 컴퓨터 (A Neural Network for Concept Learning : Recognitron)

  • 이기한;황희융;김춘석
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1989년도 하계종합학술대회 논문집
    • /
    • pp.495-499
    • /
    • 1989
  • Concept is the set of selected neurons in a stable state of a neurel network. The Recognitron uses a parallel feedback structure to support concept learning. A number of clusters can exist in response to a given input, each of which make up a selective neuron. There are supervised and unsupervised learnig methods in concept teaming. In this paper, we have chosen unsupervised learning. Also, a new concept called relaxational learning has been introduced to stop runaway weights

  • PDF

위상 최적화를 위한 생산적 적대 신경망 기반 데이터 증강 기법 (GAN-based Data Augmentation methods for Topology Optimization)

  • 이승혜;이유진;이기학;이재홍
    • 한국공간구조학회논문집
    • /
    • 제21권4호
    • /
    • pp.39-48
    • /
    • 2021
  • In this paper, a GAN-based data augmentation method is proposed for topology optimization. In machine learning techniques, a total amount of dataset determines the accuracy and robustness of the trained neural network architectures, especially, supervised learning networks. Because the insufficient data tends to lead to overfitting or underfitting of the architectures, a data augmentation method is need to increase the amount of data for reducing overfitting when training a machine learning model. In this study, the Ganerative Adversarial Network (GAN) is used to augment the topology optimization dataset. The produced dataset has been compared with the original dataset.

The Investigation of Employing Supervised Machine Learning Models to Predict Type 2 Diabetes Among Adults

  • Alhmiedat, Tareq;Alotaibi, Mohammed
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권9호
    • /
    • pp.2904-2926
    • /
    • 2022
  • Currently, diabetes is the most common chronic disease in the world, affecting 23.7% of the population in the Kingdom of Saudi Arabia. Diabetes may be the cause of lower-limb amputations, kidney failure and blindness among adults. Therefore, diagnosing the disease in its early stages is essential in order to save human lives. With the revolution in technology, Artificial Intelligence (AI) could play a central role in the early prediction of diabetes by employing Machine Learning (ML) technology. In this paper, we developed a diagnosis system using machine learning models for the detection of type 2 diabetes among adults, through the adoption of two different diabetes datasets: one for training and the other for the testing, to analyze and enhance the prediction accuracy. This work offers an enhanced classification accuracy as a result of employing several pre-processing methods before applying the ML models. According to the obtained results, the implemented Random Forest (RF) classifier offers the best classification accuracy with a classification score of 98.95%.

커널 밀도 측정에서의 나이브 베이스 접근 방법 (Naive Bayes Approach in Kernel Density Estimation)

  • 샹총량;유샹루;아메드 압둘하킴 알-압시;강대기
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2014년도 춘계학술대회
    • /
    • pp.76-78
    • /
    • 2014
  • 나이브 베이스 학습은 유명하면서도, 빠르면서도 효과적인 지도 학습 방법으로, 다소 잡음을 가진 라벨이 있는 데이터집합을 다루는 데 좋은 성능을 보인다. 그러나, 나이브 베이스의 조건적 독립성 가정은 실세계 데이터를 다루는 데 필요한 특성에 다소 제약사항을 가지게 한다. 지금까지 연구자들이 이 조건적 독립성 가정을 완화시키는 방법들을 제안해 왔다. 이러한 방법들은 어트리뷰트 가중치, 커널 밀도 측정 등이 있다. 본 논문에서, 우리는 커널 밀도 측정과 어트리뷰트 가증치를 이용하여 나이브 베이스의 학습 효과를 개선하기 위한 NB Based on Attribute Weighting in Kernel Density Estimation (NBAWKDE) 이라는 새로운 접근 방법을 제안한다.

  • PDF

Microblog Sentiment Analysis Method Based on Spectral Clustering

  • Dong, Shi;Zhang, Xingang;Li, Ya
    • Journal of Information Processing Systems
    • /
    • 제14권3호
    • /
    • pp.727-739
    • /
    • 2018
  • This study evaluates the viewpoints of user focus incidents using microblog sentiment analysis, which has been actively researched in academia. Most existing works have adopted traditional supervised machine learning methods to analyze emotions in microblogs; however, these approaches may not be suitable in Chinese due to linguistic differences. This paper proposes a new microblog sentiment analysis method that mines associated microblog emotions based on a popular microblog through user-building combined with spectral clustering to analyze microblog content. Experimental results for a public microblog benchmark corpus show that the proposed method can improve identification accuracy and save manually labeled time compared to existing methods.

3차원 의료 영상의 영역 분할을 위한 효율적인 데이터 보강 방법 (An Efficient Data Augmentation for 3D Medical Image Segmentation)

  • 박상근
    • 융복합기술연구소 논문집
    • /
    • 제11권1호
    • /
    • pp.1-5
    • /
    • 2021
  • Deep learning based methods achieve state-of-the-art accuracy, however, they typically rely on supervised training with large labeled datasets. It is known in many medical applications that labeling medical images requires significant expertise and much time, and typical hand-tuned approaches for data augmentation fail to capture the complex variations in such images. This paper proposes a 3D image augmentation method to overcome these difficulties. It allows us to enrich diversity of training data samples that is essential in medical image segmentation tasks, thus reducing the data overfitting problem caused by the fact the scale of medical image dataset is typically smaller. Our numerical experiments demonstrate that the proposed approach provides significant improvements over state-of-the-art methods for 3D medical image segmentation.