• Title/Summary/Keyword: Automatic Classifier

Search Result 203, Processing Time 0.022 seconds

Classifications of Hadiths based on Supervised Learning Techniques

  • AbdElaal, Hammam M.;Bouallegue, Belgacem;Elshourbagy, Motasem;Matter, Safaa S.;AbdElghfar, Hany A.;Khattab, Mahmoud M.;Ahmed, Abdelmoty M.
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.11
    • /
    • pp.1-10
    • /
    • 2022
  • This study aims to build a model is capable of classifying the categories of hadith, according to the reliability of hadith' narrators (sahih, hassan, da'if, maudu) and according to what was attributed to the Prophet Muhammad (saying, doing, describing, reporting ) using the supervised learning algorithms, with a view to discover a relationship between these classifications, based on the outputs of this model, which might be useful to avoid the controversy and useless debate on automatic classifications of hadith, using some of the statistical methods such as chi-square, information gain and association rules. The experimental results showed that there is a relation between these classifications, most of Sahih hadiths are belong to saying class, and most of maudu hadiths are belong to reporting class. Also the best classifier had given high accuracy was MultinomialNB, it achieved higher accuracy reached up to 0.9708 %, for his ability to process high dimensional problems and identifying the most important features that are relevant to target data in training stage. Followed by LinearSVC classifier, reached up to 0.9655, and finally, KNeighborsClassifier reached up to 0.9644.

Automatic Classification Method for Time-Series Image Data using Reference Map (Reference Map을 이용한 시계열 image data의 자동분류법)

  • Hong, Sun-Pyo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.2
    • /
    • pp.58-65
    • /
    • 1997
  • A new automatic classification method with high and stable accuracy for time-series image data is presented in this paper. This method is based on prior condition that a classified map of the target area already exists, or at least one of the time-series image data had been classified. The classified map is used as a reference map to specify training areas of classification categories. The new automatic classification method consists of five steps, i.e., extraction of training data using reference map, detection of changed pixels based upon the homogeneity of training data, clustering of changed pixels, reconstruction of training data, and classification as like maximum likelihood classifier. In order to evaluate the performance of this method qualitatively, four time-series Landsat TM image data were classified by using this method and a conventional method which needs a skilled operator. As a results, we could get classified maps with high reliability and fast throughput, without a skilled operator.

  • PDF

Intracerebral Hemorrhage Auto Recognition in Computed Tomography Images (CT 영상에서 뇌출혈의 자동인식)

  • Choi, Seok-Yoon;Kang, Se-Sik;Kim, Chang-Soo;Kim, Jung-Hoon;Kim, Dong-Hyun;Ye, Soo-Young;Ko, Seong-Jin
    • Journal of radiological science and technology
    • /
    • v.36 no.2
    • /
    • pp.141-148
    • /
    • 2013
  • The CT examination sometimes fail to localize the cerebral hemorrhage part depending on the seriousness and may embarrass the pathologist if he/she is not trained enough for emergencies. Therefore, an assisting role is necessary for examination, automatic and quick detection of the cerebral hemorrhage part, and supply of the quantitative information in emergencies. the computer based automatic detection and recognition system may be of a great service to the bleeding part detection. As a result of this research, we succeeded not only in automatic detection of the cerebral hemorrhage part by grafting threshold value handling, morphological operation, and roundness calculation onto the bleeding part but also in development of the PCA based classifier to screen any wrong choice in the detection candidate group. We think if we apply the new developed system to the cerebral hemorrhage patient in his critical condition, it will be very valuable data to the medical team for operation planning.

Design and Implementation of an Automatic Scoring Model Using a Voting Method for Descriptive Answers (투표 기반 서술형 주관식 답안 자동 채점 모델의 설계 및 구현)

  • Heo, Jeongman;Park, So-Young
    • Journal of the Korea Society of Computer and Information
    • /
    • v.18 no.8
    • /
    • pp.17-25
    • /
    • 2013
  • TIn this paper, we propose a model automatically scoring a student's answer for a descriptive problem by using a voting method. Considering the model construction cost, the proposed model does not separately construct the automatic scoring model per problem type. In order to utilize features useful for automatically scoring the descriptive answers, the proposed model extracts feature values from the results, generated by comparing the student's answer with the answer sheet. For the purpose of improving the precision of the scoring result, the proposed model collects the scoring results classified by a few machine learning based classifiers, and unanimously selects the scoring result as the final result. Experimental results show that the single machine learning based classifier C4.5 takes 83.00% on precision while the proposed model improve the precision up to 90.57% by using three machine learning based classifiers C4.5, ME, and SVM.

A Study on Analog and Digital Meter Recognition Based on Image Processing Technique (영상처리 기법에 기반한 아날로그 및 디지틀 계기의 자동인식에 관한 연구)

  • 김경호;진성일;이용범;이종민
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.32B no.9
    • /
    • pp.1215-1230
    • /
    • 1995
  • The purpose of this paper is to build a computer vision system that endows an autonomous mobile robot the ability of automatic measuring of the analog and digital meters installed in nuclear power plant(NPP). This computer vision system takes a significant part in the organization of automatic surveillance and measurement system having the instruments and gadzets in NPP under automatic control situation. In the meter image captured by the camera, the meter area is sorted out using mainly the thresholding and the region labeling and the meter value recognition process follows. The positions and the angles of the needles in analog meter images are detected using the projection based method. In the case of digital meters, digits and points are extracted and finally recognized through the neural network classifier. To use available database containing relevant information about meters and to build fully automatic meter recognition system, the segmentation and recognition of the function-name in the meter printed around the meter area should be achieved for enhancing identification reliability. For thus, the function- name of the meter needs to be identified and furthermore the scale distributions and values are also required to be analyzed for building the more sophisticated system and making the meter recognition fully automatic.

  • PDF

Power Disturbance Classifier Using Wavelet-Based Neural Network

  • Choi Jae-Ho;Kim Hong-Kyun;Lee Jin-Mok;Chung Gyo-Bum
    • Journal of Power Electronics
    • /
    • v.6 no.4
    • /
    • pp.307-314
    • /
    • 2006
  • This paper presents a wavelet and neural network based technology for the monitoring and classification of various types of power quality (PQ) disturbances. Simultaneous and automatic detection and classification of PQ transients, is recommended, however these processes have not been thoroughly investigated so far. In this paper, the hardware and software of a power quality data acquisition system (PQDAS) is described. In this system, an auto-classifying system combines the properties of the wavelet transform with the advantages of a neural network. Additionally, to improve recognition rate, extraction technology is considered.

Automatic Generation of XML Documents Using Rule-Based Document Classifier (규칙기반 문서 분류기를 이용한 XML 문서 의 자동생성)

  • 김효정;민미경
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2000.11a
    • /
    • pp.125-128
    • /
    • 2000
  • 인터넷 중심의 정보화 사회가 되면서 기존의 문서는 대부분 전자 문서로 대치되어 가고 있다. 전자 문서간의 호환과 표준화를 위하여 XML(eXtensible Markup Language)이 웹 문서의 표준으로 지정되었으나, 현재까지 사용되고 있는 문서들이 XML 형태의 문서가 아니므로 이를 수동으로 변환해야 하는 어려움이 있다. 본 논문에서는 규칙기반 분서 분류기(Rule-Based Document Classifier)를 설계하여 다양한 형태의 문서를 자동으로 분류하고 그룹화한다. 그룹화된 문서를 이용하여 자동으로 DTD(Document Type Definition)를 생성하고, 자동 생성된 DTD를 이용하여 XML 형태의 문서로 자동 변환할 수 있는 자동 XML 변환기를 제시한다. 이러한 방법은 문서들을 자동으로 분류하고, 문서의 행태에 변화가 있을 때에도 유사한 문서로 분류할수 있을 뿐만 아니라 문서를 재분류할 때 DTD의 중복 생성을 줄일 수 있는 등의 장점을 갖는다.

  • PDF

Random Forest Classifier-based Ship Type Prediction with Limited Ship Information of AIS and V-Pass

  • Jeon, Ho-Kun;Han, Jae Rim
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.4
    • /
    • pp.435-446
    • /
    • 2022
  • Identifying ship types is an important process to prevent illegal activities on territorial waters and assess marine traffic of Vessel Traffic Services Officer (VTSO). However, the Terrestrial Automatic Identification System (T-AIS) collected at the ground station has over 50% of vessels that do not contain the ship type information. Therefore, this study proposes a method of identifying ship types through the Random Forest Classifier (RFC) from dynamic and static data of AIS and V-Pass for one year and the Ulsan waters. With the hypothesis that six features, the speed, course, length, breadth, time, and location, enable to estimate of the ship type, four classification models were generated depending on length or breadth information since 81.9% of ships fully contain the two information. The accuracy were average 96.4% and 77.4% in the presence and absence of size information. The result shows that the proposed method is adaptable to identifying ship types.

Performance Comparison of Automatic Classification Using Word Embeddings of Book Titles (단행본 서명의 단어 임베딩에 따른 자동분류의 성능 비교)

  • Yong-Gu Lee
    • Journal of the Korean Society for information Management
    • /
    • v.40 no.4
    • /
    • pp.307-327
    • /
    • 2023
  • To analyze the impact of word embedding on book titles, this study utilized word embedding models (Word2vec, GloVe, fastText) to generate embedding vectors from book titles. These vectors were then used as classification features for automatic classification. The classifier utilized the k-nearest neighbors (kNN) algorithm, with the categories for automatic classification based on the DDC (Dewey Decimal Classification) main class 300 assigned by libraries to books. In the automatic classification experiment applying word embeddings to book titles, the Skip-gram architectures of Word2vec and fastText showed better results in the automatic classification performance of the kNN classifier compared to the TF-IDF features. In the optimization of various hyperparameters across the three models, the Skip-gram architecture of the fastText model demonstrated overall good performance. Specifically, better performance was observed when using hierarchical softmax and larger embedding dimensions as hyperparameters in this model. From a performance perspective, fastText can generate embeddings for substrings or subwords using the n-gram method, which has been shown to increase recall. The Skip-gram architecture of the Word2vec model generally showed good performance at low dimensions(size 300) and with small sizes of negative sampling (3 or 5).

A Feature Vector Extraction Method For the Automatic Classification of Power Quality Disturbances (전력 외란 자동 식별을 위한 특징 벡터 추출 기법)

  • Lee, Chul-Ho;Lee, Jae-Sang;Cho, Kwan-Young;Chung, Ji-Hyun;Nam, Sang-Won
    • Proceedings of the KIEE Conference
    • /
    • 1996.11a
    • /
    • pp.404-406
    • /
    • 1996
  • The objective of this paper is to present a new feature-vector extraction method for the automatic detection and classification of power quality(PQ) disturbances, where FFT, DWT(Discrete Wavelet Transform), and data compression are utilized to extract an appropriate feature vector. In particular, the proposed classifier consists of three parts: i.e., (i) automatic detection of PQ disturbances, where the wavelet transform and signal power estimation method are utilized to detect each disturbance, (ii) feature vector extraction from the detected disturbance, and (iii) automatic classification, where Multi-Layer Perceptron(MLP) is used to classify each disturbance from the corresponding extracted feature vector. To demonstrate the performance and applicability of the proposed classification algorithm, some test results obtained by analyzing 7-class power quality disturbances generated by the EMTP are also provided.

  • PDF