• Title/Summary/Keyword: Content-based Classification

Search Result 445, Processing Time 0.026 seconds

Natural Image Labeling and Classification Technique by Color-Spatial Histogram and Production Rules (칼라-공간 히스토그램과 생성 규칙을 이용한 자연 영상 레이블링 및 분류 기법)

  • 김준영;신수연;김우생
    • Proceedings of the IEEK Conference
    • /
    • 2002.06d
    • /
    • pp.153-156
    • /
    • 2002
  • The image labeling and classification is one of the important tasks for a content-based image retrieval and an image understanding. This paper propose a new technique to label and classify natural images with a color-spatial histogram and production rules. We show that our proposed method is very efficient for a natural image composed of a few regions.

  • PDF

Collaborative Filtering and Genre Classification for Music Recommendation

  • Byun, Jeong-Yong;Nasridinov, Aziz
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2014.11a
    • /
    • pp.693-694
    • /
    • 2014
  • This short paper briefly describes the proposed music recommendation method that provides suitable music pieces to a listener depending on both listeners' ratings and content of music pieces. The proposed method consists of two methods. First, listeners' ratings prediction method is a combination the traditional user-based and item-based collaborative filtering methods. Second, genre classification method is a combination of feature extraction and classification procedures. The feature extraction step obtains audio signal information and stores it in data structure, while the second one classifies the music pieces into various genres using decision tree algorithm.

Semantic Feature Analysis for Multi-Label Text Classification on Topics of the Al-Quran Verses

  • Gugun Mediamer;Adiwijaya
    • Journal of Information Processing Systems
    • /
    • v.20 no.1
    • /
    • pp.1-12
    • /
    • 2024
  • Nowadays, Islamic content is widely used in research, including Hadith and the Al-Quran. Both are mostly used in the field of natural language processing, especially in text classification research. One of the difficulties in learning the Al-Quran is ambiguity, while the Al-Quran is used as the main source of Islamic law and the life guidance of a Muslim in the world. This research was proposed to relieve people in learning the Al-Quran. We proposed a word embedding feature-based on Tensor Space Model as feature extraction, which is used to reduce the ambiguity. Based on the experiment results and the analysis, we prove that the proposed method yields the best performance with the Hamming loss 0.10317.

Proposing and Validating a Classification Method based on Knowledge Structure to Identify High-Quality Presentation Slides (고품질 슬라이드 선별을 위한 지식구조 기반 분류 기법)

  • Jung, Wonchul;Kim, Seongchan;Yi, Mun Y.
    • KIISE Transactions on Computing Practices
    • /
    • v.20 no.12
    • /
    • pp.676-681
    • /
    • 2014
  • In order to discern and classify high-quality slides, our research proposes a classification method that utilizes a knowledge structure containing information on the presentation slides. After analyzing whether our knowledge structure captures the content's quality information, we developed a classification method based on the knowledge structure produced from the analysis results. With the proposed method, we compared results classified by quality of presentation slides. Through this comparison, we verified that the slides in the high quality group could be classified and were able to retrieve high quality slides. The results show that, by utilizing the cognitive model of a knowledge structure, our method can increase the effectiveness of classification when search or recommendation is conducted mainly with high-quality slides.

A Study on the classification scheme for the design of Directory Search Engine on the web (web 데이터베이스의 디렉토리 설계를 위한 분류체계 연구)

  • 이명희
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.10 no.1
    • /
    • pp.243-268
    • /
    • 1999
  • The purpose of this study is to develop the classification scheme in subject-based directory search engine for educational research information on the web. Five classification systems. Yahoo Korea, Argus Clearinghouse, DDC, ERIC thesaurus and KEDI thesaurus were measured in terms of coverage of subject fields, system logic, accuracy of terminology and efficiency of searching. For the design of Classification Scheme, this study considered the content of subject areas, features of information resources and efficiency based on users. Finally, the Classification Scheme was established in terms of 16 main divisions and 47 sub-divisions in educational research information.

  • PDF

Machine Learning Based Automatic Categorization Model for Text Lines in Invoice Documents

  • Shin, Hyun-Kyung
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.12
    • /
    • pp.1786-1797
    • /
    • 2010
  • Automatic understanding of contents in document image is a very hard problem due to involvement with mathematically challenging problems originated mainly from the over-determined system induced by document segmentation process. In both academic and industrial areas, there have been incessant and various efforts to improve core parts of content retrieval technologies by the means of separating out segmentation related issues using semi-structured document, e.g., invoice,. In this paper we proposed classification models for text lines on invoice document in which text lines were clustered into the five categories in accordance with their contents: purchase order header, invoice header, summary header, surcharge header, purchase items. Our investigation was concentrated on the performance of machine learning based models in aspect of linear-discriminant-analysis (LDA) and non-LDA (logic based). In the group of LDA, na$\"{\i}$ve baysian, k-nearest neighbor, and SVM were used, in the group of non LDA, decision tree, random forest, and boost were used. We described the details of feature vector construction and the selection processes of the model and the parameter including training and validation. We also presented the experimental results of comparison on training/classification error levels for the models employed.

Android malicious code Classification using Deep Belief Network

  • Shiqi, Luo;Shengwei, Tian;Long, Yu;Jiong, Yu;Hua, Sun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.1
    • /
    • pp.454-475
    • /
    • 2018
  • This paper presents a novel Android malware classification model planned to classify and categorize Android malicious code at Drebin dataset. The amount of malicious mobile application targeting Android based smartphones has increased rapidly. In this paper, Restricted Boltzmann Machine and Deep Belief Network are used to classify malware into families of Android application. A texture-fingerprint based approach is proposed to extract or detect the feature of malware content. A malware has a unique "image texture" in feature spatial relations. The method uses information on texture image extracted from malicious or benign code, which are mapped to uncompressed gray-scale according to the texture image-based approach. By studying and extracting the implicit features of the API call from a large number of training samples, we get the original dynamic activity features sets. In order to improve the accuracy of classification algorithm on the features selection, on the basis of which, it combines the implicit features of the texture image and API call in malicious code, to train Restricted Boltzmann Machine and Back Propagation. In an evaluation with different malware and benign samples, the experimental results suggest that the usability of this method---using Deep Belief Network to classify Android malware by their texture images and API calls, it detects more than 94% of the malware with few false alarms. Which is higher than shallow machine learning algorithm clearly.

A Study on the Reorganization of the Knowledge Classification Scheme (학문분류표의 재설정에 관한 연구)

  • 정연경
    • Journal of the Korean Society for information Management
    • /
    • v.17 no.2
    • /
    • pp.37-66
    • /
    • 2000
  • This study attempts to reorganize the knowledge classification system for the research fields and majors in education by designing a new classification schedule. Content analysis of the majors and curriculums in the universities and major areas of the academic professors in Korea, and the comparison with the various headings in several classification systems for research fields were carried out. Based upon the comparison with library classification systems and reviews and opinions of subject specialists in major disciplines, finally, a knowledge classification system composed of three parts - schedules, tables and a relative index - was presented. The proposed classification scheme was tested for classifying the research projects listed in the 1998 catalog of the academic research funded by Korea Research Foundation. Also, several ways for developing a more useful knowledge classification scheme to organize disciplinary information effectively and to encourage interdisciplinary research were suggested.

  • PDF

A Biosignal-Based Human Interface Controlling a Power-Wheelchair for People with Motor Disabilities

  • Kim, Ki-Hong;Kim, Hong-Kee;Kim, Jong-Sung;Son, Wook-Ho;Lee, Soo-Young
    • ETRI Journal
    • /
    • v.28 no.1
    • /
    • pp.111-114
    • /
    • 2006
  • An alternative human interface enabling people with severe motor disabilities to control an assistive system is presented. Since this interface relies on the biosignals originating from the contraction of muscles on the face during particular movements, even individuals with a paralyzed limb can use it with ease. For real-world application, a dedicated hardware module employing a general-purpose digital signal processor was implemented and its validity tested on an electrically powered wheelchair. Furthermore, an additional attempt to reduce error rates to a minimum for stable operation was also made based on the entropy information inherent in the signals during the classification phase. In the experiments, most of the five participating subjects could control the target system at their own will, and thus it is found that the proposed interface can be considered a potential alternative for the interaction of the severely disabled with electronic systems.

  • PDF

Academic Registration Text Classification Using Machine Learning

  • Alhawas, Mohammed S;Almurayziq, Tariq S
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.1
    • /
    • pp.93-96
    • /
    • 2022
  • Natural language processing (NLP) is utilized to understand a natural text. Text analysis systems use natural language algorithms to find the meaning of large amounts of text. Text classification represents a basic task of NLP with a wide range of applications such as topic labeling, sentiment analysis, spam detection, and intent detection. The algorithm can transform user's unstructured thoughts into more structured data. In this work, a text classifier has been developed that uses academic admission and registration texts as input, analyzes its content, and then automatically assigns relevant tags such as admission, graduate school, and registration. In this work, the well-known algorithms support vector machine SVM and K-nearest neighbor (kNN) algorithms are used to develop the above-mentioned classifier. The obtained results showed that the SVM classifier outperformed the kNN classifier with an overall accuracy of 98.9%. in addition, the mean absolute error of SVM was 0.0064 while it was 0.0098 for kNN classifier. Based on the obtained results, the SVM is used to implement the academic text classification in this work.