• Title/Summary/Keyword: Consistency for classification

Search Result 131, Processing Time 0.026 seconds

A Study of calculation method for consistency with the fusion index (일관성 있는 기술융합지수 산출 방법 연구)

  • Kim, Byung-Chul
    • Journal of Digital Convergence
    • /
    • v.12 no.12
    • /
    • pp.227-232
    • /
    • 2014
  • Technology convergence is a global megatrends. In addition, developed contries should study for the new technology convergence production. Index of technology convergence has side effects that increase the value of production in the technology convergence and add to difference list about plan, however, this is difficult to verify. This paper presents a method to maintain consistency using list and confirm this when evaluation.

An Analysis of Service Classification Systems Provided by Major Korean Search Portals (주요 포털들의 서비스 분류체계 비교 분석)

  • Park, So-Yeon
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.44 no.2
    • /
    • pp.241-262
    • /
    • 2010
  • This study aims to perform an evaluation of classification systems provided by major Korean search portals, Naver, Nate, Daum, and Yahoo-Korea. These classification systems are evaluated in terms of the consistency of classification system, logicality of classification system, ease of interface, clarity of category names, order of category and site listing, and hierarchical structure. The results of this study show that each search portal provides separate classification systems for their services. These results imply that it is crucial for search portals to implement a common classification system and a common interface for their services. This study could contribute to the development and improvement of portals' classification systems.

A Study on Classification System of Korean Literatures Thesaurus (고전 용어 시소러스의 분류 체계에 관한 연구)

  • Yoo Yeong-Jun
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.40 no.2
    • /
    • pp.415-434
    • /
    • 2006
  • This study aim to develop a classification system to classify the descriptors, which is been in korean literatures. Firstly this classification structure is categorized on six facets and the classification system is constructed on a deductive method based on korean literature knowledge. The study compared the classification system with various thesaurus's classification system in humane studies and by the comparison, the classification system of korean literature's terms find out having some merits as using the facet method. On account of these merits the classification system has achieved a consistency of categorization independently and reduced a complexity of classification structure. And by categorizing the common categories, the study has reduced the size of schedules. Finally, the classification system has advanced the structure in the process of classifying the descriptors.

Color & Texture Attribute Classification System of Fashion Item Image for Standardizing Learning Data in Fashion AI (패션 AI의 학습 데이터 표준화를 위한 패션 아이템 이미지의 색채와 소재 속성 분류 체계)

  • Park, Nanghee;Choi, Yoonmi
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.44 no.2
    • /
    • pp.354-368
    • /
    • 2020
  • Accurate and versatile image data-sets are essential for fashion AI research and AI-based fashion businesses based on a systematic attribute classification system. This study constructs a color and texture attribute hierarchical classification system by collecting fashion item images and analyzing the metadata of fashion items described by consumers. Essential dimensions to explain color and texture attributes were extracted; in addition, attribute values for each dimension were constructed based on metadata and previous studies. This hierarchical classification system satisfies consistency, exclusiveness, inclusiveness, and flexibility. The image tagging to confirm the usefulness of the proposed classification system indicated that the contents of attributes of the same image differ depending on the annotator that require a clear standard for distinguishing differences between the properties. This classification system will improve the reliability of the training data for machine learning, by providing standardized criteria for tasks such as tagging and annotating of fashion items.

A Study of User′s Perspective and Satisfaction in National Assembly Library Electronic Library System (국회도서관 전자도서관시스템에 대한 이용자의 기대와 만족에 대한 연구)

  • Hong, Ki-Churl
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.36 no.2
    • /
    • pp.265-284
    • /
    • 2002
  • The purposes of this study are research for user's perspective and satisfaction, and analyze degree of difference in user's perspective and satisfaction through electronic library system of National Assembly Library. According to analysis of questionnaire, score of the factors that quantity of information, latest of information, quality of information, system of classification, accuracy of information are higher than average in user's perspective. Also score of the factors that speed of response, quality of information, consistency of information, consistency of retrieval, quantity of information, accuracy of information are higher than average in user's satisfaction. Result of analysis show that latest of information, quantity of information, variety of information, system of classification, accuracy of information are higher gap in user's perspective and satisfaction.

Monitoring of Graveyards in Mountainous Areas with Simulated KOMPSAT-2 imagery

  • Chang, Eun-Mi;Kim, Min-Ho;Lee, Byung-Whan;Heo, Min
    • Proceedings of the KSRS Conference
    • /
    • 2003.11a
    • /
    • pp.1409-1411
    • /
    • 2003
  • The application of simulated KOMPSAT-2 imagery to monitor graveyards is to be developed. Positions calculated from image were compared with those obtained from Geographic Positioning System. With 24 checkpoints, the position of graveyards showed within 5-meter range. Unsupervised classification, supervised classification, and objected-orientation classification algorithms were used to extract the graveyard. Unsupervised classification with masking processes based on National topographic data gives the best result. The graveyards were categorized with four types in field studies while the two types of graveyards were shown in descriptive statistics. Cluster Analysis and discriminant analysis showed the consistency with two types of tombs. It was hard to get a specific spectral signature of graveyards, as they are covered with grasses at different levels and shaded from the surrounding trees. The slopes and aspects of location of graveyards did not make any difference in the spectral signatures. This study gives the basic spectral characteristics for further development of objected-oriented classification algorithms and plausibility of KOMPSAT-2 images for management of mountainous areas in the aspect of position accuracy and classification accuracy.

  • PDF

A Study on the Classification Scheme of the Internet Search Engine (인터넷 탐색엔진에 관한 연구)

  • 김영보
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.8 no.1
    • /
    • pp.197-227
    • /
    • 1997
  • The main purpose of this study is ① to settle and to analyze the classification of the Internet Search Engine comparitively, and ② to build the compatible model of Internet Search Engine classification in order to seek information on the Internet resources. specially in the branch of the Computers and Internet areas. For this study, four Internet Search Engine (Excite, 1-Detect, Simmany, Yahoo Korea!), Inspec Classification and two distionaries were used. The major findings and result of analysis are summarized as follows : 1. The basis of the classification is the scope of topics, the system logic, the clearness, the efficiency. 2. The scope of topics is analyzed comparitively by the number of items from each Search Engine. In the result, Excite is the most superior of the four 3. The system logic is analyzed comparitively by the casuality balance and consistency of the items from each Search Engine. In the result, Excite is the most superior of the four 4. The clearness is analyzed comparitively by the clearness and accuracy of items, the recognition of the searchers. In the result, Excite is the most superior of the four. 5 The efficiency is analyzed comparitively by the exactness of indexing and decreasing the effort of the searchers. In the result, Yahoo Korea! is the most superior of the four. 6 The compatible model of Internet Search Engine classification is estavlished to uplift the scope of topics, the system logic, the clearness, and the efficiency. The model divides the area mainly based upon the topics and resources using‘bookmark’and‘shadow’concept.

  • PDF

Evaluation of Attribute Selection Methods and Prior Discretization in Supervised Learning

  • Cha, Woon Ock;Huh, Moon Yul
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.3
    • /
    • pp.879-894
    • /
    • 2003
  • We evaluated the efficiencies of applying attribute selection methods and prior discretization to supervised learning, modelled by C4.5 and Naive Bayes. Three databases were obtained from UCI data archive, which consisted of continuous attributes except for one decision attribute. Four methods were used for attribute selection : MDI, ReliefF, Gain Ratio and Consistency-based method. MDI and ReliefF can be used for both continuous and discrete attributes, but the other two methods can be used only for discrete attributes. Discretization was performed using the Fayyad and Irani method. To investigate the effect of noise included in the database, noises were introduced into the data sets up to the extents of 10 or 20%, and then the data, including those either containing the noises or not, were processed through the steps of attribute selection, discretization and classification. The results of this study indicate that classification of the data based on selected attributes yields higher accuracy than in the case of classifying the full data set, and prior discretization does not lower the accuracy.

A review of tree-based Bayesian methods

  • Linero, Antonio R.
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.6
    • /
    • pp.543-559
    • /
    • 2017
  • Tree-based regression and classification ensembles form a standard part of the data-science toolkit. Many commonly used methods take an algorithmic view, proposing greedy methods for constructing decision trees; examples include the classification and regression trees algorithm, boosted decision trees, and random forests. Recent history has seen a surge of interest in Bayesian techniques for constructing decision tree ensembles, with these methods frequently outperforming their algorithmic counterparts. The goal of this article is to survey the landscape surrounding Bayesian decision tree methods, and to discuss recent modeling and computational developments. We provide connections between Bayesian tree-based methods and existing machine learning techniques, and outline several recent theoretical developments establishing frequentist consistency and rates of convergence for the posterior distribution. The methodology we present is applicable for a wide variety of statistical tasks including regression, classification, modeling of count data, and many others. We illustrate the methodology on both simulated and real datasets.

An Automated Topic Specific Web Crawler Calculating Degree of Relevance (연관도를 계산하는 자동화된 주제 기반 웹 수집기)

  • Seo Hae-Sung;Choi Young-Soo;Choi Kyung-Hee;Jung Gi-Hyun;Noh Sang-Uk
    • Journal of Internet Computing and Services
    • /
    • v.7 no.3
    • /
    • pp.155-167
    • /
    • 2006
  • It is desirable if users surfing on the Internet could find Web pages related to their interests as closely as possible. Toward this ends, this paper presents a topic specific Web crawler computing the degree of relevance. collecting a cluster of pages given a specific topic, and refining the preliminary set of related web pages using term frequency/document frequency, entropy, and compiled rules. In the experiments, we tested our topic specific crawler in terms of the accuracy of its classification, crawling efficiency, and crawling consistency. First, the classification accuracy using the set of rules compiled by CN2 was the best, among those of C4.5 and back propagation learning algorithms. Second, we measured the classification efficiency to determine the best threshold value affecting the degree of relevance. In the third experiment, the consistency of our topic specific crawler was measured in terms of the number of the resulting URLs overlapped with different starting URLs. The experimental results imply that our topic specific crawler was fairly consistent, regardless of the starting URLs randomly chosen.

  • PDF