• Title/Summary/Keyword: classifying

Search Result 3,142, Processing Time 0.03 seconds

Mapping Categories of Heterogeneous Sources Using Text Analytics (텍스트 분석을 통한 이종 매체 카테고리 다중 매핑 방법론)

  • Kim, Dasom;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.4
    • /
    • pp.193-215
    • /
    • 2016
  • In recent years, the proliferation of diverse social networking services has led users to use many mediums simultaneously depending on their individual purpose and taste. Besides, while collecting information about particular themes, they usually employ various mediums such as social networking services, Internet news, and blogs. However, in terms of management, each document circulated through diverse mediums is placed in different categories on the basis of each source's policy and standards, hindering any attempt to conduct research on a specific category across different kinds of sources. For example, documents containing content on "Application for a foreign travel" can be classified into "Information Technology," "Travel," or "Life and Culture" according to the peculiar standard of each source. Likewise, with different viewpoints of definition and levels of specification for each source, similar categories can be named and structured differently in accordance with each source. To overcome these limitations, this study proposes a plan for conducting category mapping between different sources with various mediums while maintaining the existing category system of the medium as it is. Specifically, by re-classifying individual documents from the viewpoint of diverse sources and storing the result of such a classification as extra attributes, this study proposes a logical layer by which users can search for a specific document from multiple heterogeneous sources with different category names as if they belong to the same source. Besides, by collecting 6,000 articles of news from two Internet news portals, experiments were conducted to compare accuracy among sources, supervised learning and semi-supervised learning, and homogeneous and heterogeneous learning data. It is particularly interesting that in some categories, classifying accuracy of semi-supervised learning using heterogeneous learning data proved to be higher than that of supervised learning and semi-supervised learning, which used homogeneous learning data. This study has the following significances. First, it proposes a logical plan for establishing a system to integrate and manage all the heterogeneous mediums in different classifying systems while maintaining the existing physical classifying system as it is. This study's results particularly exhibit very different classifying accuracies in accordance with the heterogeneity of learning data; this is expected to spur further studies for enhancing the performance of the proposed methodology through the analysis of characteristics by category. In addition, with an increasing demand for search, collection, and analysis of documents from diverse mediums, the scope of the Internet search is not restricted to one medium. However, since each medium has a different categorical structure and name, it is actually very difficult to search for a specific category insofar as encompassing heterogeneous mediums. The proposed methodology is also significant for presenting a plan that enquires into all the documents regarding the standards of the relevant sites' categorical classification when the users select the desired site, while maintaining the existing site's characteristics and structure as it is. This study's proposed methodology needs to be further complemented in the following aspects. First, though only an indirect comparison and evaluation was made on the performance of this proposed methodology, future studies would need to conduct more direct tests on its accuracy. That is, after re-classifying documents of the object source on the basis of the categorical system of the existing source, the extent to which the classification was accurate needs to be verified through evaluation by actual users. In addition, the accuracy in classification needs to be increased by making the methodology more sophisticated. Furthermore, an understanding is required that the characteristics of some categories that showed a rather higher classifying accuracy of heterogeneous semi-supervised learning than that of supervised learning might assist in obtaining heterogeneous documents from diverse mediums and seeking plans that enhance the accuracy of document classification through its usage.

The study about fire counter plan through the properties qualities research of the main temple properties in Korea (우리나라 중요사찰문화재의 문화재 보유특성 조사를 통한 화재대응방안에 관한 연구)

  • Shin, Ho-Jun;Jung, Eun-Ji;Lee, Ji-Hyang;Kim, Jung-Ho;Back, Min-Ho
    • Proceedings of the Korea Institute of Fire Science and Engineering Conference
    • /
    • 2008.11a
    • /
    • pp.495-501
    • /
    • 2008
  • This study analyzes into the basic fire counter plan through classifying a kind, material, and transfer of which 66 the main temple properties among 124 the main wooden properties in Korea.

  • PDF

Classifying Scratch Defects on Billets Using Image Processing and SVM (영상처리와 SVM을 이용한 Billet의 스크래치 결함 분류)

  • Lee, Sang Jun;Kim, Sang Woo
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.19 no.3
    • /
    • pp.256-261
    • /
    • 2013
  • In the steel manufacturing area, researches for defect inspection receive a big attention for quality control. This paper proposes an algorithm to detect a scratch defect on steel billets. This algorithm takes ROIs (Regions of Interest), and extracts 11 features which represent properties of defect on a ROI. SVM (Support Vector Machine) is used to classify defect and normal ROIs. The algorithm classifies a frame image of a Billet as a defect image if there is one or more defect ROIs. In the experiments, the proposed algorithm had reliable classifying accuracy.

A Classification Methodology of Structural Types of RC Buildings for Improving Seismic Fragility Functions (지진취약도 함수 개선을 위한 철근콘크리트 건물의 구조 유형 분류 방안)

  • Kim, Taewan
    • Journal of the Earthquake Engineering Society of Korea
    • /
    • v.24 no.6
    • /
    • pp.285-292
    • /
    • 2020
  • The methodology classifying structural types of concrete buildings in the existing seismic fragility functions is too simple to estimate the fragility of existing residential buildings and neighborhood living facilities, especially those below five stories. Their structural types are dependent on information contained in the building register such as main use, total floor area, story, permission date, and first story floor area of the individual building. All of this information is not considered for classifying types in the existing functions; therefore, the goal of this study was to suggest a methodology that classifies structural types of concrete buildings by utilizing such information. The results of this study showed that the suggested methodology can classify structural types better than the existing methodology. Nevertheless, there is still a need to simplify the methodology because fragility estimation demands quickness rather than accuracy.

One Channel Five-Way Classification Algorithm For Automatically Classifying Speech

  • Lee, Kyo-Sik
    • The Journal of the Acoustical Society of Korea
    • /
    • v.17 no.3E
    • /
    • pp.12-21
    • /
    • 1998
  • In this paper, we describe the one channel five-way, V/U/M/N/S (Voice/Unvoice/Nasal/Silent), classification algorithm for automatically classifying speech. The decision making process is viewed as a pattern viewed as a pattern recognition problem. Two aspects of the algorithm are developed: feature selection and classifier type. The feature selection procedure is studied for identifying a set of features to make V/U/M/N/S classification. The classifiers used are a vector quantization (VQ), a neural network(NN), and a decision tree method. Actual five sentences spoken by six speakers, three male and three female, are tested with proposed classifiers. From a set of measurement tests, the proposed classifiers show fairly good accuracy for V/U/M/N/S decision.

  • PDF

A study of new classifying methd of target manufacturing cost to the product compnents by using customer's function evaluation (소비자 기능평가에 근거한 원가목표에 대한 계층적 세분화에 관한 연구)

  • 하재경
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.20 no.42
    • /
    • pp.87-98
    • /
    • 1997
  • product to assign objectives for target manufacturing cost on the basis of consumer's function envaluation. The principal purposes of the study include: to improve product differentiation for those products with major usability functions and to prepare effective steps of product concept formulation. Since scant research has been conducted toward the approach suggested above, this study suggests a new method using conjoint analysis for classifying goals of manufacturing costs based on customer's function evaluation. The ultimate goal of this study is to compare and check. The cost estimate for each structure, and eventually to decide target cost values to be reasonably understood by the R&D team.

  • PDF

Discriminant Analysis of Bullying Participant Roles among Children (아동의 또래괴롭힘 참여유형의 판별변인 분석)

  • Kim, Youn-Hwa;Han, Sae-Young
    • Korean Journal of Child Studies
    • /
    • v.32 no.3
    • /
    • pp.19-41
    • /
    • 2011
  • This paper was an examination of gender-specific behaviors in children and the types of bullying behavior among 1,181 fifth and sixth grade elementary schools student identified were then classified. Differences were identified in individual variables, family variables, and school variables. The data thus collected were subjected to descriptive and comparative statistical analysis using the SPSS software program. Our results showed that multiple discriminant analysis yielded a function of individual, family and school variables that proved effective in classifying bully, reinforcer, assistant, victim, outsider and defender types in boys. In girls, multiple discriminant analysis yielded a function of individual variables that was effective in classifying bully, reinforcer, assistant, victim, outsider and defender types.

Standard Occupation Classification in Surveyors (측량사관련 표준직업분류에 관한 연구)

  • Lee Young-Jin;Moon Sung-Ho;Jung Kwang-Ho;Song Jun-Ho
    • Proceedings of the Korean Society of Surveying, Geodesy, Photogrammetry, and Cartography Conference
    • /
    • 2006.04a
    • /
    • pp.15-20
    • /
    • 2006
  • For grouping the direction of improvement in the classification of surveying jobs, watching for internal surveying jobs, it has a purpose to present the direction of improvement. KSCO based on ISCO-88 was identifed with having independent special surveyors differing from foreign surveying jobs classification. Strandard Industry Classification of foreign survey is classifying KSCO of survey detailed. thus, the improvement of survey jobs classification of KSCO is urgent to produce, analyze and process the data about the same field for classifying survey information industry efficiently and systematically. It is changing from analog survey to digital survey applying a new technology such as GPS, GIS, RS. thus, the field of geoinformation is needed to classify and improve newly.

  • PDF