• Title/Summary/Keyword: Classification modeling

Search Result 600, Processing Time 0.025 seconds

A Study on Automatic Classification of Class Diagram Images (클래스 다이어그램 이미지의 자동 분류에 관한 연구)

  • Kim, Dong Kwan
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.3
    • /
    • pp.1-9
    • /
    • 2022
  • UML class diagrams are used to visualize the static aspects of a software system and are involved from analysis and design to documentation and testing. Software modeling using class diagrams is essential for software development, but it may be not an easy activity for inexperienced modelers. The modeling productivity could be improved with a dataset of class diagrams which are classified by domain categories. To this end, this paper provides a classification method for a dataset of class diagram images. First, real class diagrams are selected from collected images. Then, class names are extracted from the real class diagram images and the class diagram images are classified according to domain categories. The proposed classification model has achieved 100.00%, 95.59%, 97.74%, and 97.77% in precision, recall, F1-score, and accuracy, respectively. The accuracy scores for the domain categorization are distributed between 81.1% and 95.2%. Although the number of class diagram images in the experiment is not large enough, the experimental results indicate that it is worth considering the proposed approach to class diagram image classification.

Keyword Reorganization Techniques for Improving the Identifiability of Topics (토픽 식별성 향상을 위한 키워드 재구성 기법)

  • Yun, Yeoil;Kim, Namgyu
    • Journal of Information Technology Services
    • /
    • v.18 no.4
    • /
    • pp.135-149
    • /
    • 2019
  • Recently, there are many researches for extracting meaningful information from large amount of text data. Among various applications to extract information from text, topic modeling which express latent topics as a group of keywords is mainly used. Topic modeling presents several topic keywords by term/topic weight and the quality of those keywords are usually evaluated through coherence which implies the similarity of those keywords. However, the topic quality evaluation method based only on the similarity of keywords has its limitations because it is difficult to describe the content of a topic accurately enough with just a set of similar words. In this research, therefore, we propose topic keywords reorganizing method to improve the identifiability of topics. To reorganize topic keywords, each document first needs to be labeled with one representative topic which can be extracted from traditional topic modeling. After that, classification rules for classifying each document into a corresponding label are generated, and new topic keywords are extracted based on the classification rules. To evaluated the performance our method, we performed an experiment on 1,000 news articles. From the experiment, we confirmed that the keywords extracted from our proposed method have better identifiability than traditional topic keywords.

Dynamic Text Categorizing Method using Text Mining and Association Rule

  • Kim, Young-Wook;Kim, Ki-Hyun;Lee, Hong-Chul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.23 no.10
    • /
    • pp.103-109
    • /
    • 2018
  • In this paper, we propose a dynamic document classification method which breaks away from existing document classification method with artificial categorization rules focusing on suppliers and has changing categorization rules according to users' needs or social trends. The core of this dynamic document classification method lies in the fact that it creates classification criteria real-time by using topic modeling techniques without standardized category rules, which does not force users to use unnecessary frames. In addition, it can also search the details through the relevance analysis by calculating the relationship between the words that is difficult to grasp by word frequency alone. Rather than for logical and systematic documents, this method proposed can be used more effectively for situation analysis and retrieving information of unstructured data which do not fit the category of existing classification such as VOC (Voice Of Customer), SNS and customer reviews of Internet shopping malls and it can react to users' needs flexibly. In addition, it has no process of selecting the classification rules by the suppliers and in case there is a misclassification, it requires no manual work, which reduces unnecessary workload.

The History of Mathematical Problem Solving and the Modeling Perspective (수학 문제 해결의 역사와 모델링 관점)

  • Lee Dae Hyun;Seo Kwan Seok
    • Journal for History of Mathematics
    • /
    • v.17 no.4
    • /
    • pp.123-132
    • /
    • 2004
  • In this paper, we reviewed the history of mathematical problem solving since 1900 and investigated problem solving in modeling perspective which is focused on the 21th century. In modeling perspective, problem solvers solve the realistic problem which includes contextualized situations in which mathematics is useful. In this case, the problem is different from the traditional problems which are routine, close, and words problem, etc. Problem solving in modeling perspective emphasizes mathematizing. Most of all, what is important enables students to use mathematics in everyday problem solving situation.

  • PDF

Component classification modeling for component circulation market activation (컴포넌트 유통시장 활성화를 위한 분류체계 모델링)

  • 이서정;조은숙
    • The Journal of Society for e-Business Studies
    • /
    • v.7 no.3
    • /
    • pp.49-60
    • /
    • 2002
  • Many researchers have studied component technologies with concept, methodology and implementation for partial business domain, however there are rarely researches for component classification to manage these systematically. In this paper, we suggest a component classification model, which can make component reusability higher and can derive higher productivity of software development. We take four focuses generalization, abstraction, technology and size. The generalization means which category a component belongs to. The abstraction means how specific a component encapsulates its inside. The technology means which platform for hardware environment a component can be plugged in. The size means the physical component volume.

  • PDF

New Splitting Criteria for Classification Trees

  • Lee, Yung-Seop
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.3
    • /
    • pp.885-894
    • /
    • 2001
  • Decision tree methods is the one of data mining techniques. Classification trees are used to predict a class label. When a tree grows, the conventional splitting criteria use the weighted average of the left and the right child nodes for measuring the node impurity. In this paper, new splitting criteria for classification trees are proposed which improve the interpretablity of trees comparing to the conventional methods. The criteria search only for interesting subsets of the data, as opposed to modeling all of the data equally well. As a result, the tree is very unbalanced but extremely interpretable.

  • PDF

PCA-based Linear Dynamical Systems for Multichannel EEG Classification (다채널 뇌파 분류를 위한 주성분 분석 기반 선형동적시스템)

  • Lee, Hyekyoung;Park, Seungjin
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.10d
    • /
    • pp.232-234
    • /
    • 2002
  • EEG-based brain computer interface (BCI) provides a new communication channel between human brain and computer. The classification of EEG data is an important task in EEG-based BCI. In this paper we present methods which jointly employ principal component analysis (PCA) and linear dynamical system (LDS) modeling for the task of EEG classification. Experimental study for the classification of EEG data during imagination of a left or right hand movement confirms the validity of our proposed methods.

  • PDF

Blackboard Scheduler Control Knowledge for Recursive Heuristic Classification

  • Park, Young-Tack
    • Journal of Intelligence and Information Systems
    • /
    • v.1 no.1
    • /
    • pp.61-72
    • /
    • 1995
  • Dynamic and explicit ordering of strategies is a key process in modeling knowledge-level problem-solving behavior. This paper addressed the important problem of howl to make the scheduler more knowledge-intensive in a way that facilitates the acquisition, integration, and maintenance of the scheduler control knowledge. The solution a, pp.oach described in this paper involved formulating the scheduler task as a heuristic classification problem, and then implementing it as a classification expert system. By doing this, the wide spectrum of known methods of acquiring, refining, and maintaining the knowledge of a classification expert system are a, pp.icable to the scheduler control knowledge. One important innovation of this research is that of recursive heuristic classification : this paper demonstrates that it is possible to formulate and solve a key subcomponent of heuristic classification as heuristic classification problem. Another key innovation is the creation of a method of dynamic heuristic classification : the classification alternatives that are selected among are dynamically generated in real-time and then evidence is gathered for and aginst these alternatives. In contrast, the normal model of heuristic classification is that of structured selection between a set of preenumerated fixed alternatives.

  • PDF

A Study on Statistical Modeling of Spatial Land-use Change Prediction (토지이용 공간변화 예측의 통계학적 모형에 관한 연구)

  • 김의홍
    • Spatial Information Research
    • /
    • v.5 no.2
    • /
    • pp.177-183
    • /
    • 1997
  • S1he concept of a class in the land-use classification system can be equally applied to a class in the land-use-change classification. The maximum likelihood method using linear discriminant function and Markov transition matrix method were integrated to a synthetic modeling effort in order to project spatial allocation of land-use-change and quantitative assignment of that prediction as a whole. The algorithm of both the multivariate discriminant function and the Markov chain matrix were discussed and the test of synthetic model on the study area was resulted in the projection of '90 year as well as '95 year land -use classification. The accuracy and the issue of modeling improvement were discussed eventually.

  • PDF

Analysis of BIM Technology Structure and Core Technology Using Patent Co-classification Network Analysis (특허 동시분류 네트워크 분석을 활용한 BIM 기술구조와 핵심기술 분석)

  • Park, Yoo-Na;Lee, Hye-Jin;Lee, Seok-Hyoung;Choi, Hee-Seok
    • Journal of KIBIM
    • /
    • v.10 no.2
    • /
    • pp.1-11
    • /
    • 2020
  • BIM(Building Information Modeling) is a salient technology for influential innovation in the construction industry. The patent network analysis is useful for suggesting the direction of technology development and exploring the research and development field. Therefore, the purpose of this study is to analyze the BIM technology structure and core technologies according to the convergence of BIM technology and market expansion. In this study, social network analysis was conducted by establishing a co-classification IPC network for the United States BIM patent. In particular, the characteristics of the major technical areas in the BIM technology network were identified through centrality analysis. G06F017/00, digital computing or data processing method, is a core technology field in the BIM network. Arrangements, apparatus or systems for transmission of digital information, H04L029/00 is an influential technology across the network. B25J009/00 for program controlled manipulators is an intermediary technology field and G06T019/00, manipulating 3D models or images for computer graphics, is an important field for technological development competitiveness.