• 제목/요약/키워드: Association rule mining

검색결과 351건 처리시간 0.023초

연관규칙을 이용한 근골격계 질환 예방 - 다변량 로지스틱 회귀분석의 결과를 기반으로 - (Preventing the Musculoskeletal Disorders using Association Rule - Based on Result of Multiple Logistic Regression -)

  • 박승헌;이석환
    • 대한안전경영과학회지
    • /
    • 제9권4호
    • /
    • pp.29-38
    • /
    • 2007
  • We adapted association rules of data mining in order to investigate the relation among the factors of musculoskeletal disorders and proposed the method of preventing the musculoskeletal disorders associated with multiple logistic regression in previous study. This multiple logistic regression was difficult to establish the method of preventing musculoskeletal disorders in case factors can't be managed by worker himself, i.e., age, gender, marital status. In order to solve this problem, we devised association rules of factors of musculoskeletal disorders and proposed the interactive method of preventing the musculoskeletal disorders, by applying association rules with the result of multiple logistic regression in previous study. The result of correlation analysis showed that prevention method of one part also prevents musculoskeletal disorders of other parts of body.

Toward Successful Management of Vocational Rehabilitation Services for People with Disabilities: A Data Mining Approach

  • Kim, Yong Seog
    • Industrial Engineering and Management Systems
    • /
    • 제11권4호
    • /
    • pp.371-384
    • /
    • 2012
  • This study proposes a multi-level data analysis approach to identify both superficial and latent relationships among variables in the data set obtained from a vocational rehabilitation (VR) services program of people with significant disabilities. At the first layer, data mining and statistical predictive models are used to extract the superficial relationships between dependent and independent variables. To supplement the findings and relationships from the analysis at the first layer, association rule mining algorithms at the second layer are employed to extract additional sets of interesting associative relationships among variables. Finally, nonlinear nonparametric canonical correlation analysis (NLCCA) along with clustering algorithm is employed to identify latent nonlinear relationships. Experimental outputs validate the usefulness of the proposed approach. In particular, the identified latent relationship indicates that disability types (i.e., physical and mental) and severity (i.e., severe, most severe, not severe) have a significant impact on the levels of self-esteem and self-confidence of people with disabilities. The identified superficial and latent relationships can be used to train education program designers and policy developers to maximize the outcomes of VR training programs.

특허 마이닝을 이용한 국방과학기술 연결망 연구 (A Study on Networks of Defense Science and Technology using Patent Mining)

  • 김경수;조남욱
    • 품질경영학회지
    • /
    • 제49권1호
    • /
    • pp.97-112
    • /
    • 2021
  • Purpose: The purpose of this paper is to analyze the technology convergence and its characteristics, focusing on the defense technologies in South Korea. Methods: Patents applied by the Agency for Defense Development (ADD) during 1979~2019 were utilized in this paper. Information Entropy analysis has been conducted on the patents to analyze the usability and potential for development. To analyze the trend of technology convergence in defense technologies, Social Network Analysis(SNA) and Association Rule Mining Analysis were applied to the co-occurrence networks of International Patent Classification (IPC) codes. Results: The results show that sensor, communication, and aviation technologies played a key role in recent development of defense science and technology. The co-occurrence network analysis also showed that the convergence has gradually enhanced over time, and the convergence between different technology sectors largely emerged, showing that the convergence has been diversified. Conclusion: By analyzing the patents of the defense technologies during the last 30 years, this study presents the comprehensive perspectives on trends and characteristics of technology convergence in defense industry. The results of this study are expected to be used as a guideline for decision making in the government's R&D policies in defence industry.

특허 마이닝을 이용한 국방관련 국제특허분류 개선 방안 연구 (A Study on the Improvement of the Defense-related International Patent Classification using Patent Mining)

  • 김경수;조남욱
    • 품질경영학회지
    • /
    • 제50권1호
    • /
    • pp.21-33
    • /
    • 2022
  • Purpose: As most defense technologies are classified as confidential, the corresponding International Patent Classifications (IPCs) require special attention. Consequently, the list of defense-related IPCs has been managed by the government. This paper aims to evaluate the defense-related IPCs and propose a methodology to revalidate and improve the IPC classification scheme. Methods: The patents in military technology and their corresponding IPCs during 2009~2020 were utilized in this paper. Prior to the analysis, patents are divided into private and public sectors. Social network analysis was used to analyze the convergence structure and central defense technology, and association rule mining analysis was used to analyze the convergence pattern. Results: While the public sector was highly cohesive, the private sector was characterized by easy convergence between technologies. In addition, narrow convergence was observed in the public sector, and wide convergence was observed in the private sector. As a result of analyzing the core technologies of defense technology, defense-related IPC candidates were identified. Conclusion: This paper presents a comprehensive perspective on the structure of convergence of defense technology and the pattern of convergence. It is also significant because it proposed a method for revising defense-related IPCs. The results of this study are expected to be used as guidelines for preparing amendments to the government's defense-related IPC.

연관규칙 마이닝에서 랜덤화를 이용한 프라이버시 보호 기법에 관한 연구 (On the Privacy Preserving Mining Association Rules by using Randomization)

  • 강주성;조성훈;이옥연;홍도원
    • 정보처리학회논문지C
    • /
    • 제14C권5호
    • /
    • pp.439-452
    • /
    • 2007
  • 본 논문에서는 랜덤화 기법을 이용한 프라이버시 보존형 데이터 마이닝(PPDM) 기술에 대하여 논한다. 계산 효율성 때문에 실용화 되지 못하고 있는 안전한 다자간 계산(SMC) 기반 PPDM은 현재의 컴퓨팅 환경에서는 실용성 없는 다분히 이론적인 것이다. 그래서 우리는 실용적인 PPDM 기술에 집중하여 가장 널리 사용되고 있는 랜덤화 기법에 대한 연구 결과를 소개한다. 특히, 랜덤화를 이용한 실용적인 PPDM 분야에서 가장 중요한 프라이버시 측도 개념을 심도 있게 분석하였으며, 연관규칙 마이닝에서의 프라이버시 보호 기술에 초점을 맞춘다. Evfimievski 등이 제안한 select-a-size 범주에 속하는 새로운 랜덤화 작용소인 binomial-selector 개념을 제안하고, 적절한 파라미터를 찾기 위한 시뮬레이션 결과를 제시한다. 기존의 cut-and-paste 랜덤화 작용소는 아이템 집합이 큰 경우에는 매우 비효율적이며 복원된 지지도의 분산이 크다는 단점을 지니고 있다. 여기에서 제안하는 binomial-selector 랜덤화 작용소는 cut-and-paste 작용소가 갖는 단점들을 보완한다.

인과적 확인 측도에 의한 연관성 규칙 탐색 (Proposition of causally confirmed measures in association rule mining)

  • 박희창
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권4호
    • /
    • pp.857-868
    • /
    • 2014
  • 대량의 데이터로부터 과거에 알려지지 않았던 유용한 정보를 발견하는 기술인 데이터 마이닝 기법은 오늘날 빅 데이터 시대에 가장 대표적인 분석 기법이라고 할 수 있다. 이들 중에서도 연관성 규칙은 지지도, 신뢰도, 향상도 등의 여러 가지 흥미도 측도를 기반으로 하여 항목들 간의 관련성을 찾아내는 것이다. 그러나 기본적인 연관성 평가 기준만으로는 두 항목 간의 인과관계를 설명할 수 없을 뿐만 아니라 연관성의 방향도 파악할 수 없다. 본 논문에서는 이러한 문제를 해결하기 위해 인과적 확인 연관성 평가 기준을 제안하는 동시에, 제안한 평가 기준들이 흥미도 측도의 조건을 충족하는지의 여부를 점검하였다. 본 논문에서 제안한 인과적 확인 향상도는 세 가지 조건 모두를 만족하는 것으로 입증되었다. 인과적 확인 지지도와 인과적 확인 신뢰도는 동시 발생 확률의 값에 따라 단조 증가하는 조건과 각 항목의 주변 확률의 값에 따라 단조 감소하는 조건은 만족하였다. 또한 예제를 통해 기본적인 연관성 평가 기준과 인과적 연관성 평가 기준, 그리고 인과적 확인 연관성 평가 기준을 비교해 본 결과, 본 논문에서 제안하는 인과적 확인 측도들이 다른 평가 기준에 비해 가장 바람직한 측도라는 사실을 파악하였다.

The HCARD Model using an Agent for Knowledge Discovery

  • Gerardo Bobby D.;Lee Jae-Wan;Joo Su-Chong
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제14권3호
    • /
    • pp.53-58
    • /
    • 2005
  • In this study, we will employ a multi-agent for the search and extraction of data in a distributed environment. We will use an Integrator Agent in the proposed model on the Hierarchical Clustering and Association Rule Discovery(HCARD). The HCARD will address the inadequacy of other data mining tools in processing performance and efficiency when use for knowledge discovery. The Integrator Agent was developed based on CORBA architecture for search and extraction of data from heterogeneous servers in the distributed environment. Our experiment shows that the HCARD generated essential association rules which can be practically explained for decision making purposes. Shorter processing time had been noted in computing for clusters using the HCARD and implying ideal processing period than computing the rules without HCARD.

  • PDF

연관규칙 탐색에서 새로운 흥미도 척도의 제안 (A New Interestingness Measure in Association Rules Mining)

  • 안광일;김성집
    • 대한산업공학회지
    • /
    • 제29권1호
    • /
    • pp.41-48
    • /
    • 2003
  • In this paper, we present a new measure to evaluate the interestingness of association rules. Ultimately. to evaluate whether a rule is interesting or not is subjective. However, an interestingness measure is useful in that it shows the cause for pruning uninteresting rules statistically or logically. Some interestingness measures have been developed in association rules mining. We present an overview of interestingness measures and propose a new measure. A comparative study of some interestingness measures is made on an example dataset and a real dataset. Our experiments show that the new measure can avoid the discovery of misleading rules.

Data Mining and FNN-Driven Knowledge Acquisition and Inference Mechanism for Developing A Self-Evolving Expert Systems

  • Kim, Jin-Sung
    • 한국산학기술학회:학술대회논문집
    • /
    • 한국산학기술학회 2003년도 Proceeding
    • /
    • pp.99-104
    • /
    • 2003
  • In this research, we proposed the mechanism to develop self evolving expert systems (SEES) based on data mining (DM), fuzzy neural networks (FNN), and relational database (RDB)-driven forward/backward inference engine. Most former researchers tried to develop a text-oriented knowledge base (KB) and inference engine (IE). However, thy have some limitations such as 1) automatic rule extraction, 2) manipulation of ambiguousness in knowledge, 3) expandability of knowledge base, and 4) speed of inference. To overcome these limitations, many of researchers had tried to develop an automatic knowledge extraction and refining mechanisms. As a result, the adaptability of the expert systems was improved. Nonetheless, they didn't suggest a hybrid and generalized solution to develop self-evolving expert systems. To this purpose, in this study, we propose an automatic knowledge acquisition and composite inference mechanism based on DM, FNN, and RDB-driven inference. Our proposed mechanism has five advantages empirically. First, it could extract and reduce the specific domain knowledge from incomplete database by using data mining algorithm. Second, our proposed mechanism could manipulate the ambiguousness in knowledge by using fuzzy membership functions. Third, it could construct the relational knowledge base and expand the knowledge base unlimitedly with RDBMS (relational database management systems). Fourth, our proposed hybrid data mining mechanism can reflect both association rule-based logical inference and complicate fuzzy logic. Fifth, RDB-driven forward and backward inference is faster than the traditional text-oriented inference.

  • PDF

VOC 기반 연관규칙 마이닝을 이용한 통신선로설비의 장애 예측 (Fault Prediction of a Telecommunications Network using Association Rules Mining based on Voice of the Customer)

  • 나기주;한인섭;조남욱
    • 디지털산업정보학회논문지
    • /
    • 제11권4호
    • /
    • pp.13-24
    • /
    • 2015
  • Customer complaints handling helps organizations to retain existing customers and attract new customers, as well. As Voice of the Customer (VOC) is one of the main sources of customer complaints, many organizations utilize VOC to enhance customer satisfaction. Effective management of VOC has been proved as one of the best ways to maintain organization's brand image and reputation. In spite of its importance, little has been reported on the utilization of VOC to detect faults in a telecommunication industry. In this paper, association rule mining based on VOC is used to identify root fault causes of a telecommunications network. To do that, VOC of a Communication Service Provider has been collected first. Then, association rule mining has also been conducted with various support and confidence levels. As a result, root fault causes of the telecommunications network can be identified. It is expected that this study can be used as a basis for decisions about customer satisfaction management such as preventive maintenances or reduction of the customer maintenance cost.