• 제목/요약/키워드: Knowledge Mining

검색결과 580건 처리시간 0.023초

Artificial Intelligence and Pattern Recognition Using Data Mining Algorithms

  • Al-Shamiri, Abdulkawi Yahya Radman
    • International Journal of Computer Science & Network Security
    • /
    • 제21권7호
    • /
    • pp.221-232
    • /
    • 2021
  • In recent years, with the existence of huge amounts of data stored in huge databases, the need for developing accurate tools for analyzing data and extracting information and knowledge from the huge and multi-source databases have been increased. Hence, new and modern techniques have emerged that will contribute to the development of all other sciences. Knowledge discovery techniques are among these technologies, one popular technique of knowledge discovery techniques is data mining which aims to knowledge discovery from huge amounts of data. Such modern technologies of knowledge discovery will contribute to the development of all other fields. Data mining is important, interesting technique, and has many different and varied algorithms; Therefore, this paper aims to present overview of data mining, and clarify the most important of those algorithms and their uses.

웹 데이터 마이닝과 지식경영 프레임웍을 통한 지식-기반 디자인 패러다임 구축 (The Knowledge-Based Design Paradigm through Web Data Mining and Knowledge Management Framework)

  • 양종열
    • 디자인학연구
    • /
    • 제15권4호
    • /
    • pp.159-168
    • /
    • 2002
  • 세계는 지식정보사회(knowledge information society)에 돌입하였다. 정보기술은 지식경영을 등장시킨 요인중의 하나이며 지식경영발전을 가속화시키는 원동력이라고 볼 수 있다. 그리고 최근 정보기술과 인터넷은 눈부신 발전을 해오고 있다. 따라서 본 연구는 급변하는 디지털 환경하의 방대한 인터넷 데이터에서 웹 데이터 마이닝을 통해 고객에 대한 숨겨진 지식을 창출하고, 그 지식을 지식경영프레임웍에 적용한 지식-기반디자인 패러다임을 구축하여 디지털 환경에서 실시간에 고객에 대한 유용한 지식을 창출하여 고객의 욕구를 충족시키는 디자인을 개발 할 수 있도록 하는데 목적이 있다. 연구의 목적을 달성하기 위해 먼저 이론적 고찰에서 지식경영프로세스와 웹 데이터 마이닝에 관련된 다양한 사전 연구들을 살펴보고 지식경영프로세스와 웹 데이터 마이닝을 결합하여 새로운 지식-기반 디자인 패러다임(본 연구에서는 웹 데이터 마이닝과 지식경영프로세스가 통합하여 구현된 진정한 의미의 eCRM을 지식-기반 디자인패러다임이라 칭한다)을 제안한다.

  • PDF

Data Mining and FNN-Driven Knowledge Acquisition and Inference Mechanism for Developing A Self-Evolving Expert Systems

  • Kim, Jin-Sung
    • 한국산학기술학회:학술대회논문집
    • /
    • 한국산학기술학회 2003년도 Proceeding
    • /
    • pp.99-104
    • /
    • 2003
  • In this research, we proposed the mechanism to develop self evolving expert systems (SEES) based on data mining (DM), fuzzy neural networks (FNN), and relational database (RDB)-driven forward/backward inference engine. Most former researchers tried to develop a text-oriented knowledge base (KB) and inference engine (IE). However, thy have some limitations such as 1) automatic rule extraction, 2) manipulation of ambiguousness in knowledge, 3) expandability of knowledge base, and 4) speed of inference. To overcome these limitations, many of researchers had tried to develop an automatic knowledge extraction and refining mechanisms. As a result, the adaptability of the expert systems was improved. Nonetheless, they didn't suggest a hybrid and generalized solution to develop self-evolving expert systems. To this purpose, in this study, we propose an automatic knowledge acquisition and composite inference mechanism based on DM, FNN, and RDB-driven inference. Our proposed mechanism has five advantages empirically. First, it could extract and reduce the specific domain knowledge from incomplete database by using data mining algorithm. Second, our proposed mechanism could manipulate the ambiguousness in knowledge by using fuzzy membership functions. Third, it could construct the relational knowledge base and expand the knowledge base unlimitedly with RDBMS (relational database management systems). Fourth, our proposed hybrid data mining mechanism can reflect both association rule-based logical inference and complicate fuzzy logic. Fifth, RDB-driven forward and backward inference is faster than the traditional text-oriented inference.

  • PDF

유전적 프로그래밍과 SOM을 결합한 개선된 선박 설계용 데이터 마이닝 시스템 개발 (Development of Data Mining System for Ship Design using Combined Genetic Programming with Self Organizing Map)

  • 이경호;박종훈;한영수;최시영
    • 한국CDE학회논문집
    • /
    • 제14권6호
    • /
    • pp.382-389
    • /
    • 2009
  • Recently, knowledge management has been required in companies as a tool of competitiveness. Companies have constructed Enterprise Resource Planning(ERP) system in order to manage huge knowledge. But, it is not easy to formalize knowledge in organization. We focused on data mining system by genetic programming(GP). Data mining system by genetic programming can be useful tools to derive and extract the necessary information and knowledge from the huge accumulated data. However when we don't have enough amounts of data to perform the learning process of genetic programming, we have to reduce input parameter(s) or increase number of learning or training data. In this study, an enhanced data mining method combining Genetic Programming with Self organizing map, that reduces the number of input parameters, is suggested. Experiment results through a prototype implementation are also discussed.

백과사전 질의응답을 위한 구문정보기반 정답색인방법 (A LF based Answer Indexing Method for Encyclopedia Question-Answering System)

  • 김현진;이충희;오효정;왕지현;장영길
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2005년도 한국컴퓨터종합학술대회 논문집 Vol.32 No.1 (B)
    • /
    • pp.511-513
    • /
    • 2005
  • 본 논문은 정답 색인 방법을 이용하여 응답 속도가 빠르고 정확한 백과사전 질의응답 시스템을 구현하는 방법을 제안한다. 논문에서 제안한 정답 색인 방법은 대상 문서에서 160여 개의 정답 유형 범주에 해당하는 정답 후보를 인식하고, 정답 후보와 색인 범주에 속하는 키워드를 색인단위로 정의하여 저장하였다. 특히 용언정보에 대해서는 LF(Logical Form)단위로 색인하여 색인 정확도를 높였다. 정답 랭킹에서는 사용자 질문에서 각 단어별로 문장 성분. 단어 가중치 정보 등을 이용하여, 필수단어를 산정하고 이를 정답랭킹의 방법으로 활용하였다. 이러한 방법론은 용언 정보를 활용해야 효과적인 백과사전이라는 문서 도메인의 특성을 반영하고, 빠른 질문 응답 시간을 보장하는 백과사전 질의응답 시스템에 적합하다.

  • PDF

조선설계에서의 데이터 해석 및 활용을 위한 데이터 마이닝 도구 개발 (Development of Data Mining Tool for the Utilization of Shipbuilding Knowledge based on Genetic Programming)

  • 이경호;박종훈;최영복;장영훈;오준
    • 대한조선학회논문집
    • /
    • 제43권6호
    • /
    • pp.700-706
    • /
    • 2006
  • As development of information technology, companies stress the need of knowledge management. Companies construct ERP system including knowledge management. But, it is not easy to formalize knowledge in organization. They experience that constructing information system help knowledge management. Now, we focus on engineering knowledge. Because engineering data contains experts' experience and know-how in its own, engineering knowledge is a treasure house of knowledge. Korean shipyards are leader of world shipbuilding industry. They have accumulated a store of knowledge and data. But, they don't have data mining tool to utilize accumulated data. This paper treats development of data mining tools for the utilization of shipbuilding knowledge based on genetic programming(GP).

Self-Evolving Expert Systems based on Fuzzy Neural Network and RDB Inference Engine

  • Kim, Jin-Sung
    • 지능정보연구
    • /
    • 제9권2호
    • /
    • pp.19-38
    • /
    • 2003
  • In this research, we propose the mechanism to develop self-evolving expert systems (SEES) based on data mining (DM), fuzzy neural networks (FNN), and relational database (RDB)-driven forward/backward inference engine. Most researchers had tried to develop a text-oriented knowledge base (KB) and inference engine (IE). However, this approach had some limitations such as 1) automatic rule extraction, 2) manipulation of ambiguousness in knowledge, 3) expandability of knowledge base, and 4) speed of inference. To overcome these limitations, knowledge engineers had tried to develop an automatic knowledge extraction mechanism. As a result, the adaptability of the expert systems was improved. Nonetheless, they didn't suggest a hybrid and generalized solution to develop self-evolving expert systems. To this purpose, we propose an automatic knowledge acquisition and composite inference mechanism based on DM, FNN, and RDB-driven inference engine. Our proposed mechanism has five advantages. First, it can extract and reduce the specific domain knowledge from incomplete database by using data mining technology. Second, our proposed mechanism can manipulate the ambiguousness in knowledge by using fuzzy membership functions. Third, it can construct the relational knowledge base and expand the knowledge base unlimitedly with RDBMS (relational database management systems) module. Fourth, our proposed hybrid data mining mechanism can reflect both association rule-based logical inference and complicate fuzzy relationships. Fifth, RDB-driven forward and backward inference time is shorter than the traditional text-oriented inference time.

  • PDF

그래프마이닝을 활용한 빈발 패턴 탐색에 관한 연구 (A Methodology for Searching Frequent Pattern Using Graph-Mining Technique)

  • 홍준석
    • Journal of Information Technology Applications and Management
    • /
    • 제26권1호
    • /
    • pp.65-75
    • /
    • 2019
  • As the use of semantic web based on XML increases in the field of data management, a lot of studies to extract useful information from the data stored in ontology have been tried based on association rule mining. Ontology data is advantageous in that data can be freely expressed because it has a flexible and scalable structure unlike a conventional database having a predefined structure. On the contrary, it is difficult to find frequent patterns in a uniformized analysis method. The goal of this study is to provide a basis for extracting useful knowledge from ontology by searching for frequently occurring subgraph patterns by applying transaction-based graph mining techniques to ontology schema graph data and instance graph data constituting ontology. In order to overcome the structural limitations of the existing ontology mining, the frequent pattern search methodology in this study uses the methodology used in graph mining to apply the frequent pattern in the graph data structure to the ontology by applying iterative node chunking method. Our suggested methodology will play an important role in knowledge extraction.

A Novel Approach for Mining High-Utility Sequential Patterns in Sequence Databases

  • Ahmed, Chowdhury Farhan;Tanbeer, Syed Khairuzzaman;Jeong, Byeong-Soo
    • ETRI Journal
    • /
    • 제32권5호
    • /
    • pp.676-686
    • /
    • 2010
  • Mining sequential patterns is an important research issue in data mining and knowledge discovery with broad applications. However, the existing sequential pattern mining approaches consider only binary frequency values of items in sequences and equal importance/significance values of distinct items. Therefore, they are not applicable to actually represent many real-world scenarios. In this paper, we propose a novel framework for mining high-utility sequential patterns for more real-life applicable information extraction from sequence databases with non-binary frequency values of items in sequences and different importance/significance values for distinct items. Moreover, for mining high-utility sequential patterns, we propose two new algorithms: UtilityLevel is a high-utility sequential pattern mining with a level-wise candidate generation approach, and UtilitySpan is a high-utility sequential pattern mining with a pattern growth approach. Extensive performance analyses show that our algorithms are very efficient and scalable for mining high-utility sequential patterns.

A bio-text mining system using keywords and patterns in a grid environment

  • Kwon, Hyuk-Ryul;Jung, Tae-Sung;Kim, Kyoung-Ran;Jahng, Hye-Kyoung;Cho, Wan-Sup;Yoo, Jae-Soo
    • 한국산업정보학회:학술대회논문집
    • /
    • 한국산업정보학회 2007년도 춘계학술대회
    • /
    • pp.48-52
    • /
    • 2007
  • As huge amount of literature including biological data is being generated after post genome era, it becomes difficult for researcher to find useful knowledge from the biological databases. Bio-text mining and related natural language processing technique are the key issues in the intelligent knowledge retrieval from the biological databases. We propose a bio-text mining technique for the biologists who find Knowledge from the huge literature. At first, web robot is used to extract and transform related literature from remote databases. To improve retrieval speed, we generate an inverted file for keywords in the literature. Then, text mining system is used for extracting given knowledge patterns and keywords. Finally, we construct a grid computing environment to guarantee processing speed in the text mining even for huge literature databases. In the real experiment for 10,000 bio-literatures, the system shows 95% precision and 98% recall.

  • PDF