• 제목/요약/키워드: Knowledge Mining

검색결과 580건 처리시간 0.025초

데이터 마이닝 기반의 6 시그마 방법론 : 철강산업 적용사례 (A Six Sigma Methodology Using Data Mining : A Case Study of "P" Steel Manufacturing Company)

  • 장길상
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제20권3호
    • /
    • pp.1-24
    • /
    • 2011
  • Recently, six sigma has been widely adopted in a variety of industries as a disciplined, data-driven problem solving approach or methodology supported by a handful of powerful statistical tools in order to reduce variation through continuous process improvement. Also, data mining has been widely used to discover unknown knowledge from a large volume of data using various modeling techniques such as neural network, decision tree, regression analysis, etc. This paper proposes a six sigma methodology based on data mining for effectively and efficiently processing massive data in driving six sigma projects. The proposed methodology is applied in the hot stove system which is a major energy-consuming process in a "P" steel company for improvement of heat efficiency through reduction of energy consumption. The results show optimal operation conditions and reduction of the hot stove energy cost by 15%.

Enhanced Genetic Programming Approach for a Ship Design

  • Lee, Kyung-Ho;Han, Young-Soo;Lee, Jae-Joon
    • Journal of Ship and Ocean Technology
    • /
    • 제11권4호
    • /
    • pp.21-28
    • /
    • 2007
  • Recently the importance of the utilization of engineering data is gradually increasing. Engineering data contains the experiences and know-how of experts. Data mining technique is useful to extract knowledge or information from the accumulated existing data. This paper deals with generating optimal polynomials using genetic programming (GP) as the module of Data Mining system. Low order Taylor series are used to approximate the polynomial easily as a nonlinear function to fit the accumulated data. The overfitting problem is unavoidable because in real applications, the size of learning samples is minimal. This problem can be handled with the extended data set and function node stabilization method. The Data Mining system for the ship design based on polynomial genetic programming is presented.

Environmental Consciousness Data Modeling by Association Rules

  • 박희창;조광현
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 한국데이터정보과학회 2004년도 추계학술대회
    • /
    • pp.115-124
    • /
    • 2004
  • Data mining is the method to find useful information for large amounts of data in database. It is used to find hidden knowledge by massive data, unexpectedly pattern, relation to new rule. The methods of data mining are association rules, decision tree, clustering, neural network and so on. Association rule mining searches for interesting relationships among items in a given large data set. Association rules are frequently used by retail stores to assist in marketing, advertising, floor placement, and inventory control. There are three primary quality measures for association rule, support and confidence and lift. We analyze Gyeongnam social indicator survey data using association rule technique for environmental information discovery. We can use to environmental preservation and environmental improvement by association rule outputs.

  • PDF

WMSQL을 이용한 Web Mining System의 설계 및 구현 (Design and Implementation of a Web Mining System Using WMSQL)

  • 최성경;박민호;이근호;백인구;한기준
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2000년도 봄 학술발표논문집 Vol.27 No.1 (B)
    • /
    • pp.166-168
    • /
    • 2000
  • World-Wide Web(WWW)이 발전하면서 웹으로부터 사용자가 원하는 정보를 효과적으로 찾기 위한 정보검색 방법론이 연구가들로부터 중요한 이슈로서 대두되었고 이에 기반하여 여러 상용 정보검색 시스템들이 등장하게 되었다. 그러나, 이러한 정보검색 시스템들은 웹에 존재하는 데이터의 비구조화와 다양성, 사용자의 다양성, 그리고 정보의 질과 양이 문제로 인하여 사용자의 의도와 요구에 맞는 정보를 구하기 어렵다. 또한, 웹 상의 많은 데이터들로부터 단순히 일반적인 정보만을 얻어 이용할 뿐 효과적인 지식의 탐사나 관리 기능을 갖고 있지 않다. 본 논문에서는 이전의 정보검색 시스템들이 갖는 문제점을 분석하고 이를 보완하고자 웹에 대한 지식 발견(Knowledge Discovery)의 새로운 시도인 웹 마이닝(Web Mining)에 대한 관련 연구를 토대로 웹 마이닝 시스템을 설계 및 구현한다. 특히, 사용자의 의도를 정확히 전달하기 위하여 기존의 SQL 과 유사한 형태의 질의어인 WMSQL을 사용하여 웹 문서의 내용에 직접적인 웹 마이닝을 수행하는 Web Content Mining을 개발함으로서 웹의 비구조화된 데이터로부터 의미있고 함축적인 지식을 추출할 수 있도록 한다.

  • PDF

Genome data mining for everyone

  • Lee, Gir-Won;Kim, Sang-Soo
    • BMB Reports
    • /
    • 제41권11호
    • /
    • pp.757-764
    • /
    • 2008
  • The genomic sequences of a huge number of species have been determined. Typically, these genome sequences and the associated annotation data are accessed through Internet-based genome browsers that offer a user-friendly interface. Intelligent use of the data should expedite biological knowledge discovery. Such activity is collectively called data mining and involves queries that can be simple, complex, and even combinational. Various tools have been developed to make genome data mining available to computational and experimental biologists alike. In this mini-review, some tools that have proven successful will be introduced along with examples taken from published reports.

Web Recommendation Mechanism Based on Case-Based Reasoning and Web Data Mining

  • Kim, Jin-Sung
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2002년도 추계학술대회 및 정기총회
    • /
    • pp.443-446
    • /
    • 2002
  • In this research, we suggest a Web-based hybrid recommendation mechanism using CBR (Case-Based Reasoning) and web data mining. Data mining is used as an efficient mechanism in reasoning for relationship between goods, customers' preference and future behavior. CBR systems are normally used in problems for which it is difficult to define rules. We use CBR as an AI tool to recommend the similar purchase case. A Web-log data gathered in real-world Internet shopping mall was given to illustrate the quality of the proposed mechanism. The results showed that the CBR and web data mining-based hybrid recommendation mechanism could reflect both association knowledge and purchase information about our former customers.

Association Rule of Gyeongnam Social Indicator Survey Data for Environmental Information

  • Park, Hee-Chang;Cho, Kwang-Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • 제16권1호
    • /
    • pp.59-69
    • /
    • 2005
  • Data mining is the method to find useful information for large amounts of data in database It is used to find hidden knowledge by massive data, unexpectedly pattern, relation to new rule. The methods of data mining are decision tree, association rules, clustering, neural network and so on. We analyze Gyeongnam social indicator survey data by 2001 using association rule technique for environment information. Association rule mining searches for interesting relationships among items in a given large data set. Association rules are frequently used by retail stores to assist in marketing, advertising, floor placement, and inventory control. There are three primary quality measures for association rule, support and confidence and lift. We can use to environmental preservation and environmental improvement by association rule outputs

  • PDF

Mining Frequent Itemsets with Normalized Weight in Continuous Data Streams

  • Kim, Young-Hee;Kim, Won-Young;Kim, Ung-Mo
    • Journal of Information Processing Systems
    • /
    • 제6권1호
    • /
    • pp.79-90
    • /
    • 2010
  • A data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. The continuous characteristic of streaming data necessitates the use of algorithms that require only one scan over the stream for knowledge discovery. Data mining over data streams should support the flexible trade-off between processing time and mining accuracy. In many application areas, mining frequent itemsets has been suggested to find important frequent itemsets by considering the weight of itemsets. In this paper, we present an efficient algorithm WSFI (Weighted Support Frequent Itemsets)-Mine with normalized weight over data streams. Moreover, we propose a novel tree structure, called the Weighted Support FP-Tree (WSFP-Tree), that stores compressed crucial information about frequent itemsets. Empirical results show that our algorithm outperforms comparative algorithms under the windowed streaming model.

국민건강영양조사 자료를 이용한 만성신장질환 분류기법 연구 (The Study of Chronic Kidney Disease Classification using KHANES data)

  • 이홍기;명성민
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2020년도 제61차 동계학술대회논문집 28권1호
    • /
    • pp.271-272
    • /
    • 2020
  • Data mining is known useful in medical area when no availability of evidence favoring a particular treatment option is found. Huge volume of structured/unstructured data is collected by the healthcare field in order to find unknown information or knowledge for effective diagnosis and clinical decision making. The data of 5,179 records considered for analysis has been collected from Korean National Health and Nutrition Examination Survey(KHANES) during 2-years. Data splitting, referred as the training and test sets, was applied to predict to fit the model. We analyzed to predict chronic kidney disease (CKD) using data mining method such as naive Bayes, logistic regression, CART and artificial neural network(ANN). This result present to select significant features and data mining techniques for the lifestyle factors related CKD.

  • PDF

Studying Factors Affecting Environmental Accounting Implementation in Mining Enterprises in Vietnam

  • NGUYEN, Thi Kim Tuyen
    • The Journal of Asian Finance, Economics and Business
    • /
    • 제7권5호
    • /
    • pp.131-144
    • /
    • 2020
  • The study investigates the impact of factors on environmental accounting implementation in mining enterprises in Binh Dinh province, Vietnam. The survey was carried out in three phases: 1) a draft survey form; 2) in-depth interviews with experts; 3) design questionnaire. The survey respondents were people who had knowledge of environmental information in mining enterprises in Binh Dinh province, including: accountant, chief accountant, financial deputy director or director. The questionnaire was is sent directly or through Google Form tool. The author received 162 responses votes from the survey respondent, out of which 13 were unusable due to missing data. Thus, 149 valid responses votes were used. This study employs Cronbach's alpha analysis, exploratory factor analysis and multivariate regression analysis. The results showed the influence of five different factors on environmental accounting implementation in mining enterprises in Binh Dinh province: stakeholders pressure, corporate characteristics, coercive pressure of government agencies, environmental awareness of senior managers and accountant qualifications of environmental accounting. While the pressure of stakeholders has a negligible influence, the remaining four factors (coercive pressure of government agencies, environmental awareness of senior executives, business characteristics, accountant qualifications of environmental accounting) have significant effect on environmental accounting implementation in mining enterprises in Binh Dinh province, Vietnam.