• 제목/요약/키워드: Data mining analysis

검색결과 2,171건 처리시간 0.033초

Study of Mental Disorder Schizophrenia, based on Big Data

  • Hye-Sun Lee
    • International Journal of Advanced Culture Technology
    • /
    • 제11권4호
    • /
    • pp.279-285
    • /
    • 2023
  • This study provides academic implications by considering trends of domestic research regarding therapy for Mental disorder schizophrenia and psychosocial. For the analysis of this study, text mining with the use of R program and social network analysis method have been used and 65 papers have been collected The result of this study is as follows. First, collected data were visualized through analysis of keywords by using word cloud method. Second, keywords such as intervention, schizophrenia, research, patients, program, effect, society, mind, ability, function were recorded with highest frequency resulted from keyword frequency analysis. Third, LDA (latent Dirichlet allocation) topic modeling result showed that classified into 3 keywords: patient, subjects, intervention of psychosocial, efficacy of interventions. Fourth, the social network analysis results derived connectivity, closeness centrality, betweennes centrality. In conclusion, this study presents significant results as it provided basic rehabilitation data for schizophrenia and psychosocial therapy through new research methods by analyzing with big data method by proposing the results through visualization from seeking research trends of schizophrenia and psychosocial therapy through text mining and social network analysis.

A Study of Web Usage Mining for eCRM

  • Hyuncheol Kang;Jung, Byoung-Cheol
    • Communications for Statistical Applications and Methods
    • /
    • 제8권3호
    • /
    • pp.831-840
    • /
    • 2001
  • In this study, We introduce the process of web usage mining, which has lately attracted considerable attention with the fast diffusion of world wide web, and explain the web log data, which Is the main subject of web usage mining. Also, we illustrate some real examples of analysis for web log data and look into practical application of web usage mining for eCRM.

  • PDF

A Comparison of Capabilities of Data Mining Tools

  • Choi, Youn-Seok;Kim, Jong-Geoun;Lee, Jong-Hee
    • Communications for Statistical Applications and Methods
    • /
    • 제8권2호
    • /
    • pp.531-541
    • /
    • 2001
  • In this study, we compare the capabilities of the data mining tools of the most updated version objectively and provide the useful information in which enterprises and universities chose them. In particular, we compare the SAS/Enterprise Miner 3.0, SPSS/Clementine 5.2 and IBM/Intelligent Miner 6.1 which are well known and easily gotten.

  • PDF

프라이버시 보존형 데이터 마이닝 방법 및 척도 분석 (Privacy Preserving Data Mining Methods and Metrics Analysis)

  • 홍은주;홍도원;서창호
    • 디지털융복합연구
    • /
    • 제16권10호
    • /
    • pp.445-452
    • /
    • 2018
  • 생활의 모든 것들이 데이터화 되어가고 있는 세상에서 데이터의 양은 기하급수적으로 증가하고 있다. 이러한 데이터는 수집 및 분석을 통하여 새로운 데이터로 가공되어진다. 새로운 데이터는 병원, 금융, 기업 등 여러 분야에서 다양한 용도로 사용되고 있다. 그러나 기존의 데이터에는 개인들의 민감한 정보가 포함되어 있기 때문에 수집 및 분석과정에서 개인의 프라이버시 노출 우려가 있다. 해결 방안으로 프라이버시 보존형 데이터 마이닝(PPDM)기술이 있다. PPDM은 프라이버시를 보존하면서 동시에 데이터로부터 유용한 정보를 추출하는 방법이다. 본 논문에서는 PPDM을 조사하고 데이터의 프라이버시와 유틸리티를 평가하기 위한 다양한 측정방법을 분석한다.

Framework for False Alarm Pattern Analysis of Intrusion Detection System using Incremental Association Rule Mining

  • Chon Won Yang;Kim Eun Hee;Shin Moon Sun;Ryu Keun Ho
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2004년도 Proceedings of ISRS 2004
    • /
    • pp.716-718
    • /
    • 2004
  • The false alarm data in intrusion detection systems are divided into false positive and false negative. The false positive makes bad effects on the performance of intrusion detection system. And the false negative makes bad effects on the efficiency of intrusion detection system. Recently, the most of works have been studied the data mining technique for analysis of alert data. However, the false alarm data not only increase data volume but also change patterns of alert data along the time line. Therefore, we need a tool that can analyze patterns that change characteristics when we look for new patterns. In this paper, we focus on the false positives and present a framework for analysis of false alarm pattern from the alert data. In this work, we also apply incremental data mining techniques to analyze patterns of false alarms among alert data that are incremental over the time. Finally, we achieved flexibility by using dynamic support threshold, because the volume of alert data as well as included false alarms increases irregular.

  • PDF

데이터마이닝을 이용한 국민연금 부정수급 예측모형 개발 - 손해배상금 불성실 신고를 대상으로 - (An Application of Data-Mining Tool in Fraud Pension Payment Prediction)

  • 차경엽
    • Communications for Statistical Applications and Methods
    • /
    • 제17권1호
    • /
    • pp.1-8
    • /
    • 2010
  • 최근 사회복지분야에서 부정수급, 횡령 등이 빈번히 발생함에 따라 비리를 방지하기 위한 체계적인 관리 방안이 요구되고 있다. 데이터마이닝은 다수의 이해관계자와 많은 예산이 투입되는 사업을 관리하는데 효과적인 방법이다. 본 연구는 국민연금의 부정 수급자 관리방안으로 데이터마이닝을 이용한 예측모형을 개발하였다. 분석결과, 수급자의 급여, 연금 가입, 사고내역 정보가 부정수급의 특성 요인으로 나타났으며 이를 의사결정나무 모형, 로지스틱 회귀모형, 인공신경망 모형에 적용한 결과 의사결정나무 모형의 예측력이 가장 우수한 것으로 분석되었다.

Practical Text Mining for Trend Analysis: Ontology to visualization in Aerospace Technology

  • Kim, Yoosin;Ju, Yeonjin;Hong, SeongGwan;Jeong, Seung Ryul
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권8호
    • /
    • pp.4133-4145
    • /
    • 2017
  • Advances in science and technology are driving us to the better life but also forcing us to make more investment at the same time. Therefore, the government has provided the investment to carry on the promising futuristic technology successfully. Indeed, a lot of resources from the government have supported into the science and technology R&D projects for several decades. However, the performance of the public investments remains unclear in many ways, so thus it is required that planning and evaluation about the new investment should be on data driven decision with fact based evidence. In this regard, the government wanted to know the trend and issue of the science and technology with evidences, and has accumulated an amount of database about the science and technology such as research papers, patents, project reports, and R&D information. Nowadays, the database is supporting to various activities such as planning policy, budget allocation, and investment evaluation for the science and technology but the information quality is not reached to the expectation because of limitations of text mining to drill out the information from the unstructured data like the reports and papers. To solve the problem, this study proposes a practical text mining methodology for the science and technology trend analysis, in case of aerospace technology, and conduct text mining methods such as ontology development, topic analysis, network analysis and their visualization.

e-Commerce 쇼핑몰의 소비자 서비스 강화를 위한 활용연구 (A Study on System Applications of e-CRM to Enforcement of consumer Service)

  • 김연정
    • 대한가정학회지
    • /
    • 제43권3호
    • /
    • pp.1-10
    • /
    • 2005
  • The purpose of this study was to investigate the enforcement strategy for Consumer Service marketing of an e-Commerce shopping mall. An e-CRM for a Cosmetic e-Commerce shopping mall, Data Warehousing(DW) component, analysis of data mining of the DW, and web applications and strategies had to developed for marketing of consumer service satisfaction. The major findings were as follows: An RFM analysis was used for consumer classification, which is a fundamental process of e-CRM application. The components of the DW were web sales data and consumer data fields. The visual process of consumer segmentations (superior consumer class) for e-CRM solutions is presented. The association analysis algorithm of data mining to up-selling and cross-selling indicates an association rule. These e-CRM results apply web DB marketing and operating principles to a shopping mall. Therefore, the system applications of e-CRM to Consumer services indicate a marketing strategy for consumer-oriented management.

데이터마이닝을 이용한 설문조사의 심층 분석 (An In-depth Survey Analysis Applying Data Mining Techniques)

  • 김완섭;이수원
    • 공학교육연구
    • /
    • 제9권4호
    • /
    • pp.71-82
    • /
    • 2006
  • 학과의 교육목표 달성을 위해서는 순환형 자율 개선 구조를 운영하기 위한 시스템이 필요하며, 설문조사 분석을 통한 교육시스템의 개선은 교육목표 달성을 위한 중요한 요소 중의 하나이다. 일반적으로 설문조사 분석에서는 항목별로 통계적인 분포를 조사하거나 두 개의 항목간의 연관성을 조사하는 분석 방법이 주로 사용된다. 그러나 이러한 분석 방법은 다양한 항목들 간의 상호 연관성을 분석하지 못하는 한계가 있으므로 보다 심층적인 분석방법이 필요하다. 본 논문에서는 데이터마이닝 기법을 적용한 심층적인 분석 기법을 제시한다. 데이터마이닝이란 대용량의 데이터에 숨겨져 있는 지식을 추출해 내는 기법으로 설문분석에도 효과적으로 이용될 수 있다. 본 분석에서는 Clementine 데이터마이닝 도구를 사용하여 숭실대학교 컴퓨터학과의 재학생에 대한 설문자료에 대한 심층 분석을 수행하였다. 분석의 결과로 '학점'과 다른 항목들과의 연관성을 계층적으로 분석할 수 있었으며, '학점'에 대한 학생상담과 학과의 교육 프로그램 개선에 실제적으로 사용할 수 있는 유용한 정보들을 획득할 수 있었다.

텍스트마이닝을 이용한 약물유해반응 보고자료 분석 (Analysis of Adverse Drug Reaction Reports using Text Mining)

  • 김현희;유기연
    • 한국임상약학회지
    • /
    • 제27권4호
    • /
    • pp.221-227
    • /
    • 2017
  • Background: As personalized healthcare industry has attracted much attention, big data analysis of healthcare data is essential. Lots of healthcare data such as product labeling, biomedical literature and social media data are unstructured, extracting meaningful information from the unstructured text data are becoming important. In particular, text mining for adverse drug reactions (ADRs) reports is able to provide signal information to predict and detect adverse drug reactions. There has been no study on text analysis of expert opinion on Korea Adverse Event Reporting System (KAERS) databases in Korea. Methods: Expert opinion text of KAERS database provided by Korea Institute of Drug Safety & Risk Management (KIDS-KD) are analyzed. To understand the whole text, word frequency analysis are performed, and to look for important keywords from the text TF-IDF weight analysis are performed. Also, related keywords with the important keywords are presented by calculating correlation coefficient. Results: Among total 90,522 reports, 120 insulin ADR report and 858 tramadol ADR report were analyzed. The ADRs such as dizziness, headache, vomiting, dyspepsia, and shock were ranked in order in the insulin data, while the ADR symptoms such as vomiting, 어지러움, dizziness, dyspepsia and constipation were ranked in order in the tramadol data as the most frequently used keywords. Conclusion: Using text mining of the expert opinion in KIDS-KD, frequently mentioned ADRs and medications are easily recovered. Text mining in ADRs research is able to play an important role in detecting signal information and prediction of ADRs.