• 제목/요약/키워드: Data Management Techniques

검색결과 1,739건 처리시간 0.032초

데이터마이닝을 활용한 한국프로야구 승패예측모형 수립에 관한 연구 (Using Data Mining Techniques to Predict Win-Loss in Korean Professional Baseball Games)

  • 오윤학;김한;윤재섭;이종석
    • 대한산업공학회지
    • /
    • 제40권1호
    • /
    • pp.8-17
    • /
    • 2014
  • In this research, we employed various data mining techniques to build predictive models for win-loss prediction in Korean professional baseball games. The historical data containing information about players and teams was obtained from the official materials that are provided by the KBO website. Using the collected raw data, we additionally prepared two more types of dataset, which are in ratio and binary format respectively. Dividing away-team's records by the records of the corresponding home-team generated the ratio dataset, while the binary dataset was obtained by comparing the record values. We applied seven classification techniques to three (raw, ratio, and binary) datasets. The employed data mining techniques are decision tree, random forest, logistic regression, neural network, support vector machine, linear discriminant analysis, and quadratic discriminant analysis. Among 21(= 3 datasets${\times}$7 techniques) prediction scenarios, the most accurate model was obtained from the random forest technique based on the binary dataset, which prediction accuracy was 84.14%. It was also observed that using the ratio and the binary dataset helped to build better prediction models than using the raw data. From the capability of variable selection in decision tree, random forest, and stepwise logistic regression, we found that annual salary, earned run, strikeout, pitcher's winning percentage, and four balls are important winning factors of a game. This research is distinct from existing studies in that we used three different types of data and various data mining techniques for win-loss prediction in Korean professional baseball games.

Design of the Database Learning System based on Learner Management Techniques

  • Ahn, Jeong-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제15권4호
    • /
    • pp.707-716
    • /
    • 2004
  • Recently, many areas of application such as statistics and industrial engineering are interested in the effective education of databases. In this article we design and implement a database learning system based on learner management techniques. The system supports a personalized/ team-centered learning environment, monitoring the learning attitude of learners, and a method for the assessment.

  • PDF

우주발사체 시스템 개발에 있어서의 SE관리기법 적용 (Application of SE Management Techniques for space Launch System Development)

  • 조미옥;조병규;오범석;박정주;조광래
    • 시스템엔지니어링워크숍
    • /
    • 통권4호
    • /
    • pp.90-94
    • /
    • 2004
  • System engineering(SE) management techniques applied for space launch system development are introduced to assess the current status and address the effwctiveness of these techniques. Management plans and guides are prepared for the work breakdown structure , data, comfiguration, interface control, Quality assurance, procurement, reliability, risk and verification/validation . Further improvement is required for the system engineering management plan(SEMP) to merge the international cooperation into current engineering managment system.

  • PDF

A Survey of State-of-the-Art Multi-Authority Attribute Based Encryption Schemes in Cloud Environment

  • Reetu, Gupta;Priyesh, Kanungo;Nirmal, Dagdee
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제17권1호
    • /
    • pp.145-164
    • /
    • 2023
  • Cloud computing offers a platform that is both adaptable and scalable, making it ideal for outsourcing data for sharing. Various organizations outsource their data on cloud storage servers for availing management and sharing services. When the organizations outsource the data, they lose direct control on the data. This raises the privacy and security concerns. Cryptographic encryption methods can secure the data from the intruders as well as cloud service providers. Data owners may also specify access control policies such that only the users, who satisfy the policies, can access the data. Attribute based access control techniques are more suitable for the cloud environment as they cover large number of users coming from various domains. Multi-authority attribute-based encryption (MA-ABE) technique is one of the propitious attribute based access control technique, which allows data owner to enforce access policies on encrypted data. The main aim of this paper is to comprehensively survey various state-of-the-art MA-ABE schemes to explore different features such as attribute and key management techniques, access policy structure and its expressiveness, revocation of access rights, policy updating techniques, privacy preservation techniques, fast decryption and computation outsourcing, proxy re-encryption etc. Moreover, the paper presents feature-wise comparison of all the pertinent schemes in the field. Finally, some research challenges and directions are summarized that need to be addressed in near future.

데이터 사이언스 기술의 지하수 분야 응용 사례 분석 및 발전 방향 (Applications of Data Science Technologies in the Field of Groundwater Science and Future Trends)

  • 정진아;이재민;이수비;양우종;한원식
    • 한국지하수토양환경학회지:지하수토양환경
    • /
    • 제28권spc호
    • /
    • pp.18-39
    • /
    • 2023
  • Rapid development of geophysical exploration and hydrogeologic monitoring techniques has yielded remarkable increase of datasets related to groundwater systems. Increased number of datasets contribute to understanding of general aquifer characteristics such as groundwater yield and flow, but understanding of complex heterogenous aquifers system is still a challenging task. Recently, applications of data science technique have become popular in the fields of geophysical explorations and monitoring, and such attempts are also extended in the groundwater field. This work reviewed current status and advancement in utilization of data science in groundwater field. The application of data science techniques facilitates effective and realistic analyses of aquifer system, and allows accurate prediction of aquifer system change in response to extreme climate events. Due to such benefits, data science techniques have become an effective tool to establish more sustainable groundwater management systems. It is expected that the techniques will further strengthen the theoretical framework in groundwater management to cope with upcoming challenges and limitations.

데이터 마이닝과 통계적 기법을 통합한 최적화 기법 (Optimization Methodology Integrated Data Mining and Statistical Method)

  • 정혜진;송서일
    • 한국품질경영학회:학술대회논문집
    • /
    • 한국품질경영학회 2006년도 추계 학술대회
    • /
    • pp.205-210
    • /
    • 2006
  • Nowaday manufacture technology and manufacture environment are changing rapidly. By development of computer and enlargement of technique, most of manufacture field are computerized. It is measured automatically do much quality characteristics thereby and great many data happen in a day. corporations is important if have gotten fast information that are useful from wide data to go first in international competition according to these change. Statistical process control(SPC) techniques are used as a problem solution tool at manufacturing process until present. However, this statistical methods is not applied more extensively because have much restrictions in realistic problem. In this paper, wish to develop more realistic and scientific new statistical design techniques doing to integrate data mining(DM) and statistical methods by the alternative to cope these problem. First step selects significant factor using DM techniques from datas of manufacturing process including much factors and second step wish to find optimum of process after get the estimated response function through response surf ace methodology(RSM) that is statistical techniques.

  • PDF

원천 시스템 환경을 고려한 데이터 추출 방식의 비교 및 Index DB를 이용한 추출 방식의 구현 -ㅅ 은행 사례를 중심으로- (A Comparison of Data Extraction Techniques and an Implementation of Data Extraction Technique using Index DB -S Bank Case-)

  • 김기운
    • 경영과학
    • /
    • 제20권2호
    • /
    • pp.1-16
    • /
    • 2003
  • Previous research on data extraction and integration for data warehousing has concentrated mainly on the relational DBMS or partly on the object-oriented DBMS. Mostly, it describes issues related with the change data (deltas) capture and the incremental update by using the triggering technique of active database systems. But, little attention has been paid to data extraction approaches from other types of source systems like hierarchical DBMS, etc. and from source systems without triggering capability. This paper argues, from the practical point of view, that we need to consider not only the types of information sources and capabilities of ETT tools but also other factors of source systems such as operational characteristics (i.e., whether they support DBMS log, user log or no log, timestamp), and DBMS characteristics (i.e., whether they have the triggering capability or not, etc), in order to find out appropriate data extraction techniques that could be applied to different source systems. Having applied several different data extraction techniques (e.g., DBMS log, user log, triggering, timestamp-based extraction, file comparison) to S bank's source systems (e.g., IMS, DB2, ORACLE, and SAM file), we discovered that data extraction techniques available in a commercial ETT tool do not completely support data extraction from the DBMS log of IMS system. For such IMS systems, a new date extraction technique is proposed which first creates Index database and then updates the data warehouse using the Index database. We illustrates this technique using an example application.

데이터 마이닝과 통계적 기법을 통합한 최적화 기법 (Optimization Methodology Integrated Data Mining and Statistical Method)

  • 송서일;신상문;정혜진
    • 품질경영학회지
    • /
    • 제34권4호
    • /
    • pp.33-39
    • /
    • 2006
  • These days manufacture technology and manufacture environment are changing rapidly. By development of computer and enlargement of technique, most of manufacture field are computerized. In order to win international competition, it is important for companies how fast get the useful information from vast data. Statistical process control(SPC) techniques have been used as a problem solution tool at manufacturing process until present. However, these statistical methods are not applied more extensively because it has much restrictions in realistic problems. These statistical techniques have lots of problems when much data and factors are analyzed. In this paper, we proposed more practical and efficient a new statistical design technique which integrated data mining (DM) and statistical methods as alternative of problems. First step is selecting significant factor using DM feature selection algorithm from data of manufacturing process including many factors. Second step is finding optimum of process after estimating response function through response surface methodology(RSM) that is a statistical techniques

데이터 마이닝 기법의 기업도산예측 실증분석 (A Study of Data Mining Techniques in Bankruptcy Prediction)

  • Lee, Kidong
    • 한국경영과학회지
    • /
    • 제28권2호
    • /
    • pp.105-127
    • /
    • 2003
  • In this paper, four different data mining techniques, two neural networks and two statistical modeling techniques, are compared in terms of prediction accuracy in the context of bankruptcy prediction. In business setting, how to accurately detect the condition of a firm has been an important event in the literature. In neural networks, Backpropagation (BP) network and the Kohonen self-organizing feature map, are selected and compared each other while in statistical modeling techniques, discriminant analysis and logistic regression are also performed to provide performance benchmarks for the neural network experiment. The findings suggest that the BP network is a better choice among the data mining tools compared. This paper also identified some distinctive characteristics of Kohonen self-organizing feature map.

고품질 데이터를 지원하는 교통데이터 웨어하우스 구축 기법 (An Integrated Framework for Data Quality Management of Traffic Data Warehouses)

  • 황재일;박승용;나연묵
    • 한국공간정보시스템학회 논문지
    • /
    • 제10권4호
    • /
    • pp.89-95
    • /
    • 2008
  • 본 논문에서는 교통데이터 웨어 하우스에서 데이터 품질 관리를 위한 통합기법을 제안한다. 고속도로 교통관리시스템(FTMS)과 우회도로 교통정보시스템(ARTIS) 으로부터 대용량 교통데이터를 수집하여 데이터 웨어하우스를 구축하기 위한 방안을 기술하고, 다양한 분석을 위한 고품질 교통데이터를 제공하기 위한 통합 데이터 품질관리 기법을 제안하고 구현 평가한다. 제안된 통합 데이터 품질관리 기법을 활용하면 연구자들에게 검증된 고품질 교통데이터를 제공할 수 있고, 데이터처리와 평가를 위한 별도의 비용을 절감할 수 있을 것으로 기대된다.

  • PDF