• 제목/요약/키워드: data process

검색결과 23,601건 처리시간 0.045초

Big Data Platform Based on Hadoop and Application to Weight Estimation of FPSO Topside

  • Kim, Seong-Hoon;Roh, Myung-Il;Kim, Ki-Su;Oh, Min-Jae
    • Journal of Advanced Research in Ocean Engineering
    • /
    • 제3권1호
    • /
    • pp.32-40
    • /
    • 2017
  • Recently, the amount of data to be processed and the complexity thereof have been increasing due to the development of information and communication technology, and industry's interest in such big data is increasing day by day. In the shipbuilding and offshore industry also, there is growing interest in the effective utilization of data, since various and vast amounts of data are being generated in the process of design, production, and operation. In order to effectively utilize big data in the shipbuilding and offshore industry, it is necessary to store and process large amounts of data. In this study, it was considered efficient to apply Hadoop and R, which are mostly used in big data related research. Hadoop is a framework for storing and processing big data. It provides the Hadoop Distributed File System (HDFS) for storing big data, and the MapReduce function for processing. Meanwhile, R provides various data analysis techniques through the language and environment for statistical calculation and graphics. While Hadoop makes it is easy to handle big data, it is difficult to finely process data; and although R has advanced analysis capability, it is difficult to use to process large data. This study proposes a big data platform based on Hadoop for applications in the shipbuilding and offshore industry. The proposed platform includes the existing data of the shipyard, and makes it possible to manage and process the data. To check the applicability of the platform, it is applied to estimate the weights of offshore structure topsides. In this study, we store data of existing FPSOs in Hadoop-based Hortonworks Data Platform (HDP), and perform regression analysis using RHadoop. We evaluate the effectiveness of large data processing by RHadoop by comparing the results of regression analysis and the processing time, with the results of using the conventional weight estimation program.

개인 소프트웨어 프로세스 지원을 위한 도구 (A Tool to Support Personal Software Process)

  • 신현일;정경학;송일선;최호진;백종문
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제34권8호
    • /
    • pp.752-762
    • /
    • 2007
  • 개발자 개개인의 소프트웨어 개발 프로세스를 개선시켜 소프트웨어의 품질을 향상시킬 수 있도록 돕는 기법으로 PSP(Personal Software Process)가 널리 쓰이고 있다. PSP에 제시된 측정 및 분석활동을 지속적으로 수행함으로써 개별 개발자는 자신의 개발 프로세스에 내재된 약점을 파악할 수 있고, 이렇게 수집된 과거 프로젝트의 데이타를 이용하여 공수와 품질에 대한 예측의 정확도를 높일 수 있다. 그러나 수동으로 행해지는 데이타 수집의 오버헤드와 개발작업-측정작업 간의 문맥전환에 따른 집중력 분산의 문제점으로 인해 신뢰도 높은 데이타를 수집하기가 쉽지 않은 것이 현실이다. 한편, PSP에 제시된 문서형태의 프로세스 가이드는 프로세스 정보 검색의 불편함과 추가적인 정보를 삽입하는 데 어려움을 가지고 있다. 본 논문에서는 이러한 문제점들을 해결하기 위해 개발된 PSP 지원도구를 소개한다. 개발된 도구는 데이타 수집의 신뢰성을 높이기 위해 데이타 자동 수집 기능을 제공하고, PSP 프로세스 정보의 효율적인 검색을 위한 EPG(Electronic Process Guide) 기능 및 추가적인 프로세스 정보의 저장을 위한 경험 저장소 기능을 제공한다.

Note on the Transformed Geometric Poisson Processes

  • Park, Jeong-Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • 제8권2호
    • /
    • pp.135-141
    • /
    • 1997
  • In this paper, it is investigated the properties of the transformed geometric Poisson process when the intensity function of the process is a distribution of the continuous random variable. If the intensity function of the transformed geometric Poisson process is a Pareto distribution then the transformed geometric Poisson process is a strongly P-process.

  • PDF

연관규칙과 순차패턴을 이용한 프로세스 마이닝 (A Process Mining using Association Rule and Sequence Pattern)

  • 정소영;권수태
    • 산업경영시스템학회지
    • /
    • 제31권2호
    • /
    • pp.104-111
    • /
    • 2008
  • A process mining is considered to support the discovery of business process for unstructured process model, and a process mining algorithm by using the associated rule and sequence pattern of data mining is developed to extract information about processes from event-log, and to discover process of alternative, concurrent and hidden activities. Some numerical examples are presented to show the effectiveness and efficiency of the algorithm.

A Study on the Monitoring of Reject Rate in High Yield Process

  • Nam, Ho-Soo
    • Journal of the Korean Data and Information Science Society
    • /
    • 제18권3호
    • /
    • pp.773-782
    • /
    • 2007
  • The statistical process control charts are very extensively used for monitoring of process mean, deviation, defect rate or reject rate. In this paper we consider a control chart to monitor the process reject rate in the high yield process, which is based on the observed cumulative probability of the number of items inspected until r defective items are observed. We first propose selection of the optimal value of r in the CPC-r charts, and also consider the usefulness of the chart in high yield process such as semiconductor or TFT-LCD manufacturing process.

  • PDF

Multivariate Control Charts for Autocorrelated Process

  • Cho, Gyo-Young;Park, Mi-Ra
    • Journal of the Korean Data and Information Science Society
    • /
    • 제14권2호
    • /
    • pp.289-301
    • /
    • 2003
  • In this paper, we propose Shewhart control chart and EWMA control chart using the autocorrelated data which are common in chemical and process industries and lead to increase the number of false alarms when conventional control charts are applied. The effect of autocorrelated data is modeled as a autoregressive process, and canonical analysis is used to reduce the dimensionality of the data set and find the canonical variables that explain as much of the data variation as possible. Charting statistics are constructed based on the residual vectors from the canonical variables which are uncorrelated over time, and the control charts for these statistics can attenuate the autocorrelation in the process data. The charting procedures are illustrated with a numerical example and simulation is conducted to investigate the performances of the proposed control charts.

  • PDF

데이터마이닝 기법을 이용한 제조 공정내의 불량항목별 예측방법 (Defect Type Prediction Method in Manufacturing Process Using Data Mining Technique)

  • 변성규;강창욱;심성보
    • 산업경영시스템학회지
    • /
    • 제27권2호
    • /
    • pp.10-16
    • /
    • 2004
  • Data mining technique is the exploration and analysis, by automatic or semiautomatic means, of large quantities of data in order to discover meaningful patterns and rules. This paper uses a data mining technique for the prediction of defect types in manufacturing Process. The Purpose of this Paper is to model the recognition of defect type Patterns and Prediction of each defect type before it occurs in manufacturing process. The proposed model consists of data handling, defect type analysis, and defect type prediction stages. The performance measurement shows that it is higher in prediction accuracy than logistic regression model.

업무 프로세스와 데이터 요구사항의 통합 모델링 (A Modeling Approach to Integrate Business Processes and Data Requirements)

  • 장무경
    • 대한안전경영과학회:학술대회논문집
    • /
    • 대한안전경영과학회 2011년도 춘계학술대회
    • /
    • pp.329-338
    • /
    • 2011
  • Business processes are often of long duration, and include internal worker's decision making, which makes business processes to be exposed to many exceptional situations. These properties of business processes makes it difficult to design processes to support uncertainties from internal or external environments. The behavioral properties of business processes mainly depends on the data aspects of business processes. To formalize the data aspect of process modeling, this paper proposes a graph-based model, called Data Dependency Graph (DDG), constructed from dependency relationships specified between business data. The paper also defines a mechanism of describing a set of mapping rules that generates a process model semantically equivalent to a DDG, which is accomplished by allocating data dependencies to component activities.

  • PDF

준지도학습 기반 반도체 공정 이상 상태 감지 및 분류 (Semi-Supervised Learning for Fault Detection and Classification of Plasma Etch Equipment)

  • 이용호;최정은;홍상진
    • 반도체디스플레이기술학회지
    • /
    • 제19권4호
    • /
    • pp.121-125
    • /
    • 2020
  • With miniaturization of semiconductor, the manufacturing process become more complex, and undetected small changes in the state of the equipment have unexpectedly changed the process results. Fault detection classification (FDC) system that conducts more active data analysis is feasible to achieve more precise manufacturing process control with advanced machine learning method. However, applying machine learning, especially in supervised learning criteria, requires an arduous data labeling process for the construction of machine learning data. In this paper, we propose a semi-supervised learning to minimize the data labeling work for the data preprocessing. We employed equipment status variable identification (SVID) data and optical emission spectroscopy data (OES) in silicon etch with SF6/O2/Ar gas mixture, and the result shows as high as 95.2% of labeling accuracy with the suggested semi-supervised learning algorithm.

Analysis On Encryption Process In Data For Satellite

  • Bae, Hee-Jin
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2008년도 International Symposium on Remote Sensing
    • /
    • pp.216-219
    • /
    • 2008
  • It is necessary to study encryption for protection and safe transmission of the important information. Specially, the security in satellite data is also getting more and more important. This paper introduces DES and TDES algorithm, studies how to apply to satellite data with those algorithms and process of encryption and decryption for satellite data. Proposed encryption process in this paper will be utilized in satellite data for encryption in many satellites.

  • PDF