• 제목/요약/키워드: Data Transformation

검색결과 2,070건 처리시간 0.029초

SAMPLE ENTROPY IN ESTIMATING THE BOX-COX TRANSFORMATION

  • Rahman, Mezbahur;Pearson, Larry M.
    • Journal of the Korean Data and Information Science Society
    • /
    • 제12권1호
    • /
    • pp.103-125
    • /
    • 2001
  • The Box-Cox transformation is a well known family of power transformation that brings a set of data into agreement with the normality assumption of the residuals and hence the response variable of a postulated model in regression analysis. This paper proposes a new method for estimating the Box-Cox transformation using maximization of the Sample Entropy statistic which forces the data to get closer to normal as much as possible. A comparative study of the proposed procedure with the maximum likelihood procedure, the procedure via artificial regression estimation, and the recently introduced maximization of the Shapiro-Francia W' statistic procedure is given. In addition, we generate a table for the optimal spacings parameter in computing the Sample Entropy statistic.

  • PDF

Box-Cox Power Transformation Using R

  • Baek, Hoh Yoo
    • 통합자연과학논문집
    • /
    • 제13권2호
    • /
    • pp.76-82
    • /
    • 2020
  • If normality of an observed data is not a viable assumption, we can carry out normal-theory analyses by suitable transforming data. Power transformation by Box and Cox, one of the transformation methods, is derived the power which maximized the likelihood function. But it doesn't induces the closed form in mathematical analysis. In this paper, we compose some R the syntax of which is easier than other statistical packages for deriving the power with using numerical methods. Also, by using R, we show the transformed data approximately distributed the normal through Q-Q plot in univariate and bivariate cases with some examples. Finally, we present the value of a goodness-of-fit statistic(AD) and its p-value for normal distribution. In the similar procedure, this method can be extended to more than bivariate case.

Unlocking Digital Transformation: The Pivotal Role of Data Analytics and Business Intelligence Strategies

  • Edwin Omol;Lucy Mburu;Paul Abuonji
    • International Journal of Knowledge Content Development & Technology
    • /
    • 제14권3호
    • /
    • pp.77-91
    • /
    • 2024
  • This article aims to comprehensively analyze the crucial role played by data analytics and business intelligence (BI) strategies in propelling digital transformation within diverse industries. Through an extensive literature review and examination of real-world case studies, the study employs a systematic analysis of scholarly works and industry reports. This approach provides a panoramic view of how organizations utilize data-driven insights for competitive advantages, improved customer experiences, and fostering innovation. The findings underscore the pivotal significance of data analytics and BI strategies in influencing strategic decision-making, enhancing operational efficiency, and ensuring long-term sustainability across various industries. The study stands out in its originality by offering a unique synthesis of insights derived from scholarly works and real-world case studies, contributing to a holistic understanding of the transformative impact of data analytics and BI on contemporary business practices. While the study provides valuable insights, limitations include the scope of available literature and case studies. The implications call for further research to explore emerging trends and evolving challenges in the dynamic landscape of data analytics and BI. The practical implications highlight the tangible benefits organizations can derive from integrating data analytics and BI strategies, emphasizing their role in shaping strategic decisions and fostering operational efficiency. In a broader context, the study delves into the social implications of the symbiotic relationship between data analytics, BI, and digital transformation. It explores how these strategies impact broader societal and economic aspects, influencing innovation and sustainability.

비즈니스 인텔리전스 환경에서 변환 관리를 이용한 데이터 품질 향상에 대한 연구 (A Study on Data Quality Management in Business Intelligence Environments)

  • 이춘열
    • 경영정보학연구
    • /
    • 제6권2호
    • /
    • pp.65-77
    • /
    • 2004
  • 비즈니스 인텔리전스를 위한 통합 정보시스템의 운영을 위하여서는 무엇보다도 기업 내부와 외부에서 발생한 자료들을 상호 연계하여 통합 관리하여야 한다. 데이터의 통합관리를 위하여서는 기존의 데이터와 데이터들 사이의 일대일 매핑이 아니라 데이터의 생성부터 통합 저장까지의 변환 과정을 총괄적으로 표현하고 관리하여야 한다. 본 연구는 정보구조그래프를 확장함으로써 데이터의 변환구조들 뿐만이 아니라 세부 처리 단계들까지 통합 관리할 수 있는 방안을 제시하며, 이를 이용하여 비즈니스 인텔리전스와 같은 통합환경에서 데이터베이스의 품질 향상을 위한 활용방안을 제시한다.

국가측지좌표계 전환에 따른 변환계수 결정 및 도시기반정보 데이터베이스 변환 -원주시를 대상으로- (Transformation UIS DB and Determination of Coordinates Transformation Coefficients for Wonju City with Translation of Nation Geodetic Datum)

  • 이현직;유지호
    • 한국측량학회지
    • /
    • 제25권2호
    • /
    • pp.141-148
    • /
    • 2007
  • 측량법 개정으로 세계측지계를 전면 사용하게 됨에 따라 기존 한국측지계로 구축된 원주시 UIS DB의 좌표 변환을 수행해야 한다. 그러나 원주시는 한국측지계의 구성과와 신성과로 이원화된 기준점 성과를 이용하여 지형공간정보 자료가 구축되어 있어 좌표체계가 통일되어 있지 않음으로 각 기준점 성과에 대한 세계측지계 변환 방안이 요구된다. 이에 본 연구에서는 한국측지계의 기준점 성과별 좌표변환계수를 산출하고, 변환계수에 대한 정확도를 검증하여 한국측지계상 이원화된 원주시 도시기반정보 DB를 효과적으로 세계측지계로 전환할 수 있었다.

Gemas: Enhancing the Distribution of Integrated Eco-Friendly Marketing Strategies towards Digital Transformation and Global Competitiveness

  • Diana AQMALA;Febrianur Ibnu Fitroh Sukono PUTRA
    • 유통과학연구
    • /
    • 제22권5호
    • /
    • pp.39-57
    • /
    • 2024
  • Purpose: Various policies continue to be strengthened to develop Micro, Small and Medium Enterprises (MSMEs), which have a strategic role in the economy through the pillars of corporatization, capacity and financing to support strong and inclusive economic growth. Efforts to transform MSMEs marketing strategies are undertaken through eco-friendly digitalization to increase resilience and more productive and innovative capacity. Research design, data and methodology: This research is an exploratory qualitative approach taken to investigate the transformation of eco-friendly marketing strategies for MSMEs to increase competitiveness at the global level. The samples obtained were 425 MSMEs assisted by the DKI Jakarta, Bali, Java, Borneo, and Sumatera. The data collection technique used non-probability sampling (snowball sampling). Data is analyzed through collection, reduction, analysis, validity testing, presentation and conclusion. Results: This research shows the transformation of eco-friendly digital-based MSME marketing strategies occurred through four stages, namely production and institutional activities, expanding market share, digitalization and financing, and export market access. Conclusions: Eco-friendly digital transformation allows MSMEs competencies to be refined to improve business processes and business competitiveness at the international level. The contribution of this marketing strategy transformation is expanding MSMEs access to financial institutions (fintech), marketplaces, and QRIS (QR Code Indonesian Standard) digital payments.

농업·농촌 디지털 전환을 위한 빅데이터 활성화 방안 연구 (Big Data Activation Plan for Digital Transformation of Agriculture and Rural)

  • 이원석;손경자;전대호;신용태
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제9권8호
    • /
    • pp.235-242
    • /
    • 2020
  • 4차 산업혁명 시대를 맞아 우리 농업·농촌의 디지털 전환을 추진하고 다가오는 인공지능 시대를 대비하기 위하여, 필요한 양질의 데이터를 수집하고 분석해서 활용할 수 있는 체계와 시스템 구축이 필요하다. 이를 위해 농업인이나 농정담당자 등 다양한 이해 관계자들이 느끼는 문제점이나 이슈들을 조사·분석하여, 공동 활용을 위한 빅데이터 플랫폼 확충, 지속 가능한 빅데이터 거버넌스 구축 그리고 수요자 기반의 빅데이터 활용 기반 활성화 등 우리 농업·농촌의 디지털 전환을 추진하기 위해서 반드시 선결되어야 할 빅데이터 활성화를 위한 전략적 방안들을 제시하고자 한다.

소수 데이터의 신경망 학습에 의한 카메라 보정 (Camera Calibration Using Neural Network with a Small Amount of Data)

  • 도용태
    • 센서학회지
    • /
    • 제28권3호
    • /
    • pp.182-186
    • /
    • 2019
  • When a camera is employed for 3D sensing, accurate camera calibration is vital as it is a prerequisite for the subsequent steps of the sensing process. Camera calibration is usually performed by complex mathematical modeling and geometric analysis. On the other contrary, data learning using an artificial neural network can establish a transformation relation between the 3D space and the 2D camera image without explicit camera modeling. However, a neural network requires a large amount of accurate data for its learning. A significantly large amount of time and work using a precise system setup is needed to collect extensive data accurately in practice. In this study, we propose a two-step neural calibration method that is effective when only a small amount of learning data is available. In the first step, the camera projection transformation matrix is determined using the limited available data. In the second step, the transformation matrix is used for generating a large amount of synthetic data, and the neural network is trained using the generated data. Results of simulation study have shown that the proposed method as valid and effective.

데이터웨어하우스에서 이질적 형태를 가진 데이터의 추출을 위한 Extraction Transformation Transportation(ETT) 시스템 설계 및 구현 (Extraction Transformation Transportation (ETT) system Design and implementation for extracting heterogeneous Data on Data Warehouse)

  • 여성주;왕지남
    • 산업경영시스템학회지
    • /
    • 제24권67호
    • /
    • pp.49-60
    • /
    • 2001
  • Data warehouse(DW) manages all information in a Enterprise and also offers the specific information to users. However, it might be difficult to develope an effective DW system due to varieties in computing facilities, data base, and operating systems. The heterogeneous system environments make it harder to extract data and to provide proper information to usesr in real time. Also commonly occurred is data inconsistency of non-integrated legacy system, which requires an effective and efficient data extraction flow control as well as data cleansing. We design the integrated automatic ETT(Extraction Transformation Transportation) system to control data extraction flow and suggest implementation methodology. Detail analysis and design are given to specify the proposed ETT approach with a real implementation.

  • PDF

포아송 분포를 가정한 Wafer 수준 Statistical Bin Limits 결정방법과 표본크기 효과에 대한 평가 (Methods and Sample Size Effect Evaluation for Wafer Level Statistical Bin Limits Determination with Poisson Distributions)

  • 박성민;김영식
    • 산업공학
    • /
    • 제17권1호
    • /
    • pp.1-12
    • /
    • 2004
  • In a modern semiconductor device manufacturing industry, statistical bin limits on wafer level test bin data are used for minimizing value added to defective product as well as protecting end customers from potential quality and reliability excursion. Most wafer level test bin data show skewed distributions. By Monte Carlo simulation, this paper evaluates methods and sample size effect regarding determination of statistical bin limits. In the simulation, it is assumed that wafer level test bin data follow the Poisson distribution. Hence, typical shapes of the data distribution can be specified in terms of the distribution's parameter. This study examines three different methods; 1) percentile based methodology; 2) data transformation; and 3) Poisson model fitting. The mean square error is adopted as a performance measure for each simulation scenario. Then, a case study is presented. Results show that the percentile and transformation based methods give more stable statistical bin limits associated with the real dataset. However, with highly skewed distributions, the transformation based method should be used with caution in determining statistical bin limits. When the data are well fitted to a certain probability distribution, the model fitting approach can be used in the determination. As for the sample size effect, the mean square error seems to reduce exponentially according to the sample size.