• 제목/요약/키워드: in-database analytics

검색결과 23건 처리시간 0.026초

마케팅 관점으로 본 빅 데이터 분석 사례연구 : 은행업을 중심으로 (Big Data Analytics Case Study from the Marketing Perspective : Emphasis on Banking Industry)

  • 박성수;이건창
    • 한국IT서비스학회지
    • /
    • 제17권2호
    • /
    • pp.207-218
    • /
    • 2018
  • Recently, it becomes a big trend in the banking industry to apply a big data analytics technique to extract essential knowledge from their customer database. Such a trend is based on the capability to analyze the big data with powerful analytics software and recognize the value of big data analysis results. However, there exits still a need for more systematic theory and mechanism about how to adopt a big data analytics approach in the banking industry. Especially, there is no study proposing a practical case study in which big data analytics is successfully accomplished from the marketing perspective. Therefore, this study aims to analyze a target marketing case in the banking industry from the view of big data analytics. Target database is a big data in which about 3.5 million customers and their transaction records have been stored for 3 years. Practical implications are derived from the marketing perspective. We address detailed processes and related field test results. It proved critical for the big data analysts to consider a sense of Veracity and Value, in addition to traditional Big Data's 3V (Volume, Velocity, and Variety), so that more significant business meanings may be extracted from the big data results.

PDM 시스템을 활용한 Product Data Analytics 교육 훈련 (Education and Training of Product Data Analytics using Product Data Management System)

  • 도남철
    • 한국CDE학회논문집
    • /
    • 제22권1호
    • /
    • pp.80-88
    • /
    • 2017
  • Product data analytics (PDA) is a data-driven analysis method that uses product data management (PDM) databases as its operational data. It aims to understand and evaluate product development processes indirectly through the analysis of product data from the PDM databases. To educate and train PDA efficiently, this study proposed an approach that employs courses for both product development and PDA in a class. The participant group for product development provides a PDM database as a result of their product development activities, and the other group for PDA analyses the PDM database and provides analysis result to the product development group who can explain causes of the result. The collaboration between the two groups can enhance the efficiency of the education and training course on PDA. This study also includes an application example of the approach to a graduate class on PDA and discussion of its result.

분산 인 메모리 DBMS 기반 병렬 K-Means의 In-database 분석 함수로의 설계와 구현 (Design and Implementation of Distributed In-Memory DBMS-based Parallel K-Means as In-database Analytics Function)

  • 구해모;남창민;이우현;이용재;김형주
    • 정보과학회 컴퓨팅의 실제 논문지
    • /
    • 제24권3호
    • /
    • pp.105-112
    • /
    • 2018
  • 데이터의 양이 증가하면서 단일 노드 데이터베이스로는 저장과 처리를 동시에 수행하기에는 부족하다. 따라서, 데이터를 분산시켜 복수 노드로 구성된 분산 데이터베이스에 저장되고 있으며 분석 역시 효율성을 위해 병렬 기능을 제공해야한다. 전통적인 분석 방식은 데이터베이스에서 분석 노드로 데이터를 이동시킨 후 분석을 수행하기 때문에 네트워크의 비용이 발생하며 사용자가 분석을 위해 분석 프레임 워크도 다를 수 있어야한다. 본 연구는 군집화 분석 기법인 K-Means 군집화 알고리즘을 관계형 데이터 베이스와 칼럼 기반 데이터베이스를 이용한 분산 데이터베이스 환경에서 SQL로 구현하는 In-database 분석 함수로의 설계와 구현 그리고 관계형 데이터베이스에서의 성능 최적화 방법을 제안한다.

Multi-dimensional Query Authentication for On-line Stream Analytics

  • Chen, Xiangrui;Kim, Gyoung-Bae;Bae, Hae-Young
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제4권2호
    • /
    • pp.154-173
    • /
    • 2010
  • Database outsourcing is unavoidable in the near future. In the scenario of data stream outsourcing, the data owner continuously publishes the latest data and associated authentication information through a service provider. Clients may register queries to the service provider and verify the result's correctness, utilizing the additional authentication information. Research on On-line Stream Analytics (OLSA) is motivated by extending the data cube technology for higher multi-level abstraction on the low-level-abstracted data streams. Existing work on OLSA fails to consider the issue of database outsourcing, while previous work on stream authentication does not support OLSA. To close this gap and solve the problem of OLSA query authentication while outsourcing data streams, we propose MDAHRB and MDAHB, two multi-dimensional authentication approaches. They are based on the general data model for OLSA, the stream cube. First, we improve the data structure of the H-tree, which is used to store the stream cube. Then, we design and implement two authentication schemes based on the improved H-trees, the HRB- and HB-trees, in accordance with the main stream query authentication framework for database outsourcing. Along with a cost models analysis, consistent with state-of-the-art cost metrics, an experimental evaluation is performed on a real data set. It exhibits that both MDAHRB and MDAHB are feasible for authenticating OLSA queries, while MDAHRB is more scalable.

저자동시인용분석에 의한 Business Analytics 분야의 지적 구조 분석: 2002 ~ 2020 (The Intellectual Structure of Business Analytics by Author Co-citation Analysis : 2002 ~ 2020)

  • 임혜정;서창교
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제30권1호
    • /
    • pp.21-44
    • /
    • 2021
  • Purpose The opportunities and approaches to big data have grown in various ways in the digital era. Business analytics is nowadays an inevitable strategy for organizations to earn a competitive advantage in order to survive in the challenged environments. The purpose of this study is to analyze the intellectual structure of business analytics literature to have a better insight for the organizations to the field. Design/methodology/approach This research analyzed with the data extracted from the database Web of Science. Total of 427 documents and 23,760 references are inserted into the analysis program CiteSpace. Author co-citation analysis is used to analyze the intellectual structure of the business analytics. We performed clustering analysis, burst detection and timeline analysis with the data. Findings We identified seven sub- areas of business analytics field. The top four sub-areas are "Big Data Analytics Infrastructure", "Performance Management System", "Interactive Exploration", and "Supply Chain Management". We also identified the top 5 references with the strongest citation bursts including Trkman et al.(2010) and Davenport(2006). Through timeline analysis we interpret the clusters that are expected to be the trend subjects in the future. Lastly, limitation and further research suggestion are discussed as concluding remarks.

제품자료 분석을 통한 제품설계 실험 실패 요인 분석 (Analysis of Failure in Product Design Experiments by using Product Data Analytics)

  • 도남철
    • 대한산업공학회지
    • /
    • 제40권4호
    • /
    • pp.366-374
    • /
    • 2014
  • This study assessed and analysed a result of a product design experiment through Product Data Analytics (PDA), to find reasons for failure of some projects in the experiment. PDA is a computer-based data analysis that uses Product Data Management (PDM) databases as its operational databases. The study examines 20 product design projects in the experiment, which are prepared to follow same product development process by using an identical PDM system. The design result in the PDM database is assessed and analysed by On-Line Analytical Processing (OLAP) and data mining tools in PDA. The assesment and analysis reveals the lateness in creation of 3D CAD models as the main reason of the failure.

실시간 스트림 데이터 분석을 위한 시각화 가속 기술 및 시각적 분석 시스템 (Fast Visualization Technique and Visual Analytics System for Real-time Analyzing Stream Data)

  • 정성민;연한별;정대교;유상봉;김석연;장윤
    • 한국컴퓨터그래픽스학회논문지
    • /
    • 제22권4호
    • /
    • pp.21-30
    • /
    • 2016
  • 위험관리 시스템은 단 시간에 의사결정하기 위해 스트림 데이터를 실시간으로 분석 할 수 있어야 한다. 많은 데이터 분석 시스템은 CPU와 디스크 데이터베이스로 구성되어 있다. 하지만, cpu 기반 시스템은 스트림 데이터를 실시간으로 분석하는데 어려움이 있다. 스트림 데이터는 1ms부터 1시간, 1일까지 생성주기가 다양하다. 한 개의 센서가 생성하는 데이터는 작다. 하지만 수 만개의 센서가 생성하는 데이터는 매우 크다. 예를 들어 10만개 센서가 1초에 1GB 데이터를 생성한다면, CPU 기반 시스템은 이를 분석 할 수 없다. 이러한 이유로 실시간 스트림 데이터 분석 시스템은 빠른 처리 속도와 확장성이 필요하다. 본 논문에서는 GPU와 하이브리드 데이터베이스를 이용한 시각화 가속 기술을 제안한다. 제안한 기술을 평가하기 위해 우리는 지하 파이프라인에 설치된 센서와 트윗 데이터를 활용하여 실시간 릭 탐지 시각적 분석 시스템에 적용했다.

Enhanced Regular Expression as a DGL for Generation of Synthetic Big Data

  • Kai, Cheng;Keisuke, Abe
    • Journal of Information Processing Systems
    • /
    • 제19권1호
    • /
    • pp.1-16
    • /
    • 2023
  • Synthetic data generation is generally used in performance evaluation and function tests in data-intensive applications, as well as in various areas of data analytics, such as privacy-preserving data publishing (PPDP) and statistical disclosure limit/control. A significant amount of research has been conducted on tools and languages for data generation. However, existing tools and languages have been developed for specific purposes and are unsuitable for other domains. In this article, we propose a regular expression-based data generation language (DGL) for flexible big data generation. To achieve a general-purpose and powerful DGL, we enhanced the standard regular expressions to support the data domain, type/format inference, sequence and random generation, probability distributions, and resource reference. To efficiently implement the proposed language, we propose caching techniques for both the intermediate and database queries. We evaluated the proposed improvement experimentally.

In-Database Analytics : DB 내에서의 효율적인 정보 분석 방안

  • 장성우
    • 한국경영과학회:학술대회논문집
    • /
    • 한국경영과학회 2006년도 추계학술대회
    • /
    • pp.637-640
    • /
    • 2006
  • DB는 더 이상 단순 데이터 관리의 장소가 아니며, 실시간 정보 분석의 핵심 요소임 . 데이터 측면의 RTE 구현 방안. DB의 통합. 단순화, 표준화, 전문화, 정보 전달 체인의 효율화, 통합 DB 상에서의 정보 분석. 정보 분석업무의 개선, 단순분석의 실시간화,고급분석의 전문화

  • PDF

Trend Analysis of the Agricultural Industry Based on Text Analytics

  • Choi, Solsaem;Kim, Junhwan;Nam, Seungju
    • Agribusiness and Information Management
    • /
    • 제11권1호
    • /
    • pp.1-9
    • /
    • 2019
  • This research intends to propose the methodology for analyzing the current trends of agriculture, which directly connects to the survival of the nation, and through this methodology, identify the agricultural trend of Korea. Based on the relationship between three types of data - policy reports, academic articles, and news articles - the research deducts the major issues stored by each data through LDA, the representative topic modeling method. By comparing and analyzing the LDA results deducted from each data source, this study intends to identify the implications regarding the current agricultural trends of Korea. This methodology can be utilized in analyzing industrial trends other than agricultural ones. To go on further, it can also be used as a basic resource for contemplation on potential areas in the future through insight on the current situation. database of the profitability of a total of 180 crop types by analyzing Rural Development Administration's survey of agricultural products income of 115 crop types, small land profitability index survey of 53 crop types, and Statistics Korea's survey of production costs of 12 crop types. Furthermore, this research presents the result and developmental process of a web-based crop introduction decision support system that provides overseas cases of new crop introduction support programs, as well as databases of outstanding business success cases of each crop type researched by agricultural institutions.