• Title/Summary/Keyword: in-database analytics

Search Result 23, Processing Time 0.02 seconds

Big Data Analytics Case Study from the Marketing Perspective : Emphasis on Banking Industry (마케팅 관점으로 본 빅 데이터 분석 사례연구 : 은행업을 중심으로)

  • Park, Sung Soo;Lee, Kun Chang
    • Journal of Information Technology Services
    • /
    • v.17 no.2
    • /
    • pp.207-218
    • /
    • 2018
  • Recently, it becomes a big trend in the banking industry to apply a big data analytics technique to extract essential knowledge from their customer database. Such a trend is based on the capability to analyze the big data with powerful analytics software and recognize the value of big data analysis results. However, there exits still a need for more systematic theory and mechanism about how to adopt a big data analytics approach in the banking industry. Especially, there is no study proposing a practical case study in which big data analytics is successfully accomplished from the marketing perspective. Therefore, this study aims to analyze a target marketing case in the banking industry from the view of big data analytics. Target database is a big data in which about 3.5 million customers and their transaction records have been stored for 3 years. Practical implications are derived from the marketing perspective. We address detailed processes and related field test results. It proved critical for the big data analysts to consider a sense of Veracity and Value, in addition to traditional Big Data's 3V (Volume, Velocity, and Variety), so that more significant business meanings may be extracted from the big data results.

Education and Training of Product Data Analytics using Product Data Management System (PDM 시스템을 활용한 Product Data Analytics 교육 훈련)

  • Do, Namchul
    • Korean Journal of Computational Design and Engineering
    • /
    • v.22 no.1
    • /
    • pp.80-88
    • /
    • 2017
  • Product data analytics (PDA) is a data-driven analysis method that uses product data management (PDM) databases as its operational data. It aims to understand and evaluate product development processes indirectly through the analysis of product data from the PDM databases. To educate and train PDA efficiently, this study proposed an approach that employs courses for both product development and PDA in a class. The participant group for product development provides a PDM database as a result of their product development activities, and the other group for PDA analyses the PDM database and provides analysis result to the product development group who can explain causes of the result. The collaboration between the two groups can enhance the efficiency of the education and training course on PDA. This study also includes an application example of the approach to a graduate class on PDA and discussion of its result.

Design and Implementation of Distributed In-Memory DBMS-based Parallel K-Means as In-database Analytics Function (분산 인 메모리 DBMS 기반 병렬 K-Means의 In-database 분석 함수로의 설계와 구현)

  • Kou, Heymo;Nam, Changmin;Lee, Woohyun;Lee, Yongjae;Kim, HyoungJoo
    • KIISE Transactions on Computing Practices
    • /
    • v.24 no.3
    • /
    • pp.105-112
    • /
    • 2018
  • As data size increase, a single database is not enough to serve current volume of tasks. Since data is partitioned and stored into multiple databases, analysis should also support parallelism in order to increase efficiency. However, traditional analysis requires data to be transferred out of database into nodes where analytic service is performed and user is required to know both database and analytic framework. In this paper, we propose an efficient way to perform K-means clustering algorithm inside the distributed column-based database and relational database. We also suggest an efficient way to optimize K-means algorithm within relational database.

Multi-dimensional Query Authentication for On-line Stream Analytics

  • Chen, Xiangrui;Kim, Gyoung-Bae;Bae, Hae-Young
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.4 no.2
    • /
    • pp.154-173
    • /
    • 2010
  • Database outsourcing is unavoidable in the near future. In the scenario of data stream outsourcing, the data owner continuously publishes the latest data and associated authentication information through a service provider. Clients may register queries to the service provider and verify the result's correctness, utilizing the additional authentication information. Research on On-line Stream Analytics (OLSA) is motivated by extending the data cube technology for higher multi-level abstraction on the low-level-abstracted data streams. Existing work on OLSA fails to consider the issue of database outsourcing, while previous work on stream authentication does not support OLSA. To close this gap and solve the problem of OLSA query authentication while outsourcing data streams, we propose MDAHRB and MDAHB, two multi-dimensional authentication approaches. They are based on the general data model for OLSA, the stream cube. First, we improve the data structure of the H-tree, which is used to store the stream cube. Then, we design and implement two authentication schemes based on the improved H-trees, the HRB- and HB-trees, in accordance with the main stream query authentication framework for database outsourcing. Along with a cost models analysis, consistent with state-of-the-art cost metrics, an experimental evaluation is performed on a real data set. It exhibits that both MDAHRB and MDAHB are feasible for authenticating OLSA queries, while MDAHRB is more scalable.

The Intellectual Structure of Business Analytics by Author Co-citation Analysis : 2002 ~ 2020 (저자동시인용분석에 의한 Business Analytics 분야의 지적 구조 분석: 2002 ~ 2020)

  • Lim, Hyae Jung;Suh, Chang Kyo
    • The Journal of Information Systems
    • /
    • v.30 no.1
    • /
    • pp.21-44
    • /
    • 2021
  • Purpose The opportunities and approaches to big data have grown in various ways in the digital era. Business analytics is nowadays an inevitable strategy for organizations to earn a competitive advantage in order to survive in the challenged environments. The purpose of this study is to analyze the intellectual structure of business analytics literature to have a better insight for the organizations to the field. Design/methodology/approach This research analyzed with the data extracted from the database Web of Science. Total of 427 documents and 23,760 references are inserted into the analysis program CiteSpace. Author co-citation analysis is used to analyze the intellectual structure of the business analytics. We performed clustering analysis, burst detection and timeline analysis with the data. Findings We identified seven sub- areas of business analytics field. The top four sub-areas are "Big Data Analytics Infrastructure", "Performance Management System", "Interactive Exploration", and "Supply Chain Management". We also identified the top 5 references with the strongest citation bursts including Trkman et al.(2010) and Davenport(2006). Through timeline analysis we interpret the clusters that are expected to be the trend subjects in the future. Lastly, limitation and further research suggestion are discussed as concluding remarks.

Analysis of Failure in Product Design Experiments by using Product Data Analytics (제품자료 분석을 통한 제품설계 실험 실패 요인 분석)

  • Do, Namchul
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.40 no.4
    • /
    • pp.366-374
    • /
    • 2014
  • This study assessed and analysed a result of a product design experiment through Product Data Analytics (PDA), to find reasons for failure of some projects in the experiment. PDA is a computer-based data analysis that uses Product Data Management (PDM) databases as its operational databases. The study examines 20 product design projects in the experiment, which are prepared to follow same product development process by using an identical PDM system. The design result in the PDM database is assessed and analysed by On-Line Analytical Processing (OLAP) and data mining tools in PDA. The assesment and analysis reveals the lateness in creation of 3D CAD models as the main reason of the failure.

Fast Visualization Technique and Visual Analytics System for Real-time Analyzing Stream Data (실시간 스트림 데이터 분석을 위한 시각화 가속 기술 및 시각적 분석 시스템)

  • Jeong, Seongmin;Yeon, Hanbyul;Jeong, Daekyo;Yoo, Sangbong;Kim, Seokyeon;Jang, Yun
    • Journal of the Korea Computer Graphics Society
    • /
    • v.22 no.4
    • /
    • pp.21-30
    • /
    • 2016
  • Risk management system should be able to support a decision making within a short time to analyze stream data in real time. Many analytical systems consist of CPU computation and disk based database. However, it is more problematic when existing system analyzes stream data in real time. Stream data has various production periods from 1ms to 1 hour, 1day. One sensor generates small data but tens of thousands sensors generate huge amount of data. If hundreds of thousands sensors generate 1GB data per second, CPU based system cannot analyze the data in real time. For this reason, it requires fast processing speed and scalability for analyze stream data. In this paper, we present a fast visualization technique that consists of hybrid database and GPU computation. In order to evaluate our technique, we demonstrate a visual analytics system that analyzes pipeline leak using sensor and tweet data.

Enhanced Regular Expression as a DGL for Generation of Synthetic Big Data

  • Kai, Cheng;Keisuke, Abe
    • Journal of Information Processing Systems
    • /
    • v.19 no.1
    • /
    • pp.1-16
    • /
    • 2023
  • Synthetic data generation is generally used in performance evaluation and function tests in data-intensive applications, as well as in various areas of data analytics, such as privacy-preserving data publishing (PPDP) and statistical disclosure limit/control. A significant amount of research has been conducted on tools and languages for data generation. However, existing tools and languages have been developed for specific purposes and are unsuitable for other domains. In this article, we propose a regular expression-based data generation language (DGL) for flexible big data generation. To achieve a general-purpose and powerful DGL, we enhanced the standard regular expressions to support the data domain, type/format inference, sequence and random generation, probability distributions, and resource reference. To efficiently implement the proposed language, we propose caching techniques for both the intermediate and database queries. We evaluated the proposed improvement experimentally.

In-Database Analytics : DB 내에서의 효율적인 정보 분석 방안

  • Jang, Seong-U
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2006.11a
    • /
    • pp.637-640
    • /
    • 2006
  • DB는 더 이상 단순 데이터 관리의 장소가 아니며, 실시간 정보 분석의 핵심 요소임 . 데이터 측면의 RTE 구현 방안. DB의 통합. 단순화, 표준화, 전문화, 정보 전달 체인의 효율화, 통합 DB 상에서의 정보 분석. 정보 분석업무의 개선, 단순분석의 실시간화,고급분석의 전문화

  • PDF

Trend Analysis of the Agricultural Industry Based on Text Analytics

  • Choi, Solsaem;Kim, Junhwan;Nam, Seungju
    • Agribusiness and Information Management
    • /
    • v.11 no.1
    • /
    • pp.1-9
    • /
    • 2019
  • This research intends to propose the methodology for analyzing the current trends of agriculture, which directly connects to the survival of the nation, and through this methodology, identify the agricultural trend of Korea. Based on the relationship between three types of data - policy reports, academic articles, and news articles - the research deducts the major issues stored by each data through LDA, the representative topic modeling method. By comparing and analyzing the LDA results deducted from each data source, this study intends to identify the implications regarding the current agricultural trends of Korea. This methodology can be utilized in analyzing industrial trends other than agricultural ones. To go on further, it can also be used as a basic resource for contemplation on potential areas in the future through insight on the current situation. database of the profitability of a total of 180 crop types by analyzing Rural Development Administration's survey of agricultural products income of 115 crop types, small land profitability index survey of 53 crop types, and Statistics Korea's survey of production costs of 12 crop types. Furthermore, this research presents the result and developmental process of a web-based crop introduction decision support system that provides overseas cases of new crop introduction support programs, as well as databases of outstanding business success cases of each crop type researched by agricultural institutions.