• Title/Summary/Keyword: BIG

Search Result 11,558, Processing Time 0.039 seconds

Big Data Smoothing and Outlier Removal for Patent Big Data Analysis

  • Choi, JunHyeog;Jun, Sunghae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.8
    • /
    • pp.77-84
    • /
    • 2016
  • In general statistical analysis, we need to make a normal assumption. If this assumption is not satisfied, we cannot expect a good result of statistical data analysis. Most of statistical methods processing the outlier and noise also need to the assumption. But the assumption is not satisfied in big data because of its large volume and heterogeneity. So we propose a methodology based on box-plot and data smoothing for controling outlier and noise in big data analysis. The proposed methodology is not dependent upon the normal assumption. In addition, we select patent documents as target domain of big data because patent big data analysis is a important issue in management of technology. We analyze patent documents using big data learning methods for technology analysis. The collected patent data from patent databases on the world are preprocessed and analyzed by text mining and statistics. But the most researches about patent big data analysis did not consider the outlier and noise problem. This problem decreases the accuracy of prediction and increases the variance of parameter estimation. In this paper, we check the existence of the outlier and noise in patent big data. To know whether the outlier is or not in the patent big data, we use box-plot and smoothing visualization. We use the patent documents related to three dimensional printing technology to illustrate how the proposed methodology can be used for finding the existence of noise in the searched patent big data.

A Study on Policies to Revitalize the Public Big Data in Seoul (서울시 공공빅데이터 활성화 방안 연구)

  • Choi, Bong;Yun, Jongjin;Um, Taehyee
    • Knowledge Management Research
    • /
    • v.20 no.3
    • /
    • pp.73-89
    • /
    • 2019
  • The purpose of this study is to investigate the current state of public Big Data in Seoul and suggest policy directions for the revitalization of Seoul's public Big Data. Big Data is perceived as innovation resources under the era of 4th Industrial revolution and Data economy. Especially, public Big Data serves a significant role in terms of universal access for citizens, startup, and enterprise compared with the private sector. Seoul reorganized a substructure of government's focus on Big Data and established organizations such as Big Data Campus and Urban Data Science Lab. Although the number of public open Data has increased in Seoul, there exists not much Data with characteristics similar to Big Data, such as volume, velocity, and value. In order to present the direction of Big Data policy in Seoul, we investigate the current status of Big Data Campus and Urban Data Science Lab operated by Seoul City. Considering the results of this study, we have proposed several directions that Seoul can use in establishing big data related strategies.

A Study on the Factors Affecting the Decision Making Satisfaction and User Behavior of Big Data Characteristics (빅데이터 특성이 의사결정 만족도와 이용행동에 영향을 미치는 요인에 관한 연구)

  • Kim, Byung-Gon;Yoon, Il-Ki;Kim, Ki-Won
    • Journal of Information Technology Applications and Management
    • /
    • v.28 no.1
    • /
    • pp.13-31
    • /
    • 2021
  • The purpose of this study is to find the factors that influence big data characteristics on decision satisfaction and utilization behavior, analyze the extent of their influence, and derive differences from existing studies. To summarize the results of this study, First, the study found that among the three categories that classify the characteristics of big data, qualitative attributes such as representation, purpose, interpretability, and innovation in the value innovation category greatly enhance decision confidence and decision effectiveness of decision makers who make decisions using big data. Second, the study found that, among the three categories that classify the characteristics of big data, the individuality properties belonging to the social impact category improve decision confidence and decision effectiveness of decision makers who use big data to make decisions. However, collectivity and bias characteristics have been shown to increase decision confidence, but not the effectiveness of decision making. Third, the study found that among the three categories that classify the characteristics of big data, the attributes of inclusiveness, realism, etc. in the integrity category greatly improve decision confidence and decision effectiveness of decision makers who make decisions using big data. Fourth, it was analyzed that using big data in organizational decision making has a positive impact on the behavior of big data users when the decision-making confidence and finally, decision-making effect of decision-makers increases.

Impact of Big Data Analytics on Indian E-Tailing from SCM to TCS

  • Avinash BM;Divakar GM;Rajasekhara Mouly Potluri;Megha B
    • Journal of Distribution Science
    • /
    • v.22 no.8
    • /
    • pp.65-76
    • /
    • 2024
  • Purpose: The study aims to recognize the relationship between big data analytics capabilities, big data analytics process, and perceived business performance from supply chain management to total customer satisfaction. Research design, data and methodology: The study followed a quantitative approach with a descriptive design. The data was collected from leading e-commerce companies in India using a structured questionnaire, and the data was coded and decoded using MS Excel, SPSS, and R language. It was further tested using Cronbach's alpha, KMO, and Bartlett's test for reliability and internal consistency. Results: The results showed that the big data analytics process acts as a robust mediator between big data analytics capabilities and perceived business performance. The 'direct, indirect and total effect of the model' and 'PLS-SEM model' showed that the big data analytics process directly impacts business performance. Conclusions: A complete indirect relationship exists between big data analytics capabilities and perceived business performance through the big data analytics process. The research contributesto e-commerce companies' understanding of the importance of big data analytics capabilities and processes.

Big Data in Smart Tourism: A Perspective Article

  • Park, Sangwon
    • Journal of Smart Tourism
    • /
    • v.1 no.3
    • /
    • pp.3-5
    • /
    • 2021
  • The advancement of Information Communication Technology has provided tourism researchers with a golden opportunity to access big data, which plays a critical role in smart tourism. Recognizing the current issue, this paper discusses the evolution of the literature on tourism big data focusing on conceptual understanding of and types of big data, and insights from big data analytics. Indeed, this article provides important research agenda for future tourism researchers who would like to conduct academic research about big data and smart tourism.

Study on the Direction of Universal Big Data and Big Data Education-Based on the Survey of Big Data Experts (보편적 빅데이터와 빅데이터 교육의 방향성 연구 - 빅데이터 전문가의 인식 조사를 기반으로)

  • Park, Youn-Soo;Lee, Su-Jin
    • Journal of The Korean Association of Information Education
    • /
    • v.24 no.2
    • /
    • pp.201-214
    • /
    • 2020
  • Big data is gradually expanding in diverse fields, with changing the data-related legislation. Moreover it would be interest in big data education. However, it requires a high level of knowledge and skills in order to utilize Big Data and it takes a long time for education spends a lot of money for training. We study that in order to define Universal Big Data used to the industrial field in a wide range. As a result, we make the paradigm for Big Data education for college students. We survey to the professional the Big Data definition and the Big Data perception. According to the survey, the Big Data related-professional recognize that is a wider definition than Computer Science Big Data is. Also they recognize that the Big Data Processing dose not be required Big Data Processing Frameworks or High Performance Computers. This means that in order to educate Big Data, it is necessary to focus on the analysis methods and application methods of Universal Big Data rather than computer science (Engineering) knowledge and skills. Based on the our research, we propose the Universal Big Data education on the new paradigm.

Providing Service Model Based on Concept and Requirements of Spatial Big Data (공간 빅데이터의 개념 및 요구사항을 반영한 서비스 제공 방안)

  • Kim, Geun Han;Jun, Chul Min;Jung, Hui Cheul;Yoon, Jeong Ho
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.24 no.4
    • /
    • pp.89-96
    • /
    • 2016
  • By reviewing preceding studies of big data and spatial big data, spatial big data was defined as one part of big data, which spatialize location information and systematize time series data. Spatial big data, as one part of big data, should not be separated with big data and application methods within the system is to be examined. Therefore in this study, services that spatial big data is required to provide were suggested. Spatial big data must be available of various spatial analysis and is in need of services that considers present and future spatial information. Not only should spatial big data be able to detect time series changes in location, but also analyze various type of big data using attribute information of spatial data. To successfully provide the requirements of spatial big data and link various type of big data with spatial big data, methods of forming sample points and extracting attribute information were proposed in this study. The increasing application of spatial information related to big data is expected to attribute to the development of spatial data industry and technological advancement.

Study for Spatial Big Data Concept and System Building (공간빅데이터 개념 및 체계 구축방안 연구)

  • Ahn, Jong Wook;Yi, Mi Sook;Shin, Dong Bin
    • Spatial Information Research
    • /
    • v.21 no.5
    • /
    • pp.43-51
    • /
    • 2013
  • In this study, the concept of spatial big data and effective ways to build a spatial big data system are presented. Big Data is defined as 3V(volume, variety, velocity). Spatial big data is the basis for evolution from 3V's big data to 6V's big data(volume, variety, velocity, value, veracity, visualization). In order to build an effective spatial big data, spatial big data system building should be promoted. In addition, spatial big data system should be performed a national spatial information base, convergence platform, service providers, and providers as a factor of production. The spatial big data system is made up of infrastructure(hardware), technology (software), spatial big data(data), human resources, law etc. The goals for the spatial big data system build are spatial-based policy support, spatial big data platform based industries enable, spatial big data fusion-based composition, spatial active in social issues. Strategies for achieving the objectives are build the government-wide cooperation, new industry creation and activation, and spatial big data platform built, technologies competitiveness of spatial big data.

An Empirical Study on the Influencing Factors for Big Data Intented Adoption: Focusing on the Strategic Value Recognition and TOE Framework (빅데이터 도입의도에 미치는 영향요인에 관한 연구: 전략적 가치인식과 TOE(Technology Organizational Environment) Framework을 중심으로)

  • Ka, Hoi-Kwang;Kim, Jin-soo
    • Asia pacific journal of information systems
    • /
    • v.24 no.4
    • /
    • pp.443-472
    • /
    • 2014
  • To survive in the global competitive environment, enterprise should be able to solve various problems and find the optimal solution effectively. The big-data is being perceived as a tool for solving enterprise problems effectively and improve competitiveness with its' various problem solving and advanced predictive capabilities. Due to its remarkable performance, the implementation of big data systems has been increased through many enterprises around the world. Currently the big-data is called the 'crude oil' of the 21st century and is expected to provide competitive superiority. The reason why the big data is in the limelight is because while the conventional IT technology has been falling behind much in its possibility level, the big data has gone beyond the technological possibility and has the advantage of being utilized to create new values such as business optimization and new business creation through analysis of big data. Since the big data has been introduced too hastily without considering the strategic value deduction and achievement obtained through the big data, however, there are difficulties in the strategic value deduction and data utilization that can be gained through big data. According to the survey result of 1,800 IT professionals from 18 countries world wide, the percentage of the corporation where the big data is being utilized well was only 28%, and many of them responded that they are having difficulties in strategic value deduction and operation through big data. The strategic value should be deducted and environment phases like corporate internal and external related regulations and systems should be considered in order to introduce big data, but these factors were not well being reflected. The cause of the failure turned out to be that the big data was introduced by way of the IT trend and surrounding environment, but it was introduced hastily in the situation where the introduction condition was not well arranged. The strategic value which can be obtained through big data should be clearly comprehended and systematic environment analysis is very important about applicability in order to introduce successful big data, but since the corporations are considering only partial achievements and technological phases that can be obtained through big data, the successful introduction is not being made. Previous study shows that most of big data researches are focused on big data concept, cases, and practical suggestions without empirical study. The purpose of this study is provide the theoretically and practically useful implementation framework and strategies of big data systems with conducting comprehensive literature review, finding influencing factors for successful big data systems implementation, and analysing empirical models. To do this, the elements which can affect the introduction intention of big data were deducted by reviewing the information system's successful factors, strategic value perception factors, considering factors for the information system introduction environment and big data related literature in order to comprehend the effect factors when the corporations introduce big data and structured questionnaire was developed. After that, the questionnaire and the statistical analysis were performed with the people in charge of the big data inside the corporations as objects. According to the statistical analysis, it was shown that the strategic value perception factor and the inside-industry environmental factors affected positively the introduction intention of big data. The theoretical, practical and political implications deducted from the study result is as follows. The frist theoretical implication is that this study has proposed theoretically effect factors which affect the introduction intention of big data by reviewing the strategic value perception and environmental factors and big data related precedent studies and proposed the variables and measurement items which were analyzed empirically and verified. This study has meaning in that it has measured the influence of each variable on the introduction intention by verifying the relationship between the independent variables and the dependent variables through structural equation model. Second, this study has defined the independent variable(strategic value perception, environment), dependent variable(introduction intention) and regulatory variable(type of business and corporate size) about big data introduction intention and has arranged theoretical base in studying big data related field empirically afterwards by developing measurement items which has obtained credibility and validity. Third, by verifying the strategic value perception factors and the significance about environmental factors proposed in the conventional precedent studies, this study will be able to give aid to the afterwards empirical study about effect factors on big data introduction. The operational implications are as follows. First, this study has arranged the empirical study base about big data field by investigating the cause and effect relationship about the influence of the strategic value perception factor and environmental factor on the introduction intention and proposing the measurement items which has obtained the justice, credibility and validity etc. Second, this study has proposed the study result that the strategic value perception factor affects positively the big data introduction intention and it has meaning in that the importance of the strategic value perception has been presented. Third, the study has proposed that the corporation which introduces big data should consider the big data introduction through precise analysis about industry's internal environment. Fourth, this study has proposed the point that the size and type of business of the corresponding corporation should be considered in introducing the big data by presenting the difference of the effect factors of big data introduction depending on the size and type of business of the corporation. The political implications are as follows. First, variety of utilization of big data is needed. The strategic value that big data has can be accessed in various ways in the product, service field, productivity field, decision making field etc and can be utilized in all the business fields based on that, but the parts that main domestic corporations are considering are limited to some parts of the products and service fields. Accordingly, in introducing big data, reviewing the phase about utilization in detail and design the big data system in a form which can maximize the utilization rate will be necessary. Second, the study is proposing the burden of the cost of the system introduction, difficulty in utilization in the system and lack of credibility in the supply corporations etc in the big data introduction phase by corporations. Since the world IT corporations are predominating the big data market, the big data introduction of domestic corporations can not but to be dependent on the foreign corporations. When considering that fact, that our country does not have global IT corporations even though it is world powerful IT country, the big data can be thought to be the chance to rear world level corporations. Accordingly, the government shall need to rear star corporations through active political support. Third, the corporations' internal and external professional manpower for the big data introduction and operation lacks. Big data is a system where how valuable data can be deducted utilizing data is more important than the system construction itself. For this, talent who are equipped with academic knowledge and experience in various fields like IT, statistics, strategy and management etc and manpower training should be implemented through systematic education for these talents. This study has arranged theoretical base for empirical studies about big data related fields by comprehending the main variables which affect the big data introduction intention and verifying them and is expected to be able to propose useful guidelines for the corporations and policy developers who are considering big data implementationby analyzing empirically that theoretical base.

Big data and statistics (빅데이터와 통계학)

  • Kim, Yongdai;Cho, Kwang Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.5
    • /
    • pp.959-974
    • /
    • 2013
  • We investigate the roles of statistics and statisticians in the big data era. Definition and application areas of big data are reviewed and statistical characteristics of big data and their meanings are discussed. Various statistical methodologies applicable to big data analysis are illustrated, and two real big data projects are explained.