• Title/Summary/Keyword: Data 분석

Search Result 63,740, Processing Time 0.066 seconds

Missing Data Imputation Using Permanent Traffic Counts on National Highways (일반국토 상시 교통량자료를 이용한 교통량 결측자료 추정)

  • Ha, Jeong-A;Park, Jae-Hwa;Kim, Seong-Hyeon
    • Journal of Korean Society of Transportation
    • /
    • v.25 no.1 s.94
    • /
    • pp.121-132
    • /
    • 2007
  • Up to now Permanent traffic volumes have been counted by Automatic Vehicle Classification (AVC) on National Highways. When counted data have missing items or errors, the data must be revised to stay statistically reliable This study was carried out to estimate correct data based on outoregression and seasonal AutoRegressive Integrated Moving Average (ARIMA). As a result of verification through seasonal ARIMA, the longer the missed period is, the greater the error. Autoregression results in better verification results than seasonal ARIMA. Traffic data is affected by the present state mote than past patterns. However. autoregression can be applied only to the cases where data include similar neighborhood patterns and even in this case. the data cannot be corrected when data are missing due to low qualify or errors Therefore, these data shoo)d be corrected using past patterns and seasonal ARIMA when the missing data occurs in short periods.

A Review and Application of Library User Comments Data Analysis Tool: Focused on the LibQUAL+ Survey Comments (도서관 이용자 코멘트 데이터 분석도구 리뷰 및 적용: LibQUAL+ 설문 데이터를 중심으로)

  • Byun, Jeayeon;Shim, Wonsik
    • Journal of the Korean Society for information Management
    • /
    • v.30 no.3
    • /
    • pp.157-181
    • /
    • 2013
  • Using user satisfaction surveys and LibQUAL+ instruments, libraries are increasingly gathering qualitative data such as verbatim user comments as well as quantitative data. Such qualitative data can be utilized as clues in establishing library service strategies: to better understand user issues, to identify areas for service improvement, and to prioritize user needs. For this, it is necessary to analyze user comments data and to apply results to the delivery of service and the library policies. This study is an attempt to investigate ways in which user comments data can be made useful in libraries. It identifies different methods of analyzing user comments data from LibQUAL+ surveys and compares qualitative data analysis software programs and taxonomies. It also presents the results of applying these tools to a subset of actual user comments data gathered from a recent LibQUAL+ survey at a major university library in Korea.

Buying Customer Classification in Automotive Corporation with Decision Tree (의사결정트리를 통한 자동차산업의 구매패턴분류)

  • Lee, Byoung-Yup;Park, Yong-Hoon;Yoo, Jae-Soo
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.2
    • /
    • pp.372-380
    • /
    • 2010
  • Generally, data mining is the process of analyzing data from different perspectives and summarizing it into useful information that can be used to increase revenue, cuts costs, or both. It allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships identified. Technically, data mining is the process of finding correlations or patterns among dozens of fields in large relational databases. Data mining is one of the fastest growing field in the computer industry. Because of According to computer technology has been improving, Massive customer data has stored in database. Using this massive data, decision maker can extract the useful information to make a valuable plan with data mining. Data mining offers service providers great opportunities to get closer to customer. Data mining doesn't always require the latest technology, but it does require a magic eye that looks beyond the obvious to find and use the hidden knowledge to drive marketing strategies. Automotive market face an explosion of data arising from customer but a rate of increasing customer is getting lower. therefore, we need to determine which customer are profitable clients whom you wish to hold. This paper builds model of customer loyalty detection and analyzes customer buying patterns in automotive market with data mining using decision tree as a quinlan C4.5 and basic statics methods.

Multi-dimension Categorical Data with Bayesian Network (베이지안 네트워크를 이용한 다차원 범주형 분석)

  • Kim, Yong-Chul
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.11 no.2
    • /
    • pp.169-174
    • /
    • 2018
  • In general, the methods of the analysis of variance(ANOVA) for the continuous data and the chi-square test for the discrete data are used for statistical analysis of the effect and the association. In multidimensional data, analysis of hierarchical structure is required and statistical linear model is adopted. The structure of the linear model requires the normality of the data. A multidimensional categorical data analysis methods are used for causal relations, interactions, and correlation analysis. In this paper, Bayesian network model using probability distribution is proposed to reduce analysis procedure and analyze interactions and causal relationships in categorical data analysis.

Frequency and Social Network Analysis of the Bible Data using Big Data Analytics Tools R (빅데이터 분석도구 R을 이용한 성경 데이터의 빈도와 소셜 네트워크 분석)

  • Ban, ChaeHoon;Ha, JongSoo;Kim, Dong Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.2
    • /
    • pp.166-171
    • /
    • 2020
  • Big data processing technology that can store and analyze data and obtain new knowledge has been adjusted for importance in many fields of the society. Big data is emerging as an important problem in the field of information and communication technology, but the mind of continuous technology is rising. the R, a tool that can analyze big data, is a language and environment that enables information analysis of statistical bases. In this paper, we use this to analyze the Bible data. We analyze the four Gospels of the New Testament in the Bible. We collect the Bible data and perform filtering for analysis. The R is used to investigate the frequency of what text is distributed and analyze the Bible through social network analysis, in which words from a sentence are paired and analyzed between words for accurate data analysis.

An Analysis of Domestic Research Trend on Research Data Using Keyword Network Analysis (키워드 네트워크 분석을 이용한 연구데이터 관련 국내 연구 동향 분석)

  • Sangwoo Han
    • Journal of Korean Library and Information Science Society
    • /
    • v.54 no.4
    • /
    • pp.393-414
    • /
    • 2023
  • The goal of this study is to investigate domestic research trend on research data study. To achieve this goal, articles related research data topic were collected from RISS. After data cleansing, 134 author keywords were extracted from a total of 58 articles and keyword network analysis was performed. As a result, first, the number of studies related to research data in Korea is still only 58, so it was found that many related studies need to be conducted in the future. Second, most research fields related to research data were focused on library and information science among complex studies. Third, as a result of frequency analysis of author keywords related to research data, 'research data management', 'research data sharing', 'data repository', and 'open science' were analyzed as major frequent keywords, so research data-related research focuses on the above keywords. The keyword network analysis results also showed that high-frequency keywords occupy a central position in degree centrality and betweenness centrality and are located as core keywords in related studies. Through the results of this study, we were able to identify trends related to recent research data and identify areas that require intensive research in the future.

Metadata Analysis of Open Government Data by Formal Concept Analysis (형식 개념 분석을 통한 공공데이터의 메타데이터 분석)

  • Kim, Haklae
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.1
    • /
    • pp.305-313
    • /
    • 2018
  • Public open data is a database or electronic file produced by a public agency or government. The government is opening public data through the open data portals and individual agency websites. However, it is a reality that there is a limit to search and utilize desired public data from the perspective of data users. In particular, it takes a great deal of effort and time to understand the characteristics of data sets and to combine different data sets. This study suggests the possibility of interlinking between data sets by analyzing the common relationship of item names held by public data. The data sets are collected from the open data portal, and item names included in the data sets are extracted. The extracted item names consist of formal context and formal concept through formal concept analysis. The format concept has a list of data sets and a set of item name as extent and intent, respectively, and analyzes the common items of intent end to determine the possibility of data connection. The results derived from the formal concept analysis can be effectively applied to the semantic connection of the public data, and can be applied to data standard and quality improvement for public data release.

Big Data Analysis for Public Libraries Utilizing Big Data Platform: A Case Study of Daejeon Hanbat Library (도서관 빅데이터 플랫폼을 활용한 공공도서관 빅데이터 분석 연구: 대전한밭도서관을 중심으로)

  • On, Jeongmee;Park, Sung Hee
    • Journal of the Korean Society for information Management
    • /
    • v.37 no.3
    • /
    • pp.25-50
    • /
    • 2020
  • Since big data platform services for the public library began January 1, 2016, libraries have used big data to improve their work performance. This paper aims to examine the use cases of library big data and attempts to draw improvement plan to improve the effectiveness of library big data. For this purpose, first, we examine big data used while utilizing the library big data platform, the usage pattern of big data and services/policies drawn by big data analysis. Next, the limitations and advantages of the library big data platform are examined by comparing the data analysis of the integrated library management system (ILUS) currently used in public libraries and data analysis through the library big data platform. As a result of case analysis, big data usage patterns were found program planning and execution, collection, collection, and other types, and services/policies were summarized as customizing bookshelf themes for the book curation and reading promotion program, increasing collection utilization, and building a collection based on special topics. and disclosure of loan status data. As a result of the comparative analysis, ILUS is specialized in statistical analysis of library collection unit, and the big data platform enables selective and flexible analysis according to various attributes (age, gender, region, time of loan, etc.) reducing analysis time. Finally, the limitations revealed in case analysis and comparative analysis are summarized and suggestions for improvement are presented.

Analysis of the Core Concepts of Middle School Informatics Textbook Using Big Data Analysis Techniques (빅데이터 분석 방법을 이용한 중학교 정보 교과서 핵심 개념 분석)

  • Woon, Daewoong;Choe, Hyunjong
    • Journal of Creative Information Culture
    • /
    • v.5 no.2
    • /
    • pp.157-164
    • /
    • 2019
  • Big data is a field that has been utilized and developed in various fields in our society recently. Big data analysis techniques are frequently used to analyze various big data in various fields of politics, economy, and society to grasp various meanings hidden in the data. However, big data analysis is used some case studies of in fields of analysis of educational data, but analysis of the curriculum and direction is still inadequate. Therefore, this study aims to identify and analyze the core concepts of middle school informatics textbooks using big data analysis techniques. Text mining was used for big data analysis for informatics textbook analysis. Through the core concepts of middle school informatics textbooks identified using this techniques, we could confirm the concepts to be emphasized in the textbooks and the possibility of using big data in the field of education.

Partition-based Big Data Analysis and Visualization Algorithm (빅데이터 분석을 위한 파티션 기반 시각화 알고리즘)

  • Hong, Jun-Ki
    • The Journal of Bigdata
    • /
    • v.5 no.1
    • /
    • pp.147-154
    • /
    • 2020
  • Today, research is actively being conducted to derive meaningful results from big data. In this paper, we propose a partition-based big data analysis algorithm that can analyze the correlation between variables by setting the data areas of big data as partitions and calculating the representative values of each partition. In this paper, the analyzed visualization results are compared according to the partition size of a proposed partition-based big data analysis (PBDA) algorithm that can control the size of the partition. In order to verify the proposed PBDA algorithm, the big data of 'A' is analyzed, and meaningful results are obtained through the analysis of changes in sales volume of products according to changes in temperature and sales price.