• Title/Summary/Keyword: 탐색적 데이터 분석

Search Result 900, Processing Time 0.024 seconds

Exploring the Possibilities of Operation Data Use for Data-Driven Management in National R&D API Management System (데이터 기반 경영을 위한 국가R&D API관리시스템의 운영 데이터 활용 가능성 탐색)

  • Na, Hye-In;Lee, Jun-Young;Lee, Byeong-Hee;Choi, Kwang-Nam
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.4
    • /
    • pp.14-24
    • /
    • 2020
  • This paper aims to establish an efficient national R&D Application Programming Interface (API) management system for national R&D data-driven management and explore the possibility of using operational data according to the recent global data openness and sharing policy. In accordance with the trend of opening and sharing of national R&D data, we plan to improve management efficiency by analyzing operational data of the national R&D API service. For this purpose, we standardized the parameters for the national R&D APIs that were distributed separately by integrating the individual APIs to build a national R&D API management system. The results of this study revealed that the service call traffic of the national R&D API has shown 554.5% growth in the year as compared to the year 2015 when the measurement started. In addition, this paper also evaluations the possibility of using operational data through data preparation, analysis, and prediction based on service operations management data in the actual operation of national R&D integrated API management system.

Searching for Spatio-Temporal Pattern in EEG Signal with Hypernetwork (하이퍼네트워크를 이용한 EEG 신호의 시공간적 패턴 탐색)

  • Kim, Eun-Sol;Lee, Chung-Yeon;Lee, Ki-Seok Kevin;Lee, Hyun-Min;Kim, Joon-Shik;Zhang, Byoung-Tak
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06c
    • /
    • pp.331-334
    • /
    • 2011
  • 입력 데이터의 공통적인 특징을 찾아내는 방법은 기계 학습 분야의 중요한 분야이다. 일반적으로 입력 데이터의 형태적 패턴을 찾아내는 알고리즘들이 많이 연구되었는데, 최근에는 데이터의 입력 순서 또는 데이터 사이의 시간적 인과 관계와 같이 시간에 연관된 패턴을 찾는 방법이 주목을 받고 있다. 우리는 형태적 혹은 공간적 패턴 탐색에 뛰어난 성능을 보이는 하이퍼네트워크 모델을 확장하여 입력 데이터의 시공간적 패턴을 찾는 방법을 제시한다. 하이퍼네트워크는 두 개 이상의 변수를 하나의 엣지로 연결하여 문제공간을 탐색하는 모델로, 시간과 공간의 변수를 동시에 고려하여 데이터의 특성을 찾아내는 데에 적합하다. 이를 확인하기 위하여 사람의 EEG 신호를 분석하였는데, 시각적인 정보를 처리할 때와 언어적 정보를 처리할 때의 특징적인 패턴들을 찾았다.

A Researh for Consumer Dissatisfaction and Institutional Improvement of The Overseas Direct Purchase using Exploratory Data Analysis (탐색적 자료 분석(EDA) 기법을 활용한 온라인 해외직접구매에 대한 소비자 불만족 및 제도 개선 방안 연구)

  • Park, Seongwoo;Kang, Juyoung
    • The Journal of Bigdata
    • /
    • v.5 no.1
    • /
    • pp.41-54
    • /
    • 2020
  • With the recent expansion of Internet channels and the development of financial technology and information and communication technology, direct overseas purchases have expanded. Although direct overseas purchases dominate consumers in terms of price and scarcity by providing relatively low-priced products and products that are difficult to obtain in Korea, there is a higher chance of consumer dissatisfaction in terms of delivery, product, A/S and refund than domestic purchases. Therefore, this study analyzed consumer dissatisfaction caused by active overseas direct purchase and studied ways to improve problems with overseas direct purchase. As a research method, Several statistical data were collected from the Korea Consumer Agency(KCA), the Korea Customs Service(KCS) and the Korea International Trade Association(KITA) and analyzed using the Exploratory Data Analysis Technique (EDA). The analysis confirmed that consumers were not well aware of information about direct overseas purchases and that the type or degree of consumer complaints varied depending on the type of purchase. Therefore, this study suggests a direction for the revitalization of overseas direct purchases by using EDA to identify the overall status of overseas direct purchases and consumer dissatisfaction and to improve them.

A Study on the Scholarly Information and Data Requirements of Researchers for Data-Driven Research and Development (데이터 기반 R&D 지원을 위한 연구자의 학술정보 및 데이터 요구 분석 연구)

  • Seok-Hyoung Lee;Kangsandajung Lee;Jayhoon Kim;Hyejin Lee
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.58 no.1
    • /
    • pp.255-283
    • /
    • 2024
  • In this study, as a preliminary research to effectively support data-driven R&D of researchers, we analyzed the academic information and data requirements for researchers to discover new types of academic information and datasets, and to propose directions for academic information services. To achieve the research objectives, we conducted an exploratory case study involving five researchers and administered an online survey among ScienceON users to glean insights into data-driven R&D behaviors and information/data requirements. As a result, researchers relatively referred to academic papers, datasets and software information from academic papers or conference materials. Moreover, the methods and pathways for acquiring data, as well as the types of data, varied across different subject areas. Researchers often faced challenges in data-driven R&D due to difficulties in locating and accessing necessary datasets or software such as learning models. Therefore it has been analyzed that for future support of data-driven R&D, there is a need to systematically construct datasets by subject. Additionally, it is considered necessary to extract and summarize dataset and related software information in conjunction with academic papers.

A Comparison of the Search Based Testing Algorithm with Metrics (메트릭에 따른 탐색 기반 테스팅 알고리즘 비교)

  • Choi, HyunJae;Chae, HeungSeok
    • Journal of KIISE
    • /
    • v.43 no.4
    • /
    • pp.480-488
    • /
    • 2016
  • Search-Based Software Testing (SBST) is an effective technique for test data generation on large domain size. Although the performance of SBST seems to be affected by the structural characteristics of Software Under Test (SUT), studies for the comparison of SBST techniques considering structural characteristics are rare. In addition to the comparison study for SBST, we analyzed the best algorithm with different structural characteristics of SUT. For the generalization of experimental results, we automatically generated 19,800 SUTs by combining four metrics, which are expected to affect the performance of SBST. According to the experiment results, Genetic algorithm showed the best performance for SUTs with high complexity and test data evaluation with count ${\leq}20,000$. On the other hand, the genetic simulated annealing and the simulated annealing showed relatively better performance for SUTs with high complexity and test data evaluation with count ${\geq}50,000$. Genetic simulated annealing, simulated annealing and hill climbing showed better performance for SUTs with low complexity.

A Sliding Window Technique for Open Data Mining over Data Streams (개방 데이터 마이닝에 효율적인 이동 윈도우 기법)

  • Chang Joong-Hyuk;Lee Won-Suk
    • The KIPS Transactions:PartD
    • /
    • v.12D no.3 s.99
    • /
    • pp.335-344
    • /
    • 2005
  • Recently open data mining methods focusing on a data stream that is a massive unbounded sequence of data elements continuously generated at a rapid rate are proposed actively. Knowledge embedded in a data stream is likely to be changed over time. Therefore, identifying the recent change of the knowledge quickly can provide valuable information for the analysis of the data stream. This paper proposes a sliding window technique for finding recently frequent itemsets, which is applied efficiently in open data mining. In the proposed technique, its memory usage is kept in a small space by delayed-insertion and pruning operations, and its mining result can be found in a short time since the data elements within its target range are not traversed repeatedly. Moreover, the proposed technique focused in the recent data elements, so that it can catch out the recent change of the data stream.

Analyzing and Visualizing the Intellectual Structure of Data Science (데이터사이언스 연구의 지적 구조 분석 및 시각화)

  • Park, Hyoungjoo
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.7
    • /
    • pp.18-29
    • /
    • 2022
  • The purpose of this exploratory study is to examine the intellectual structure of data science. For this purpose, this research examined a total of 17,997 bibliographies on data science indexed in Web of Science(WoS) of Clarivate Analytics from 2012 to 2021. This research applied methods such as descriptive analysis, citation analysis, co-author network analysis, co-occurrence network analysis, bibliographic coupling analysis, and co-citation analysis. This research contributes to finding the research directions of future data science topics.

Exploration of Types and Context of Errors in the Weather Data Analysis Process (기상 데이터 분석 과정에서 나타나는 오류의 유형과 맥락 탐색)

  • Seok-Young Hong
    • Journal of the Korean Society of Earth Science Education
    • /
    • v.17 no.2
    • /
    • pp.153-167
    • /
    • 2024
  • This study explored the errors and context occurred during high school students' data analysis processes. For the study, 222 data inquiry reports produced by 74 students from 'A' High School were collected and explored the detailed error types in the data analysis processes such as data collection and preprocessing, data representation, and data interpretation. The results of study found that in the data interpretation process, students had a somewhat insufficient understanding of seasonal variations and periodic patterns about weather elements. And, various types of errors were identified in the data representation process, such as basic unit in graphs, legend settings, trend lines. The causes of these errors are the feature of authoring tools, misconceptions related to weather elements, and cognitive biases, etc. Based on the study's results, educational implications for big data education, a significant topic in future science education, were derived. And related follow-up studies were suggested.

An Instrument for Measuring Take-out Food Safety Perception (테이크아웃 음식의 안전에 대한 고객인식도 측정을 위한 척도에 관한 연구)

  • Kim, Hak-Seon
    • Culinary science and hospitality research
    • /
    • v.18 no.2
    • /
    • pp.82-90
    • /
    • 2012
  • This study was conducted to evaluate a take-out food safety perception instrument that could be used by foodservice establishments. A total of 324 responses was collected via online survey, and 299 responses (92.3%) were used for the statistical analysis. Data was randomly split into two groups. Exploratory Factor Analysis (EFA) was performed on the first split-half sample (n=150) to identify a factor structure using standard principal component analysis. EFA revealed three dimensions, titled "Consumer food safety perception," "Take-out food handling," and "Elements impacting on purchase decisions." Confirmatory Factor Analysis (CFA) was performed on the remaining half sample (n=149) using Structural Equation Modeling (SEM). CFA revealed acceptable absolute model fits for three dimensions and excellent comparative model fits for the instrument. These findings propose standardized measures that can be useful in assessing the take-out food safety perception.

  • PDF

Exploratory research based on big data for Improving the revisit rate of foreign tourists and invigorating consumption (외국인 관광객 재방문율 향상과 소비 활성화를 위한 빅데이터 기반의 탐색적 연구)

  • An, Sung-Hyun;Park, Seong-Taek
    • Journal of Industrial Convergence
    • /
    • v.18 no.6
    • /
    • pp.19-25
    • /
    • 2020
  • Big data analytics are indispensable today in various industries and public sectors. Therefore, in this study, we will utilize big data analysis to search for improvement plans for domestic tourism services using the LDA analysis method. In particular, we have tried an exploratory approach that can improve tourist satisfaction, which can improve revisit and service, especially in Seoul, which has the largest number of foreign tourists. In this study, we collected and analyzed statistical data of Seoul City and Korea Tourism Organization and Internet information such as SNS via R. And we utilized text mining methods including LDA. As a result of the analysis, one of the purposes of visiting South Korea by foreigners was gastronomic tourism. We will try to derive measures to improve the quality of services centered on gastronomic tourism.