• Title/Summary/Keyword: Big data collection

Search Result 342, Processing Time 0.031 seconds

Implementation of marine static data collection and DB storage algorithms (해양 정적 데이터 수집 및 DB 저장 알고리즘 구현)

  • Seung-Hwan Choi;Gi-Jo Park;Ki-Sook Chung;Woo-Sug Jung;Kyung-Seok Kim
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.23 no.2
    • /
    • pp.95-101
    • /
    • 2023
  • Globally, the importance of utilization and management of marine spatial information is being maximized, and analyzing such data is emerging as a major driving force for R&D. In Korea, it is expected that collecting marine data from the past to the present and extracting its value will play an important role in the development of science in Korea in the future. In particular, marine static data constitutes a huge big database, and it is necessary to store and store the collected data without loss as high data collection costs and high-level observation techniques are required. In addition, the Disaster Safety Intelligence Convergence Center's "Marine Digital Twin Establishment and Utilization-Based Technology Research" task requires collection and analysis of marine data, so this paper conducts a current status survey of static marine data. And we present a series of algorithms that collect and store them in a database.

A Study on Environmental Factor Recommendation Technology based on Deep Learning for Digital Agriculture (디지털 농업을 위한 딥러닝 기반의 환경 인자 추천 기술 연구)

  • Han-Jin Cho
    • Smart Media Journal
    • /
    • v.12 no.5
    • /
    • pp.65-72
    • /
    • 2023
  • Smart Farm means creating new value in various fields related to agriculture, including not only agricultural production but also distribution and consumption through the convergence of agriculture and ICT. In Korea, a rental smart farm is created to spread smart agriculture, and a smart farm big data platform is established to promote data collection and utilization. It is pushing for digital transformation of agricultural products distribution from production areas to consumption areas, such as expanding smart APCs, operating online exchanges, and digitizing wholesale market transaction information. As such, although agricultural data is generated according to characteristics from various sources, it is only used as a service using statistics and standardized data. This is because there are limitations due to distributed data collection from agriculture to production, distribution, and consumption, and it is difficult to collect and process various types of data from various sources. Therefore, in this paper, we analyze the current state of domestic agricultural data collection and sharing for digital agriculture and propose a data collection and linkage method for artificial intelligence services. And, using the proposed data, we propose a deep learning-based environmental factor recommendation method.

The Effect of Data 3 on the Utilization of Medical Big Data for Early Detection of Dementia (데이터 3법이 치매 조기 예측을 위한 의료 빅데이터 활용에 미치는 영향 연구)

  • Kim, Hyejin
    • Journal of Digital Convergence
    • /
    • v.18 no.5
    • /
    • pp.305-315
    • /
    • 2020
  • As the incidence and prevalence of dementia increases with our aging population, so does the social burden on our society, which calls for a special emphasis on need for early diagnosis. Thus, efforts are made to prevent dementia and early detection but with current diagnostic measures, these efforts appear futile. As a solution, it is crucial to integrate and standardize healthcare big data and analysis of each index. In order to increase use of large database, the Korea National Assembly passed the Data 3 Act focusing on open-access and sharing of database, but a follow-up legislation is needed a for safer utilization. In this study, we have identified number of foreign of foreign policies through review of prior researches on the topic leading to specific enforcement ordinances tailored to the Data 3 Act for safe access and utilization of database. We also aimed to establish secure process of data collection and disposal as well as governance at the national level to ensure safe utilization of healthcare big data.

A Study on Perception of Educational Big Data Utilization and Current State of Data Utilization of Officials of the Provicial Office of Education (교육청 공무원의 데이터 활용실태 및 교육 빅데이터 활용에 관한 인식 연구 - A도교육청을 중심으로)

  • Shin, Jong-Ho
    • Journal of Digital Convergence
    • /
    • v.18 no.9
    • /
    • pp.39-47
    • /
    • 2020
  • This study was conducted with the aim of investigating the actual state of data utilization and the perception of big data utilization by officials of the provincial Office of Education and to derive implications for the establishment of strategies for big data utilization. An online survey of 440 people was conducted. As a result, the types and sources of data used for work varied, and data collection and refining were the most difficult parts. The infrastructure for data utilization was insufficient and the most necessary factor. The purpose of big data utilization was related to the establishment of educational policy agenda.

Case Study on Big Data Sampling Population Collection Method Errors in Service Business (서비스 비즈니스의 빅데이터 모집단 산정방식 오류에 관한 사례연구)

  • Ahn, Jinho;Lee, Jeungsun
    • Journal of Service Research and Studies
    • /
    • v.10 no.2
    • /
    • pp.1-15
    • /
    • 2020
  • As big data become more important socially and economically in recent years, many problems have been derived from the indiscriminate application of big data. Big data are valuable because it can figure out the meaning of informative information hidden within the data. In particular, to predict customer behavior patterns and experiences, structured data that were extracted from Customer Relationship Management (CRM) or unstructured data that were extracted from Social Network Service(SNS) can be defined as a population to interpret the data, during which many errors can occur. However, those errors are usually overlooked. In addition to data analysis techniques, some data, which should be considered in the analysis, are not included in the population and thus do not show any meaningful patterns. Therefore, this study presents the measurement and interpretation of the data generated when the cause of error in the population setting is strong relationship and interaction between people or a person and an object. In other words, it will be shown that if the relationship and interaction are strong, it is important to include data collected from the perspective of user experience and ethnography in the population by comparing various cases of big data application, through which the meaning will be derived and the best direction will be suggested.

A study on the ordering of PIM family similarity measures without marginal probability (주변 확률을 고려하지 않는 확률적 흥미도 측도 계열 유사성 측도의 서열화)

  • Park, Hee Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.2
    • /
    • pp.367-376
    • /
    • 2015
  • Today, big data has become a hot keyword in that big data may be defined as collection of data sets so huge and complex that it becomes difficult to process by traditional methods. Clustering method is to identify the information in a big database by assigning a set of objects into the clusters so that the objects in the same cluster are more similar to each other clusters. The similarity measures being used in the cluster analysis may be classified into various types depending on the nature of the data. In this paper, we computed upper and lower limits for probability interestingness measure based similarity measures without marginal probability such as Yule I and II, Michael, Digby, Baulieu, and Dispersion measure. And we compared these measures by real data and simulated experiment. By Warrens (2008), Coefficients with the same quantities in the numerator and denominator, that are bounded, and are close to each other in the ordering, are likely to be more similar. Thus, results on bounds provide means of classifying various measures. Also, knowing which coefficients are similar provides insight into the stability of a given algorithm.

A Prediction System for Server Performance Management (서버 성능 관리를 위한 장애 예측 시스템)

  • Lim, Bock-Chool;Kim, Soon-Gohn
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.11 no.6
    • /
    • pp.684-690
    • /
    • 2018
  • In society of the big data is being recognized as one of the core technologies witch is analysis of the collected information, the intelligent evolution of society seems to be more oriented society through an optimized value creation based on a prediction technique. If we take advantage of technologies based on big data about various data and a large amount of data generated during system operation, it will be possible to support stable operation and prevention of faults and failures. In this paper, we suggested an environment using the collection and analysis of big data, and proposed an derive time series prediction model for predicting failure through server performance monitoring for data collected and analyzed. It can be capable of supporting stable operation of the IT systems through failure prediction model for the server operator.

A Review on the Management of Water Resources Information based on Big Data and Cloud Computing (빅 데이터와 클라우드 컴퓨팅 기반의 수자원 정보 관리 방안에 관한 검토)

  • Kim, Yonsoo;Kang, Narae;Jung, Jaewon;Kim, Hung Soo
    • Journal of Wetlands Research
    • /
    • v.18 no.1
    • /
    • pp.100-112
    • /
    • 2016
  • In recent, the direction of water resources policy is changing from the typical plan for water use and flood control to the sustainable water resources management to improve the quality of life. This change makes the information related to water resources such as data collection, management, and supply is becoming an important concern for decision making of water resources policy. We had analyzed the structured data according to the purpose of providing information on water resources. However, the recent trend is big data and cloud computing which can create new values by linking unstructured data with structured data. Therefore, the trend for the management of water resources information is also changing. According to the paradigm change of information management, this study tried to suggest an application of big data and cloud computing in water resources field for efficient management and use of water. We examined the current state and direction of policy related to water resources information in Korea and an other country. Then we connected volume, velocity and variety which are the three basic components of big data with veracity and value which are additionally mentioned recently. And we discussed the rapid and flexible countermeasures about changes of consumer and increasing big data related to water resources via cloud computing. In the future, the management of water resources information should go to the direction which can enhance the value(Value) of water resources information by big data and cloud computing based on the amount of data(Volume), the speed of data processing(Velocity), the number of types of data(Variety). Also it should enhance the value(Value) of water resources information by the fusion of water and other areas and by the production of accurate information(Veracity) required for water management and prevention of disaster and for protection of life and property.

Analysis of Smart Factory Research Trends Based on Big Data Analysis (빅데이터 분석을 활용한 스마트팩토리 연구 동향 분석)

  • Lee, Eun-Ji;Cho, Chul-Ho
    • Journal of Korean Society for Quality Management
    • /
    • v.49 no.4
    • /
    • pp.551-567
    • /
    • 2021
  • Purpose: The purpose of this paper is to present implications by analyzing research trends on smart factories by text analysis and visual analysis(Comprehensive/ Fields / Years-based) which are big data analyses, by collecting data based on previous studies on smart factories. Methods: For the collection of analysis data, deep learning was used in the integrated search on the Academic Research Information Service (www.riss.kr) to search for "SMART FACTORY" and "Smart Factory" as search terms, and the titles and Korean abstracts were scrapped out of the extracted paper and they are organize into EXCEL. For the final step, 739 papers derived were analyzed using the Rx64 4.0.2 program and Rstudio using text mining, one of the big data analysis techniques, and Word Cloud for visualization. Results: The results of this study are as follows; Smart factory research slowed down from 2005 to 2014, but until 2019, research increased rapidly. According to the analysis by fields, smart factories were studied in the order of engineering, social science, and complex science. There were many 'engineering' fields in the early stages of smart factories, and research was expanded to 'social science'. In particular, since 2015, it has been studied in various disciplines such as 'complex studies'. Overall, in keyword analysis, the keywords such as 'technology', 'data', and 'analysis' are most likely to appear, and it was analyzed that there were some differences by fields and years. Conclusion: Government support and expert support for smart factories should be activated, and researches on technology-based strategies are needed. In the future, it is necessary to take various approaches to smart factories. If researches are conducted in consideration of the environment or energy, it is judged that bigger implications can be presented.

IP-Based Heterogeneous Network Interface Gateway for IoT Big Data Collection (IoT 빅데이터 수집을 위한 IP기반 이기종 네트워크 인터페이스 연동 게이트웨이)

  • Kang, Jiheon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.23 no.2
    • /
    • pp.173-178
    • /
    • 2019
  • Recently, the types and amount of data generated, collected, and measured in IoT such as smart home, security, and factory are increasing. The technologies for IoT service include sensor devices to measure desired data, embedded software to control the devices such as signal processing, wireless network protocol to transmit and receive the measured data, and big data and AI-based analysis. In this paper, we focused on developing a gateway for interfacing heterogeneous sensor network protocols that are used in various IoT devices and propose a heterogeneous network interface IoT gateway. We utilized a OpenWrt-based wireless routers and used 6LoWAN stack for IP-based communication via BLE and IEEE 802.15.4 adapters. We developed a software to convert Z-Wave and LoRa packets into IP packet using our Python-based middleware. We expect the IoT gateway to be used as an effective device for collecting IoT big data.