• Title/Summary/Keyword: Big data model

Search Result 1,373, Processing Time 0.029 seconds

A Model of Strawberry Pest Recognition using Artificial Intelligence Learning

  • Guangzhi Zhao
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.15 no.2
    • /
    • pp.133-143
    • /
    • 2023
  • In this study, we propose a big data set of strawberry pests collected directly for diagnosis model learning and an automatic pest diagnosis model architecture based on deep learning. First, a big data set related to strawberry pests, which did not exist anywhere before, was directly collected from the web. A total of more than 12,000 image data was directly collected and classified, and this data was used to train a deep learning model. Second, the deep-learning-based automatic pest diagnosis module is a module that classifies what kind of pest or disease corresponds to when a user inputs a desired picture. In particular, we propose a model architecture that can optimally classify pests based on a convolutional neural network among deep learning models. Through this, farmers can easily identify diseases and pests without professional knowledge, and can respond quickly accordingly.

Artificial Intelligence for the Fourth Industrial Revolution

  • Jeong, Young-Sik;Park, Jong Hyuk
    • Journal of Information Processing Systems
    • /
    • v.14 no.6
    • /
    • pp.1301-1306
    • /
    • 2018
  • Artificial intelligence is one of the key technologies of the Fourth Industrial Revolution. This paper introduces the diverse kinds of approaches to subjects that tackle diverse kinds of research fields such as model-based MS approach, deep neural network model, image edge detection approach, cross-layer optimization model, LSSVM approach, screen design approach, CPU-GPU hybrid approach and so on. The research on Superintelligence and superconnection for IoT and big data is also described such as 'superintelligence-based systems and infrastructures', 'superconnection-based IoT and big data systems', 'analysis of IoT-based data and big data', 'infrastructure design for IoT and big data', 'artificial intelligence applications', and 'superconnection-based IoT devices'.

Convergence Study on Big Data Competency Reference Model (빅데이터 직무능력 참조모형에 관한 융합적 연구)

  • Noh, Kyoo-Sung;Park, Seong Taek;Park, Kyung-Hye
    • Journal of Digital Convergence
    • /
    • v.13 no.3
    • /
    • pp.55-63
    • /
    • 2015
  • South Korean Government confirmed the creation of competency-centered society as a key national issue and announced development and utilization plan of NCS(National Competency Standards) On May 21, 2013. As a part of the government's plans, they had been developed NCS about 833 jobs by 2014. But Big Data related job, as an emerging job, cannot be seen as a reliable form of job yet. As, at the major industrialized countries and the domestic, education and job competency models of knowledge and skills to take advantage of various types of Big Data have coming, it is a situation that is certainly not settled and more or less in confusion. In this study, for the purpose to present the Big Data Competency reference model for companies and organizations to effectively leverage Big Data, we have presented this reference model and summarized competency elements units such as 20 knowledges and 15 skills of Big Data competency.

Scalable Prediction Models for Airbnb Listing in Spark Big Data Cluster using GPU-accelerated RAPIDS

  • Muralidharan, Samyuktha;Yadav, Savita;Huh, Jungwoo;Lee, Sanghoon;Woo, Jongwook
    • Journal of information and communication convergence engineering
    • /
    • v.20 no.2
    • /
    • pp.96-102
    • /
    • 2022
  • We aim to build predictive models for Airbnb's prices using a GPU-accelerated RAPIDS in a big data cluster. The Airbnb Listings datasets are used for the predictive analysis. Several machine-learning algorithms have been adopted to build models that predict the price of Airbnb listings. We compare the results of traditional and big data approaches to machine learning for price prediction and discuss the performance of the models. We built big data models using Databricks Spark Cluster, a distributed parallel computing system. Furthermore, we implemented models using multiple GPUs using RAPIDS in the spark cluster. The model was developed using the XGBoost algorithm, whereas other models were developed using traditional central processing unit (CPU)-based algorithms. This study compared all models in terms of accuracy metrics and computing time. We observed that the XGBoost model with RAPIDS using GPUs had the highest accuracy and computing time.

Big Data Based Urban Transportation Analysis for Smart Cities - Machine Learning Based Traffic Prediction by Using Urban Environment Data - (도시 빅데이터를 활용한 스마트시티의 교통 예측 모델 - 환경 데이터와의 상관관계 기계 학습을 통한 예측 모델의 구축 및 검증 -)

  • Jang, Sun-Young;Shin, Dong-Youn
    • Journal of KIBIM
    • /
    • v.8 no.3
    • /
    • pp.12-19
    • /
    • 2018
  • The research aims to find implications of machine learning and urban big data as a way to construct the flexible transportation network system of smart city by responding the urban context changes. This research deals with a problem that existing a bus headway model is difficult to respond urban situations in real-time. Therefore, utilizing the urban big data and machine learning prototyping tool in weathers, traffics, and bus statues, this research presents a flexible headway model to predict bus delay and analyze the result. The prototyping model is composed by real-time data of buses. The data is gathered through public data portals and real time Application Program Interface (API) by the government. These data are fundamental resources to organize interval pattern models of bus operations as traffic environment factors (road speeds, station conditions, weathers, and bus information of operating in real-time). The prototyping model is implemented by the machine learning tool (RapidMiner Studio) and conducted several tests for bus delays prediction according to specific circumstances. As a result, possibilities of transportation system are discussed for promoting the urban efficiency and the citizens' convenience by responding to urban conditions.

Applying Service Quality to Big Data Quality (빅데이터 품질 확장을 위한 서비스 품질 연구)

  • Park, Jooseok;Kim, Seunghyun;Ryu, Hocheol;Lee, Zoonky;Lee, Jangho;Lee, Junyong
    • The Journal of Bigdata
    • /
    • v.2 no.2
    • /
    • pp.87-93
    • /
    • 2017
  • The research on data quality has been performed for a long time. However, the research focused on structured data. With the recent digital revolution or the fourth industrial revolution, quality control of big data is becoming more important. In this paper, we analyze and classify big data quality types through previous research. The types of big data quality can be classified into value, data structure, process, value chain, and maturity model. Based on these comparative studies, this paper proposes a new standard, service quality of big data.

  • PDF

A Study on the Success Model for the Establishment of Big Data System in Public Institutions (공공기관 빅데이터 시스템 구축을 위한 성공모형에 관한 연구)

  • Lee, Gwang-Su;Kwon, Jungin
    • Journal of Digital Convergence
    • /
    • v.20 no.1
    • /
    • pp.129-139
    • /
    • 2022
  • This study aims to identify which factors affect successful big data system construction, identify the relationship between the factors, and identify the success model and success factors necessary for public institutions to build big data systems. Therefore, the preceding and related studies related to this study were reviewed, and success factors for the establishment of a big data system were derived based on this. As a research method, a survey was conducted on users of institutions that have established or planned to build a big data system, and a structural equation (AMOS) was conducted to verify the impact relationship between success factors. As a result of the analysis, organizational support factors, development support factors, user support factors, information quality, service quality, system quality, use, and net benefit were derived as success factors for building big data systems, and a success model was presented. This can be seen as significant and academic contributions in that it is the first study of the success model for building an information system reflecting big data characteristics, and it is expected that this study will be used as basic data for building a big data system in public institutions in the future.

Research on Big Data Integration Method

  • Kim, Jee-Hyun;Cho, Young-Im
    • Journal of the Korea Society of Computer and Information
    • /
    • v.22 no.1
    • /
    • pp.49-56
    • /
    • 2017
  • In this paper we propose the approach for big data integration so as to analyze, visualize and predict the future of the trend of the market, and that is to get the integration data model using the R language which is the future of the statistics and the Hadoop which is a parallel processing for the data. As four approaching methods using R and Hadoop, ff package in R, R and Streaming as Hadoop utility, and Rhipe and RHadoop as R and Hadoop interface packages are used, and the strength and weakness of four methods are described and analyzed, so Rhipe and RHadoop are proposed as a complete set of data integration model. The integration of R, which is popular for processing statistical algorithm and Hadoop contains Distributed File System and resource management platform and can implement the MapReduce programming model gives us a new environment where in R code can be written and deployed in Hadoop without any data movement. This model allows us to predictive analysis with high performance and deep understand over the big data.

Analysis of Social Welfare Effects of Onion Observation Using Big Data (빅데이터를 활용한 양파 관측의 사회적 후생효과 분석)

  • Joo, Jae-Chang;Moon, Ji-Hye
    • Korean Journal of Organic Agriculture
    • /
    • v.29 no.3
    • /
    • pp.317-332
    • /
    • 2021
  • This study estimated the predictive onion yield through Stepwise regression of big data and weather variables by onion growing season. The economic feasibility of onion observations using big data was analyzed using estimated predictive data. The social welfare effect was estimated through the model of Harberger's triangle using onion yield prediction with big data and it without big data. Predicted yield using big data showed a deviation of -9.0% to 4.2%. As a result of estimating the social welfare effect, the average annual value was 23.3 billion won. The average annual value of social welfare effects if big data was not used was measured at 22.4 billion won. Therefore, it was estimated that the difference between the social welfare effect when the prediction using big data was used and when it was not was about 950 million won. When these results are applied to items other than onion items, the effect will be greater. It is judged that it can be used as basic data to prove the justification of the agricultural observation project. However, since the simple Harberger's triangle theory has the limitation of oversimplifying reality, it is necessary to evaluate the economic value through various methods such as measuring the effect of agricultural observation under a more realistic rational expectation hypothesis in future studies.

Shared Distributed Big-Data Processing Platform Model: a Study (대용량 분산처리 플랫폼 공유 모델 연구)

  • Jeong, Hwanjin;Kang, Taeho;Kim, GyuSeok;Shin, YoungHo;Jeong, Jinkyu
    • KIISE Transactions on Computing Practices
    • /
    • v.22 no.11
    • /
    • pp.601-613
    • /
    • 2016
  • With the increasing need for big data processing, building a shared big data processing platform is important to minimize time and monetary costs. In shared big data processing, multitenancy is a major requirement that needs to be addressed, in order to provide a single isolated personal big data platform for each user, but to share the underlying hardware is shared among users to increase hardware utilization. In this paper, we explore two well-known shared big data processing platform models. One is to use a native Hadoop cluster, and the other is to build a virtual Hadoop cluster for each user. For each model we verified whether it is sufficient to support multi-tenancy. We also present a method to complement unsupported multi-tenancy features in a native Hadoop cluster model. Lastly we built prototype platforms and compared the performance of both models.