• Title/Summary/Keyword: 빅 데이터 패턴 분석

Search Result 195, Processing Time 0.037 seconds

Structuring of unstructured big data and visual interpretation (부산지역 교통관련 기사를 이용한 비정형 빅데이터의 정형화와 시각적 해석)

  • Lee, Kyeongjun;Noh, Yunhwan;Yoon, Sanggyeong;Cho, Youngseuk
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.6
    • /
    • pp.1431-1438
    • /
    • 2014
  • We analyzed the articles from "Kukje Shinmun" and "Busan Ilbo", which are two local newpapers of Busan Metropolitan City. The articles cover from January 1, 2013 to December 31, 2013. Meaningful pattern inherent in 2889 articles of which the title includes "Busan" and "Traffic" and related data was analyzed. Textmining method, which is a part of datamining, was used for the social network analysis (SNA). HDFS and MapReduce (from Hadoop ecosystem), which is open-source framework based on JAVA, were used with Linux environment (Uubntu-12.04LTS) for the construction of unstructured data and the storage, process and the analysis of big data. We implemented new algorithm that shows better visualization compared with the default one from R package, by providing the color and thickness based on the weight from each node and line connecting the nodes.

A Survey on Deep Learning-based Analysis for Education Data (빅데이터와 AI를 활용한 교육용 자료의 분석에 대한 조사)

  • Lho, Young-uhg
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.240-243
    • /
    • 2021
  • Recently, there have been research results of applying Big data and AI technologies to the evaluation and individual learning for education. It is information technology innovations that collect dynamic and complex data, including student personal records, physiological data, learning logs and activities, learning outcomes and outcomes from social media, MOOCs, intelligent tutoring systems, LMSs, sensors, and mobile devices. In addition, e-learning was generated a large amount of learning data in the COVID-19 environment. It is expected that learning analysis and AI technology will be applied to extract meaningful patterns and discover knowledge from this data. On the learner's perspective, it is necessary to identify student learning and emotional behavior patterns and profiles, improve evaluation and evaluation methods, predict individual student learning outcomes or dropout, and research on adaptive systems for personalized support. This study aims to contribute to research in the field of education by researching and classifying machine learning technologies used in anomaly detection and recommendation systems for educational data.

  • PDF

Emotion Prediction of Paragraph using Big Data Analysis (빅데이터 분석을 이용한 문단 내의 감정 예측)

  • Kim, Jin-su
    • Journal of Digital Convergence
    • /
    • v.14 no.11
    • /
    • pp.267-273
    • /
    • 2016
  • Creation and Sharing of information which is structured data as well as various unstructured data. makes progress actively through the spread of mobile. Recently, Big Data extracts the semantic information from SNS and data mining is one of the big data technique. Especially, the general emotion analysis that expresses the collective intelligence of the masses is utilized using large and a variety of materials. In this paper, we propose the emotion prediction system architecture which extracts the significant keywords from social network paragraphs using n-gram and Korean morphological analyzer, and predicts the emotion using SVM and these extracted emotion features. The proposed system showed 82.25% more improved recall rate in average than previous systems and it will help extract the semantic keyword using morphological analysis.

A Study on Condition Analysis of Revised Project Level of Gravity Port facility using Big Data (빅데이터 분석을 통한 중력식 항만시설 수정프로젝트 레벨의 상태변화 특성 분석)

  • Na, Yong Hyoun;Park, Mi Yeon;Jang, Shinwoo
    • Journal of the Society of Disaster Information
    • /
    • v.17 no.2
    • /
    • pp.254-265
    • /
    • 2021
  • Purpose: Inspection and diagnosis on the performance and safety through domestic port facilities have been conducted for over 20 years. However, the long-term development strategies and directions for facility renewal and performance improvement using the diagnosis history and results are not working in realistically. In particular, in the case of port structures with a long service life, there are many problems in terms of safety and functionality due to increasing of the large-sized ships, of port use frequency, and the effects of natural disasters due to climate change. Method: In this study, the maintenance history data of the gravity type quay in element level were collected, defined as big data, and a predictive approximation model was derived to estimate the pattern of deterioration and aging of the facility of project level based on the data. In particular, we compared and proposed models suitable for the use of big data by examining the validity of the state-based deterioration pattern and deterioration approximation model generated through machine learning algorithms of GP and SGP techniques. Result: As a result of reviewing the suitability of the proposed technique, it was considered that the RMSE and R2 in GP technique were 0.9854 and 0.0721, and the SGP technique was 0.7246 and 0.2518. Conclusion: This research through machine learning techniques is expected to play an important role in decision-making on investment in port facilities in the future if port facility data collection is continuously performed in the future.

A Study for Electronic Trading Business System Using Big Data (빅데이터를 활용한 전자무역시스템에 대한 연구)

  • Lee, Cheol-Woong;Cho, Sung-Woo;Cho, Sae-Hong;Hwang, Dae-Hoon
    • Journal of Digital Contents Society
    • /
    • v.14 no.4
    • /
    • pp.573-580
    • /
    • 2013
  • With the growth of the smart-devices and information & communication technology, information society has developed and information can be produced, spread and consumed at much faster pace easily. Hence, individuals can utilize wireless communication and smart-devices to create, share and consume information at anytime and anywhere. The growth of technology has allowed the large-scale transfer and sharing of image, sound and video data; it changed the users' data consumption pattern that was mainly consisted of the text. Therefore, the amount of data that an individual consumes increased significantly. The importance of finding and analyzing practical and necessary data among huge amount of data has arisen. In this study, the current status of Big Data is researched and analyzed and the method to utilize Big Data in the electronic trading field is suggested.

Case Study on Big Data Sampling Population Collection Method Errors in Service Business (서비스 비즈니스의 빅데이터 모집단 산정방식 오류에 관한 사례연구)

  • Ahn, Jinho;Lee, Jeungsun
    • Journal of Service Research and Studies
    • /
    • v.10 no.2
    • /
    • pp.1-15
    • /
    • 2020
  • As big data become more important socially and economically in recent years, many problems have been derived from the indiscriminate application of big data. Big data are valuable because it can figure out the meaning of informative information hidden within the data. In particular, to predict customer behavior patterns and experiences, structured data that were extracted from Customer Relationship Management (CRM) or unstructured data that were extracted from Social Network Service(SNS) can be defined as a population to interpret the data, during which many errors can occur. However, those errors are usually overlooked. In addition to data analysis techniques, some data, which should be considered in the analysis, are not included in the population and thus do not show any meaningful patterns. Therefore, this study presents the measurement and interpretation of the data generated when the cause of error in the population setting is strong relationship and interaction between people or a person and an object. In other words, it will be shown that if the relationship and interaction are strong, it is important to include data collected from the perspective of user experience and ethnography in the population by comparing various cases of big data application, through which the meaning will be derived and the best direction will be suggested.

Big Data based Diet Analysis and Relevant Product Recommendation Online-mall API (빅 데이터 기반의 식습관 분석 및 관련 상품 추천 온라인 몰 API)

  • Jang, Soe-Un;Kim, Moon-Hyun;Na, Ji-Hyun;Hong, Jang-Eui
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2019.10a
    • /
    • pp.1129-1132
    • /
    • 2019
  • 최근 현대인들은 식습관이 불규칙하고 서구화되면서, 건강상의 많은 문제를 겪고 있다. 이와 더불어 1인 가구의 증가와 간단한 구매 방법 등으로 인해 온라인 몰 사용자가 늘어나고 있다. 본 프로젝트는 이러한 추세를 바탕으로, 사용자가 자주 사용하는 온라인 몰에 축적된 데이터를 기반으로 사용자의 식습관을 분석한다. 뿐만 아니라, 이를 바탕으로 구매 패턴을 분석하여 사용자의 영양 상태를 개선시킬 수 있는 상품 추천 서비스를 제공한다. 사용자는 자주 사용하는 온라인 쇼핑몰에서 상품 구매를 함과 동시에 구매한 상품에 대해 시각화된 영양소 분석 결과와 구매 패턴 분석 결과를 제공받을 수 있다. 본 논문에서는 개발한 API를 통해 사용자는 부족한 영양소를 쉽게 파악하여 효율적으로 건강관리를 할 수 있게 된다. 더 나아가, 자신의 구매 패턴을 파악할 수 있게 되어 현명한 소비 습관을 만드는 데에 기여할 수 있다.

Transaction Pattern Discrimination of Malicious Supply Chain using Tariff-Structured Big Data (관세 정형 빅데이터를 활용한 우범공급망 거래패턴 선별)

  • Kim, Seongchan;Song, Sa-Kwang;Cho, Minhee;Shin, Su-Hyun
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.2
    • /
    • pp.121-129
    • /
    • 2021
  • In this study, we try to minimize the tariff risk by constructing a hazardous cargo screening model by applying Association Rule Mining, one of the data mining techniques. For this, the risk level between supply chains is calculated using the Apriori Algorithm, which is an association analysis algorithm, using the big data of the import declaration form of the Korea Customs Service(KCS). We perform data preprocessing and association rule mining to generate a model to be used in screening the supply chain. In the preprocessing process, we extract the attributes required for rule generation from the import declaration data after the error removing process. Then, we generate the rules by using the extracted attributes as inputs to the Apriori algorithm. The generated association rule model is loaded in the KCS screening system. When the import declaration which should be checked is received, the screening system refers to the model and returns the confidence value based on the supply chain information on the import declaration data. The result will be used to determine whether to check the import case. The 5-fold cross-validation of 16.6% precision and 33.8% recall showed that import declaration data for 2 years and 6 months were divided into learning data and test data. This is a result that is about 3.4 times higher in precision and 1.5 times higher in recall than frequency-based methods. This confirms that the proposed method is an effective way to reduce tariff risks.

A review of artificial intelligence based demand forecasting techniques (인공지능 기반 수요예측 기법의 리뷰)

  • Jeong, Hyerin;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.6
    • /
    • pp.795-835
    • /
    • 2019
  • Big data has been generated in various fields. Many companies have now tried to make profits by building a system capable of analyzing big data based on artificial intelligence (AI) techniques. Integrating AI technology has made analyzing and utilizing vast amounts of data increasingly valuable. In particular, demand forecasting with maximum accuracy is critical to government and business management in various fields such as finance, procurement, production and marketing. In this case, it is important to apply an appropriate model that considers the demand pattern for each field. It is possible to analyze complex patterns of real data that can also be enlarged by a traditional time series model or regression model. However, choosing the right model among the various models is difficult without prior knowledge. Many studies based on AI techniques such as machine learning and deep learning have been proven to overcome these problems. In addition, demand forecasting through the analysis of stereotyped data and unstructured data of images or texts has also shown high accuracy. This paper introduces important areas where demand forecasts are relatively active as well as introduces machine learning and deep learning techniques that consider the characteristics of each field.

A Study on the Establishment of the IDS Using Machine Learning (머신 러닝을 활용한 IDS 구축 방안 연구)

  • Kang, Hyun-Sun
    • Journal of Software Assessment and Valuation
    • /
    • v.15 no.2
    • /
    • pp.121-128
    • /
    • 2019
  • Computing systems have various vulnerabilities to cyber attacks. In particular, various cyber attacks that are intelligent in the information society have caused serious social problems and economic losses. Traditional security systems are based on misuse-based technology, which requires the continuous updating of new attack patterns and the real-time analysis of vast amounts of data generated by numerous security devices in order to accurately detect. However, traditional security systems are unable to respond through detection and analysis in real time, which can delay the recognition of intrusions and cause a lot of damage. Therefore, there is a need for a new security system that can quickly detect, analyze, and predict the ever-increasing cyber security threats based on machine learning and big data analysis models. In this paper, we present a IDS model that combines machine learning and big data technology.