• Title/Summary/Keyword: 빅데이터 기법

Search Result 790, Processing Time 0.038 seconds

Keyword Data Analysis Using Bayesian Conjugate Prior Distribution (베이지안 공액 사전분포를 이용한 키워드 데이터 분석)

  • Jun, Sunghae
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.6
    • /
    • pp.1-8
    • /
    • 2020
  • The use of text data in big data analytics has been increased. So, much research on methods for text data analysis has been performed. In this paper, we study Bayesian learning based on conjugate prior for analyzing keyword data extracted from text big data. Bayesian statistics provides learning process for updating parameters when new data is added to existing data. This is an efficient process in big data environment, because a large amount of data is created and added over time in big data platform. In order to show the performance and applicability of proposed method, we carry out a case study by analyzing the keyword data from real patent document data.

Cost-Effective MapReduce Processing in the Cloud (클라우드 환경에서의 비용 효율적인 맵리듀스 처리)

  • Ryu, Wooseok
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.10a
    • /
    • pp.114-115
    • /
    • 2018
  • This paper studies a mechanism for cost-effective analysis of big data in the cloud environment. Recently, as a storage of electronic medical records can be managed outside the hospital, there is a growing demand for cloud-based big data analysis in small-and-medium hospitals. This paper firstly analyze the Amazon Elastic MapReduce which is a popular cloud framework for big data analysis, and proposes a cost model for analyzing big data using Amazon EMR with less cost. Using the proposed model, the user can construct a cost-effective computing cluster, which maximize the effectiveness of the analysis per operational cost.

  • PDF

A Study on the Development of the Use Index of Closed School Facilities Using Big Data -Focused on Text-Mining Techniques- (빅데이터를 활용한 폐교시설의 지표 개발에 관한 연구 -텍스트마이닝 기법을 중심으로-)

  • Kim, Jae-Young;Lee, Jong-Kuk
    • The Journal of Sustainable Design and Educational Environment Research
    • /
    • v.18 no.2
    • /
    • pp.1-11
    • /
    • 2019
  • The purpose of this study is to make objective decisions in the use of closed schools through the development of utilization indicators for the efficient use of closed schools, which is expected to increase continuously. The research phase was largely carried out by drawing preliminary indicators for use in closed schools, drawing final indicators using big data, and quantifying indicators, and finally objectifying them through quantification. The institution intends to apply and verify the facility based on future indicators. This study has implications for the application of big data analysis methods that have not been attempted in planning and research for the use of closed school facilities to date.

A Study of Data Mining Techniques for CEP (CEP를 위한 데이터 마이닝 기법 연구)

  • Kang, Donghyun;Hwang, Buhyun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2012.04a
    • /
    • pp.1116-1117
    • /
    • 2012
  • 최근에 이슈가 되고 있는 빅 데이터 처리 방법중의 하나로 CEP가 있다. 그러나 CEP는 사전에 정의된 질의에 해당되는 이벤트만을 선별하여 패턴 매칭 등의 기능을 수행하므로, 새로이 발견되는 이벤트를 찾는데 제약이 있다. 또한 실시간으로 생산되는 빅 데이터에 기초한 다양한 패턴 탐사에 한계를 노출하고 있다. 이 논문에서는, CEP 환경에서 빅 데이터 사이에 존재하는 다양한 이벤트와 패턴 탐사를 위한 실시간 데이터 마이닝 기법을 제안한다. 제안 방법은 CEP 엔진을 위한 고급의 패턴 매칭을 개발하고, CEP를 위한 실시간 데이터 마이닝 기법을 개발한다. 마지막으로, 기존의 CQL을 확장하여 개발한다. 이라한 방법을 통하여 기존의 CEP의 기능적인 한계를 극복할 수 있다.

Identify research trends through big data analysis method for autonomous driving car (자율주행자동차의 빅데이터 분석을 통한 연구 동향 파악)

  • Namkoong, Helly;Kang, SunJoon;Won, YooHyung;Park, SungWok
    • Proceedings of the Korea Technology Innovation Society Conference
    • /
    • 2017.11a
    • /
    • pp.459-468
    • /
    • 2017
  • 본 논문에서는 자율주행자동차와 관련한 주제어를 선정하여 KCI 등재 논문의 서론 자료를 수집하고, 이에 빅데이터 분석 기법을 적용하였다. 이를 토대로 자율주행자동차와 관련된 다양한 이슈 분석을 통해 자율주행자동차의 연구 동향을 파악할 수 있으며, 추가적인 연구가 필요한 분야에 대해 알 수 있다. 제4차 산업혁명의 영향으로 등장한 다양한 기술들의 활용이라고 볼 수 있는 자율주행자동차는 2025년 상용화 될 가능성이 높다. 자율주행자동차의 상용화를 위해 지속적인 연구와 논의가 필요하지만, 과거부터 등재된 자율주행자동차 관련 KCI 논문 빅데이터 분석을 통해 기술들 간의 군집 방식과 주제어의 밀집도, 네트워킹 형성 방식 등에 대해 파악할 수 있다. 이처럼 논문 데이터 분석을 통해 향후 정부출연(연), 혹은 기업체에서 더욱 발전시켜야 할 부분에 대해 인지하고 정부 차원의 과제 지원과 연구를 통해 자율 주행자동차 상용화를 촉진시킬 수 있을 것이라고 예상한다.

  • PDF

Anomaly Detection Technique of Log Data Using Hadoop Ecosystem (하둡 에코시스템을 활용한 로그 데이터의 이상 탐지 기법)

  • Son, Siwoon;Gil, Myeong-Seon;Moon, Yang-Sae
    • KIISE Transactions on Computing Practices
    • /
    • v.23 no.2
    • /
    • pp.128-133
    • /
    • 2017
  • In recent years, the number of systems for the analysis of large volumes of data is increasing. Hadoop, a representative big data system, stores and processes the large data in the distributed environment of multiple servers, where system-resource management is very important. The authors attempted to detect anomalies from the rapid changing of the log data that are collected from the multiple servers using simple but efficient anomaly-detection techniques. Accordingly, an Apache Hive storage architecture was designed to store the log data that were collected from the multiple servers in the Hadoop ecosystem. Also, three anomaly-detection techniques were designed based on the moving-average and 3-sigma concepts. It was finally confirmed that all three of the techniques detected the abnormal intervals correctly, while the weighted anomaly-detection technique is more precise than the basic techniques. These results show an excellent approach for the detection of log-data anomalies with the use of simple techniques in the Hadoop ecosystem.

Design and Implementation of HDFS data encryption scheme using ARIA algorithms on Hadoop (하둡 상에서 ARIA 알고리즘을 이용한 HDFS 데이터 암호화 기법의 설계 및 구현)

  • Song, Youngho;Shin, YoungSung;Yoon, Min;Jang, Miyoung;Chang, Jae-Woo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2015.10a
    • /
    • pp.613-616
    • /
    • 2015
  • 최근 스마트폰 기기의 보급 및 소셜 서비스 산업의 고도화로 인해, 빅데이터가 등장하였다. 한편 빅데이터에서 효율적으로 정보를 분석하는 대표적인 플랫폼으로 하둡이 존재한다. 하둡은 클러스터 환경에 기반한 우수한 확장성, 장애 복구 기능 및 사용자가 기능을 정의할 수 있는 맵리듀스 프레임워크 등을 지원한다. 아울러 하둡은 개인정보나 위치 데이터 등의 민감한 정보를 보호하기 위해 Kerberos를 통한 사용자 인증 기법을 제공하고, HDFS 압축 코덱을 활용한 AES 코덱 기반 데이터 암호화를 지원하고 있다. 그러나 하둡 기반 소프트웨어를 사용하고 있는 국내 기관 및 기업은 국내 ARIA 데이터 암호화를 적용하지 못하고 있다. 이를 해결하기 위해 본 논문에서는 하둡을 기반으로 ARIA 암호화를 지원하는 HDFS 데이터 암호화 기법을 제안한다.

Big data Cloud Service for Manufacturing Process Analysis (제조 공정 분석을 위한 빅데이터 클라우드 서비스)

  • Lee, Yong-Hyeok;Song, Min-Seok;Ha, Seung-Jin;Baek, Tae-Hyun;Son, Sook-Young
    • The Journal of Bigdata
    • /
    • v.1 no.1
    • /
    • pp.41-51
    • /
    • 2016
  • Big data is an emerging issue as large data which was impossible to be processed in the past is possible to be handled with the development of information and communication technology. Manufacturing is the most promising field that big data is applied such that there are abundant data available. It is important to improve an efficiency of manufacturing process for quality control and production efficiency because the processes from production design, sales, productions and so on are mixed intricately. This study proposes big data cloud service for manufacturing analysis using a big data technology and a process mining technique. It is expected for manufacturing corporations to improve a manufacturing process and reduced the cost by applying the proposed service. The service provides various analyses including manufacturing analysis and manufacturing duration analysis. Big data cloud service has been implemented and it has been validated by conducting a case study.

  • PDF

A Study On The Economic Value Of Firm's Big Data Technologies Introduction Using Real Option Approach - Based On YUYU Pharmaceuticals Case - (실물옵션 기법을 이용한 기업의 빅데이터 기술 도입의 경제적 가치 분석 - 유유제약 사례를 중심으로 -)

  • Jang, Hyuk Soo;Lee, Bong Gyou
    • Journal of Internet Computing and Services
    • /
    • v.15 no.6
    • /
    • pp.15-26
    • /
    • 2014
  • This study focus on a economic value of the Big Data technologies by real options model using big data technology company's stock price to determine the price of the economic value of incremental assessed value. For estimating stochastic process of company's stock price by big data technology to extract the incremental shares, Generalized Moments Method (GMM) are used. Option value for Black-Scholes partial differential equation was derived, in which finite difference numerical methods to obtain the Big Data technology was introduced to estimate the economic value. As a result, a option value of big data technology investment is 38.5 billion under assumption which investment cost is 50 million won and time value is a about 1 million, respectively. Thus, introduction of big data technology to create a substantial effect on corporate profits, is valuable and there are an effects on the additional time value. Sensitivity analysis of lower underlying asset value appear decreased options value and the lower investment cost showed increased options value. A volatility are not sensitive on the option value due to the big data technological characteristics which are low stock volatility and introduction periods.

A Study on Condition Analysis of Revised Project Level of Gravity Port facility using Big Data (빅데이터 분석을 통한 중력식 항만시설 수정프로젝트 레벨의 상태변화 특성 분석)

  • Na, Yong Hyoun;Park, Mi Yeon;Jang, Shinwoo
    • Journal of the Society of Disaster Information
    • /
    • v.17 no.2
    • /
    • pp.254-265
    • /
    • 2021
  • Purpose: Inspection and diagnosis on the performance and safety through domestic port facilities have been conducted for over 20 years. However, the long-term development strategies and directions for facility renewal and performance improvement using the diagnosis history and results are not working in realistically. In particular, in the case of port structures with a long service life, there are many problems in terms of safety and functionality due to increasing of the large-sized ships, of port use frequency, and the effects of natural disasters due to climate change. Method: In this study, the maintenance history data of the gravity type quay in element level were collected, defined as big data, and a predictive approximation model was derived to estimate the pattern of deterioration and aging of the facility of project level based on the data. In particular, we compared and proposed models suitable for the use of big data by examining the validity of the state-based deterioration pattern and deterioration approximation model generated through machine learning algorithms of GP and SGP techniques. Result: As a result of reviewing the suitability of the proposed technique, it was considered that the RMSE and R2 in GP technique were 0.9854 and 0.0721, and the SGP technique was 0.7246 and 0.2518. Conclusion: This research through machine learning techniques is expected to play an important role in decision-making on investment in port facilities in the future if port facility data collection is continuously performed in the future.