• 제목/요약/키워드: Big-data Software

검색결과 441건 처리시간 0.023초

하둡과 순차패턴 마이닝 기술을 통한 교통카드 빅데이터 분석 (Analysis of Traffic Card Big Data by Hadoop and Sequential Mining Technique)

  • 김우생;김용훈;박희성;박진규
    • Journal of Information Technology Applications and Management
    • /
    • 제24권4호
    • /
    • pp.187-196
    • /
    • 2017
  • It is urgent to prepare countermeasures for traffic congestion problems of Korea's metropolitan area where central functions such as economic, social, cultural, and education are excessively concentrated. Most users of public transportation in metropolitan areas including Seoul use the traffic cards. If various information is extracted from traffic big data produced by the traffic cards, they can provide basic data for transport policies, land usages, or facility plans. Therefore, in this study, we extract valuable information such as the subway passengers' frequent travel patterns from the big traffic data provided by the Seoul Metropolitan Government Big Data Campus. For this, we use a Hadoop (High-Availability Distributed Object-Oriented Platform) to preprocess the big data and store it into a Mongo database in order to analyze it by a sequential pattern data mining technique. Since we analysis the actual big data, that is, the traffic cards' data provided by the Seoul Metropolitan Government Big Data Campus, the analyzed results can be used as an important referenced data when the Seoul government makes a plan about the metropolitan traffic policies.

사용자 행동인식을 위한 적응적 경계 보정기반 Particle Swarm Optimization 알고리즘 (Adaptive Boundary Correction based Particle Swarm Optimization for Activity Recognition)

  • 허성욱;권용진;강규창;배창석
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2012년도 추계학술발표대회
    • /
    • pp.1166-1169
    • /
    • 2012
  • 본 논문은 사용자 행동인식을 위해 기존 PSO (Particle Swarm Optimization) 알고리즘의 경계선을 통한 데이터 분류에서 데이터의 수집환경에 의해 발생하는 문제를 벡터의 길이비교를 이용한 보정을 통해 보완한 알고리즘을 제안한다. 기존의 PSO 알고리즘은 데이터 분류를 위해서 데이터의 최소, 최대값을 이용하여 경계를 생성하고, 이를 이용하여 데이터를 분류하였다. 그러나 PSO를 이용하여 행동인식을 할 때 행동이 수집되는 환경에 따라서 경계에 포함되지 못해 행동이 분류되지 못하는 문제가 있다. 이러한 분류의 문제를 보완하기 위해 경계를 벗어난 데이터와 각 행동을 대표하는 데이터의 벡터 길이를 계산하고 최소길이를 비교하여 분류한다. 실험결과, 기존 PSO 방법에 비해 개선된 방법이 평균적으로 앉기 1%, 걷기 7%, 서기 7%의 개선된 결과를 얻었다.

비전공자를 위한 AI기초통계 교육의 고찰 (A Study on AI basic statistics Education for Non-majors)

  • 유진아
    • 통합자연과학논문집
    • /
    • 제14권4호
    • /
    • pp.176-182
    • /
    • 2021
  • We live in the age of artificial intelligence, and big data and artificial intelligence education are no longer just for majors, but are required to be able to handle non-majors as well. Software and artificial intelligence education for non-majors is not just a general education, it creates talents who can understand and utilize them, and the quality of education is increasingly important. Through such education, we can nurture creative talents who can create and use new values by fusion with various fields of computing technology. Since 2015, many universities have been implementing software-oriented colleges and AI-oriented colleges to foster software-oriented human resources. However, it is not easy to provide AI basic statistics education of big data analysis deception to non-majors. Therefore, we would like to present a big data education model for non-majors in big data analysis so that big data analysis can be directly applied.

빅데이터 도입 효과 분석을 통한 빅데이터 성공요인에 관한 연구 (A study on the success factors of Big Data through an analysis of introduction effect of Big Data)

  • 정영기;석명건;김창재
    • 디지털융복합연구
    • /
    • 제12권11호
    • /
    • pp.241-248
    • /
    • 2014
  • 정보기술의 발달과 기반하드웨어 기술의 비약적인 발전은 데이터 사용의 폭을 넓혀주었고 이로 인해서 빅데이터 시대라는 새로운 패러다임을 제시하였다. 빅데이터 기술과 그 활용성과는 점차 늘어나는 추세이며 이에 기업들은 데이터의 중요성을 깨닫고 이를 활용하려는 움직임이 활발해지고 있다. 본 연구는 기업에서 빅데이터를 활용함에 있어 빅데이터 기술의 적극적 도입 및 활용을 위한 요인들을 선별해내고 이를 통한 중요도를 검증하고자 수행되었다. 연구모형에 포함된 빅데이터의 특성 요인으로는 예측성, 관리성, 지원성, 경쟁성을 선정하였다. 빅데이터에 대한 경험을 보유한 기업의 실무자를 대상으로 한 설문과 통계를 바탕으로 검증한 결과 관리성 측면이 가장 중요한 성공요인으로 채택되었으며, 본 연구의 결과는 기업에서의 빅데이터 도입 시에 빅데이터의 특성에 대한 좀더 객관적인 이해와 이를 통한 고려사항을 통해 좀더 효율성 있는 사용을 가능하게 정보를 제공하는 것이 가능할 것이다.

A Keyword-Based Big Data Analysis for Individualized Health Activity: Focusing on Methodological Approach

  • 김한별;배근표;허준호
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2017년도 춘계학술발표대회
    • /
    • pp.540-543
    • /
    • 2017
  • It will be possible to solve some of the major issues in our society and economy with the emerging Big Data used across 21st century global digital economy. One of the main areas where big data can be quite useful is the medical and health area. IT technology is being used extensively in this area and expected to expand its application field further. However, there is still room for improvement in the usage of Big Data as it is difficult to search unstructured data contained in Big Data and collect statistics for them. This limits wider application of Big Data. Depending on data collection and analysis method, the results from a Big Data can be varied. Some of them could be positive or negative so that it is essential that Big Data should be handled adequately and appropriately adapting to a purpose. Therefore, a Big Data has been constructed in this study to applying Crawling technique for data mining and analyzed with R. Also, the data were visualized for easier recognition and this was effective in developing an individualized health plan from different angles.

A Study on Deep Learning Model-based Object Classification for Big Data Environment

  • Kim, Jeong-Sig;Kim, Jinhong
    • 한국소프트웨어감정평가학회 논문지
    • /
    • 제17권1호
    • /
    • pp.59-66
    • /
    • 2021
  • Recently, conceptual information model is changing fast, and these changes are coming about as a result of individual tendency, social cultural, new circumstances and societal shifts within big data environment. Despite the data is growing more and more, now is the time to commit ourselves to the development of renewable, invaluable information of social/live commerce. Because we have problems with various insoluble data, we propose about deep learning prediction model-based object classification in social commerce of big data environment. Accordingly, it is an increased need of social commerce platform capable of handling high volumes of multiple items by users. Consequently, responding to rapid changes in users is a very significant by deep learning. Namely, promptly meet the needs of the times, and a widespread growth in big data environment with the goal of realizing in this paper.

수집된 경로데이터를 사용하는 내비게이션을 위한 대용량 경로조합 방법 (A Big-Data Trajectory Combination Method for Navigations using Collected Trajectory Data)

  • 구광민;이태호;박희민
    • 한국멀티미디어학회논문지
    • /
    • 제19권2호
    • /
    • pp.386-395
    • /
    • 2016
  • In trajectory-based navigation systems, a huge amount of trajectory data is needed for efficient route explorations. However, it would be very hard to collect trajectories from all the possible start and destination combinations. To provide a practical solution to this problem, we suggest a method combining collected GPS trajectories data into additional generated trajectories with new start and destination combinations without road information. We present a trajectory combination algorithm and its implementation with Scala programming language on Spark platform for big data processing. The experimental results proved that the proposed method can effectively populate the collected trajectories into valid trajectory paths more than three hundred times.

New Medical Image Fusion Approach with Coding Based on SCD in Wireless Sensor Network

  • Zhang, De-gan;Wang, Xiang;Song, Xiao-dong
    • Journal of Electrical Engineering and Technology
    • /
    • 제10권6호
    • /
    • pp.2384-2392
    • /
    • 2015
  • The technical development and practical applications of big-data for health is one hot topic under the banner of big-data. Big-data medical image fusion is one of key problems. A new fusion approach with coding based on Spherical Coordinate Domain (SCD) in Wireless Sensor Network (WSN) for big-data medical image is proposed in this paper. In this approach, the three high-frequency coefficients in wavelet domain of medical image are pre-processed. This pre-processing strategy can reduce the redundant ratio of big-data medical image. Firstly, the high-frequency coefficients are transformed to the spherical coordinate domain to reduce the correlation in the same scale. Then, a multi-scale model product (MSMP) is used to control the shrinkage function so as to make the small wavelet coefficients and some noise removed. The high-frequency parts in spherical coordinate domain are coded by improved SPIHT algorithm. Finally, based on the multi-scale edge of medical image, it can be fused and reconstructed. Experimental results indicate the novel approach is effective and very useful for transmission of big-data medical image(especially, in the wireless environment).

제조 기반 IIoT 환경에서 데이터 분석 소프트웨어의 품질 평가를 위한 모델 (Model for Quality Assessment of Data Analytics Software in Manufacturing-Based IIoT Environments)

  • 최종석;신용태
    • 한국정보전자통신기술학회논문지
    • /
    • 제14권4호
    • /
    • pp.292-299
    • /
    • 2021
  • IT기술의 발달로 제조 기반의 IIoT환경을 기반으로 한 데이터 마이닝 형태의 소프트웨어들이 점차 늘어나고 있다. 그러나 빅데이터 및 데이터마이닝을 진행해야 하는 대량의 데이터를 가지는 제조 기업의 소프트웨어 특성상 일반 소프트웨어와 동일한 형태로 소프트웨어 품질을 평가하기 힘든 실정이다. 또한 이기종간의 장비 및 소프트웨어가 혼재된 제조 기반의 환경에서 특히 기존의 품질 특성을 적용하여 사용되는 소프트웨어에 대한 품질 판단을 진행하기 어렵다. 본 논문에서는 제조 기반의 특성을 조사하고 이에 맞는 소프트웨어 품질 평가 모델을 개발하여 평가를 실시하고자 한다.

Analyzing Operation Deviation in the Deasphalting Process Using Multivariate Statistics Analysis Method

  • Park, Joo-Hwang;Kim, Jong-Soo;Kim, Tai-Suk
    • 한국멀티미디어학회논문지
    • /
    • 제17권7호
    • /
    • pp.858-865
    • /
    • 2014
  • In the case of system like MES, various sensors collect the data in real time and save it as a big data to monitor the process. However, if there is big data mining in distributed computing system, whole processing process can be improved. In this paper, system to analyze the cause of operation deviation was built using the big data which has been collected from deasphalting process at the two different plants. By applying multivariate statistical analysis to the big data which has been collected through MES(Manufacturing Execution System), main cause of operation deviation was analyzed. We present the example of analyzing the operation deviation of deasphalting process using the big data which collected from MES by using multivariate statistics analysis method. As a result of regression analysis of the forward stepwise method, regression equation has been found which can explain 52% increase of performance compare to existing model. Through this suggested method, the existing petrochemical process can be replaced which is manual analysis method and has the risk of being subjective according to the tester. The new method can provide the objective analysis method based on numbers and statistic.