• Title/Summary/Keyword: Big-data Software

Search Result 441, Processing Time 0.04 seconds

Analysis of Traffic Card Big Data by Hadoop and Sequential Mining Technique (하둡과 순차패턴 마이닝 기술을 통한 교통카드 빅데이터 분석)

  • Kim, Woosaeng;Kim, Yong Hoon;Park, Hee-Sung;Park, Jin-Kyu
    • Journal of Information Technology Applications and Management
    • /
    • v.24 no.4
    • /
    • pp.187-196
    • /
    • 2017
  • It is urgent to prepare countermeasures for traffic congestion problems of Korea's metropolitan area where central functions such as economic, social, cultural, and education are excessively concentrated. Most users of public transportation in metropolitan areas including Seoul use the traffic cards. If various information is extracted from traffic big data produced by the traffic cards, they can provide basic data for transport policies, land usages, or facility plans. Therefore, in this study, we extract valuable information such as the subway passengers' frequent travel patterns from the big traffic data provided by the Seoul Metropolitan Government Big Data Campus. For this, we use a Hadoop (High-Availability Distributed Object-Oriented Platform) to preprocess the big data and store it into a Mongo database in order to analyze it by a sequential pattern data mining technique. Since we analysis the actual big data, that is, the traffic cards' data provided by the Seoul Metropolitan Government Big Data Campus, the analyzed results can be used as an important referenced data when the Seoul government makes a plan about the metropolitan traffic policies.

Adaptive Boundary Correction based Particle Swarm Optimization for Activity Recognition (사용자 행동인식을 위한 적응적 경계 보정기반 Particle Swarm Optimization 알고리즘)

  • Heo, Seonguk;Kwon, Yongjin;Kang, Kyuchang;Bae, Changseok
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2012.11a
    • /
    • pp.1166-1169
    • /
    • 2012
  • 본 논문은 사용자 행동인식을 위해 기존 PSO (Particle Swarm Optimization) 알고리즘의 경계선을 통한 데이터 분류에서 데이터의 수집환경에 의해 발생하는 문제를 벡터의 길이비교를 이용한 보정을 통해 보완한 알고리즘을 제안한다. 기존의 PSO 알고리즘은 데이터 분류를 위해서 데이터의 최소, 최대값을 이용하여 경계를 생성하고, 이를 이용하여 데이터를 분류하였다. 그러나 PSO를 이용하여 행동인식을 할 때 행동이 수집되는 환경에 따라서 경계에 포함되지 못해 행동이 분류되지 못하는 문제가 있다. 이러한 분류의 문제를 보완하기 위해 경계를 벗어난 데이터와 각 행동을 대표하는 데이터의 벡터 길이를 계산하고 최소길이를 비교하여 분류한다. 실험결과, 기존 PSO 방법에 비해 개선된 방법이 평균적으로 앉기 1%, 걷기 7%, 서기 7%의 개선된 결과를 얻었다.

A Study on AI basic statistics Education for Non-majors (비전공자를 위한 AI기초통계 교육의 고찰)

  • Yoo, Jin-Ah
    • Journal of Integrative Natural Science
    • /
    • v.14 no.4
    • /
    • pp.176-182
    • /
    • 2021
  • We live in the age of artificial intelligence, and big data and artificial intelligence education are no longer just for majors, but are required to be able to handle non-majors as well. Software and artificial intelligence education for non-majors is not just a general education, it creates talents who can understand and utilize them, and the quality of education is increasingly important. Through such education, we can nurture creative talents who can create and use new values by fusion with various fields of computing technology. Since 2015, many universities have been implementing software-oriented colleges and AI-oriented colleges to foster software-oriented human resources. However, it is not easy to provide AI basic statistics education of big data analysis deception to non-majors. Therefore, we would like to present a big data education model for non-majors in big data analysis so that big data analysis can be directly applied.

A study on the success factors of Big Data through an analysis of introduction effect of Big Data (빅데이터 도입 효과 분석을 통한 빅데이터 성공요인에 관한 연구)

  • Jung, Young-Ki;Suk, Myung-Gun;Kim, Chang-Jae
    • Journal of Digital Convergence
    • /
    • v.12 no.11
    • /
    • pp.241-248
    • /
    • 2014
  • It has been expanded the bandwidth of data usages due to the rapid developments of information technology and infra hardware and then it was proposed to new paradigm of Big Data era. It has a trend to increase a Big Data technology and its performance gradually, thus enterprises have realized the importance of Data and the movement to take advantage of Big Data becomes active. This study has been performed to verify the importance through select the factors in order to active adoption of Big Data technology and utilization when enterprises use Big Data. It was selected that Big Data characteristic factors are the natures of predictability, manageability, affordability, competitiveness, creativity, responsiveness and supportability on the study. It is verified and showed that manageability were influenced to introduce Big Data in order, at the result of survey and statistics for enterprise practitioners who have big data experience.

A Keyword-Based Big Data Analysis for Individualized Health Activity: Focusing on Methodological Approach

  • Kim, Han-Byul;Bae, Geun-Pyo;Huh, Jun-Ho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2017.04a
    • /
    • pp.540-543
    • /
    • 2017
  • It will be possible to solve some of the major issues in our society and economy with the emerging Big Data used across 21st century global digital economy. One of the main areas where big data can be quite useful is the medical and health area. IT technology is being used extensively in this area and expected to expand its application field further. However, there is still room for improvement in the usage of Big Data as it is difficult to search unstructured data contained in Big Data and collect statistics for them. This limits wider application of Big Data. Depending on data collection and analysis method, the results from a Big Data can be varied. Some of them could be positive or negative so that it is essential that Big Data should be handled adequately and appropriately adapting to a purpose. Therefore, a Big Data has been constructed in this study to applying Crawling technique for data mining and analyzed with R. Also, the data were visualized for easier recognition and this was effective in developing an individualized health plan from different angles.

A Study on Deep Learning Model-based Object Classification for Big Data Environment

  • Kim, Jeong-Sig;Kim, Jinhong
    • Journal of Software Assessment and Valuation
    • /
    • v.17 no.1
    • /
    • pp.59-66
    • /
    • 2021
  • Recently, conceptual information model is changing fast, and these changes are coming about as a result of individual tendency, social cultural, new circumstances and societal shifts within big data environment. Despite the data is growing more and more, now is the time to commit ourselves to the development of renewable, invaluable information of social/live commerce. Because we have problems with various insoluble data, we propose about deep learning prediction model-based object classification in social commerce of big data environment. Accordingly, it is an increased need of social commerce platform capable of handling high volumes of multiple items by users. Consequently, responding to rapid changes in users is a very significant by deep learning. Namely, promptly meet the needs of the times, and a widespread growth in big data environment with the goal of realizing in this paper.

A Big-Data Trajectory Combination Method for Navigations using Collected Trajectory Data (수집된 경로데이터를 사용하는 내비게이션을 위한 대용량 경로조합 방법)

  • Koo, Kwang Min;Lee, Taeho;Park, Heemin
    • Journal of Korea Multimedia Society
    • /
    • v.19 no.2
    • /
    • pp.386-395
    • /
    • 2016
  • In trajectory-based navigation systems, a huge amount of trajectory data is needed for efficient route explorations. However, it would be very hard to collect trajectories from all the possible start and destination combinations. To provide a practical solution to this problem, we suggest a method combining collected GPS trajectories data into additional generated trajectories with new start and destination combinations without road information. We present a trajectory combination algorithm and its implementation with Scala programming language on Spark platform for big data processing. The experimental results proved that the proposed method can effectively populate the collected trajectories into valid trajectory paths more than three hundred times.

New Medical Image Fusion Approach with Coding Based on SCD in Wireless Sensor Network

  • Zhang, De-gan;Wang, Xiang;Song, Xiao-dong
    • Journal of Electrical Engineering and Technology
    • /
    • v.10 no.6
    • /
    • pp.2384-2392
    • /
    • 2015
  • The technical development and practical applications of big-data for health is one hot topic under the banner of big-data. Big-data medical image fusion is one of key problems. A new fusion approach with coding based on Spherical Coordinate Domain (SCD) in Wireless Sensor Network (WSN) for big-data medical image is proposed in this paper. In this approach, the three high-frequency coefficients in wavelet domain of medical image are pre-processed. This pre-processing strategy can reduce the redundant ratio of big-data medical image. Firstly, the high-frequency coefficients are transformed to the spherical coordinate domain to reduce the correlation in the same scale. Then, a multi-scale model product (MSMP) is used to control the shrinkage function so as to make the small wavelet coefficients and some noise removed. The high-frequency parts in spherical coordinate domain are coded by improved SPIHT algorithm. Finally, based on the multi-scale edge of medical image, it can be fused and reconstructed. Experimental results indicate the novel approach is effective and very useful for transmission of big-data medical image(especially, in the wireless environment).

Model for Quality Assessment of Data Analytics Software in Manufacturing-Based IIoT Environments (제조 기반 IIoT 환경에서 데이터 분석 소프트웨어의 품질 평가를 위한 모델)

  • Choi, Jongseok;Shin, Yongtae
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.14 no.4
    • /
    • pp.292-299
    • /
    • 2021
  • A form of data mining software, based on manufacturing-based IIoT environment with the development of IT technologies are increasingly growing. However, it is difficult to evaluate the software quality in the same form as general software due to the characteristics of the software of a manufacturing company that has a large amount of data that needs to be carried out with big data and data mining. In addition, in a manufacturing-based environment where heterogeneous equipment and software are mixed, it is difficult to perform quality judgment on software used by applying existing quality characteristics. Therefore, in this paper, the characteristics of the manufacturing base are investigated, and a software quality evaluation model suitable for it is developed and evaluated.

Analyzing Operation Deviation in the Deasphalting Process Using Multivariate Statistics Analysis Method

  • Park, Joo-Hwang;Kim, Jong-Soo;Kim, Tai-Suk
    • Journal of Korea Multimedia Society
    • /
    • v.17 no.7
    • /
    • pp.858-865
    • /
    • 2014
  • In the case of system like MES, various sensors collect the data in real time and save it as a big data to monitor the process. However, if there is big data mining in distributed computing system, whole processing process can be improved. In this paper, system to analyze the cause of operation deviation was built using the big data which has been collected from deasphalting process at the two different plants. By applying multivariate statistical analysis to the big data which has been collected through MES(Manufacturing Execution System), main cause of operation deviation was analyzed. We present the example of analyzing the operation deviation of deasphalting process using the big data which collected from MES by using multivariate statistics analysis method. As a result of regression analysis of the forward stepwise method, regression equation has been found which can explain 52% increase of performance compare to existing model. Through this suggested method, the existing petrochemical process can be replaced which is manual analysis method and has the risk of being subjective according to the tester. The new method can provide the objective analysis method based on numbers and statistic.