• Title/Summary/Keyword: Data-based analysis

Search Result 30,989, Processing Time 0.057 seconds

Big Data Platform Based on Hadoop and Application to Weight Estimation of FPSO Topside

  • Kim, Seong-Hoon;Roh, Myung-Il;Kim, Ki-Su;Oh, Min-Jae
    • Journal of Advanced Research in Ocean Engineering
    • /
    • v.3 no.1
    • /
    • pp.32-40
    • /
    • 2017
  • Recently, the amount of data to be processed and the complexity thereof have been increasing due to the development of information and communication technology, and industry's interest in such big data is increasing day by day. In the shipbuilding and offshore industry also, there is growing interest in the effective utilization of data, since various and vast amounts of data are being generated in the process of design, production, and operation. In order to effectively utilize big data in the shipbuilding and offshore industry, it is necessary to store and process large amounts of data. In this study, it was considered efficient to apply Hadoop and R, which are mostly used in big data related research. Hadoop is a framework for storing and processing big data. It provides the Hadoop Distributed File System (HDFS) for storing big data, and the MapReduce function for processing. Meanwhile, R provides various data analysis techniques through the language and environment for statistical calculation and graphics. While Hadoop makes it is easy to handle big data, it is difficult to finely process data; and although R has advanced analysis capability, it is difficult to use to process large data. This study proposes a big data platform based on Hadoop for applications in the shipbuilding and offshore industry. The proposed platform includes the existing data of the shipyard, and makes it possible to manage and process the data. To check the applicability of the platform, it is applied to estimate the weights of offshore structure topsides. In this study, we store data of existing FPSOs in Hadoop-based Hortonworks Data Platform (HDP), and perform regression analysis using RHadoop. We evaluate the effectiveness of large data processing by RHadoop by comparing the results of regression analysis and the processing time, with the results of using the conventional weight estimation program.

Automated ground penetrating radar B-scan detection enhanced by data augmentation techniques

  • Donghwi Kim;Jihoon Kim;Heejung Youn
    • Geomechanics and Engineering
    • /
    • v.38 no.1
    • /
    • pp.29-44
    • /
    • 2024
  • This research investigates the effectiveness of data augmentation techniques in the automated analysis of B-scan images from ground-penetrating radar (GPR) using deep learning. In spite of the growing interest in automating GPR data analysis and advancements in deep learning for image classification and object detection, many deep learning-based GPR data analysis studies have been limited by the availability of large, diverse GPR datasets. Data augmentation techniques are widely used in deep learning to improve model performance. In this study, we applied four data augmentation techniques (geometric transformation, color-space transformation, noise injection, and applying kernel filter) to the GPR datasets obtained from a testbed. A deep learning model for GPR data analysis was developed using three models (Faster R-CNN ResNet, SSD ResNet, and EfficientDet) based on transfer learning. It was found that data augmentation significantly enhances model performance across all cases, with the mAP and AR for the Faster R-CNN ResNet model increasing by approximately 4%, achieving a maximum mAP (Intersection over Union = 0.5:1.0) of 87.5% and maximum AR of 90.5%. These results highlight the importance of data augmentation in improving the robustness and accuracy of deep learning models for GPR B-scan analysis. The enhanced detection capabilities achieved through these techniques contribute to more reliable subsurface investigations in geotechnical engineering.

Information Visualization for the Manufacturing Process Optimization Based on Design of Experiment and Data Analysis (실험계획법과 데이터 분석 기반의 제조공정 최적화를 위한 정보 시각화)

  • Kim, Jae Chun;Jin, Seon A;Park, Young Hee;Noh, Seong Yeo;Lee, Hyun Dong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.9
    • /
    • pp.393-402
    • /
    • 2015
  • Data visualization technology helps people easily understand various data and its analysis result, so usefulness of it is expected in the real industrial manufacturing sites. The large amount of data which is occurred at the manufacturing sites is able to fulfill very important roll to improve the manufacturing process. In this paper, we propose an information visualization for the manufacturing process optimization based on design of experimental and data analysis. The manufacturing process may be improved and be reduced cause of faulty by providing the easy-process analysis to understand the operation site through the information visualization of data analysis result.

A case study on the application of process abnormal detection process using big data in smart factory (Smart Factory Big Data를 활용한 공정 이상 탐지 프로세스 적용 사례 연구)

  • Nam, Hyunwoo
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.1
    • /
    • pp.99-114
    • /
    • 2021
  • With the Fourth Industrial Revolution based on new technology, the semiconductor manufacturing industry researches various analysis methods such as detecting process abnormalities and predicting yield based on equipment sensor data generated in the manufacturing process. The semiconductor manufacturing process consists of hundreds of processes and thousands of measurement processes associated with them, each of which has properties that cannot be defined by chemical or physical equations. In the individual measurement process, the actual measurement ratio does not exceed 0.1% to 5% of the target product, and it cannot be kept constant for each measurement point. For this reason, efforts are being made to determine whether to manage by using equipment sensor data that can indirectly determine the normal state of each step of the process. In this study, the Functional Data Analysis (FDA) was proposed to define a process abnormality detection process based on equipment sensor data and compensate for the disadvantages of the currently applied statistics-based diagnosis method. Anomaly detection accuracy was compared using machine learning on actual field case data, and its effectiveness was verified.

A Study on the Strategy of the Use of Big Data for Cost Estimating in Construction Management Firms based on the SWOT Analysis (SWOT분석을 통한 CM사 견적업무 빅데이터 활용전략에 관한 연구)

  • Kim, Hyeon Jin;Kim, Han Soo
    • Korean Journal of Construction Engineering and Management
    • /
    • v.23 no.2
    • /
    • pp.54-64
    • /
    • 2022
  • Since the interest in big data is growing exponentially, various types of research and development in the field of big data have been conducted in the construction industry. Among various application areas, cost estimating can be a topic where the use of big data provides positive benefits. In order for firms to make efficient use of big data for estimating tasks, they need to establish a strategy based on the multifaceted analysis of internal and external environments. The objective of the study is to develop and propose a strategy of the use of big data for construction management(CM) firms' cost estimating tasks based on the SWOT analysis. Through the combined efforts of literature review, questionnaire survey, interviews and the SWOT analysis, the study suggests that CM firms need to maintain the current level of the receptive culture for the use of big data and expand incrementally information resources. It also proposes that they need to reinforce the weak areas including big data experts and practice infrastructure for improving the big data-based cost estimating.

A Hybrid Approach Based on Multi-Criteria Satisfaction Analysis (MUSA) and a Network Data Envelopment Analysis (NDEA) to Evaluate Efficiency of Customer Services in Bank Branches

  • Khalili-Damghani, Kaveh;Taghavi-Fard, Mohammad;Karbaschi, Kiaras
    • Industrial Engineering and Management Systems
    • /
    • v.14 no.4
    • /
    • pp.347-371
    • /
    • 2015
  • A hybrid procedure based on multi-Criteria Satisfaction Analysis (MUSA) and a Network Data Envelopment Analysis (NDEA) is proposed to evaluate the relative efficiency of customer services in bank branches. First, a three-stage process including sub-processes such as customer expectations, customer satisfaction, and customer loyalty, is defined to model the banking customer services. Then, fulfillment of customer expectations, customer loyalty level, and the customer satisfaction degree are measured and quantified through a multi-dimensional questionnaire based on customers' perceptions analysis and MUSA method, respectively. The customer services scores and the other criteria such as mean of employee evaluation score, operation costs, assets, deposits, loans, number of accounts are considered in network three-stage DEA model. The proposed NDEA model is formed based on multipliers perspective, output-oriented, and constant return to scale assumptions. The proposed NDEA model quantifies and assesses the total efficiency of main process and assigns the efficiency to customer expectations, customer satisfactions, and customer loyalties sub-processes in bank branches. The whole procedure is applied on 30 bank branches in IRAN. The proposed approach can be used in other organizations such as airports, airline agencies, urban transportation systems, railway organizations, chain stores, chain restaurants, public libraries, and entertainment centers.

Development of Web-Based Wind Data Analysis System for HeMOSU-1 (웹기반 해모수-1 풍황자료 분석 시스템 개발)

  • Ryu, Ki-Wahn;Park, Kun-Sung;Lee, Jong-Hwa;Oh, Soo-Yun;Kim, Ji-Young;Park, Myoung-Ho
    • Journal of Wind Energy
    • /
    • v.4 no.1
    • /
    • pp.60-67
    • /
    • 2013
  • A web-based program was developed for analyzing weather and structure data from the HeMOSU-1 offshore meteorological mast installed by the KEPCO Research Institute, and 35 km west-southwestward away from Gyeokpo located in Jeonbuk province. All of the measured data are obtained through the data transmitter and the server systems equipped on the HeMOSU-1 and the aerodynamic laboratory in Chonbuk National University respectively. The dualised server system consists of two servers, one is for logging the 1 second based raw data with 10 minute averaged values, and the other is for managing web page with processed weather data. Daily or weekly 10-min averaged data can be provided based on the input date by users. Processed weather data such as wind rose, Weibull distribution, diurnal distribution, turbulence intensity according to wind speed, wind energy density, and so forth are visualized through the web page which would be both useful and informative for developing the wind farm or designing a wind blade for the wind farm nearby southwest sea around the Korean Peninsula. The URL for this web page is http://www.hemosu.org/.

A Study on the Analysis of Spatial Characteristics with Respect to Regional Mobility Using Clustering Technique Based on Origin-Destination Mobility Data (기종점 모빌리티 데이터 기반 클러스터링 기법을 활용한 지역 모빌리티의 공간적 특성 분석 연구)

  • Donghoun Lee;Yongjun Ahn
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.22 no.1
    • /
    • pp.219-232
    • /
    • 2023
  • Mobility services need to change according to the regional characteristics of the target service area. Accordingly, analysis of mobility patterns and characteristics based on Origin-Destination (OD) data that reflect travel behaviors in the target service area is required. However, since conventional methods construct the OD data obtained from the administrative district-based zone system, it is hard to ensure spatial homogeneity. Hence, there are limitations in analyzing the inherent travel patterns of each mobility service, particularly for new mobility service like Demand Responsive Transit (DRT). Unlike the conventional approach, this study applies a data-driven clustering technique to conduct spatial analyses on OD travel patterns of regional mobility services based on reconstructed OD data derived from re-aggregation for original OD distributions. Based on the reconstructed OD data that contains information on the inherent feature vectors of the original OD data, the proposed method enables analysis of the spatial characteristics of regional mobility services, including public transit bus, taxi and DRT.

The Design and Implementation of Web-Based Integrated Genome Analysis Tools (웹 기반 통합 유전체 분석 시스템의 설계 및 구현)

  • 최범순;이경희;권해룡;조완섭;이충세;김영창
    • Journal of Korea Multimedia Society
    • /
    • v.7 no.3
    • /
    • pp.408-417
    • /
    • 2004
  • Genome analysis process requires several steps of various software analysis tools. We propose WGAT(Web-based Genome Analysis Tool), which combines several tools for gene analysis and provides a graphic user interface for users. Software tools related to gene analysis are based on Linux or Unix oriented program, which is difficult to install and use for biologists. Furthermore, files generated from gene analysis frequently require manual transformation for next step input file. Web-based tools which are recently developed process orily one sequence at a time. So it needs many repetitive processes to analyze large size data file. WGAT is developed to support Web-based genome analysis for easy use as well as fast service for users. Whole genome data analysis can be done by running WGAT on Linux server and giving sequence data files with various options. Therefore many steps of the analysis can be done automatically by the system. Simulation shows that WGAT method gives 20 times faster analysis when sequence segment is one thousand.

  • PDF

A Development Method of Framework for Collecting, Extracting, and Classifying Social Contents

  • Cho, Eun-Sook
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.1
    • /
    • pp.163-170
    • /
    • 2021
  • As a big data is being used in various industries, big data market is expanding from hardware to infrastructure software to service software. Especially it is expanding into a huge platform market that provides applications for holistic and intuitive visualizations such as big data meaning interpretation understandability, and analysis results. Demand for big data extraction and analysis using social media such as SNS is very active not only for companies but also for individuals. However despite such high demand for the collection and analysis of social media data for user trend analysis and marketing, there is a lack of research to address the difficulty of dynamic interlocking and the complexity of building and operating software platforms due to the heterogeneity of various social media service interfaces. In this paper, we propose a method for developing a framework to operate the process from collection to extraction and classification of social media data. The proposed framework solves the problem of heterogeneous social media data collection channels through adapter patterns, and improves the accuracy of social topic extraction and classification through semantic association-based extraction techniques and topic association-based classification techniques.