• Title/Summary/Keyword: Data Comparison

Search Result 12,566, Processing Time 0.038 seconds

Performance Comparison and Analysis of Container-based Host Operating Systems for sending and receiving High-capacity data on Server Systems

  • Kim, Sungho;Kwon, Oeon;Kim, Jung Han;Byeon, JiHyeon;Hwang, Sang-Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.7
    • /
    • pp.65-73
    • /
    • 2022
  • Recently, as the Windows system supports the Windows subsystem for Linux (WSL), various researchers have studied to apply a docker container on various systems such as server systems, workstation system and so on. However, in various existing researchers, there is a lack of performance-related indicators to apply the system to each operating system (linux system and windows system). In this paper, we compared a performance comparison and analysis of container-based host operating systems. We configured experimental environments of operating systems for microsoft windows systems and linux systems based on a docker container support. In experimental results, the containers of linux systems reduced the average data latency of dataset 1-6 by 3.9%, 62.16%, 1552.38%, 7.27%, 60.83%, and 1567.2%, compared to the containers on microsoft windows systems.

Comparison of Methods for Reducing the Dimension of Compositional Data with Zero Values

  • Song, Taeg-Youn;Choi, Byung-Jin
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.4
    • /
    • pp.559-569
    • /
    • 2012
  • Compositional data consist of compositions that are non-negative vectors of proportions with the unit-sum constraint. In disciplines such as petrology and archaeometry, it is fundamental to statistically analyze this type of data. Aitchison (1983) introduced a log-contrast principal component analysis that involves logratio transformed data, as a dimension-reduction technique to understand and interpret the structure of compositional data. However, the analysis is not usable when zero values are present in the data. In this paper, we introduce 4 possible methods to reduce the dimension of compositional data with zero values. Two real data sets are analyzed using the methods and the obtained results are compared.

Improving the Performance of Threshold Bootstrap for Simulation Output Analysis (시뮬레이션 출력분석을 위한 임계값 부트스트랩의 성능개선)

  • Kim, Yun-Bae
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.23 no.4
    • /
    • pp.755-767
    • /
    • 1997
  • Analyzing autocorrelated data set is still an open problem. Developing on easy and efficient method for severe positive correlated data set, which is common in simulation output, is vital for the simulation society. Bootstrap is on easy and powerful tool for constructing non-parametric inferential procedures in modern statistical data analysis. Conventional bootstrap algorithm requires iid assumption in the original data set. Proper choice of resampling units for generating replicates has much to do with the structure of the original data set, iid data or autocorrelated. In this paper, a new bootstrap resampling scheme is proposed to analyze the autocorrelated data set : the Threshold Bootstrap. A thorough literature search of bootstrap method focusing on the case of autocorrelated data set is also provided. Theoretical foundations of Threshold Bootstrap is studied and compared with other leading bootstrap sampling techniques for autocorrelated data sets. The performance of TB is reported using M/M/1 queueing model, else the comparison of other resampling techniques of ARMA data set is also reported.

  • PDF

A Study on the de-identification of Personal Information of Hotel Users (호텔 이용 고객의 개인정보 비식별화 방안에 관한 연구)

  • Kim, Taekyung
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.12 no.4
    • /
    • pp.51-58
    • /
    • 2016
  • In the area of hotel and tourism sector, various research are analyzed using big data. Big data is being generated by any digital devices around us all the times. All the digital process and social media exchange produces the big data. In this paper, we analyzed the de-identification method of big data to use the personal information of hotel guests. Through the analysis of these big data, hotel can provide differentiated and diverse services to hotel guests and can improve the service and support the marketing of hotels. If the hotel wants to use the information of the guest, the private data should be de-identified. There are several de-identification methods of personal information such as pseudonymisation, aggregation, data reduction, data suppression and data masking. Using the comparison of these methods, the pseudonymisation is discriminated to the suitable methods for the analysis of information for the hotel guest. Also, among the pseudonymisation methods, the t-closeness was analyzed to the secure and efficient method for the de-identification of personal information in hotel.

Twostep Clustering of Environmental Indicator Survey Data

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.1-11
    • /
    • 2006
  • Data mining technique is used to find hidden knowledge by massive data, unexpectedly pattern, relation to new rule. The methods of data mining are decision tree, association rules, clustering, neural network and so on. Clustering is the process of grouping the data into clusters so that objects within a cluster have high similarity in comparison to one another. It has been widely used in many applications, such that pattern analysis or recognition, data analysis, image processing, market research on off-line or on-line and so on. We analyze Gyeongnam social indicator survey data by 2001 using twostep clustering technique for environment information. The twostep clustering is classified as a partitional clustering method. We can apply these twostep clustering outputs to environmental preservation and improvement.

  • PDF

Comparison of Neural Network Techniques for Text Data Analysis

  • Kim, Munhee;Kang, Kee-Hoon
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.2
    • /
    • pp.231-238
    • /
    • 2020
  • Generally, sequential data refers to data having continuity. Text data, which is a representative type of unstructured data, is also sequential data in that it is necessary to know the meaning of the preceding word in order to know the meaning of the following word or context. So far, many techniques for analyzing sequential data such as text data have been proposed. In this paper, four methods of 1d-CNN, LSTM, BiLSTM, and C-LSTM are introduced, focusing on neural network techniques. In addition, by using this, IMDb movie review data was classified into two classes to compare the performance of the techniques in terms of accuracy and analysis time.

Filtering Correction Method and Performance Comparison for Time Series Data

  • Baek, Jongwoo;Choi, Jiyoung;Jung, Hoekyung
    • Journal of information and communication convergence engineering
    • /
    • v.20 no.2
    • /
    • pp.125-130
    • /
    • 2022
  • In modern society, as many data are used for research or commercial purposes, the value of data is gradually increasing. In related fields, research is being actively conducted to collect valuable data, but it is difficult to collect proper data because the value of collection is determined according to the performance of existing sensors. To solve this problem, a method to effectively reduce noise has been proposed, but there is a point in which performance is degraded due to damage caused by noise. In this paper, a device capable of collecting time series data was designed to correct such data noise, and a correction technique was performed by giving an error value based on the representatively collected ultrafine dust data, and then comparing before and after Compare performance. For the correction method, Kalman, LPF, Savitzky-Golay, and Moving Average filter were used. Savitzky-Golay filter and Moving Average Filter showed excellent correction rate as an experiment. Through this, the performance of the sensor can be supplemented and it is expected that data can be effectively collected.

Voice Analysis of Chronic & Daily Voice Burden in Professionals (직업적인 음성과사용자들의 음성 부담에 대한 평가)

  • 남순열;김준모;박형욱;이석우;박혜성;김상윤;유승주
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.12 no.1
    • /
    • pp.17-21
    • /
    • 2001
  • Aims of study : The purpose of this study is to measure the chronic and daily voice burden of the professionals in their actual working places. These will be a valuable guideline for preventing and controlling the voice production of professionals. Material and method : Our study was selected to the 10 female telephone operators in the Asan Medical Center, ages ranging from 22 to 38 years old. The symptoms and acoustic analysis of both telephone operators and the controls were evaluated before and after their working. The symptoms were evaluated with questionaires, and the acoustic analysis was measured by using CSL (computerized speech laboratory) system. Results : The symptoms of the professional voice abusers are same as those symptoms in laryngeal fatigue. The acoustic analysis before their working were significantly increased in jitter and shimmer, in comparison with the data of the control. This shows that the experimental group is exposed to the chronic burden of voice production. The jitter, shimmer, and NHR after their working are significantly increased in comparison with the data of the acoustic analysis before their working. This also shows that the experimental group is exposed to the daily burden of voice production. Conclusion : The acoustic analysis of the professional voice overusers has objectively measured that there are chronic and daily overloading to the voice of operators, and these will be a valuable data for preventing and controlling the professionals that abuse their voice.

  • PDF

A Development of Sound Quality Index of an Intake and Exhaust System for High Quality Improvement of Luxury Vehicles (차량 고급감 향상을 위한 흡배기계 음질지수 개발)

  • Lee, Jong-Kyu;Cho, Teock-Hyeong;Seo, Dae-Won;Lim, Yun-Soo;Won, Kwang-Min
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.22 no.3
    • /
    • pp.234-243
    • /
    • 2012
  • In this paper, a sound quality indices for the evaluation of vehicle intake and exhaust noise were developed through a correlation analysis of objective measurement data and subjective evaluation data. At first, intake and exhaust orifice noise were measured at the wide-open throttle sweep condition. And then, acoustic transfer function between intake orifice noise and interior noise at the steady state condition was measured. Also, acoustic transfer function for exhaust system was measured as the same method. Simultaneously, subjective evaluation was carried out by the paired comparison and semantic differential method by 27 engineers. Next, the correlation analysis between the psycho-acoustic parameters derived from the measured data and the subjective evaluation was performed. The most critical factor was determined and the corresponding sound quality index for the intake and exhaust noise was obtained from the multiple factor regression method. At last, the effectiveness of the proposed index was investigated.

Development, Demonstration and Validation of the Deep Space Orbit Determination Software Using Lunar Prospector Tracking Data

  • Lee, Eunji;Kim, Youngkwang;Kim, Minsik;Park, Sang-Young
    • Journal of Astronomy and Space Sciences
    • /
    • v.34 no.3
    • /
    • pp.213-223
    • /
    • 2017
  • The deep space orbit determination software (DSODS) is a part of a flight dynamic subsystem (FDS) for the Korean Pathfinder Lunar Orbiter (KPLO), a lunar exploration mission expected to launch after 2018. The DSODS consists of several sub modules, of which the orbit determination (OD) module employs a weighted least squares algorithm for estimating the parameters related to the motion and the tracking system of the spacecraft, and subroutines for performance improvement and detailed analysis of the orbit solution. In this research, DSODS is demonstrated and validated at lunar orbit at an altitude of 100 km using actual Lunar Prospector tracking data. A set of a priori states are generated, and the robustness of DSODS to the a priori error is confirmed by the NASA planetary data system (PDS) orbit solutions. Furthermore, the accuracy of the orbit solutions is determined by solution comparison and overlap analysis as about tens of meters. Through these analyses, the ability of the DSODS to provide proper orbit solutions for the KPLO are proved.