• 제목/요약/키워드: Log data

Search Result 2,131, Processing Time 0.026 seconds

Design and Application of Metadata Schema in Datawebhouse System (데이터웹하우스 시스템에서 메타데이터 스키마의 설계 및 활용)

  • Park, Jong-Mo;Cho, Kyung-San
    • The KIPS Transactions:PartD
    • /
    • v.14D no.6
    • /
    • pp.701-706
    • /
    • 2007
  • Datawebhouse consists of both web log analysis used for customer management and datawarehouse used for decision support. However, datawebhouse needs complex operations for management in order to transform and integrate data from heterogeneous data sources and distributed systems. We propose a metadata schema in order to enable data integration and data management which are essential in datawebhouse environments. We show that our proposed schema supports datawebhouse development and enables integrated asset management of business information. With ETL metadata for web log extract, we can improve the data processing time of web log.

Performance Analysis of M-ary Optical Communication over Log-Normal Fading Channels for CubeSat Platforms

  • Lim, Hyung-Chul;Yu, Sung-Yeol;Sung, Ki-Pyoung;Park, Jong Uk;Choi, Chul-Sung;Choi, Mansoo
    • Journal of Astronomy and Space Sciences
    • /
    • v.37 no.4
    • /
    • pp.219-228
    • /
    • 2020
  • A CubeSat platform has become a popular choice due to inexpensive commercial off-the-shelf (COTS) components and low launch cost. However, it requires more power-efficient and higher-data rate downlink capability for space applications related to remote sensing. In addition, the platform is limited by the size, weight and power (SWaP) constraints as well as the regulatory issue of licensing the radio frequency (RF) spectrum. The requirements and limitations have put optical communications on promising alternatives to RF communications for a CubeSat platform, owing to the power efficiency and high data rate as well as the license free spectrum. In this study, we analyzed the performance of optical downlink communications compatible with CubeSat platforms in terms of data rate, bit error rate (BER) and outage probability. Mathematical models of BER and outage probability were derived based on not only the log-normal model of atmospheric turbulence but also a transmitter with a finite extinction ratio. Given the fixed slot width, the optimal guard time and modulation orders were chosen to achieve the target data rate. And the two performance metrics, BER and outage data rate, were analyzed and discussed with respect to beam divergence angle, scintillation index and zenith angle.

Analysis of Pathogenic Microorganism's Contamination on Cultivation Environment of Strawberry and Tomato in Korea

  • Oh, Soh-Young;Nam, Ki-Woong;Kim, Won-Il;Lee, Mun Haeng;Yoon, Deok-Hoon
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.47 no.6
    • /
    • pp.510-517
    • /
    • 2014
  • The purpose of this study was to analyze microbial hazards for cultivation environments and personal hygiene of strawberry and tomato farms at the growth and harvesting stage. Samples were collected from thirty strawberry farms and forty tomato farms located in Korea and tested for Staphylococcus aureus and Bacillus cereus. To investigate the change in the distribution of the S. aureus and B. cereus, a total of 4,284 samples including air born, soil or medium, mulching film, harvest basket, groves and irrigation water etc. were collected from eight strawberry farms and nine tomato farms for one year. As a result, total S. aureus and B. cereus in all samples were detected. Among the total bacteria of strawberry farms, S. aureus (glove: $0{\sim}2.1Log\;CFU/100cm^2$, harvest basket: $0{\sim}3.0Log\;CFU/100cm^2$, soil or culture media: 0~4.1 Log CFU/g, mulching film: $0{\sim}3.8Log\;CFU/100cm^2$), B. cereus (glove: $0{\sim}2.8Log\;CFU/100cm^2$, harvest basket: $0{\sim}4.8Log\;CFU/100cm^2$, soil or culture media: 0~5.3 Log CFU/g, mulching film: $0{\sim}4.5Log\;CFU/100cm^2$) were detected in all samples. The total bacteria of tomato farms, S. aureus (glove: $0{\sim}4.0Log\;CFU/100cm^2$, harvest basket: $0{\sim}5.0Log\;CFU/100cm^2$, soil or culture media: 0~6.1 Log CFU/g, mulching film: $0{\sim}4.0Log\;CFU/100cm^2$), B. cereus (glove: $0{\sim}4.0Log\;CFU/100cm^2$, harvest basket: $0{\sim}4.3Log\;CFU/100cm^2$, soil or culture media: 0~5.9 Log CFU/g, mulching film: $0{\sim}4.7Log\;CFU/100cm^2$) were detected in all samples. The contamination of S. aureus and B. cereus were detected in soil, mulching film and harvest basket from planting until harvest to processing, with the highest count recorded from the soil. But S. aureus and B. cereus were not detected in irrigation water samples. The incidence of S. aureus and B. cereus in hydroponics culture farm were less than those in soil culture. The amount of S. aureus and B. cereus detected in strawberry and tomato farms were less than the minimum amount required to produce a toxin that induces food poisoning. In this way, the degree of contamination of food poisoning bacteria was lower in the production environment of the Korea strawberry and tomato, but problems can be caused by post-harvest management method. These results will be used as fundamental data to create a manual for sanitary agricultural environment management, and post-harvest management should be performed to reduce the contamination of hazardous microorganisms.

A Personal Memex System Using Uniform Representation of the Data from Various Devices (다양한 기기로부터의 데이터 단일 표현을 통한 개인 미멕스 시스템)

  • Min, Young-Kun;Lee, Bog-Ju
    • The KIPS Transactions:PartB
    • /
    • v.16B no.4
    • /
    • pp.309-318
    • /
    • 2009
  • The researches on the system that automatically records and retrieves one's everyday life is relatively actively worked recently. These systems, called personal memex or life log, usually entail dedicated devices such as SenseCam in MyLifeBits project. This research paid attention to the digital devices such as mobile phones, credit cards, and digital camera that people use everyday. The system enables a person to store everyday life systematically that are saved in the devices or the deviced-related web pages (e.g., phone records in the cellular phone company) and to refer this quickly later. The data collection agent in the proposed system, called MyMemex, collects the personal life log "web data" using the web services that the web sites provide and stores the web data into the server. The "file data" stored in the off-line digital devices are also loaded into the server. Each of the file data or web data is viewed as a memex event that can be described by 4W1H form. The different types of data in different services are transformed into the memex event data in 4W1H form. The memex event ontology is used in this transform. Users can sign in to the web server of this service to view their life logs in the chronological manner. Users can also search the life logs using keywords. Moreover, the life logs can be viewed as a diary or story style by converting the memex events to sentences. The related memex events are grouped to be displayed as an "episode" by a heuristic identification method. A result with high accuracy has been obtained by the experiment for the episode identification using the real life log data of one of the authors.

Modeling of Rate-of-Occurrence-of-Failure According to the Failure Data Type of Water Distribution Cast Iron Pipes and Estimation of Optimal Replacement Time Using the Modified Time Scale (상수도 주철 배수관로의 파손자료 유형에 따른 파손율 모형화와 수정된 시간척도를 이용한 최적교체시기의 산정)

  • Park, Su-Wan;Jun, Hwan-Don;Kim, Jung-Wook
    • Journal of Korea Water Resources Association
    • /
    • v.40 no.1 s.174
    • /
    • pp.39-50
    • /
    • 2007
  • This paper presents applications of the log-linear ROCOF(rate-of-occurrence-of-failure) and the Weibull ROCOF to model the failure rate of individual cast iron pipes in a water distribution system and provides a method of estimating the economically optimal replacement time of the pipes using the 'modified time-scale'. The performance of the two ROCOFs is examined using the maximized log-likelihood estimates of the ROCOFs for the two types of failure data: 'failure-time data' and 'failure-number data'. The optimal replacement time equations for the two models are developed by applying the 'modified time-scale' to ensure the numerical convergence of the estimated values of the model parameters. The methodology is applied to the case study water distribution cast iron pipes and it is found that the log-linear ROCOF has better modeling capability than the Weibull ROCOF when the 'failure-time data' is used. Furthermore, the 'failure-time data' is determined to be more appropriate for both ROCOFs compared to the 'failure-number data' in terms of the ROCOF modeling performances for the water mains under study, implying that recording each failure time results in better modeling of the failure rate than recording failure numbers in some time intervals.

The Comparative Study of NHPP Software Reliability Model Based on Log and Exponential Power Intensity Function (로그 및 지수파우어 강도함수를 이용한 NHPP 소프트웨어 무한고장 신뢰도 모형에 관한 비교연구)

  • Yang, Tae-Jin
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.8 no.6
    • /
    • pp.445-452
    • /
    • 2015
  • Software reliability in the software development process is an important issue. Software process improvement helps in finishing with reliable software product. Infinite failure NHPP software reliability models presented in the literature exhibit either constant, monotonic increasing or monotonic decreasing failure occurrence rates per fault. In this paper, proposes the reliability model with log and power intensity function (log linear, log power and exponential power), which made out efficiency application for software reliability. Algorithm to estimate the parameters used to maximum likelihood estimator and bisection method, model selection based on mean square error (MSE) and coefficient of determination($R^2$), for the sake of efficient model, was employed. Analysis of failure, using real data set for the sake of proposing log and power intensity function, was employed. This analysis of failure data compared with log and power intensity function. In order to insurance for the reliability of data, Laplace trend test was employed. In this study, the log type model is also efficient in terms of reliability because it (the coefficient of determination is 70% or more) in the field of the conventional model can be used as an alternative could be confirmed. From this paper, software developers have to consider the growth model by prior knowledge of the software to identify failure modes which can be able to help.

Comparison of Methods for Reducing the Dimension of Compositional Data with Zero Values

  • Song, Taeg-Youn;Choi, Byung-Jin
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.4
    • /
    • pp.559-569
    • /
    • 2012
  • Compositional data consist of compositions that are non-negative vectors of proportions with the unit-sum constraint. In disciplines such as petrology and archaeometry, it is fundamental to statistically analyze this type of data. Aitchison (1983) introduced a log-contrast principal component analysis that involves logratio transformed data, as a dimension-reduction technique to understand and interpret the structure of compositional data. However, the analysis is not usable when zero values are present in the data. In this paper, we introduce 4 possible methods to reduce the dimension of compositional data with zero values. Two real data sets are analyzed using the methods and the obtained results are compared.

Hazard Analysis of Staphylococcus aureus in Ready-to-Eat Sandwiches (즉석섭취 샌드위치류의 황색포도상구균에 대한 위해분석)

  • Park, Hae-Jung;Bae, Hyun-Joo
    • Journal of the Korean Society of Food Science and Nutrition
    • /
    • v.36 no.7
    • /
    • pp.938-943
    • /
    • 2007
  • This study investigated the hazard analysis of ready-to-eat sandwiches sold in various establishments. Sandwich samples were collected from convenience stores, discount stores, sandwich chain stores, bakery shops, fast-food chain stores, and food service operations located in Daegu and Gyeongbuk. Out of 174 samples, 18 (10.3%) contained coagulase positive staphylococci with counts ranging from 0.30 to 4.08 log CFU/g. There was significant seasonal difference in Staphylococcus aureus isolation; the average count in summer (3.24 log CFU/g) was 3 times higher than that of winter (1.10 log CFU/g) (P<0.001). According to the microbiological guidelines of PHLS for ready-to-eat foods, 95.4% of the samples were acceptable. As a result of enterotoxin producing experimental data ($35^{\circ}C$, pH 5.8, NaCl 0.5%), enterotoxin was not produced in a sandwich until Staphylococcus aureus increased to a level greater than 4.95 log CFU/g. This microbiological hazard analysis data could be applied to future studies on quantitative risk assessment of ready-to-eat foods.

A Consistency Control of Method for Spatial Data Cached in Mobile Clients (모바일 클라이언트에 캐쉬된 공간 데이터의 일관성 제어 기법)

  • 안경환;차지태;홍봉희
    • Journal of KIISE:Databases
    • /
    • v.31 no.3
    • /
    • pp.274-286
    • /
    • 2004
  • In mobile client-server environments, mobile clients usually are disconnected with their server because of high cost of wireless communication and keep their own local copies to provide efficient updating the cached map. The update of the server database leads to invalidation of the cached map in the client side. To solve the issues of invalidation of the cached map, it is not efficient to resend part of the updated server database to clients whenever the updating of the server database occurs. This paper proposes a log-based update propagation method to propagate the server's update into its relevant clients by using only update logs. Too many logs increasingly accumulate as the sever database is updated several times. The sequential search of the relevant log data for a specific client is time-consuming. Sending of unnecessary logs should be avoided for reducing the overhead of communication.'re solve these problems, we first define unnecessary logs and then suggest log reduction methods to avoid or cancel creating unnecessary logs. The update log index is used for quickly retrieving relevant logs.

Sparse Data Cleaning using Multiple Imputations

  • Jun, Sung-Hae;Lee, Seung-Joo;Oh, Kyung-Whan
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.4 no.1
    • /
    • pp.119-124
    • /
    • 2004
  • Real data as web log file tend to be incomplete. But we have to find useful knowledge from these for optimal decision. In web log data, many useful things which are hyperlink information and web usages of connected users may be found. The size of web data is too huge to use for effective knowledge discovery. To make matters worse, they are very sparse. We overcome this sparse problem using Markov Chain Monte Carlo method as multiple imputations. This missing value imputation changes spare web data to complete. Our study may be a useful tool for discovering knowledge from data set with sparseness. The more sparseness of data in increased, the better performance of MCMC imputation is good. We verified our work by experiments using UCI machine learning repository data.