• Title/Summary/Keyword: Data Model

Search Result 46,834, Processing Time 0.062 seconds

A Proportional Odds Mixed - Effects Model for Ordinal Data

  • Choi, Jae-Sung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.2
    • /
    • pp.471-479
    • /
    • 2007
  • This paper discusses about how to build up mixed-effects model for analysing ordinal response data by using cumulative logits. Random factors are assumed to be coming from the designed sampling scheme for choosing observational units. Since the observed responses of individuals are ordinal, a proportional odds model with two random effects is suggested. Estimation procedure for the unknown parameters in a suggested model is also discussed by an illustrated example.

  • PDF

A Cumulative Logit Mixed Model for Ordered Response Data

  • Choi, Jae-Sung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.123-130
    • /
    • 2006
  • This paper discusses about how to build up a mixed-effects model using cumulative logits when some factors are fixed and others are random. Location effects are considered as random effects by choosing them randomly from a population of locations. Estimation procedure for the unknown parameters in a suggested model is also discussed by an illustrated example.

  • PDF

Relationships Between the Characteristics of the Business Data Set and Forecasting Accuracy of Prediction models (시계열 데이터의 성격과 예측 모델의 예측력에 관한 연구)

  • 이원하;최종욱
    • Journal of Intelligence and Information Systems
    • /
    • v.4 no.1
    • /
    • pp.133-147
    • /
    • 1998
  • Recently, many researchers have been involved in finding deterministic equations which can accurately predict future event, based on chaotic theory, or fractal theory. The theory says that some events which seem very random but internally deterministic can be accurately predicted by fractal equations. In contrast to the conventional methods, such as AR model, MA, model, or ARIMA model, the fractal equation attempts to discover a deterministic order inherent in time series data set. In discovering deterministic order, researchers have found that neural networks are much more effective than the conventional statistical models. Even though prediction accuracy of the network can be different depending on the topological structure and modification of the algorithms, many researchers asserted that the neural network systems outperforms other systems, because of non-linear behaviour of the network models, mechanisms of massive parallel processing, generalization capability based on adaptive learning. However, recent survey shows that prediction accuracy of the forecasting models can be determined by the model structure and data structures. In the experiments based on actual economic data sets, it was found that the prediction accuracy of the neural network model is similar to the performance level of the conventional forecasting model. Especially, for the data set which is deterministically chaotic, the AR model, a conventional statistical model, was not significantly different from the MLP model, a neural network model. This result shows that the forecasting model. This result shows that the forecasting model a, pp.opriate to a prediction task should be selected based on characteristics of the time series data set. Analysis of the characteristics of the data set was performed by fractal analysis, measurement of Hurst index, and measurement of Lyapunov exponents. As a conclusion, a significant difference was not found in forecasting future events for the time series data which is deterministically chaotic, between a conventional forecasting model and a typical neural network model.

  • PDF

Schema Class Inheritance Model for Research Data Management and Search (연구데이터 관리 및 검색을 위한 스키마 클래스 상속 모델)

  • Kim, Suntae
    • Journal of the Korean Society for information Management
    • /
    • v.31 no.2
    • /
    • pp.41-56
    • /
    • 2014
  • The necessity of the raw data management and reuse is issued by diffusion of the recognition that research data is a national asset. In this paper, a metadata design model by schema class inheritance and a metadata integrated search model by schema objects are suggested for a structural management of the data. A data architecture in which an schema object has an 1 : 1 relation to the data collection was designed. A suggested model was testified by creation of a virtual schema class and objects which inherit the schema class. It showed the possibility of implement systematically. A suggested model can be used to manage the data which are produced by government agencies because schema inheritance and integrated search model present way to overcome the weak points of the 'Top-dow Hierarchy model' which is being used to design the metadata schema.

Conceptual Data Modeling: Entity-Relationship Models as Thinging Machines

  • Al-Fedaghi, Sabah
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.9
    • /
    • pp.247-260
    • /
    • 2021
  • Data modeling is a process of developing a model to design and develop a data system that supports an organization's various business processes. A conceptual data model represents a technology-independent specification of structure of data to be stored within a database. The model aims to provide richer expressiveness and incorporate a set of semantics to (a) support the design, control, and integrity parts of the data stored in data management structures and (b) coordinate the viewing of connections and ideas on a database. The described structure of the data is often represented in an entity–relationship (ER) model, which was one of the first data-modeling techniques and is likely to continue to be a popular way of characterizing entity classes, attributes, and relationships. This paper attempts to examine the basic ER modeling notions in order to analyze the concepts to which they refer as well as ways to represent them. In such a mission, we apply a new modeling methodology (thinging machine; TM) to ER in terms of its fundamental building constructs, representation entities, relationships, and attributes. The goal of this venture is to further the understanding of data models and enrich their semantics. Three specific contributions to modeling in this context are incorporated: (a) using the TM model's five generic actions to inject processing in the ER structure; (b) relating the single ontological element of TM modeling (i.e., a thing/machine or thimac) to ER entities and relationships; and (c) proposing a high-level integrated, extended ER model that includes structural and time-oriented notions (e.g., events or behavior).

Simulation combined transfer learning model for missing data recovery of nonstationary wind speed

  • Qiushuang Lin;Xuming Bao;Ying Lei;Chunxiang Li
    • Wind and Structures
    • /
    • v.37 no.5
    • /
    • pp.383-397
    • /
    • 2023
  • In the Structural Health Monitoring (SHM) system of civil engineering, data missing inevitably occurs during the data acquisition and transmission process, which brings great difficulties to data analysis and poses challenges to structural health monitoring. In this paper, Convolution Neural Network (CNN) is used to recover the nonstationary wind speed data missing randomly at sampling points. Given the technical constraints and financial implications, field monitoring data samples are often insufficient to train a deep learning model for the task at hand. Thus, simulation combined transfer learning strategy is proposed to address issues of overfitting and instability of the deep learning model caused by the paucity of training samples. According to a portion of target data samples, a substantial quantity of simulated data consistent with the characteristics of target data can be obtained by nonstationary wind-field simulation and are subsequently deployed for training an auxiliary CNN model. Afterwards, parameters of the pretrained auxiliary model are transferred to the target model as initial parameters, greatly enhancing training efficiency for the target task. Simulation synergy strategy effectively promotes the accuracy and stability of the target model to a great extent. Finally, the structural dynamic response analysis verifies the efficiency of the simulation synergy strategy.

Bayesian Outlier Detection in Regression Model

  • Younshik Chung;Kim, Hyungsoon
    • Journal of the Korean Statistical Society
    • /
    • v.28 no.3
    • /
    • pp.311-324
    • /
    • 1999
  • The problem of 'outliers', observations which look suspicious in some way, has long been one of the most concern in the statistical structure to experimenters and data analysts. We propose a model for an outlier problem and also analyze it in linear regression model using a Bayesian approach. Then we use the mean-shift model and SSVS(George and McCulloch, 1993)'s idea which is based on the data augmentation method. The advantage of proposed method is to find a subset of data which is most suspicious in the given model by the posterior probability. The MCMC method(Gibbs sampler) can be used to overcome the complicated Bayesian computation. Finally, a proposed method is applied to a simulated data and a real data.

  • PDF

Evaluation of the Tank Model Optimized Parameter for Watershed Modeling (유역 유출량 추정을 위한 TANK 모형의 매개변수 최적화에 따른 적용성 평가)

  • Kim, Kye Ung;Song, Jung Hun;Ahn, Jihyun;Park, Jihoon;Jun, Sang Min;Song, Inhong;Kang, Moon Seong
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.56 no.4
    • /
    • pp.9-19
    • /
    • 2014
  • The objective of this study was to evaluate of the Tank model in simulating runoff discharge from rural watershed in comparison to the SWAT (Soil and Water Assessment Tool) model. The model parameters of SWAT was calibrated by the shuffled complex evolution-university Arizona (SCE-UA) method while Tank model was calibrated by genetic algorithm (GA) and validated. Four dam watersheds were selected as the study areas. Hydrological data of the Water Management Information System (WAMIS) and geological data were used as an input data for the model simulation. Runoff data were used for the model calibration and validation. The determination coefficient ($R^2$), root mean square error (RMSE), Nash-Sutcliffe efficiency index (NSE) were used to evaluate the model performances. The result indicated that both SWAT model and Tank model simulated runoff reasonably during calibration and validation period. For annual runoff, the Tank model tended to overestimate, especially for small runoff (< 0.2 mm) whereas SWAT model underestimate runoff as compared to observed data. The statistics indicated that the Tank model simulated runoff more accurately than the SWAT model. Therefore the Tank model could be a good tool for runoff simulation considering its ease of use.

Performance Evaluation of a Dynamic Inverse Model with EnergyPlus Model Simulation for Building Cooling Loads (건물냉방부하에 대한 동적 인버스 모델링기법의 EnergyPlus 건물모델 적용을 통한 성능평가)

  • Lee, Kyoung-Ho;Braun, James E.
    • Korean Journal of Air-Conditioning and Refrigeration Engineering
    • /
    • v.20 no.3
    • /
    • pp.205-212
    • /
    • 2008
  • This paper describes the application of an inverse building model to a calibrated forward building model using EnergyPlus program. Typically, inverse models are trained using measured data. However, in this study, an inverse building model was trained using data generated by an EnergyPlus model for an actual office building. The EnergyPlus model was calibrated using field data for the building. A training data set for a month of July was generated from the EnergyPlus model to train the inverse model. Cooling load prediction of the trained inverse model was tested using another data set from the EnergyPlus model for a month of August. Predicted cooling loads showed good agreement with cooling loads from the EnergyPlus model with root-mean square errors of 4.11%. In addition, different control strategies with dynamic cooling setpoint variation were simulated using the inverse model. Peak cooling loads and daily cooling loads were compared for the dynamic simulation.

Comparative Analysis of 3D Spatial Data Models (3차원 공간정보 데이터 모델 비교 분석)

  • Park, Se-Ho;Lee, Ji-Yeong
    • Spatial Information Research
    • /
    • v.17 no.3
    • /
    • pp.277-285
    • /
    • 2009
  • Each system should have a suitable data model about their purpose for efficiently managing, analyzing, and manipulating data. And the usable range of application is determined by the data model, and suitable data models are being developed for each application. In GIS, diversity spatial data model is being developed too. The accuracy and update of the spatial data would be important for applying efficient application as well as the data modeling is important as constructing the spatial data structure. Therefore, the purposes of this research are to 1)compare domestic spatial data models with oversea spatial data models about their geometry model, topology model and visualizing method of 3D spatial data 2)to compare the features of the data model by analyzing each data structures. We 3)compare and analyze features of each spatial data models via the quantitative analysis of each spatial data models.

  • PDF