• Title/Summary/Keyword: common data model

Search Result 1,232, Processing Time 0.031 seconds

Research on Metadata Model of Data Warehouse

  • Zeng, Zhi-Yong;Yu, Jian-Kun
    • Proceedings of the Korea Society for Industrial Systems Conference
    • /
    • 2007.02a
    • /
    • pp.72-79
    • /
    • 2007
  • OMG's Common Warehouse Metamodel is now a single data warehouse metadata standard. Interchange of metadata between data warehousing tools and metadata repositories made easy and convenient by using CWM. In this paper, we present the origin, importance and architecture of CWM, and offer an application case.

  • PDF

A credit classification method based on generalized additive models using factor scores of mixtures of common factor analyzers (공통요인분석자혼합모형의 요인점수를 이용한 일반화가법모형 기반 신용평가)

  • Lim, Su-Yeol;Baek, Jang-Sun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.2
    • /
    • pp.235-245
    • /
    • 2012
  • Logistic discrimination is an useful statistical technique for quantitative analysis of financial service industry. Especially it is not only easy to be implemented, but also has good classification rate. Generalized additive model is useful for credit scoring since it has the same advantages of logistic discrimination as well as accounting ability for the nonlinear effects of the explanatory variables. It may, however, need too many additive terms in the model when the number of explanatory variables is very large and there may exist dependencies among the variables. Mixtures of factor analyzers can be used for dimension reduction of high-dimensional feature. This study proposes to use the low-dimensional factor scores of mixtures of factor analyzers as the new features in the generalized additive model. Its application is demonstrated in the classification of some real credit scoring data. The comparison of correct classification rates of competing techniques shows the superiority of the generalized additive model using factor scores.

Design of an Information Integration System based on XML Schema : DataBlender (XML Schema 기반 정보 통합 시스템 설계 : DataBlender)

  • 이미영;김명준;이규철
    • The Journal of the Korea Contents Association
    • /
    • v.2 no.2
    • /
    • pp.36-41
    • /
    • 2002
  • In a mediator based information integration system that integrates information distributed on various data sources as a common data model, there are many researches on a common data mode providing a reservation of integrated data semantic, and on a resolution for a conflict caused by integrating various data modes. This paper proposes the DataBlender system that minimizes the semantic losses and provides an easy resolution for a conflict using the XML technology like as XML Schema, XQuery. Futhermore, it provides an integrated query facility usable in the mobile environment. So it becomes a foundation for a next Internet business environment.

  • PDF

Application of CE-QUAL-W2 [v3.2] to Andong Reservoir: Part I: Simulations of Hydro-thermal Dynamics, Dissolved Oxygen and Density Current

  • Bhattarai, Prasid Ram;Kim, Yoon-Hee;Heo, Woo-Myoung
    • Korean Journal of Ecology and Environment
    • /
    • v.41 no.2
    • /
    • pp.247-263
    • /
    • 2008
  • A two-dimensional (2D) reservoir hydrodynamics and water quality model, CE-QUAL-W2, is employed to simulate the hydrothermal behavior and density current regime in Andong Reservoir. Observed data used for model forcing and calibration includes: surface water level, water temperature, dissolved oxygen and suspended solids concentration. The model was calibrated to the year of 2003 and verified with continuous run from 2000 till 2004. Without major adjustments, the model accurately simulated surface water levels including the events of large storm. Deep-water reservoirs, like Andong Reservoir, located in the Asian Monsoon region begin to stratify in summer and overturn in fall. This mixing pattern as well as the descending thermocline, onset and duration of stratification and timing of turnover phenomenon were well reproduced by the Andong Model. The temperature field and distinct thermocline are simulated to within $2^{\circ}C$ of observed data. The model performed well in simulating not only the dissolved oxygen profiles but also the metalimnetic dissolved minima phenomenon, a common1y occurring phenomenon in deep reservoirs of temperate regions. The Root Mean Square Error (RMSE) values of model calibration for surface water elevation, temperature and dissolved oxygen were 0.0095 m, $1.82^{\circ}C$, and $1.13\;mg\;L^{-1}$, respectively. The turbid storm runoff, during the summer monsoon, formed an intermediate layer of about 15 m thickness, moved along the metalimnion until being finally discharged from the dam. This mode of transport of density current, a common characteristic of various other large reservoirs in the Asian summer monsoon region, was well tracked by the model.

Automation System for Sharing CDM Data (CDM 데이터 공유를 위한 자동화 시스템)

  • Jeong, Chae-Eun;Kang, Yunhee;Park, Young B.
    • Journal of Platform Technology
    • /
    • v.8 no.3
    • /
    • pp.3-9
    • /
    • 2020
  • As the need for sharing for research purposes in the medical field increases, the use of a Common Data Model (CDM) is increasing. However, when sharing CDM data, there are some problems in that access control and personal information in the data are not protected. In this paper, in order to solve this problem, access to CDM data is controlled by using an encryption method in a blockchain network, and information of CDM data is recorded to enable tracking. In addition, IPFS was used to share a large amount of CDM data, and Celery was used to automate the sharing process. In other words, we propose a multi-channel automation system in which the information required for CDM data sharing is shared by a trust-based technology, a distributed file system, and a message queue for automation. This aims to solve the problem of access control and personal information protection in the data that occur in the process of sharing CDM data.

  • PDF

A Study on a Model for Using and Preserving Scientific Data (과학데이터 보존 및 활용모델에 관한 연구)

  • Kim, Sun-Tae;Hahn, Sun-Hwa;Lee, Tae-Young;Kim, Yong
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.21 no.4
    • /
    • pp.81-93
    • /
    • 2010
  • This study is to suggest a model for preservation and circulation of scientific data as Records. Analysis for national trends in U.S., British, Australia, Europe about scientific data was performed. Foreign advanced programs on management of scientific data were surveyed and analyzed. The analyzed programs were DataCite, WDS, PANGAEA, Dataverse, BSRN, DLESE, GCMD and SEDIS. Common implications were deducted from each program. With the results of analyzing the programs, this study proposed a model for preservation and circulation of scientific data.

USING TRMM SATELLITE C BAND DATA TO RETRIEVE SOIL MOISTURE ON THE TffiETAN PLATEAU

  • Chang Tzu-Yin;Liou Yuei-An
    • Proceedings of the KSRS Conference
    • /
    • 2005.10a
    • /
    • pp.737-740
    • /
    • 2005
  • Soil moisture, through its dominance in the exchange of energy and moisture between the land and atmosphere, plays a crucial role in influencing atmospheric circulation. To identify the crucial role, it is a common agreement that knowledge of land surface processes and development of remote sensing techniques are of great important scientific issues. This research uses TRMM satellite C band (10.65 GHz) data to retrieve soil moisture on the Tibetan Plateau in Mainland China. Two retrieval schemes that are implemented include the t-(J) model and the R model. The latter one is developed based on a land surface process and radiobrightness (R) model for bare soil and vegetated terrain. Compared with the in situ ground measurements, the soil moisture retrieved from the R model and the t-(J) model with vegetation information obviously appear more accurate than that derived from bare soil model. Retrieved soil moisture contents from the two inversion models, R model and t-(J) model, have a similar trend, but the former appears to be superior in terms of correlation coefficient and bias compared with in situ data. In the future, we will apply the R model with the TRMM 10.65 GHz brightness temperature to monitor long-term soil moisture variation over Tibet Plateau.

  • PDF

3-D Information Model for High-speed Railway Infrastructures (고속철도시설물을 위한 3차원정보모델)

  • Shim, Chang-Su;Kim, Deok-Won;Youn, Nu-Ri
    • Proceedings of the Computational Structural Engineering Institute Conference
    • /
    • 2008.04a
    • /
    • pp.241-246
    • /
    • 2008
  • Design of a high-speed railway line requires collaboration of heterogeneous application systems and of engineers with different background. Object-based 3D models with metadata can be a shared information model for the effective collaborative design. In this paper, railway infrastructure information model is proposed to enable integrated and inter-operable works throughout the life-cycle of the railway infrastructures, from planning to maintenance. In order to develop the model, object-based 3-D models were built for a 10km railway among Korea high-speed railway lines. The model has basically three information layers for designers, contractors and an owner, respectively. Prestressed concrete box-girders are the most common superstructure of bridges. The design information layer has metadata on requirements, design codes, geometry, analysis and so on. The construction layer has data on drawings, real data for material and products, schedules and so on. The maintenance layer for the owner has the final geometry, material data, products and their suppliers and so on. These information has its own data architecture which is derived from similar concept of product breakdown structure(PBS) and work breakdown structure(WBS). The constructed RIIM for the infrastructures of the high-speed railway was successfully applied to various areas such as design check, structural analysis, automated estimation, construction simulation, virtual viewing, and digital mock-up. The integrated information model can realize virtual construction system for railway lines and dramatically increase the productivity of the whole engineering process.

  • PDF

Multivariate Procedure for Variable Selection and Classification of High Dimensional Heterogeneous Data

  • Mehmood, Tahir;Rasheed, Zahid
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.6
    • /
    • pp.575-587
    • /
    • 2015
  • The development in data collection techniques results in high dimensional data sets, where discrimination is an important and commonly encountered problem that are crucial to resolve when high dimensional data is heterogeneous (non-common variance covariance structure for classes). An example of this is to classify microbial habitat preferences based on codon/bi-codon usage. Habitat preference is important to study for evolutionary genetic relationships and may help industry produce specific enzymes. Most classification procedures assume homogeneity (common variance covariance structure for all classes), which is not guaranteed in most high dimensional data sets. We have introduced regularized elimination in partial least square coupled with QDA (rePLS-QDA) for the parsimonious variable selection and classification of high dimensional heterogeneous data sets based on recently introduced regularized elimination for variable selection in partial least square (rePLS) and heterogeneous classification procedure quadratic discriminant analysis (QDA). A comparison of proposed and existing methods is conducted over the simulated data set; in addition, the proposed procedure is implemented to classify microbial habitat preferences by their codon/bi-codon usage. Five bacterial habitats (Aquatic, Host Associated, Multiple, Specialized and Terrestrial) are modeled. The classification accuracy of each habitat is satisfactory and ranges from 89.1% to 100% on test data. Interesting codon/bi-codons usage, their mutual interactions influential for respective habitat preference are identified. The proposed method also produced results that concurred with known biological characteristics that will help researchers better understand divergence of species.

Authorization Model with Provisions and Obligations in XML

  • Kim Suhee;Park Jongjin
    • Proceedings of the IEEK Conference
    • /
    • summer
    • /
    • pp.355-360
    • /
    • 2004
  • With the growing acceptance of XML technologies, XML will be the most common tool for all data manipulation and data transmission. Meeting security requirements for privacy, confidentiality and integrity is essential in order to move business online and it is important for security to be integrated with XML solutions. Many policies require certain conditions to be satisfied and actions to be performed before or after a decision is made. Binary yes/no decision to an access request is not enough for many applications. These issues were addressed and formalized as provisions and obligations by Betti et Al. In this paper, we propose an authorization model with provisions and obligations in XML. We introduce a formal definition of authorization policy and the issues involving obligation discussed by Betti et Al. We use the formal model as a basis to develop an authorization model in XML. We develop DTDs in XML for main components such as authorization request, authorization policy and authorization decision. We plan to develop an authorization system using the model proposed.

  • PDF