• 제목/요약/키워드: data heterogeneity

검색결과 599건 처리시간 0.022초

통합 정보시스템에서의 데이터 이질성 해결 방안에 관한 연구 (A Study on the Method for Solving Data Heterogeneity in the Integrated Information System)

  • 박성진;박성공;박화규
    • 한국IT서비스학회지
    • /
    • 제7권4호
    • /
    • pp.87-99
    • /
    • 2008
  • As the technologies for telecommunication have been evolving, more enhanced information services and integrated information systems have been introduced, which can manage a variety of information from the heterogeneous systems. The major obstacle for the integrated information systems is the integrating heterogeneous databases in the systems and the heterogeneity problems can be classified into the structural and data heterogeneities. However, the previous researches have mainly highlighted into the solving structural heterogeneity problems. This paper identifies the data heterogeneity problems for multi-database schema integrations and proposes a new solving method. We analyze the semantics equivalence in data values based on the functional dependency, primary and candidate keys, and present a procedural solution of data heterogeneity in the perspective of the concept of attribute equivalence, integration key and conceptual integration table.

Data Exchange between Cadastre and Physical Planning by Database Coupling

  • Kim, Kam-Rae;Choi, Won-Jun
    • 한국측량학회지
    • /
    • 제25권1호
    • /
    • pp.69-75
    • /
    • 2007
  • The information in physical planning field shows the socio-economic potentials of land resources while cadastral data does the physical and legal realities of the land. The two domains commonly deal with land information but have different views. Cadastre has to evolved to the multi-purpose ones which provide value-added information and support a wide spectrum of decision makers by mixing their own information with other spatial/non-spatial databases. In this context, the demands of data exchange between the two domains is growing up but this cannot be done without resolving the heterogeneity between the two information applications. Both of either discipline sees the reality within its own scope, which means each has a unique way to abstract real world phenomena to the database. The heterogeneity problem emerges when an GIS is autonomously and independently established. It causes considerable communication difficulties since heterogeneity of representations forms unique data semantics for each database. The semantic heterogeneity obviously creates an obstacle to data exchange but, at the same time, it can be a key to solve the problems too. Therefore, the study focuses on facilitating data sharing between the fields of cadastre and physical planning by resolving the semantic heterogeneity. The core job is developing a conversion mechanism of cadastral data into the information for the physical planning by DB coupling techniques.

메타데이타 이질성 해결을 위한 MDR 기반의 메시지 변환 시스템 (A Message Conversion System based on MDR for Resolving Metadata Heterogeneity)

  • 김진관;김중일;정동원;백두권
    • 한국정보과학회논문지:데이타베이스
    • /
    • 제31권3호
    • /
    • pp.232-242
    • /
    • 2004
  • 메타데이타는 데이타의 의미, 표현 등을 명확히 기술함으로써 공유 및 교환을 향상시키기 위한 데이타에 대한 데이타이다. 그러나 다양한 방식으로 생성된 메타데이타는 메타데이타간의 불일치라는 또 다른 문제를 야기하였다. 최근 메타데이타 불일치 문제를 해결하기 위하여 메타데이타 게이트웨이 방식에 대한 연구가 활발히 진행되고 있다. 그러나 메타데이타 게이트웨이 방식으로 구현된 기존의 시스템들은 메타데이타 스키마에 종속되어 메타데이타의 변화에 따른 시스템의 유지 보수에 많은 시간과 비용이 소요된다. 이 논문에서는 기존의 메타데이타 게이트웨이 방식이 가지고 있는 단점을 개선하기 위하여, 이질적인 메타데이타의 사상 정보와 사상 규칙을 분리한 개념을 적용한 메시지 변환 시스템을 제안한다. 이 논문에서 제안하는 시스템은 ISO/IEC l1179를 적용하여 표준화된 데이타 요소를 동적으로 관리하며, 향후 생성 될 데이타 요소에 대한 표준을 제공함으로써 추가적인 메타데이타 불일치 발생 문제를 근본적으로 해결할 수 있는 기능을 제공한다.

A spatial heterogeneity mixed model with skew-elliptical distributions

  • Farzammehr, Mohadeseh Alsadat;McLachlan, Geoffrey J.
    • Communications for Statistical Applications and Methods
    • /
    • 제29권3호
    • /
    • pp.373-391
    • /
    • 2022
  • The distribution of observations in most econometric studies with spatial heterogeneity is skewed. Usually, a single transformation of the data is used to approximate normality and to model the transformed data with a normal assumption. This assumption is however not always appropriate due to the fact that panel data often exhibit non-normal characteristics. In this work, the normality assumption is relaxed in spatial mixed models, allowing for spatial heterogeneity. An inference procedure based on Bayesian mixed modeling is carried out with a multivariate skew-elliptical distribution, which includes the skew-t, skew-normal, student-t, and normal distributions as special cases. The methodology is illustrated through a simulation study and according to the empirical literature, we fit our models to non-life insurance consumption observed between 1998 and 2002 across a spatial panel of 103 Italian provinces in order to determine its determinants. Analyzing the posterior distribution of some parameters and comparing various model comparison criteria indicate the proposed model to be superior to conventional ones.

Inference for heterogeneity of treatment eect in multi-center clinical trial

  • Ha, Il-Do
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권3호
    • /
    • pp.605-612
    • /
    • 2011
  • In multi-center randomized clinical trial the treatment eect may be changed over centers. It is thus important to investigate the heterogeneity in treatment eect between centers. For this, uncorrelated random-eect models assuming independence between random-eect terms have been often used, which may be a strong assumption. In this paper we propose a correlated frailty modelling approach of investigating such heterogeneity using the hierarchical-likelihood method when the outcome is time-to-event. In particular, we show how to construct a proper prediction interval for frailty, which explores graphically the potential heterogeneity for a treatment-by-center interaction term. The proposed method is illustrated via numerical studies based on data from the design of a multi-center clinical trial.

Firm Heterogeneity and Location Choice: The Case of South Korean Manufacturing Multinationals

  • Han, Jae-Joon;Lee, Hongshik;Lee, Insu
    • East Asian Economic Review
    • /
    • 제16권4호
    • /
    • pp.315-331
    • /
    • 2012
  • Previous studies of location choice have focused on country-level data more than firm-level data and been more concerned with host countries' distinctive features than with firm heterogeneity. Therefore, they do not answer the question of who will go where in terms of location choice. To analyze the role of firm heterogeneity in determining location choice, we develop a theoretical model and analyze data on 3,644 Korean manufacturing multinationals operating in 87 countries between 1982 and 2006. The results of our conditional logit analysis indicate that not only host country characteristics but also firm heterogeneous factors such as productivity, labor intensity, and size have considerable influence on the decision of where to locate FDI.

  • PDF

Identification of ERBB pathway-activated cells in triple-negative breast cancer

  • Cho, Soo Young
    • Genomics & Informatics
    • /
    • 제17권1호
    • /
    • pp.3.1-3.4
    • /
    • 2019
  • Intratumor heterogeneity within a single tumor mass is one of the hallmarks of malignancy and has been reported in various tumor types. The molecular characterization of intratumor heterogeneity in breast cancer is a significant challenge for effective treatment. Using single-cell RNA sequencing (RNA-seq) data from a public resource, an ERBB pathway activated triple-negative cell population was identified. The differential expression of three subtyping marker genes (ERBB2, ESR1, and PGR) was not changed in the bulk RNA-seq data, but the single-cell transcriptomes showed intratumor heterogeneity. This result shows that ERBB signaling is activated using an indirect route and that the molecular subtype is changed on a single-cell level. Our data propose a different view on breast cancer subtypes, clarifying much confusion in this field and contributing to precision medicine.

XMDR 기반의 통합 검색을 위한 데이터 그리드 Wrapper 설계 (The Design of Data Grid Wrapper for Integrated Retrieve based on XMDR)

  • 황치곤;정계동;최영근
    • 한국정보통신학회논문지
    • /
    • 제12권5호
    • /
    • pp.921-929
    • /
    • 2008
  • 최근 데이터 통합을 위한 방안으로 데이터 이질성을 해결하기 위한 많은 연구가 진행되고 있다. 우리가 제안하는 시스템의 구성요소는 XMDR 래퍼와 XMDR 저장소이다. XMDR 래 퍼는 XMDR의 표준 정보를 기반으로 인터페이스를 생성하고, 표준 정보와 로컬스키마 간의 매핑정보를 이용하여 글로벌 XMDR 쿼리와 로컬 쿼리 간의 상호변환을 수행함으로써 기존 시스템의 이질성을 해결한다. XMDR 저장소는 표준 정보와 로컬간의 매핑정보를 관리하는 XMDR과 수행된 결과를 저장하는 Proxy DB로 구성되어 있다. 사용자는 동일한 인터페이스를 사용하고, XMDR 래퍼가 XMDR의 메타 시멘틱 온톨로지를 이용하여 스키마의 이질성을 해결뿐만 아니라 인스턴스 시멘틱 온톨로지를 통한 값의 의미에 따른 이질성도 고려함으로써 중복된 질의를 수행하지 않아도 된다. 따라서 본 논문에서는 이러한 데이터 이질성을 해결하고 효율적인 데이터 통합을 위한 데이터 그리드 래퍼를 제안한다.

Beta-Meta: a meta-analysis application considering heterogeneity among genome-wide association studies

  • Gyungbu Kim;Yoonsuk Lee;Jeong Ho Park;Dongmin Kim;Wonseok Lee
    • Genomics & Informatics
    • /
    • 제20권4호
    • /
    • pp.49.1-49.7
    • /
    • 2022
  • Many packages for a meta-analysis of genome-wide association studies (GWAS) have been developed to discover genetic variants. Although variations across studies must be considered, there are not many currently-accessible packages that estimate between-study heterogeneity. Thus, we propose a python based application called Beta-Meta which can easily process a meta-analysis by automatically selecting between a fixed effects and a random effects model based on heterogeneity. Beta-Meta implements flexible input data manipulation to allow multiple meta-analyses of different genotype-phenotype associations in a single process. It provides a step-by-step meta-analysis of GWAS for each association in the following order: heterogeneity test, two different calculations of an effect size and a p-value based on heterogeneity, and the Benjamini-Hochberg p-value adjustment. These methods enable users to validate the results of individual studies with greater statistical power and better estimation precision. We elaborate on these and illustrate them with examples from several studies of infertility-related disorders.

Genetic heterogeneity of liver cancer stem cells

  • Minjeong Kim;Kwang-Woo Jo;Hyojin Kim;Myoung-Eun Han;Sae-Ock Oh
    • Anatomy and Cell Biology
    • /
    • 제56권1호
    • /
    • pp.94-108
    • /
    • 2023
  • Cancer cell heterogeneity is a serious problem in the control of tumor progression because it can cause chemoresistance and metastasis. Heterogeneity can be generated by various mechanisms, including genetic evolution of cancer cells, cancer stem cells (CSCs), and niche heterogeneity. Because the genetic heterogeneity of CSCs has been poorly characterized, the genetic mutation status of CSCs was examined using Exome-Seq and RNA-Seq data of liver cancer. Here we show that different surface markers for liver cancer stem cells (LCSCs) showed a unique propensity for genetic mutations. Cluster of differentiation 133 (CD133)-positive cells showed frequent mutations in the IRF2, BAP1, and ERBB3 genes. However, leucine-rich repeat-containing G protein-coupled receptor 5-positive cells showed frequent mutations in the CTNNB1, RELN, and ROBO1 genes. In addition, some genetic mutations were frequently observed irrespective of the surface markers for LCSCs. BAP1 mutations was frequently observed in CD133-, CD24-, CD13-, CD90-, epithelial cell adhesion molecule-, or keratin 19-positive LCSCs. ASXL2, ERBB3, IRF2, TLX3, CPS1, and NFATC2 mutations were observed in more than three types of LCSCs, suggesting that common mechanisms for the development of these LCSCs. The present study provides genetic heterogeneity depending on the surface markers for LCSCs. The genetic heterogeneity of LCSCs should be considered in the development of LCSC-targeting therapeutics.