• Title/Summary/Keyword: data integration

Search Result 3,443, Processing Time 0.034 seconds

Standard-based Integration of Heterogeneous Large-scale DNA Microarray Data for Improving Reusability

  • Jung, Yong;Seo, Hwa-Jeong;Park, Yu-Rang;Kim, Ji-Hun;Bien, Sang Jay;Kim, Ju-Han
    • Genomics & Informatics
    • /
    • v.9 no.1
    • /
    • pp.19-27
    • /
    • 2011
  • Gene Expression Omnibus (GEO) has kept the largest amount of gene-expression microarray data that have grown exponentially. Microarray data in GEO have been generated in many different formats and often lack standardized annotation and documentation. It is hard to know if preprocessing has been applied to a dataset or not and in what way. Standard-based integration of heterogeneous data formats and metadata is necessary for comprehensive data query, analysis and mining. We attempted to integrate the heterogeneous microarray data in GEO based on Minimum Information About a Microarray Experiment (MIAME) standard. We unified the data fields of GEO Data table and mapped the attributes of GEO metadata into MIAME elements. We also discriminated non-preprocessed raw datasets from others and processed ones by using a two-step classification method. Most of the procedures were developed as semi-automated algorithms with some degree of text mining techniques. We localized 2,967 Platforms, 4,867 Series and 103,590 Samples with covering 279 organisms, integrated them into a standard-based relational schema and developed a comprehensive query interface to extract. Our tool, GEOQuest is available at http://www.snubi.org/software/GEOQuest/.

Comparative Analysis of Building Models to Develop a Generic Indoor Feature Model

  • Kim, Misun;Choi, Hyun-Sang;Lee, Jiyeong
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.39 no.5
    • /
    • pp.297-311
    • /
    • 2021
  • Around the world, there is an increasing interest in Digital Twin cities. Although geospatial data is critical for building a digital twin city, currently-established spatial data cannot be used directly for its implementation. Integration of geospatial data is vital in order to construct and simulate the virtual space. Existing studies for data integration have focused on data transformation. The conversion method is fundamental and convenient, but the information loss during this process remains a limitation. With this, standardization of the data model is an approach to solve the integration problem while hurdling conversion limitations. However, the standardization within indoor space data models is still insufficient compared to 3D building and city models. Therefore, in this study, we present a comparative analysis of data models commonly used in indoor space modeling as a basis for establishing a generic indoor space feature model. By comparing five models of IFC (Industry Foundation Classes), CityGML (City Geographic Markup Language), AIIM (ArcGIS Indoors Information Model), IMDF (Indoor Mapping Data Format), and OmniClass, we identify essential elements for modeling indoor space and the feature classes commonly included in the models. The proposed generic model can serve as a basis for developing further indoor feature models through specifying minimum required structure and feature classes.

Inference of Genetic Regulatory Modules Using ChIP-on-chip and mRNA Expression Data

  • Cho, Hye-Young;Lee, Do-Heon
    • Bioinformatics and Biosystems
    • /
    • v.2 no.2
    • /
    • pp.62-65
    • /
    • 2007
  • We present here the strategy of data integration for inference of genetic regulatory modules. First, we construct all possible combinations of regulators of genes using chromatin-immunoprecipitation(ChIP)-chip data. Second, hierarchical clustering method is employed to analyze mRNA expression profiles. Third, integration method is applied to both of the data. Finally, we construct a genetic regulatory module which is involved in the function of ribosomal protein synthesis.

  • PDF

A Study on Maturity Model of Information Integration System (정보연계 시스템의 성숙도 모델에 관한 연구)

  • Ha, Hyodong;Lee, Ook
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.8
    • /
    • pp.570-578
    • /
    • 2019
  • In this era of big data, a variety of government organizations are trying to create new added value via Information Integration. Therefore, several projects related to government agencies' information sharing have activated system connection/integration. The risk factors of system operation, however, have increased as the volume of Information Integration System grows. The interference in information sharing is predicted to affect the operation of the agencies, and the issue will grow even worse with massive impact on civil society when the agency operation is interrupted due to system failures in terms of infrastructure, software, data quality, and security. Diverse studies related to the maintenance of Information System have been conducted, but there is currently no evaluation framework for the operational system of Information Integration between various government agencies. In this respect, this study distinguishes each of the Information System components, Data, IT, People, Process, systematizes with Plan-Do-See, and finally presents a maturity model for Information Integration. Nine derived processes were analyzed through interview and questionnaires from Information Integration System officials, further suggesting maturity stage applying CMMI. This model allows diagnosis of the maturity level of an Information Integration System, and is expected to be utilized as resource for improving organizational processes.

Design and Integration of a Dual Redundancy Air Data System for Unmanned Air Vehicles (무인항공기 이중화 대기자료시스템 설계 및 통합 연구)

  • Won, Dae-Yeon;Yun, Seonghun;Lee, Hongju;Hong, Jin-Sung;Hwang, Sun-Yu;Lim, Heung-Sik;Kim, Taekyeum
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.23 no.6
    • /
    • pp.639-649
    • /
    • 2020
  • Air data systems measure airspeed, pressure altitude, angle of attack and angle of sideslip. These measurements are essential for operating flight control laws to ensure safe flights. Since the loss or corruption of air data measurements is considered as catastrophic, a high level of operational reliability needs to be achieved for air data systems. In the case of unmanned air vehicles, failure of any of air data sensors is more critical due to the absence of onboard pilot decision aid. This paper presents design of a dual redundancy air data system and the integration process for an unmanned air vehicle. The proposed dual-redundant architecture is based on two independent air data probes and redundancy management by central processing in two independent flight control computers. Starting from unit testing of single air data sensor, details are provided of system level tests used to meet overall requirements. Test results from system integration demonstrate the efficiency of the proposed process.

Integration of Heterogeneous Protein Databases Based on RDF(S) Models (RDF(S) 모델에 기반한 다양한 형태의 단백질 데이타베이스 통합)

  • Lee, Kang-Pyo;Yoo, Sang-Won;Kim, Hyoung-Joo
    • Journal of KIISE:Databases
    • /
    • v.35 no.2
    • /
    • pp.132-142
    • /
    • 2008
  • In biological domain, there exist a variety of protein analysis databases which have their own meaning toward the same target of protein. If we integrate these scattered heterogeneous data efficiently, we can obtain useful information which otherwise cannot be found from each original source. Reflecting the characteristics of biological data, each data source has its own syntax and semantics. If we describe these data through RDF(S) models, one of the Semantic Web standards, we can achieve not only syntactic but also semantic integration. In this paper, we propose a new concept of integration layer based on the RDF unified schema. As a conceptual model, we construct a unified schema focusing on the protein information; as a representational model, we propose a technique for the wrappers to aggregate necessary information from the relevant sources and dynamically generate RDF instances. Two example queries show that our integration layer succeeds in processing the integrated requests from users and displaying the appropriate results.

Multi-view learning review: understanding methods and their application (멀티 뷰 기법 리뷰: 이해와 응용)

  • Bae, Kang Il;Lee, Yung Seop;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.1
    • /
    • pp.41-68
    • /
    • 2019
  • Multi-view learning considers data from various viewpoints as well as attempts to integrate various information from data. Multi-view learning has been studied recently and has showed superior performance to a model learned from only a single view. With the introduction of deep learning techniques to a multi-view learning approach, it has showed good results in various fields such as image, text, voice, and video. In this study, we introduce how multi-view learning methods solve various problems faced in human behavior recognition, medical areas, information retrieval and facial expression recognition. In addition, we review data integration principles of multi-view learning methods by classifying traditional multi-view learning methods into data integration, classifiers integration, and representation integration. Finally, we examine how CNN, RNN, RBM, Autoencoder, and GAN, which are commonly used among various deep learning methods, are applied to multi-view learning algorithms. We categorize CNN and RNN-based learning methods as supervised learning, and RBM, Autoencoder, and GAN-based learning methods as unsupervised learning.

A Study on Processor Monitoring for Integration Test of Flight Control Computer equipped with A Modern Processor (최신 프로세서 탑재 비행제어 컴퓨터의 통합시험을 위한 프로세서 모니터링 연구)

  • Lee, Cheol;Kim, Jae-Cheol;Cho, In-Jae
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.14 no.10
    • /
    • pp.1081-1087
    • /
    • 2008
  • This paper describes limitations and solutions of the existing processor-monitoring concept for a military supersonics aircraft Flight Control Computer (FLCC) equipped with modern architecture processor to perform the system integration test. Safecritical FLCC integration test, which requires automatic test for thousands of test cases and real-time input/output test condition generation, depends on the processor-monitoring device called Processor Interface (PI). The PI, which relies upon on the FLCC processor's external address and data-bus data, has some limitations due to multi-fetching capability of the modern sophisticated military processors, like C6000's VLIW (Very-Long Instruction Word) architecture and PowerPC's Superscalar architecture. Several techniques for limitations were developed and proper monitoring approach was presented for modem processor-adopted FLCC system integration test.

An Empirical Examination on the Strategic Use of System Integration Technology in Japanese Manufacture (일본 제조업에 있어서 정보시스템 통합기술의 전략적 활용목적에 관한 실증분석)

  • 이덕주;일본명
    • Journal of Technology Innovation
    • /
    • v.6 no.2
    • /
    • pp.80-100
    • /
    • 1998
  • The application of computers and information technologies progressed with remarkable pace has paved the way for technological innovation toward the efficient integration of various functions in a manufacturing system. Actually it is observed that the greater part of manufacturing companies already initiated significant efforts to integrate information systems and attained the level of integration to some extent. The purpose of this paper is to clarify the strategic direction that manufacturing firms are pursuing through integrating information systems. Using the extensive data gathered from Japanese manufacturers, it is attempted to find the differences in competitive priorities and action programmes between the companies with high and low-level system integration. Our empirical data reveals that the manufacturers with highly integrated system are focusing their competitive capabilities on the rapid design change and new product introduction, and accordingly they place a greater emphasis on the action programmes related to design, process and quality improvement. As a conclusion, highly integrated firms tend to pursue new product development strategy more intensively, thereby they want to be a preemptor in the new market through exploiting technological advantages of integrated manufacturing system.

  • PDF

Conversion of Rain Rate Cumulative Distributions by Multiple Regression Model (다중회기모형에 의한 강우강도 누적분포의 변환)

  • Dung, Luong Ngoc Thuy;Sohn, Won
    • Journal of Satellite, Information and Communications
    • /
    • v.9 no.4
    • /
    • pp.13-15
    • /
    • 2014
  • At frequencies above 10 GHz, rain is a dominant propagation phenomenon on satellite link attenuation. The prediction of rain attenuation is based on the point rainfall rate for 0.01 % of an average year with one minute integration time. Most of available rain data have been measured with 60 minutes integration time, and many researchers have been studying on converting the rainfall rate data from various integration times to one minute integration time. This paper proposes a new Multiple Regression model for the conversion, and the proposed schemes show better performance than the existing schemes.