• Title/Summary/Keyword: Large-scale database

Search Result 298, Processing Time 0.032 seconds

Retrieving Protein Domain Encoding DNA Sequences Automatically Through Database Cross-referencing

  • Choi, Yoon-Sup;Yang, Jae-Seong;Ryu, Sung-Ho;Kim, Sang-Uk
    • Bioinformatics and Biosystems
    • /
    • v.1 no.2
    • /
    • pp.95-98
    • /
    • 2006
  • Recent proteomic studies of protein domains require high-throughput and systematic approaches. Since most experiments using protein domains, the modules of protein-protein interactions, require gene cloning, the first experimental step should be retrieving DNA sequences of domain encoding regions from databases. For a large scale proteomic research, however, it is a laborious task to extract a large number of domain sequences manually from several inter-linked databases. We present a new methodology to retrieve DNA sequences of domain encoding regions through automatic database cross-referencing. To extract protein domain encoding regions, it traverses several inter-connected database with validation process. And we applied this method to retrieve all the EGF domain encoding DNA sequences of homo sapiens. This new algorithm was implemented using Python library PAMIE, which enables to cross-reference across distinct databases automatically.

  • PDF

NVST DATA ARCHIVING SYSTEM BASED ON FASTBIT NOSQL DATABASE

  • Liu, Ying-Bo;Wang, Feng;Ji, Kai-Fan;Deng, Hui;Dai, Wei;Liang, Bo
    • Journal of The Korean Astronomical Society
    • /
    • v.47 no.3
    • /
    • pp.115-122
    • /
    • 2014
  • The New Vacuum Solar Telescope (NVST) is a 1-meter vacuum solar telescope that aims to observe the fine structures of active regions on the Sun. The main tasks of the NVST are high resolution imaging and spectral observations, including the measurements of the solar magnetic field. The NVST has been collecting more than 20 million FITS files since it began routine observations in 2012 and produces maximum observational records of 120 thousand files in a day. Given the large amount of files, the effective archiving and retrieval of files becomes a critical and urgent problem. In this study, we implement a new data archiving system for the NVST based on the Fastbit Not Only Structured Query Language (NoSQL) database. Comparing to the relational database (i.e., MySQL; My Structured Query Language), the Fastbit database manifests distinctive advantages on indexing and querying performance. In a large scale database of 40 million records, the multi-field combined query response time of Fastbit database is about 15 times faster and fully meets the requirements of the NVST. Our slestudy brings a new idea for massive astronomical data archiving and would contribute to the design of data management systems for other astronomical telescopes.

A SENSOR DATA PROCESSING SYSTEM FOR LARGE SCALE CONTEXT AWARENESS

  • Choi Byung Kab;Jung Young Jin;Lee Yang Koo;Park Mi;Ryu Keun Ho;Kim Kyung Ok
    • Proceedings of the KSRS Conference
    • /
    • 2005.10a
    • /
    • pp.333-336
    • /
    • 2005
  • The advance of wireless telecommunication and observation technologies leads developing sensor and sensor network for serving the context information continuously. Besides, in order to understand and cope with the context awareness based on the sensor network, it is becoming important issue to deal with plentiful data transmitted from various sensors. Therefore, we propose a context awareness system to deal with the plentiful sensor data in a vast area such as the prevention of a forest fire, the warning system for detecting environmental pollution, and the analysis of the traffic information, etc. The proposed system consists of the context acquisition to collect and store various sensor data, the knowledge base to keep context information and context log, the rule manager to process context information depending on user defined rules, and the situation information manager to analysis and recognize the context, etc. The proposed system is implemented for managing renewable energy data management transmitted from a large scale area.

  • PDF

Trace Research on IT Personnel Turnover in Large Scale Sl Projects (대규모 시스템 통합 프로젝트 환경에 있어서 IT인력의 이직 원인에 관한 추적연구)

  • 조남재;장성주
    • The Journal of Information Technology and Database
    • /
    • v.8 no.2
    • /
    • pp.61-69
    • /
    • 2001
  • System Integration Projects, especially in public sector, have a tendency of growing in size and duration. Under such environment the effects of IT personnel turnover becomes a serious problem. In addition a high rate of turnover hinders shill accumulation and competitiveness obtainment. In this research we explored the reasons of turnover by way of tracing IT personnel who have involved in large-scale SI projects and moved or quit before the completion of the projects. Based on previous research we identified three dimensions of variables that affected turnover : task-related dimension, human relations dimension, treatment-related dimension. We used these dimensions as a guideline for the interview and questioning exploration. Implications from the research for improving IT Personnel management are also elaborated.

  • PDF

A FRAMEWORK FOR QUERY PROCESSING OVER HETEROGENEOUS LARGE SCALE SENSOR NETWORKS

  • Lee, Chung-Ho;Kim, Min-Soo;Lee, Yong-Joon
    • Proceedings of the KSRS Conference
    • /
    • 2007.10a
    • /
    • pp.101-104
    • /
    • 2007
  • Efficient Query processing and optimization are critical for reducing network traffic and decreasing latency of query when accessing and manipulating sensor data of large-scale sensor networks. Currently it has been studied in sensor database projects. These works have mainly focused on in-network query processing for sensor networks and assumes homogeneous sensor networks, where each sensor network has same hardware and software configuration. In this paper, we present a framework for efficient query processing over heterogeneous sensor networks. Our proposed framework introduces query processing paradigm considering two heterogeneous characteristics of sensor networks: (1) data dissemination approach such as push, pull, and hybrid; (2) query processing capability of sensor networks if they may support in-network aggregation, spatial, periodic and conditional operators. Additionally, we propose multi-query optimization strategies supporting cross-translation between data acquisition query and data stream query to minimize total cost of multiple queries. It has been implemented in WSN middleware, COSMOS, developed by ETRI.

  • PDF

Construction of Requirement Management Database for Test and Evaluation of High Speed Rollingstock 350 eXperimental (한국형 고속열차 신뢰성 시험평가 요건관리 체계 구축)

  • Lee, Tae-Hyung;Park, Chan-Kyung;Kang, Byung-Mo;Son, Kwang-Soo
    • Proceedings of the KSR Conference
    • /
    • 2006.11b
    • /
    • pp.164-169
    • /
    • 2006
  • A high-speed rail system represents a typical example of large-scale multi-disciplinary systems, consisting of subsystems such as train, electrical hardware, electronics, control, information, communication, civil technology etc. The system design and acquisition data of the large-scale system must be the subject under strict configuration control and management. This paper presents the results from systems engineering application to High Speed Rollingstock 350 eXperimental for management of test and evaluation.

  • PDF

Affine Local Descriptors for Viewpoint Invariant Face Recognition

  • Gao, Yongbin;Lee, Hyo Jong
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2014.04a
    • /
    • pp.781-784
    • /
    • 2014
  • Face recognition under controlled settings, such as limited viewpoint and illumination change, can achieve good performance nowadays. However, real world application for face recognition is still challenging. In this paper, we use Affine SIFT to detect affine invariant local descriptors for face recognition under large viewpoint change. Affine SIFT is an extension of SIFT algorithm. SIFT algorithm is scale and rotation invariant, which is powerful for small viewpoint changes in face recognition, but it fails when large viewpoint change exists. In our scheme, Affine SIFT is used for both gallery face and probe face, which generates a series of different viewpoints using affine transformation. Therefore, Affine SIFT allows viewpoint difference between gallery face and probe face. Experiment results show our framework achieves better recognition accuracy than SIFT algorithm on FERET database.

DEVELOPMENT PROCESS OF INFORMATION FLOW RETRIEVAL SYSTEM FOR LARGE-SCALE CONSTRUCTION PROJECTS

  • Jinho Shin;Hyun-soo Lee ;Moonseo Park;Jung-ho Yu;Jungseok Kim
    • International conference on construction engineering and project management
    • /
    • 2011.02a
    • /
    • pp.556-560
    • /
    • 2011
  • Players of construction projects proceed with each work process by information gathering, modification and communication. Due to the complex and long-span lifecycle projects increased, it became more important to grasp this mechanism for the successful project performance in construction project. Hence, most project information management systems or knowledge management systems equip information retrieval system. There are two logic to infer the meaning of retrieval target; inductive reasoning and deductive reasoning. The former is based on metadata explaining the target and the later is based on relation between data. To infer the information flow, it is necessary to define the correlation between players and work processes. However, most established information retrieval systems are based on index search system and it is not focused on correlation between data but data itself. Thus, this research aims to research on process of information flow retrieval system for large-scale construction projects.

  • PDF

A Multi-Resolution Database Model for Management of Vector Geodata in Vehicle Dynamic Route Guidance System (동적 경로안내시스템에서 벡터 지오데이터의 관리를 위한 다중 해상도 모델)

  • Joo, Yong-Jin;Park, Soo-Hong
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.18 no.4
    • /
    • pp.101-107
    • /
    • 2010
  • The aim of this paper is to come up with a methodology of constructing an efficient model for multiple representations which can manage and reconcile real-time data about large-scale roads in Vector Domain. In other words, we suggested framework based on a bottom-up approach, which is allowed to integrate data from the network of the lowest level sequentially and perform automated matching in order to produce variable-scale map. Finally, we applied designed multi-LoD model to in-vehicle application.

A Database System for High-Throughput Transposon Display Analyses of Rice

  • Inoue, Etsuko;Yoshihiro, Takuya;Kawaji, Hideya;Horibata, Akira;Nakagawa, Masaru
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.15-20
    • /
    • 2005
  • We developed a database system to enable efficient and high-throughput transposon analyses in rice. We grow large-scale mutant series of rice by taking advantage of an active MITE transposon mPing, and apply the transposon display method to them to study correlation between genotypes and phenotypes. But the analytical phase, in which we find mutation spots from waveform data called fragment profiles, involves several problems from a viewpoint of labor amount, data management, and reliability of the result. As a solution, our database system manages all the analytical data throughout the experiments, and provides several functions and well designed web interfaces to perform overall analyses reliably and efficiently.

  • PDF