• Title/Summary/Keyword: Database Structure

Search Result 1,367, Processing Time 0.031 seconds

Predict Protein Secondary Structure based on Emerging Sequence Mining (출현 시퀀스 마이닝 기반의 단백질 2 차 구조 예측)

  • Li, Meijing;Lee, Heon Gyu;Saeed, Khalid E.K.;Shon, Ho Sun;Ryu, Keun Ho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2009.04a
    • /
    • pp.379-382
    • /
    • 2009
  • 최근 단백질 기능 예측을 위한 서열비교와 구조비교 기법들은 정확한 분류가 가능한 반면, 새로운 단백질 기능 분류를 함에 있어서 많은 복잡도가 따른다. 따라서 이 논문에서는 보다 빠른 단백질의 구조 분류 및 예측을 위하여 출현 시퀀스(emerging sequence)를 기반으로 하는 분류기법을 제안하였다. 이 기법에서는 먼저, 출현 시퀀스 마이닝 알고리즘을 이용하여 단백질 서열 데이터로부터 4 가지의 단백질 2 차 구조 출현 시퀀스를 발견하고, SVM을 이용하여 단백질의 출현 시퀀스 속성으로부터 단백질의 2 차 구조를 예측하였다.

Development of educational programs for managing medical information utilizing medical data generation and analysis techniques (의료 데이터 발생과 분석기술을 활용한 의료정보관리 교육용 프로그램 개발)

  • Choi, Joonyoung
    • Journal of Digital Convergence
    • /
    • v.15 no.10
    • /
    • pp.377-386
    • /
    • 2017
  • This study has developed a medical information management educational program that can improve the management ability of medical information. The educational medical information management program was developed for 8mnths uing VB. The database utilized the ACCESS Database, which allows learners to easily understand and understand the structure of the data. The learners enter data in the discharge analysis and the cancer registration program and the incomplete program after analyze the medical records. After entering and saving data, medical information management programs can be used to understand and analyze the structure of the database to generate medical information. The educational programs can improve the ability of learners to manage medical information by extracting the necessary data from the database directly through SQL and creating various medical information. However, although the medical information management program is an educational program, there is no evaluation system for the learners program operation. Accordingly, the next studies should develop the assessment system of the medical information management program for learners evaluation.

Development of an Object-Relational IFC Server

  • Hoon-sig Kang;Ghang Lee
    • International conference on construction engineering and project management
    • /
    • 2009.05a
    • /
    • pp.1346-1351
    • /
    • 2009
  • In this paper we propose a framework for an Object Relational IFC Server (OR-IFC Server). Enormous amounts of information are generated in each project. Today, many BIM systems are developed by various CAD software vendors. Industry Foundation Classes (IFC) developed by International Alliance for Interoperability (IAI) is an open standard data model for exchanging data between the various BIM tools. The IFC provides a foundation for exchanging and sharing of information directly between software applications and define a shared building project model. The IFC model server is a database management system that can keep track of transactions, modifications, and deletions. It plays a role as an information hub for storing and sharing information between various parties involved in construction projects. Users can communicate with each other via the internet and utilize functions implemented in the model server such as partial data import/export, file merge, version control, etc. IFC model servers using relational database systems have been developed. However, they suffered from slow performance and long transaction time due to a complex mapping process between the IFC structure and a relational-database structure because the IFC model schema is defined in the EXPRESS language which is object-favored language. In order to simplify the mapping process, we developed a set of rules to map the IFC model to an object-relational database (ORDB). Once the database has been configured, only those pieces of information that are required for a specific information-exchange scenario are extracted using the pre-defined information delivery manual (IDM). Therefore, file sizes will be reduced when exchanging data, meaning that files can now be effectively exchanged and shared. In this study, the framework of the IFC server using ORDB and IDM and the method to develop it will be examined.

  • PDF

Parallel R-tree Using Multiple Disks (복수의 Disk를 사용하는 병렬형 R-tree)

  • 방갑산;김일민
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1998.10b
    • /
    • pp.114-116
    • /
    • 1998
  • 1차원 이상의 공간 데이터의 효율적인 처리는 현대의 멀티미디어 데이터베이스에 있어서 대단히 중요한 역할을 하고 있다. 공간데이터를 관리하는 공간 index structure는 대부분 serial processing을 위한 구조를 가지고 있다. 많은 application에서 방대한 양의 공간 데이터는 보조기억장치(예: disk)에 저장이 되어 사용이 되고 공간 index structure의 query반응시간을 현저하게 줄일 수 있다. 또한 여러개의 disk를 사용하는 병렬처리는 방대한 양의 공간 데이터를 저장하는데 적당하다. 본 논문에서는 PML-tree라는 병렬형 공간 index structure를 제안한다. PML-tree는 MXR-tree에 비해 높은 공간활용도와 빠른 처리시간을 보임으로써 공간 database를 위한 효율적인 index structure로 사용이 될 것으로 기대된다.

A Data Cleansing Strategy for Improving Data Quality of National R&D Information - Case Study of NTIS (데이터 품질을 고려한 국가R&D정보 데이터베이스의 통합 사례 연구 - NTIS 데이터베이스 통합 사례)

  • Shin, Sung-Ho;Yoon, Young-Jun;Yang, Myung-Suk;Kim, Jin-Man;Shon, Kang-Ryul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.6
    • /
    • pp.119-130
    • /
    • 2011
  • On the point of data quality management, data quality is influenced by quality policy, quality organization, business process, and business rule. Business rules, guide of data manipulation, have effects on data quality directly. In case of building an integration database among distributed databases, defining business rule is more important because data integration needs to consider heterogeneous structure, code, and data standardization. Also data value has various figures depended on data type, unit, and transcription. Finally, database structure and data value problem have to be solved to improve data quality. For handling them, it is needed to draw database integration model and cleanse data in integrated database. NTIS(stands for National science and Technology Information Service) has an aim to serve users who need all information about national R&D by internet, and for that aim, it has a integrated database which has been made with several database sources. We prove that database integration model and data cleansing are needed to build a successful integrated database through NTIS case study.

A Transformation Technique of XML DTD to Relational Database Schema Based On Extracting Common Structure in XML Documents (공통 문서 구조 추출을 통한 XML DTD의 관계형 데이터 베이스 스키마 변환 기법)

  • Ahn, Sung-Eun;Choi, Hwang-Kyu
    • The KIPS Transactions:PartD
    • /
    • v.9D no.6
    • /
    • pp.999-1008
    • /
    • 2002
  • XML is emerging as a standard data format to exchange and to present data on the Web. There are increasing needs to efficiently store and to query XML data. In this paper. we propose a new schema transformation algorithm based on a common structure extracting technique from XML documents. The common structure is shared by all XML documents referenced by DTD and the uncommon structure is ununiformly appeared on all XML documents referenced by DTD. Based on the extracted common and uncommon structures, we transform XML DTD into relational database schema. We conduct a performance evaluation based on the number of the generated tables, the size of the record, query processing time and the number of joins on the query. The performance of our algorithm is compared with the existing algorithms, then in most cates, our algorithm is better than the existing ones with respect to the number of the generated tables and appearance of NULL values in the tables.

A Protein Structure Comparison System based on PSAML (PSAML을 이용한 단백질 구조 비고 시스템)

  • Kim Jin-Hong;Ahn Geon-Tae;Byun Sang-Hee;Lee Su-Hyun;Lee Myung-Joon
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.11 no.2
    • /
    • pp.133-148
    • /
    • 2005
  • Since understanding of similarities and differences among protein structures is very important for the study of the relationship between structure and function, many protein structure comparison systems have been developed. Hut, unfortunately, these systems introduce their own protein data derived from the PDB(Protein Data Bank), which are needed in their algorithms for comparing protein structures. In addition, according to the rapid increase in the size of PDB, these systems require much more computation to search for common substructures in their databases. In this paper, we introduce a protein structure comparison system named WS4E(A Web-Based Searching Substructures of Secondary Structure Elements) based on a PSAML database which stores PSAML documents using the eXist open XML DBMS. PSAML(Protein Structure Abstraction Markup Language) is an XML representation of protein data, describing a protein structure as the secondary structures of the protein and their relationships. Using the PSAML database, the WS4E provides web services searching for common substructures among proteins represented in PSAML. In addition, to reduce the number of candidate protein structures to be compared in the PSAML database, we used topology strings which contain the spatial information of secondary structures in a protein.

Document Structure Understanding on Subjects Registration Table

  • Ito, Yuichi;Ohno, Masanaga;Tsuruoka, Shinji;Yoshikawa, Tomohiro;Tsuyoshi, Shinogi
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09a
    • /
    • pp.571-574
    • /
    • 2003
  • This research is aimed to automate the generating process of the database from paper based table forms like this work. The registration table has so complicate table structures, ana in this research we used the registration tables as an example of general table structure understanding. We propose a table structure understanding system for some table types, and it has some steps. The first step is that the document images on paper are read from the image scanner. The second step is that a document image segments into some tables. In the third step, the character strings is extracted using image processing technology and the property of the character strings is determined. And the structured database is generated automatically. The proposed system consists of two systems. "Master document generation system" is used for the table form definition, and it doesn′t include the handwritten characters. "Structure analysis system for complete d table" is used for the written form, and it analyzes the table form filled in the handwritten character. We implemented the system using MS Visual C++ on Windows, and it can get the correct extraction rate 98% among 51 registration tables written by the different students.

  • PDF

Structuring of BOM and Routings for CIM System In Make to Order Environments -Application of CIM System for Ship Production- (수주생산 환경에서의 CIM 시스템을 위한 BOM과 라우팅의 구조화 -조선산업 사례 중심-)

  • Hwang, Sung-Ryong;Kim, Jae-Gyun
    • IE interfaces
    • /
    • v.15 no.1
    • /
    • pp.26-39
    • /
    • 2002
  • Two key data areas of the integrated production database in computer-integrated manufacturing (CIM) systems are the product structure in the forms of bills of material(BOM) and the process structure in the forms of routings. The great majority of existing information systems regard the BOM and routing as two separate data entities, possibly with some degree of cross-referencing. This paper proposes new information structure called the bills of material and routings(BMR) that logically integrates the BOM and routings for the CIM systems in ship production. The characteristics of ship production are described as: 1) make-to-order production type, 2) combined manufacturing principles (workshop production and construction site production), 3) significant overlapping of design, planning and manufacturing, 4) very long order throughput time, 5) complex product structure and production process. The proposed BMR systematically manages ail parts and operations data needed ship production considering characteristics of ship production. Also, the BMR situated on the integrated production database more efficiently supports interface between engineering and production functions, and integrates a wide variety of functions within production such as production planning, process planning, operation scheduling, material planning, costing etc., and simplifies information flow between sub-systems in CIM systems.

A fragmentation database of soyasaponins by liquid chromatography with a photodiode array detector and tandem mass spectrometry

  • Son, Haereon;Mukaiyama, Kyosuke;Omizu, Yohei;Tsukamoto, Chigen
    • Analytical Science and Technology
    • /
    • v.34 no.4
    • /
    • pp.172-179
    • /
    • 2021
  • Oleanane-type triterpenoids exist as secondary metabolites in various plants. In particular, soyasaponin, an oleanane-type triterpenoid, is abundant in the hypocotyl of soybean, one of the most widely cultivated crops in the world. Depending on their chemical structure, soyasaponins are categorized as group A saponins or group DDMP (2,3-dihydro-2,5-dihydroxy-6-methyl-4H-pyran-4-one) saponins. The different soyasaponin chemical structures present different health functionalities and taste characteristics. However, conventional phenotype screening of soybean requires a substantial amount of time for functionality of soyasaponins. Therefore, we attempted to use liquid chromatography with a photodiode array detector and tandem mass spectrometry (LC-PDA/MS/MS) for accurately predicting the phenotype and chemical structure of soyasaponins in the hypocotyl of five common soybean natural mutants. In this method, the aglycones (soyasapogenol A [SS-A] and soyasapogenol B [SS-B]) were detected after acid hydrolysis. These results indicated that the base peak and fragmentation differ depending on the chemical structure of soyasaponin with aglycone. Thus, a fragmentation database can help predict the chemical structure of soyasaponins in soyfoods and plants.