• Title/Summary/Keyword: Shared Database

Search Result 190, Processing Time 0.029 seconds

OryzaGP: rice gene and protein dataset for named-entity recognition

  • Larmande, Pierre;Do, Huy;Wang, Yue
    • Genomics & Informatics
    • /
    • v.17 no.2
    • /
    • pp.17.1-17.3
    • /
    • 2019
  • Text mining has become an important research method in biology, with its original purpose to extract biological entities, such as genes, proteins and phenotypic traits, to extend knowledge from scientific papers. However, few thorough studies on text mining and application development, for plant molecular biology data, have been performed, especially for rice, resulting in a lack of datasets available to solve named-entity recognition tasks for this species. Since there are rare benchmarks available for rice, we faced various difficulties in exploiting advanced machine learning methods for accurate analysis of the rice literature. To evaluate several approaches to automatically extract information from gene/protein entities, we built a new dataset for rice as a benchmark. This dataset is composed of a set of titles and abstracts, extracted from scientific papers focusing on the rice species, and is downloaded from PubMed. During the 5th Biomedical Linked Annotation Hackathon, a portion of the dataset was uploaded to PubAnnotation for sharing. Our ultimate goal is to offer a shared task of rice gene/protein name recognition through the BioNLP Open Shared Tasks framework using the dataset, to facilitate an open comparison and evaluation of different approaches to the task.

CR Technology and Activation Plan for White Space Utilization (화이트 스페이스 활용을 위한 무선환경 인지 기술 및 활성화 방안)

  • Yoo, Sung-Jin;Kang, Kyu-Min;Jung, Hoiyoon;Park, SeungKeun
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.39B no.11
    • /
    • pp.779-789
    • /
    • 2014
  • Cognitive radio (CR) technology based on geo-location database access approach and/or wideband spectrum sensing approach is absolutely vital in order to recognize available frequency bands in white spaces (WSs), and efficiently utilize shared spectrums. This paper presents a new structure for the TVWS database access protocol implementation based on Internet Engineering Task Force (IETF) Protocol to Access WS database (PAWS). A wideband compressive spectrum sensing (WCSS) scheme using a modulated wideband converter is also proposed for the TVWS utilization. The developed database access protocol technology which is adopted in both the TV band device (TVBD) and the TVWS database operates well in the TV frequency bands. The proposed WCSS shows a stable performance in false alarm probability irrespective of noise variance estimation error as well as provides signal detection probabilities greater than 95%. This paper also investigates Federal Communications Commision (FCC) regulatory requirements of TVWS database as well as European Telecommunications Standards Institute (ETSI) policy related to TVWS database. A standardized protocol to achieve interoperability among multiple TVBDs and TVWS databases, which is currently prepared in the IETF, is discussed.

A comparison of five sets of overlapping and non-overlapping sliding windows for semen production traits in the Thai multibreed dairy population

  • Mattaneeya Sarakul;Mauricio A. Elzo;Skorn Koonawootrittriron;Thanathip Suwanasopee;Danai Jattawa;Thawee Laodim
    • Animal Bioscience
    • /
    • v.37 no.3
    • /
    • pp.428-436
    • /
    • 2024
  • Objective: This study compared five distinct sets of biological pathways and associated genes related to semen volume (VOL), number of sperm (NS), and sperm motility (MOT) in the Thai multibreed dairy population. Methods: The phenotypic data included 13,533 VOL records, 12,773 NS records, and 12,660 MOT records from 131 bulls. The genotypic data consisted of 76,519 imputed and actual single nucleotide polymorphisms (SNPs) from 72 animals. The SNP additive genetic variances for VOL, NS, and MOT were estimated for SNP windows of one SNP (SW1), ten SNP (SW10), 30 SNP (SW30), 50 SNP (SW50), and 100 SNP (SW100) using a single-step genomic best linear unbiased prediction approach. The fixed effects in the model were contemporary group, ejaculate order, bull age, ambient temperature, and heterosis. The random effects accounted for animal additive genetic effects, permanent environment effects, and residual. The SNPs explaining at least 0.001% of the additive genetic variance in SW1, 0.01% in SW10, 0.03% in SW30, 0.05% in SW50, and 0.1% in SW100 were selected for gene identification through the NCBI database. The pathway analysis utilized genes associated with the identified SNP windows. Results: Comparison of overlapping and non-overlapping SNP windows revealed notable differences among the identified pathways and genes associated with the studied traits. Overlapping windows consistently yielded a larger number of shared biological pathways and genes than non-overlapping windows. In particular, overlapping SW30 and SW50 identified the largest number of shared pathways and genes in the Thai multibreed dairy population. Conclusion: This study yielded valuable insights into the genetic architecture of VOL, NS, and MOT. It also highlighted the importance of assessing overlapping and non-overlapping SNP windows of various sizes for their effectiveness to identify shared pathways and genes influencing multiple traits.

A Study on the Inter-constructive Design Dictionary through the Internet. (인터넷을 통한 상호구축적 디자인 용어사전의 연구)

  • 김태균
    • Archives of design research
    • /
    • v.14 no.4
    • /
    • pp.25-33
    • /
    • 2001
  • With the increasing access to the internet, the number of designers who rely on internet to use information on design is on the rise. Therefore common dictionary of design terminology need to be formed and shared among designers. To do so, internet is very useful medium. However as relating terminology increases rapidly through interactivity among designers, it will be far from taking full advantage of features of internet to set up and provide such information unilaterally on internet. This indicates that providing data on the internet, not via traditional books, requires in-depth study on process of establishment of database structure and appropriate interface design. Thus this study will show design terms database model that harnesses internet feature that enables establishment of information spontaneously through user's interactivity, departing from a model that conveys information unilaterally. This report summarized and analyzed various models and suggested classification system in accordance with user's learning cognition. Problems on existing dictionary of design terminology were identified and new methods addressing such problems were exploited. In a word, this report is intended to propose user oriented inter-constructive database model that highlights high level of openness and interactivity by enabling changes of text in the cyber space and encouraging user to participate in making design dictionary.

  • PDF

An Online Terminology Dictionary of Traditional Korean Medicine (온라인 한의학 용어 사전 시스템 구축)

  • Kim, Sang-Kyun;Jang, Hyun-Chul;Yea, Sang-Jun;Kim, Chul;Song, Mi-Young
    • Korean Journal of Oriental Medicine
    • /
    • v.18 no.1
    • /
    • pp.45-52
    • /
    • 2012
  • Objectives : Our study aims to provide a collaborative Internet terminology dictionary like Wikipedia, where about 30,000 concept terminologies with respect to traditional Korean medicine (TKM) are shared and TKM experts can edit the terminologies. Methods : The concept terminologies have been collected and refined for three years by the terminology management system, a custom-made software built upon the Oracle database, where each terminology is divided and normalized into one or more tables. The operation of Wikipedia depends on MediaWiki, a free and open source wiki software built upon the MySQL database. The database schema of our terminology management system is different from that of MediaWiki so that MediaWiki cannot used as our terminology dictionary. Thus, we propose a way to share and edit TKM terminologies with wiki-like user interface. Results : We devise a new terminology dictionary system to search and edit terminology upon the database of the terminology management system. The online terminology dictionary of TKM has the user interface and functions which is similar to Wikipedia to support collaborative works. Conclusions : Wikipedia is operated on MediaWiki which is can be downloaded and used freely under the GNU General Public License. However, there occur problems to use MediaWiki upon the legacy system. Thus, other wiki projects start, they should be considered.

One-Snapshot Algorithm for Secure Transaction Management in Electronic Stock Trading Systems (전자 주식 매매 시스템에서의 보안 트랜잭션 관리를 위한 단일 스냅샷 알고리즘)

  • 김남규;문송천;손용락
    • Journal of KIISE:Databases
    • /
    • v.30 no.2
    • /
    • pp.209-224
    • /
    • 2003
  • Recent development of electronic commerce enables the use of Electronic Stock Trading Systems(ESTS) to be expanded. In ESTS, information with various sensitivity levels is shared by multiple users with mutually different clearance levels. Therefore, it is necessary to use Multilevel Secure Database Management Systems(MLS/DBMSs) in controlling concurrent execution among multiple transactions. In ESTS, not only analytical OLAP transactions, but also mission critical OLTP transactions are executed concurrently, which causes it difficult to adapt traditional secure transaction management schemes to ESTS environments. In this paper, we propose Secure One Snapshot(SOS) protocol that is devised for Secure Transaction Management in ESTS. By maintaining additional one snapshot as well as working database SOS blocks covert-channel efficiently, enables various real-time transaction management schemes to be adapted with ease, and reduces the length of waiting queue being managed to maintain freshness of data by utilizing the characteristics of less strict correctness criteria. In this paper, we introduce the process of SOS protocol with some examples, and then analyze correctness of devised protocol.

Optimistic Concurrency Control for Secure Real-Time Database Systems (실시간 보안 데이타베이스 시스템을 위한 낙관적 동시성 제어 기법)

  • Kim, Dae-Ho;Jeong, Byeong-Soo;Lee, Sung-Young
    • Journal of KIISE:Databases
    • /
    • v.27 no.1
    • /
    • pp.42-52
    • /
    • 2000
  • In many real time applications that the system maintains sensitive information to be shared by multiple users with different security levels, security is another important requirement. A secure real time database system must satisfy not only logical data consistency but also timing constrains and security requirements associated with transactions. Even though an optimistic concurrency control method outperforms locking based method in firm real time database systems, where late transactions are immediately discarded, most existing secure real time concurrency control methods are based on locking. In this paper, we propose a new optimistic concurrency control protocol for secure real time database systems, and compare the performance characteristics of our protocol with locking based method while varying workloads. The result shoes that our proposed O.C.C protocol has good performance in case of many data conflict.

  • PDF

Implementation of Temporal Relationship Macros for History Management in SDE (SDE에서 이력 관리를 위한 시간관계 매크로의 구현)

  • Lee, Jong-Yeon;Ryu, Geun-Ho
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.5 no.5
    • /
    • pp.553-563
    • /
    • 1999
  • The Spatial Database Engine(SDETM) developed by Environmental Systems Research Institute, Inc. is a spatial database that employs a client-server architecture incorporated with a set of software services to perform efficient spatial operations and to manage large, shared and geographic data sets. It currently supports a wide variety of spatial search methods and spatial relationships determined dynamically. Spatial objects in the space world can be changed by either non-spatial operations or spatial operations. Conventional geographical information systems(GISs) did not manage their historical information, however, because they handle the snapshot images of spatial objects in the world. In this paper we propose a spatio-temporal data model and an algorithm for temporal relationship macro which is able to manage and retrieve the historical information of spatial objects. The proposed spatio-temporal data model and its operations can be used as a software tool for history management of time-varying objects in database without any change.

Geographic Information Database for Facilitating Regional Development (지역개발 활성화를 위한 지리정보 DB 연구)

  • Kim, Hang-Jib;Choi, Bong-Moon
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.5 no.2
    • /
    • pp.69-80
    • /
    • 2002
  • GIS is essential to regional development and management in informatization age. But utilization of GIS in Korea remains still elementary area of automated mapping or facility management. In this paper, we suggest executable principles that are able to enhance and improve the efficiency of regional development affairs using GIS DB. In order to accomplish the role of planning support tool, GIS DB have to be plentiful geographic contents. Metadata DB, user-friendly application interface and compatibility of data between the public and the private must be constructed in GIS DB. And geographic information should be shared between the public and the private.

  • PDF

Join Operation of Parallel Database System with Large Main Memory (대용량 메모리를 가진 병렬 데이터베이스 시스템의 조인 연산)

  • Park, Young-Kyu
    • Journal of the Korea Society of Computer and Information
    • /
    • v.12 no.3
    • /
    • pp.51-58
    • /
    • 2007
  • The shared-nothing multiprocessor architecture has advantages in scalability, this architecture has been adopted in many multiprocessor database system. But, if the data are not uniformly distributed across the processors, load will be unbalanced. Therefore, the whole system performance will deteriorate. This is the data skew problem, which usually occurs in processing parallel hash join. Balancing the load before performing join will resolve this problem efficiently and the whole system performance can be improved. In this paper, we will present an algorithm using merit of very large memory to reduce disk access overhead in performing load balancing and to efficiently solve the data skew problem. Also, we will present analytical model of our new algorithm and present the result of some performance study we made comparing our algorithm with the other algorithms in handling data skew.

  • PDF