• Title/Summary/Keyword: large database

Search Result 1,454, Processing Time 0.028 seconds

A Study on the Design and the Construction of a Korean Speech DB for Common Use (공동이용을 위한 음성DB의 설계 및 구축에 관한 연구)

  • Kim, Bong-Wan;Kim, Jong-Jin;Kim, Sun-Tae;Lee, Yong-Ju
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.4
    • /
    • pp.35-41
    • /
    • 1997
  • Speech database is an indispensable part of speech research. Speech database is necessary to use in speech research and development processes, and to evaluate performances of various speech-processing systems. To use speech database for common purpose, it is necessary to design utterance list that has all the possible phonetical events in minimal number of words, and is independent of tasks. To meet those restrictions this paper extracts PBW set from large text corpus. Speech database that was constructed using PBW set for utterance list and its properties are described in this paper.

  • PDF

Design and Implementation of Indexing and Query Languages for an Efficient Retrieval of SGML Documents (SGML 문서의 효율적인 검색을 위한 색인 및 질의 언어의 설계 및 구현)

  • Lee, Bong-Sin;Lee, Gyeong-Ho;Go, Seung-Gyu;Choe, Yun-Cheol
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.11
    • /
    • pp.2911-2921
    • /
    • 1999
  • We present new methods for an efficient retrieval of SGML documents. We define IDDL (index database description language) which is able to describe various information such as meta data, an indexing range, and the creation and manipulation of a database. In addition, we design IDQL (index database query language) that can deal with querying meta data as well as logical structure. Especially, the retrieval system based on IDDL and IDQL has been developed and implemented, and has been experimented on large number of documents. Experimental result shows that the proposed method provides the dynamic creation of an index database and a convenient retrieval environment.

  • PDF

Mining Association Rules in Multiple Databases using Links (복수 데이터베이스에서 링크를 이용한 연관 규칙 탐사)

  • Bae, Jin-Uk;Sin, Hyo-Seop;Lee, Seok-Ho
    • Journal of KIISE:Software and Applications
    • /
    • v.26 no.8
    • /
    • pp.939-954
    • /
    • 1999
  • 데이타마이닝 분야에서는 대용량의 트랜잭션 데이타베이스와 같은 하나의 데이타베이스로부터 연관 규칙을 찾는 연구가 많이 수행되어왔다. 그러나, 창고형 할인매장이나 백화점 같이 고객 카드를 이용하는 판매점의 등장으로, 단지 트랜잭션에 대한 분석 뿐만이 아니라, 트랜잭션과 고객과의 관계에 대한 분석 또한 요구되고 있다. 즉, 두 개의 데이타베이스로부터 연관 규칙을 찾는 연구가 필요하다. 이 논문에서는 두 데이타베이스 사이에 링크를 생성하여 연관 항목집합을 찾는 알고리즘을 제안한다. 실험 결과, 링크를 이용한 알고리즘은 고객 데이타베이스가 메모리에 거주가능한 크기라면 시간에 따른 분석에 유용함을 보여주었다.Abstract There have been a lot of researches of mining association rules from one database such as transaction database until now. But as the large discount store using customer card emerges, the analysis is not only required about transactions, but also about the relation between transactions and customer data. That is, it is required to search association rules from two databases. This paper proposes an efficient algorithm constructing links from one database to the other. Our experiments show the algorithm using link is useful for temporal analysis of memory-resident customer database.

The Development of a Advanced Information Management System for PSA (확률론적 안전성 평가를 위한 정보 관리 시스템 개발)

  • Kim Seung-Hwan
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.6 s.38
    • /
    • pp.337-344
    • /
    • 2005
  • In order to Perform a PSA. it requires a large number of data for various fields. Therefore, the effective management of the data is essential to perform and review a PSA and to maintain the quality of a PSA. Korea Atomic Energy Research Institute (KAERI) is developing a PSA information management system (AIMS: Advanced Information Management System for PSA) which enhances the accessibility to PSA information for all PSA related activities. The AIMS is a database system that stores all references and links to the information used for the PSA analysis. The AIMS consists of a database, information browsing modules and a PSA model manager. This Paper describes how we implemented such a database centered application in the view of two areas, database design and data (document) service.

  • PDF

AN ABSTRACTION MODEL FOR IN-SITU SENSOR DATA USING SENSORML

  • Lee Yang Koo;Jung Young Jin;Park Mi;Kim Hak Cheol;Lee Chung Ho;Ryu Keun Ho
    • Proceedings of the KSRS Conference
    • /
    • 2005.10a
    • /
    • pp.337-340
    • /
    • 2005
  • Context-awareness techniques in ubiquitous computing environment provide various services to users who need to get information via the analysis of collected information from sensors in a spatial area. Context-awareness has been increased in ubiquitous computing and is applied to many different applications such as disaster management system, intelligent robot system, transportation management system, shopping management system, and digital home service. Many researches have recently focused on services that provide the appropriate information, which are collected from Internet by different kinds of sensors, to users according to context of their surrounding environment. In this paper, we propose an abstraction model to manage the large-scale contextual information and their metadata which are collected from different kinds of in-situ sensors in a spatial area and are presented them on the web. This model is composed of the modules expressing functional elements of sensors using sensorML(Sensor Model Language) based on XML language and the modules managing contextual information, which is transmitted from the sensors.

  • PDF

Parallel Deductive Database Systems for Personal Communication Service (개인 통신 서비스(PCS)의 지능망을 위한 병렬 연역 데이터 베이스 시스템)

  • 이원석
    • The Proceeding of the Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.4 no.1
    • /
    • pp.28-37
    • /
    • 1993
  • The future Personal Communication Service(PCS) is the total personal information communication service to satisfy the importance of information in a highly information - oriented society. Compared with the conventional communication method, it is necessary to provide considerable intelligence to the commu- nication network in order to implement the PCS which guarantees the mobility of each individual. Consequen- tly, the PCS needs a fast large database system which stores the information of each individual. To achieve this goal, this paper presents the parallel implementation of deductive database systems as a main - memory database system.

  • PDF

Implementation of Database MiddleWare for Geospatial Data Processing (공간정보 처리를 위한 데이터베이스 미들웨어 구현)

  • Lee, Jin-Kyu;Khan, Ki-Ho;Kim, Tae-Hi;Lee, Hyung-Jin;Lee, Young-Woo
    • Proceedings of the Korean Society of Surveying, Geodesy, Photogrammetry, and Cartography Conference
    • /
    • 2009.04a
    • /
    • pp.145-150
    • /
    • 2009
  • In this paper, we introduce about the implementation of geospatial database middleware for construction of next generation digital map. This middleware supports global transactions based on the distributed and heterogeneous data store circumstance, geospatial data processing which does not follow standard, global schema management, object management based UFID and large scale data process. In addition, this supports creating, managing, calling for user defined procedures that can help making business logic of middleware user. User defined procedure interface is designed using CORBA and RMI protocol. In this paper, we describe the implementation of distributed object oriented database middleware which use variable data store.

  • PDF

RDB-based Automatic Knowledge Acquisition and Forward Inference Mechanism for Self-Evolving Expert Systems

  • Kim, Jin-Sung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.6
    • /
    • pp.743-748
    • /
    • 2003
  • In this research, we propose a mechanism to develop an inference engine and expert systems based on relational database (RDB) and SQL (structured query language). Generally, former researchers had tried to develop an expert systems based on text-oriented knowledge base and backward/forward (chaining) inference engine. In these researches, however, the speed of inference was remained as a tackling point in the development of agile expert systems. Especially, the forward inference needs more times than backward inference. In addition, the size of knowledge base, complicate knowledge expression method, expansibility of knowledge base, and hierarchies among rules are the critical limitations to develop an expert system. To overcome the limitations in speed of inference and expansibility of knowledge base, we proposed a relational database-oriented knowledge base and forward inference engine. Therefore, our proposed mechanism could manipulate the huge size of knowledge base efficiently. and inference with the large scaled knowledge base in a short time. To this purpose, we designed and developed an SQL-based forward inference engine using relational database. In the implementation process, we also developed a prototype expert system and presented a real-world validation data set collected from medical diagnosis field.

A Study on Transcriptome Analysis Using de novo RNA-sequencing to Compare Ginseng Roots Cultivated in Different Environments

  • Yang, Byung Wook
    • Proceedings of the Plant Resources Society of Korea Conference
    • /
    • 2018.04a
    • /
    • pp.5-5
    • /
    • 2018
  • Ginseng (Panax ginseng C.A. Meyer), one of the most widely used medicinal plants in traditional oriental medicine, is used for the treatment of various diseases. It has been classified according to its cultivation environment, such as field cultivated ginseng (FCG) and mountain cultivated ginseng (MCG). However, little is known about differences in gene expression in ginseng roots between field cultivated and mountain cultivated ginseng. In order to investigate the whole transcriptome landscape of ginseng, we employed High-Throughput sequencing technologies using the Illumina HiSeqTM2500 system, and generated a large amount of sequenced transcriptome from ginseng roots. Approximately 77 million and 87 million high-quality reads were produced in the FCG and MCG roots transcriptome analyses, respectively, and we obtained 256,032 assembled unigenes with an average length of 1,171 bp by de novo assembly methods. Functional annotations of the unigenes were performed using sequence similarity comparisons against the following databases: the non-redundant nucleotide database, the InterPro domains database, the Gene Ontology Consortium database, and the Kyoto Encyclopedia of Genes and Genomes pathway database. A total of 4,207 unigenes were assigned to specific metabolic pathways, and all of the known enzymes involved in starch and sucrose metabolism pathways were also identified in the KEGG library. This study indicated that alpha-glucan phosphorylase 1, putative pectinesterase/pectinesterase inhibitor 17, beta-amylase, and alpha-glucan phosphorylase isozyme H might be important factors involved in starch and sucrose metabolism between FCG and MCG in different environments.

  • PDF

A Comparison of Performance Between MSSQL Server and MongoDB for Telco Subscriber Data Management (통신 가입자 데이터 관리를 위한 MSSQL Server와 NoSQL MongoDB의 성능 비교)

  • Nichie, Aaron;Koo, Heung-Seo
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.65 no.3
    • /
    • pp.469-476
    • /
    • 2016
  • Relational Database Management Systems have become de facto database model among most developers and users since the inception of Data Science. From IoT devices, sensors, social media and other sources, data is generated in structured, semi-structured and unstructured formats, in huge volumes, thereby the difficulty of data management greatly increases. Organizations that collect large amounts of data are increasingly turning to non relational databases - NoSQL databases. In this paper, through experiments with real field data, we demonstrate that MongoDB, a document-based NoSQL database, is a better alternative for building a Telco Subscriber Data Management System which hitherto is mainly built with Relational Database Management Systems. We compare the existing system in various phases of data flow with our proposed system powered by MongoDB. We show how various workloads at some phases of the existing system were either completely removed or significantly simplified on the new system. Based on experiment results, using MongoDB for managing telco subscriber data turned out to offer performance better than the existing system built with MSSQL Server.