Search | Korea Science

Accelerating Keyword Search Processing over XML Documents using Document-level Ranking (문서 단위 순위화를 통한 XML 문서에 대한 키워드 검색 성능 향상)

Lee, Hyung-Dong;Kim, Hyoung-Joo
- Journal of KIISE:Databases
- /
- v.33 no.5
- /
- pp.538-550
- /
- 2006
XML Keyword search enables us to get information easily without knowledge of structure of documents and returns specific and useful partial document results instead of whole documents. Element level query processing makes it possible, but computational complexity, as the number of documents grows, increases significantly overhead costs. In this paper, we present document-level ranking scheme over XML documents which predicts results of element-level processing to reduce processing cost. To do this, we propose the notion of 'keyword proximity' - the correlation of keywords in a document that affects the results of element-level query processing using path information of occurrence nodes and their resemblances - for document ranking process. In benefit of document-centric view, it is possible to reduce processing time using ranked document list or filtering of low scored documents. Our experimental evaluation shows that document-level processing technique using ranked document list is effective and improves performance by the early termination for top-k query.
PDF KSCI

Design and Implementation of a HTMLtoVoiceXML Converter (HTMLtoVoiceXML 변환기의 설계 및 구현)

Choi, Hoon-il;Jang, Young-Gun
- Journal of KIISE:Computing Practices and Letters
- /
- v.7 no.6
- /
- pp.559-569
- /
- 2001
It is possible to access web contents by mobile terminals and telephones due to the evolution of voice and VoiceXML technologies. Nevertheless, because these days most of all the web contents are constructed by HTML, it is impossible to access them by using the new technology. Therefore, to access the contents through voices requires the conversion of the web document from HTML to VoiceXML, but its manual conversion process should be involved additional time and expenditure. In this paper, we design and implement HTMLtoVoiceXML converter, suggest a system structure of it and propose a practical identification algorithm of meaningful fragmented group of contents to solve the problem. To test the performance and validity of the converter, we apply it to more than 400 web pages in Korean web sites, it works well except for web pages which involve invalid HTML codes.
PDF

An Efficient Scheme of Encapsulation Method to Avoid Fragmentation Degradation During TVA Metadata Delivery (TVA 메타데이터 전송과정에서 단편화에 의한 성능 감소를 회피하기 위한 효율적인 캡슐화 방식)

Oh, Bong-Jin;Park, Jong-Youl;Kim, Sang-Hyung;Yoo, Kwan-Jong
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.37 no.7C
- /
- pp.627-636
- /
- 2012
Recently, XML is used to describe details of service and contents for various fields such as IPTV and digital broadcast services because of it's high readability and extensibility. TV-Anytime's schema and delivery protocol have been especially adopted as basic standards for them, and extended to include their own private functions. However, XML describes documents using text-based method, and this causes to create big documents rather than traditional methods. Therefore, many encoding algorithms have been proposed to reduce XML documents like EXI, BiM, GZIP and fast-info set etc. Although these algorithms shows efficient compression effects for XML documents, but they can't avoid fragmentation degradation during encapsulation steep. This paper proposes an efficient encapsulation scheme of TV-Anytime to avoid fragmentation degradation of encoding effect using common string tables.
https://doi.org/10.7840/KICS.2012.37.7C.627 인용 PDF KSCI

Building a Biological Genomic Database Management System in Laboratory Level (실험실 레벨의 유전체 생물학 데이터베이스 관리시스템 구축)

차효성;정광수;박성희;류근호
- Proceedings of the Korean Information Science Society Conference
- /
- 2004.04b
- /
- pp.28-30
- /
- 2004
대부분의 생물학 실험실에서는 스퀸싱 실험으로 얻어진 서열조각에 대해 어셈블리 과정을 통해 획득된 일치된 서열을 서열 실험파일 형태로 저장한다. 이러한 서열 파일형태로 서열 데이터를 저장하면 사용자의 임의로 서열 정보 수정 및 서열 정보의 중복 등 서열 데이터에 대한 일관성 있고 무결성 있는 저장 관리가 어렵다 또한 이질적 데이터 및 포맷을 통한 다양한 생물학적 분석이 요구된다. 따라서 이 논문에서는 시퀸싱을 통해 생성된 유전체 및 단백질 서열 데이터의 자장관리를 위해 서열 정보의 편집, 저장 및 검색과 서열 파일 포멧 변환을 수행하는 서열 정보관리 시스템의 구현을 목적으로 한다. 서열 저장시 서열 버전의 생성 및 검출을 위해 능동 데이터베이스의 트리거를 이용하여 시스템의 성능을 향상시킨다. 또한 서열정보 분석을 위해 이질적인 서열 포맷간의 포맷 변환은 서열 및 관련된 정보를 XML로 표현하고 포맷간의 매핑정보를 XML의 스타일 언어인 XSL을 적용하여 수행한다. 그러므로 원시 소스 변경시 영향을 적게 받으므로 이질적인 포맷간의 파서를 이용한 포맷 변환 보다 효율적이다.
PDF

An Efficient Technique for Evaluating Queries with Multiple Regular Path Expressions (다중 정규 경로 질의 처리를 위한 효율적 기법)

Chung, Tae-Sun;Kim, Hyoung-Joo
- Journal of KIISE:Databases
- /
- v.28 no.3
- /
- pp.449-457
- /
- 2001
As XML has become an emerging standard for information exchange on the World Wide Web, it has gained attention in database communities to extract information from XML seen as a database model. XML queries are based on regular path queries, which find objects reachable by given regular expressions. To answer many kinds of user queries, it is necessary to evaluate queries that have multiple regular path expressions. However, previous work such as query rewriting and query optimization in the frame work of semistructured data has dealt with a single regular expression. For queries that have multiple regular expressions we suggest a two phase optimizing technique: 1. query rewriting using views by finding the mappings from the view's body to the query's body and 2. for rewritten queries, evaluating each query conjunct and combining them. We show that our rewriting algorithm is sound and our query evaluation technique is more efficient than the previous work on optimizing semistructured queries.
PDF

Implementation of 3D Fashion Show Animation Based on Web using Similar Modeling (근사 모델링을 이용한 웹 기반 3D 패션쇼 애니메이션 구현)

Cho Jin-Ei;Yang Ok-Yul;Nam Myung-Woo;Lee Yong-Ju;Jung Sung-Tea;Joung Suck-Tea;Lee Gang-Il
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.7 no.1
- /
- pp.33-38
- /
- 2006
In this paper, we propose implementation of 3D fashion show animation based on web which needs a construction of 3D fashion shopping mall. The 3D model made by 3D MAX program was exported XML type and displayed on web using ActiveX of D3D library. For the purpose of fashion show animation of 3D model in various clothes which selected by customer of shopping mall, we design the parts of standard clothes with hierarchical structure and assemble them. Next, we are mapping the various textures to them and animate 3D model in similar clothes which selected by customer on web.
PDF

Implementation of an Information Management System for Nucleotide Sequences based on BSML using Active Trigger Rules (BSML 기반 능동 트리거 규칙을 이용한 염기서열정보관리시스템의 구현)

Park Sung Hee;Jung Kwang Su;Ryu Keun Ho
- Journal of KIISE:Databases
- /
- v.32 no.1
- /
- pp.24-42
- /
- 2005
Characteristics of biological data including genome sequences are heterogeneous and various. Although the need of management systems for genome sequencing which should reflect biological characteristics has been raised, most current biological databases provide restricted function as repositories for biological data. Therefore, this paper describes a management system of nucleotide sequences at the level of biological laboratories. It includes format transformation, editing, storing and retrieval for collected nucleotide sequences from public databases, and handles sequence produced by experiments. It uses BSML based on XML as a common format in order to extract data fields and transfer heterogeneous sequence formats. To manage sequences and their changes, version management system for originated DNA is required so as to detect transformed new sequencing appearance and trigger database update. Our experimental results show that applying active trigger rules to manage changes of sequences can automatically store changes of sequences into databases.
PDF KSCI

An Efficient ROLAP Cube Generation Scheme (효율적인 ROLAP 큐브 생성 방법)

Kim, Myung;Song, Ji-Sook
- Journal of KIISE:Databases
- /
- v.29 no.2
- /
- pp.99-109
- /
- 2002
ROLAP(Relational Online Analytical Processing) is a process and methodology for a multidimensional data analysis that is essential to extract desired data and to derive value-added information from an enterprise data warehouse. In order to speed up query processing, most ROLAP systems pre-compute summary tables. This process is called 'cube generation' and it mostly involves intensive table sorting stages. (1) showed that it is much faster to generate ROLAP summary tables indirectly using a MOLAP(multidimensional OLAP) cube generation algorithm. In this paper, we present such an indirect ROLAP cube generation algorithm that is fast and scalable. High memory utilization is achieved by slicing the input fact table along one or more dimensions before generating summary tables. High speed is achieved by producing summary tables from their smallest parents. We showed the efficiency of our algorithm through experiments.
PDF KSCI

Search Result 18, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)