Search | Korea Science

Clustering Technique Using a Node and Level of XML tree (XML 트리의 노드와 레벨을 사용한 군집화 방법)

Kim, Woosaeng
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.17 no.3
- /
- pp.649-655
- /
- 2013
Recently, researches are studied in developing efficient techniques for accessing, querying, and managing XML documents which are frequently used in the Internet. In this paper, we propose a new method to cluster XML documents efficiently. An element and an inclusion relationship of a XML document corresponds to a node and a level of the corresponding tree, respectively. Therefore, when two XML documents are similar then their nodes' names and levels of the corresponding trees are also similar. In this paper, we cluster XML documents by using nodes' names and levels of the corresponding tree as a feature of a document. The experiment shows that our proposed method has a good performance.
https://doi.org/10.6109/jkiice.2013.17.3.649 인용 PDF KSCI

Document Clustering Method using Coherence of Cluster and Non-negative Matrix Factorization (비음수 행렬 분해와 군집의 응집도를 이용한 문서군집)

Kim, Chul-Won;Park, Sun
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.13 no.12
- /
- pp.2603-2608
- /
- 2009
Document clustering is an important method for document analysis and is used in many different information retrieval applications. This paper proposes a new document clustering model using the clustering method based NMF(non-negative matrix factorization) and refinement of documents in cluster by using coherence of cluster. The proposed method can improve the quality of document clustering because the re-assigned documents in cluster by using coherence of cluster based similarity between documents, the semantic feature matrix and the semantic variable matrix, which is used in document clustering, can represent an inherent structure of document set more well. The experimental results demonstrate appling the proposed method to document clustering methods achieves better performance than documents clustering methods.
https://doi.org/10.6109/JKIICE.2009.13.12.2603 인용 PDF KSCI

Similarity Measure and Clustering Technique for XML Documents by a Parent-Child Matrix (부모-자식 행렬을 사용한 XML 문서 유사도 측정과 군집 기법)

Lee, Yun-Gu;Kim, Woosaeng
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.19 no.7
- /
- pp.1599-1607
- /
- 2015
Recently, researches have been developing efficient techniques for accessing, querying, and managing XML documents which are frequently used in the Internet. In this paper, we propose a parent-child matrix to cluster XML documents efficiently. A parent-child matrix analyzes both the content and structural features of an XML document. Each cell of a parent-child matrix has either the value of a node in an XML tree or the value of a child node, where a parent-child relationship exists in the XML tree. Then, the similarity between two XML documents can be measured by the similarity between two corresponding parent-child matrices. The experiment shows that our proposed method has good performance.
https://doi.org/10.6109/jkiice.2015.19.7.1599 인용 PDF KSCI KPUBS HTML

Analysis of Singapore's BIM tender documents for the development of infrastructure BIM guidelines in Korea (국외 BIM 발주지침 분석을 통한 국내 토목 분야 BIM 가이드라인 개발 방향 제시에 관한 연구 - 싱가폴 토목 사업 과업지시서를 중심으로-)

Koo, Bon-Sang;Ok, Hyun;Yu, Young-Su;Jung, Rae-Kyu
- Journal of KIBIM
- /
- v.8 no.2
- /
- pp.19-28
- /
- 2018
Recent increase in the interest and adoption of BIM for infrastructure projects has created a need for formal BIM guidelines in the civil engineering domain. Currently a BIM guideline has been developed in Korea exclusively for the road sector. However, the guideline has gaps in the specification of how BIM models should be generated, managed and applied for maximum effect in projects. This study reviewed the guidelines and tender documents of Singapore to determine potential improvements to adopt in Korea. Results showed that Korea's guideline should focus more on process integration as to stipulating BIM deliverables, encourage a common data environment, clearly distinguish between compulsory and selective BIM applications, and require data and models that can be leveraged in the operation phase of the facility.
https://doi.org/10.13161/kibim.2018.8.2.019 인용 PDF KSCI

Ranking Decision Method of Retrieved Documents Using User Profile from Searching Engine (검색 엔진에서 사용자 프로파일을 이용한 문서 순위결정 방법)

Kim Yong-Ho;Kim Hyeong-Gyun
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.10 no.9
- /
- pp.1590-1595
- /
- 2006
This paper proposes a technique of user oriented document ranking using user refile to provide more satisfied results which reflect preference of specific users. User profile is constructed to represent his or her preference. User pfofile consists of 'term array' and 'preference vector' according to the interest field of one. And the User profile for a particular person is updated by 'user access', 'latent relaeon', 'User Profile' proposed in this paper. The latent structures of documents in same domain are analysed by singular value decomposition(SVD). Then, the rank of documents is determined by comparison of user profile with analyzed document on the basis of relevance.
PDF KSCI

A Study on Effective Internet Data Extraction through Layout Detection

Sun Bok-Keun;Han Kwang-Rok
- International Journal of Contents
- /
- v.1 no.2
- /
- pp.5-9
- /
- 2005
Currently most Internet documents including data are made based on predefined templates, but templates are usually formed only for main data and are not helpful for information retrieval against indexes, advertisements, header data etc. Templates in such forms are not appropriate when Internet documents are used as data for information retrieval. In order to process Internet documents in various areas of information retrieval, it is necessary to detect additional information such as advertisements and page indexes. Thus this study proposes a method of detecting the layout of Web pages by identifying the characteristics and structure of block tags that affect the layout of Web pages and calculating distances between Web pages. This method is purposed to reduce the cost of Web document automatic processing and improve processing efficiency by providing information about the structure of Web pages using templates through applying the method to information retrieval such as data extraction.
PDF

A Novel Technique of Topic Detection for On-line Text Documents: A Topic Tree-based Approach (온라인 텍스트문서의 계층적 트리 기반 주제탐색 기법)

Xuan, Man;Kim, Han-Joon
- Proceedings of the Korea Information Processing Society Conference
- /
- 2012.11a
- /
- pp.396-399
- /
- 2012
Topic detection is a problem of discovering the topics of online publishing documents. For topic detection, it is important to extract correct topic words and to show the topical words easily to understand. We consider a topic tree-based approach to more effectively and more briefly show the result of topic detection for online text documents. In this paper, to achieve the topic tree-based topic detection, we propose a new term weighting method, called CTF-CDF-IDF, which is simple yet effective. Moreover, we have modified a conventional clustering method, which we call incremental k-medoids algorithm. Our experimental results with Reuters-21578 and Google news collections show that the proposed method is very useful for topic detection.
https://doi.org/10.3745/PKIPS.y2012m11a.396 인용 PDF

A View from the Bottom: Project-Oriented Risk Mining Approach for Overseas Construction Projects

Lee, JeeHee;Son, JeongWook;Yi, June-Seong
- International conference on construction engineering and project management
- /
- 2015.10a
- /
- pp.97-100
- /
- 2015
Analysis of construction tender documents in overseas projects is a very important issue from a risk management point of view. Unfortunately, majority of construction firms are biased by winning contracts without in-depth analysis of tender documents. As a result, many contractors have incurred loss in overseas projects. Although a lot of risk analysis techniques have been introduced, most of them focus project's external unexpected risks such as country conditions and owner's financial standing. However, because those external risks are difficult to control and take preemptive action, we need to concentrate on project inherent risks. Based on this premise, this paper proposes a project-oriented risk mining approach which could detect and extract project risk factors automatically before they are materialized and assess them. This study presents a methodology regarding how to extract potential risks which exist in owner's project requirements and project tender documents using state of the art data analysis method such as text mining, data mining, and information visualization. The project-oriented risk mining approach is expected to effectively reflect project characteristics to the project risk management and could provide construction firms with valuable business intelligence.
PDF

The Managing Records for ISO 9000 Compliance in Engineering Corporation (ISO 9000 요건하에서 엔지니어링업체의 기록관리시스템 고찰)

이상복
- Proceedings of the Korean Society for Information Management Conference
- /
- 1998.08a
- /
- pp.115-118
- /
- 1998
This article introduces definition and theoretical background of the managing records for ISO 9000 compliance, especially, quality record management and describes the method of establishing efficient system for the control of quality records in engineering corporation. To establish the best control system of quality records, the organization must not only understand ISO Code requirements for quality record completely but also identify the documents to be controlled as a quality records correctly. This will provide the guidance which need to establish the system for quality control to the organization which produces documents in accordance with ISO Code requirements.
PDF

Design of Templating System for Web Publication (웹 출판을 위한 템플릿 시스템의 설계)

Abdallah, Hisham;Koo, Heung-Seo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2002.11c
- /
- pp.1777-1780
- /
- 2002
This paper presents a well-designed templating system for CMS web Publication using XML/XSL technology. The primary motivation is the need of Web CMS to separate content from layout and logic. Our system provides GUI XSLT editor (x-editor) to create and modify XSLT stylesheet documents easily. These documents are used to add "layout" and "look and feel" information to XML document which contains content and functionality. The modified XML document is processed by XML-template engine to produce dynamic or static web sites.
PDF

Search Result 1,074, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)