• Title/Summary/Keyword: document structure

Search Result 594, Processing Time 0.027 seconds

Partitioning and Merging an Index for Efficient XML Keyword Search (효율적 XML키워드 검색을 인덱스 분할 및 합병)

  • Kim, Sung-Jin;Lee, Hyung-Dong;Kim, Hyoung-Joo
    • Journal of KIISE:Databases
    • /
    • v.33 no.7
    • /
    • pp.754-765
    • /
    • 2006
  • In XML keyword search, a search result is defined as a set of the smallest elements (i.e., least common ancestors) containing all query keywords and a granularity of indexing is an XML element instead of a document. Under the conventional index structure, all least common ancestors produced by the combination of the elements, each of which contains a query keyword, are considered as a search result. In this paper, to avoid unnecessary operations of producing the least common ancestors and reduce query process time, we describe a way to construct a partitioned index composed of several partitions and produce a search result by merging those partitions if necessary. When a search result is restricted to be composed of the least common ancestors whose depths are higher than a given minimum depth, under the proposed partitioned index structure, search systems can reduce the query process time by considering only combinations of the elements belonging to the same partition. Even though the minimum depth is not given or unknown, search systems can obtain a search result with the partitioned index, which requires the same query process time to obtain the search result with non-partitioned index. Our experiment was conducted with the XML documents provided by the DBLP site and INEX2003, and the partitioned index could reduce a substantial amount of query processing time when the minimum depth is given.

The Design and Implementation of The Amendment Statement Automatic Generated System for Attached Tables in Legislation (법령 내 별표 서식에 대한 개정지시문 자동 생성 시스템의 설계 및 구현)

  • Cho, Sung Soo;Jo, Dae Woong;Kim, Myung Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.4
    • /
    • pp.111-122
    • /
    • 2014
  • Legislation are social norms that give directly or indirectly, huge impact on the social or corporate, personal problems, unlike a normal document. Also, over time it has a feature constantly changing by the laws enactment and amendment, repealed. The amendment statement automatic generated system is used for purpose of proclamation to those. However, existing system is able to generate amendment statement just text body of law how compare and analyze the current legislation and amendment legislation. However, actual legislation to be created attached table of the table form in complex structure besides simple text form as body text. In this paper, we additional implement attached table processing to existing the amendment statement automatic generated system that containing the table does not handle attached table. We were analyse to the amendment statement generated grammar and table structure in attached table of the legislation for processing to attached table. Also proposed a method to compare attached table in the table. So, it is enable the automatic generation with amendment statement which various forms of legislation the documents.

Job Characteristics of Care Workers in Elderly Care Voucher Service as a Quality Element (사회서비스 품질 요소로서 제공인력의 근무특성 : 노인돌보미 바우처 사업을 중심으로)

  • Choi, Eun-Young
    • Korea journal of population studies
    • /
    • v.33 no.3
    • /
    • pp.101-121
    • /
    • 2010
  • The purpose of this study is to examine the job characteristics of care workers in elderly care voucher service emphasizing a social service quality management approach. The study sample was composed of randomly-selected 233 centers which dispatched care staffs to clients' home. Descriptive analyses were performed for examining the unique aspects of relationship-based labor of care staffs, and logistic regression analyses were performed for investigating the association between service quality structure and human right violation against staffs. As the first empirical study focusing on staff-side service quality factors, this study found out that human right violation against staffs was mainly influenced by record-keeping and document management capacity of center, risk protection under insurance, compliance of standard contract procedure, and regular supervision. These results suggest particular policy attention should be given to basic protection for and set-up of core activity boundaries of care workers as well as clients-centered rights both for preventing human right violation and improving overall social service quality.

Analysis of Volatile Organic Compounds Produced from Incineration of Papers at 600°C (600°C에서 제지류 소각시 발생하는 휘발성 유기화합물 농도분석 연구)

  • 이병규;조정범
    • Journal of Environmental Science International
    • /
    • v.11 no.10
    • /
    • pp.1109-1116
    • /
    • 2002
  • This study analyzed concentrations of volatile organic compounds (VOCs) produced from incineration of papers at $600^{\circ}C$. The papers used in this study included A4 papers (new, printed with ink-jet, printed with carbon), newspapers (printed with bean oil, printed with a general newspaper ink), packaging box, document envelope, single-use paper cup, and cosmetic tissue. Papers were heated from room temperature upto $600^{\circ}C$ providing air inside of the electric furnace and then they were oxidized for 80 minutes at $600^{\circ}C$ maintaining the same air supply. VOCs emitted from the incineration process were sampled using an air sampling pump and bags for 160 minutes and then the components and concentrations of the VOCs were analyzed by a CC-MS. The most prominent chemical structure of the Vous identified from incineration of the papers was furans and then furans were followed by aromatics and aliphatic alkenes. About 40% of the identified VOCs contained double bonds, which have relatively a high ozone (ground level) formation potential, within their molecular structure. Also, some cancer suspecting compounds like benzene, dichlorormethane and chloroform were identified.

A Study on the Elaboration of Request for Proposal of Localization Parts using AHP method (AHP 기법을 적용한 부품국산화 제안요청서 정교화 연구)

  • Song, Hyeong-Min
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.1
    • /
    • pp.35-44
    • /
    • 2020
  • The purpose of this study is to elaborate the request for proposal (RFP) for the localization parts development support project of core parts carried out by the Defense Agency for Technology and Quality. The RFP is the most important document throughout the localization parts project, including project announcement and developer selection, design and test of the development product, final evaluation, and standardization of the project. However, if the RFP is not established at the beginning of the project, there is an increased risk of business failure due to frequent changes by various reasons. In this study, we recognized the necessity of elaboration of RFP and applied the AHP method for quantitative elaboration. Eight requirements of the RFP related to the mechanical/electrical performance of localized development products and three elaboration methods for each requirement were designed in a hierarchical structure, and each weight was calculated by applying the 5-point scale AHP method. The AHP survey was conducted with 20 developers participating in the localization parts project, and the consistency ratio of the AHP survey result was less than 0.1. The elaboration method with the highest value among the calculated weights is classified, and the analysis results and future research directions of the elaboration method are presented.

A Study on the Landscape Structure and Meaning of Eight Scenic Views of Yeongsa-jeong Pavilion through the Painting and Poem (<영사정팔경도(永思亭八景圖)>와 팔영시로 본 영사정팔경의 경관구조와 의미)

  • Rho, Jae-Hyun;Son, Hee-Kyung;Kim, Hong-Kyun
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.35 no.2
    • /
    • pp.58-68
    • /
    • 2017
  • The conclusion of this research after analyzing and interpreting the landscape structure and meaning of Yeongsajeongpalkyung (永思亭八景) that appears in Yeongsajeongpalyeongsi(永思亭八詠詩) of Cheonggye(靑溪) Yang, Dae-bak(梁大樸, 1544~1592) and through document studies, poetry and painting analysis and interpretation, and site investigation, is as follows. Yeongsajeong and its nearby lands are the area of "Yeongsa", where the builder, Ahn, Jeon(安?, 1518~1571) worshipped towards the grave of ancestors, and Yeongsajeongpalkyung oversees a family burial ground in Namwon, centering around Yeongsajeong such as Yocheon, Geumseokgyo and Cheonggyedong, and Sunjagang River and Mountain Jiri, which are the foot hold and key points of advantageous scenic views in Namwon. Yeongsajeongpalkyung, unlike general Jeongjapalkyung, shows a panoramic bird's-eye structure overseeing the landscape and scenery of the Yocheon area and Sunjagang River, in addition to Yeongsajeong, while show in a transition of location, a multi-view structure and time. The trace of visual unity with Sosangpalkyung of China can be seen in many places in Yeongsajeongpalkyung, which seems to be a transitional feature of composing poems regarding Palgyeong during the mid-Joseon dynasty, which pursues harmony with the local landscape of the Namwon area. The 'Changsongchwijuk(蒼松翠竹)' appearing in each of the first and second scenic views of Palgyeong and Yeongsajeongpalyeong can be understood as an incarnation of Yang, Dae-bak, the author of Palyeongsi or Ahn, Jeon, the builder of Yeongsajeong. On the other hand, as a result of interpreting the yin-yang features of poetic diction and picture elements appearing in the subtitle of Yeongsajeongpalyeong, Palyeongsi seems mostly full of yin-like elements and Palgyeongdo. Moreover, as a result of comparing and analyzing the acts expressed in and acts described in Yeongsajeongpalyeong, based on the fact that the reis almost no common ground between the two media except for Soongangmowoo, the third scenic view, the formal similarity between the two media can be acknowledged, however, it is difficult to discover any substantive 'integrity of poetry and painting'.

PIRS : Personalized Information Retrieval System using Adaptive User Profiling and Real-time Filtering for Search Results (적응형 사용자 프로파일기법과 검색 결과에 대한 실시간 필터링을 이용한 개인화 정보검색 시스템)

  • Jeon, Ho-Cheol;Choi, Joong-Min
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.4
    • /
    • pp.21-41
    • /
    • 2010
  • This paper proposes a system that can serve users with appropriate search results through real time filtering, and implemented adaptive user profiling based personalized information retrieval system(PIRS) using users' implicit feedbacks in order to deal with the problem of existing search systems such as Google or MSN that does not satisfy various user' personal search needs. One of the reasons that existing search systems hard to satisfy various user' personal needs is that it is not easy to recognize users' search intentions because of the uncertainty of search intentions. The uncertainty of search intentions means that users may want to different search results using the same query. For example, when a user inputs "java" query, the user may want to be retrieved "java" results as a computer programming language, a coffee of java, or a island of Indonesia. In other words, this uncertainty is due to ambiguity of search queries. Moreover, if the number of the used words for a query is fewer, this uncertainty will be more increased. Real-time filtering for search results returns only those results that belong to user-selected domain for a given query. Although it looks similar to a general directory search, it is different in that the search is executed for all web documents rather than sites, and each document in the search results is classified into the given domain in real time. By applying information filtering using real time directory classifying technology for search results to personalization, the number of delivering results to users is effectively decreased, and the satisfaction for the results is improved. In this paper, a user preference profile has a hierarchical structure, and consists of domains, used queries, and selected documents. Because the hierarchy structure of user preference profile can apply the context when users perfomed search, the structure is able to deal with the uncertainty of user intentions, when search is carried out, the intention may differ according to the context such as time or place for the same query. Furthermore, this structure is able to more effectively track web documents search behaviors of a user for each domain, and timely recognize the changes of user intentions. An IP address of each device was used to identify each user, and the user preference profile is continuously updated based on the observed user behaviors for search results. Also, we measured user satisfaction for search results by observing the user behaviors for the selected search result. Our proposed system automatically recognizes user preferences by using implicit feedbacks from users such as staying time on the selected search result and the exit condition from the page, and dynamically updates their preferences. Whenever search is performed by a user, our system finds the user preference profile for the given IP address, and if the file is not exist then a new user preference profile is created in the server, otherwise the file is updated with the transmitted information. If the file is not exist in the server, the system provides Google' results to users, and the reflection value is increased/decreased whenever user search. We carried out some experiments to evaluate the performance of adaptive user preference profile technique and real time filtering, and the results are satisfactory. According to our experimental results, participants are satisfied with average 4.7 documents in the top 10 search list by using adaptive user preference profile technique with real time filtering, and this result shows that our method outperforms Google's by 23.2%.

IPC Multi-label Classification based on Functional Characteristics of Fields in Patent Documents (특허문서 필드의 기능적 특성을 활용한 IPC 다중 레이블 분류)

  • Lim, Sora;Kwon, YongJin
    • Journal of Internet Computing and Services
    • /
    • v.18 no.1
    • /
    • pp.77-88
    • /
    • 2017
  • Recently, with the advent of knowledge based society where information and knowledge make values, patents which are the representative form of intellectual property have become important, and the number of the patents follows growing trends. Thus, it needs to classify the patents depending on the technological topic of the invention appropriately in order to use a vast amount of the patent information effectively. IPC (International Patent Classification) is widely used for this situation. Researches about IPC automatic classification have been studied using data mining and machine learning algorithms to improve current IPC classification task which categorizes patent documents by hand. However, most of the previous researches have focused on applying various existing machine learning methods to the patent documents rather than considering on the characteristics of the data or the structure of patent documents. In this paper, therefore, we propose to use two structural fields, technical field and background, considered as having impacts on the patent classification, where the two field are selected by applying of the characteristics of patent documents and the role of the structural fields. We also construct multi-label classification model to reflect what a patent document could have multiple IPCs. Furthermore, we propose a method to classify patent documents at the IPC subclass level comprised of 630 categories so that we investigate the possibility of applying the IPC multi-label classification model into the real field. The effect of structural fields of patent documents are examined using 564,793 registered patents in Korea, and 87.2% precision is obtained in the case of using title, abstract, claims, technical field and background. From this sequence, we verify that the technical field and background have an important role in improving the precision of IPC multi-label classification in IPC subclass level.

X-tree Diff: An Efficient Change Detection Algorithm for Tree-structured Data (X-tree Diff: 트리 기반 데이터를 위한 효율적인 변화 탐지 알고리즘)

  • Lee, Suk-Kyoon;Kim, Dong-Ah
    • The KIPS Transactions:PartC
    • /
    • v.10C no.6
    • /
    • pp.683-694
    • /
    • 2003
  • We present X-tree Diff, a change detection algorithm for tree-structured data. Our work is motivated by need to monitor massive volume of web documents and detect suspicious changes, called defacement attack on web sites. From this context, our algorithm should be very efficient in speed and use of memory space. X-tree Diff uses a special ordered labeled tree, X-tree, to represent XML/HTML documents. X-tree nodes have a special field, tMD, which stores a 128-bit hash value representing the structure and data of subtrees, so match identical subtrees form the old and new versions. During this process, X-tree Diff uses the Rule of Delaying Ambiguous Matchings, implying that it perform exact matching where a node in the old version has one-to one corrspondence with the corresponding node in the new, by delaying all the others. It drastically reduces the possibility of wrong matchings. X-tree Diff propagates such exact matchings upwards in Step 2, and obtain more matchings downwsards from roots in Step 3. In step 4, nodes to ve inserted or deleted are decided, We aldo show thst X-tree Diff runs on O(n), woere n is the number of noses in X-trees, in worst case as well as in average case, This result is even better than that of BULD Diff algorithm, which is O(n log(n)) in worst case, We experimented X-tree Diff on reat data, which are about 11,000 home pages from about 20 wev sites, instead of synthetic documets manipulated for experimented for ex[erimentation. Currently, X-treeDiff algorithm is being used in a commeercial hacking detection system, called the WIDS(Web-Document Intrusion Detection System), which is to find changes occured in registered websites, and report suspicious changes to users.

A study on characteristics of palace wallpaper in the Joseon Dynasty - Focusing on Gyeongbokgung Palace, Changdeokgung Palace and Chilgung Palace - (조선시대 궁궐 도배지 특성 연구 - 경복궁, 창덕궁, 칠궁을 중심으로 -)

  • KIM Jiwon;KIM Jisun;KIM, Myoungnam;JEONG Seonhwa
    • Korean Journal of Heritage: History & Science
    • /
    • v.56 no.1
    • /
    • pp.80-97
    • /
    • 2023
  • By taking wallpaper specimens from Gyeongbokgung Palace, Changdeokgung Palace, and Chilgung Palace preserved from the late Joseon Dynasty to the present, we planned in this study to determine the types and characteristics of the paper used as wallpaper in the Joseon royal family. First, we confirmed the features of paper hanging in the palaces with old literature on the wallpaper used by the royal family based on archival research. Second, we conducted a field survey targeting the royal palaces whose construction period was relatively clear, and analyzed the first layer of wallpaper directly attached to the wall structure after sampling the specimens. Therefore, we confirmed that the main raw material was hanji, which was used as a wallpaper by the royal family, and grasped the types of substances(dyes and pigments) used to produce a blue color in spaces that must have formality by analyzing the blue-colored paper. Based on the results confirmed through the analysis, we checked documents and the existing wallpaper by comparing the old literature related to wallpaper records of the Joseon Dynasty palaces. We also built a database for the restoration of cultural properties when conserving the wallpaper in the royal palaces. We examined the changes in wallpaper types by century and the content according to the place of use by extracting wallpaper-related contents recorded in 36 cases of Uigwe from the 17th to 20th centuries. As a result, it was found that the names used for document paper and wallpaper were not different, thus document paper and wallpaper were used without distinction during the Joseon Dynasty. And though there are differences in the types of wallpaper depending on the period, it was confirmed that the foundation of wallpaper continued until the late Joseon Dynasty, with Baekji(white hanji), Hubaekji(thick white paper), jeojuji(common hanji used to write documents), chojuji(hanji used as a draft for writing documents) and Gakjang(a wide and thick hanji used as a pad). As a result of fiber identification by the morphological characteristics of fibers and the normal color reaction(KS M ISO 9184-4: Graph "C" staining test) for the first layer of paper directly attached to the palace wall, the main materials of hanji used by the royal family were confirmed and the raw materials used to make hanii in buildings of palaces based on the construction period were determined. Also, as a result of analyzing the coloring materials of the blue decorative paper with an optical microscope, ultraviolet-visible spectroscopic analysis(UV-Vis), and X-ray diffraction analysis(XRD), we determined that the type of blue decorative paper dyes and pigments used in the palaces must have formality and identified that the raw materials used to produce the blue color were natural indigo, lazurite and cobalt blue.