• Title/Summary/Keyword: Knowledge retrieval

Search Result 389, Processing Time 0.027 seconds

Hypermedia, Multimedia and Hypertext: Definitions and Overview (하이퍼미디어.멀티미디어.하이퍼텍스트: 정의(定義)와 개관(槪觀))

  • Kim, Ji-Hee
    • Journal of Information Management
    • /
    • v.25 no.1
    • /
    • pp.24-46
    • /
    • 1994
  • In this paper I will discuss definitions of hypermedia, multimedia and hypertext. Hypertext is the grouping of relevant information in the form of nodes. These nodes are then connected together through links. In the case of hypertext the nodes contain text or graphics. Multimedia is the combining of different media types for example sound, animation, text, graphics and video for the presentation of information by making use of computers. Hypermedia can be viewed as an extension of hypertext and multimedia. It is based on the concept of hypertext that uses nodes and links in the structuring of information in the system. In this case the nodes consist of an the different data types that are mentioned in the multimedia definition above. The 'node-and-link' concept is used in organisation of the information in hypermedia systems. The 'book' metaphor is an example of the way these systems are implemented. This concept is explained and a few advantages and disadvantages of making use of hypermedia systems are discussed. A new approach for the development of hypermedia systems, namely the knowledge-based approach is now looked into. Joel Peing-Ling Loo proposed this approach because he thought that it is the most effective way for handling this kind of technology. A semantic-based hypermedia model is developed in this approach to formulate solutions for the restrictions in presenting information authoring, maintenance and retrieval. The knowledge-based presentation of information includes the use of conventional data structures. These data structures make use of frames(objects), slots and the inheritance theory that is also used in expert systems. Relations develop between the different objects as these objects are included in the database. Relations can also exist between frames by means of attributes that belong to the frames.

  • PDF

Design and Implementation of Thesaurus System for Geological Terms (지질용어 시소러스 시스템의 설계 및 구축)

  • Hwang, Jaehong;Chi, KwangHoon;Han, JongGyu;Yeon, Young Kwang;Ryu, Keun Ho
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.10 no.2
    • /
    • pp.23-35
    • /
    • 2007
  • With the development of semantic web technologies in information retrieval area, the necessity for thesaurus is recently increasing along with internet lexicons. A thesaurus is the combination of classification and a lexicon, and is the topic map of knowledge structure expressing relations among concepts(terms) subject to human knowledge activities such as learning and research using formally organized and controlled index terms for clarifying the context of superordinate and subordinate concepts. However, although thesaurus are regarded as essential tools for controlling and standardizing terms and searching and processing information efficiently, we do not have a Korean thesaurus for geology. To build a thesaurus, we need standardized and well-defined guidelines. The standardized guidelines enable efficient information management and help information users use correct information easily and conveniently. The present study purposed to build a thesaurus system with terms used in geology. For this, First, we surveyed related works for standardizing geological terms in Korea and other countries. Second, we defined geological topics in 15 areas and prepared a classification system(draft) for each topic. Third, based on the geological thesaurus classification system, we created the specification of geological thesaurus. Lastly, we designed and implemented an internet-based geological thesaurus system using the specification.

  • PDF

A New Approach to Automatic Keyword Generation Using Inverse Vector Space Model (키워드 자동 생성에 대한 새로운 접근법: 역 벡터공간모델을 이용한 키워드 할당 방법)

  • Cho, Won-Chin;Rho, Sang-Kyu;Yun, Ji-Young Agnes;Park, Jin-Soo
    • Asia pacific journal of information systems
    • /
    • v.21 no.1
    • /
    • pp.103-122
    • /
    • 2011
  • Recently, numerous documents have been made available electronically. Internet search engines and digital libraries commonly return query results containing hundreds or even thousands of documents. In this situation, it is virtually impossible for users to examine complete documents to determine whether they might be useful for them. For this reason, some on-line documents are accompanied by a list of keywords specified by the authors in an effort to guide the users by facilitating the filtering process. In this way, a set of keywords is often considered a condensed version of the whole document and therefore plays an important role for document retrieval, Web page retrieval, document clustering, summarization, text mining, and so on. Since many academic journals ask the authors to provide a list of five or six keywords on the first page of an article, keywords are most familiar in the context of journal articles. However, many other types of documents could not benefit from the use of keywords, including Web pages, email messages, news reports, magazine articles, and business papers. Although the potential benefit is large, the implementation itself is the obstacle; manually assigning keywords to all documents is a daunting task, or even impractical in that it is extremely tedious and time-consuming requiring a certain level of domain knowledge. Therefore, it is highly desirable to automate the keyword generation process. There are mainly two approaches to achieving this aim: keyword assignment approach and keyword extraction approach. Both approaches use machine learning methods and require, for training purposes, a set of documents with keywords already attached. In the former approach, there is a given set of vocabulary, and the aim is to match them to the texts. In other words, the keywords assignment approach seeks to select the words from a controlled vocabulary that best describes a document. Although this approach is domain dependent and is not easy to transfer and expand, it can generate implicit keywords that do not appear in a document. On the other hand, in the latter approach, the aim is to extract keywords with respect to their relevance in the text without prior vocabulary. In this approach, automatic keyword generation is treated as a classification task, and keywords are commonly extracted based on supervised learning techniques. Thus, keyword extraction algorithms classify candidate keywords in a document into positive or negative examples. Several systems such as Extractor and Kea were developed using keyword extraction approach. Most indicative words in a document are selected as keywords for that document and as a result, keywords extraction is limited to terms that appear in the document. Therefore, keywords extraction cannot generate implicit keywords that are not included in a document. According to the experiment results of Turney, about 64% to 90% of keywords assigned by the authors can be found in the full text of an article. Inversely, it also means that 10% to 36% of the keywords assigned by the authors do not appear in the article, which cannot be generated through keyword extraction algorithms. Our preliminary experiment result also shows that 37% of keywords assigned by the authors are not included in the full text. This is the reason why we have decided to adopt the keyword assignment approach. In this paper, we propose a new approach for automatic keyword assignment namely IVSM(Inverse Vector Space Model). The model is based on a vector space model. which is a conventional information retrieval model that represents documents and queries by vectors in a multidimensional space. IVSM generates an appropriate keyword set for a specific document by measuring the distance between the document and the keyword sets. The keyword assignment process of IVSM is as follows: (1) calculating the vector length of each keyword set based on each keyword weight; (2) preprocessing and parsing a target document that does not have keywords; (3) calculating the vector length of the target document based on the term frequency; (4) measuring the cosine similarity between each keyword set and the target document; and (5) generating keywords that have high similarity scores. Two keyword generation systems were implemented applying IVSM: IVSM system for Web-based community service and stand-alone IVSM system. Firstly, the IVSM system is implemented in a community service for sharing knowledge and opinions on current trends such as fashion, movies, social problems, and health information. The stand-alone IVSM system is dedicated to generating keywords for academic papers, and, indeed, it has been tested through a number of academic papers including those published by the Korean Association of Shipping and Logistics, the Korea Research Academy of Distribution Information, the Korea Logistics Society, the Korea Logistics Research Association, and the Korea Port Economic Association. We measured the performance of IVSM by the number of matches between the IVSM-generated keywords and the author-assigned keywords. According to our experiment, the precisions of IVSM applied to Web-based community service and academic journals were 0.75 and 0.71, respectively. The performance of both systems is much better than that of baseline systems that generate keywords based on simple probability. Also, IVSM shows comparable performance to Extractor that is a representative system of keyword extraction approach developed by Turney. As electronic documents increase, we expect that IVSM proposed in this paper can be applied to many electronic documents in Web-based community and digital library.

A Curricular Study on AI & ES in Library and Information Science (문헌정보학에서의 인공지능과 전문가시스템 교육과정 연구)

  • Koo Bon-Young;Park Mi-Young
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.32 no.2
    • /
    • pp.211-232
    • /
    • 1998
  • It is the purpose of this study to specify contents of Library and Information Science to train information professional to meet environment change of technology and system. Among them. recognizing necessity of present Artificial Intelligence and Export System (AI and ES) required by changing environment of latest Information technology, it is also the purpose of this work to figure out fundamental data and the way of solution how to introduce what contents out of AI and ES to Library and Information Science. The briefed results are as follows. 1. Due to rapid change of high Information technology and computer application it is the most important essential points, In order of Importance, in finding available network source, In indexing on-line data base, in analysing and design information system. and in computer application ability. 2. In contents of AI and ES, most Important training portion for Library and Information Science are : data base treating, thesaurus, natural language processing. and knowledge representation. 3. Library and information science professors recognize It necessary for bigger number of Library and Information Science students to be educated artificial intelligence and expert system. 4. During forthcoming age it shows more important reorganization that artificial intelligence and expert system improves information professional in reference service, cataloging, classification, information retrieval, and documentation delivery 5. According to library and information science professors more important reorganization on the subject of AI and ES, the curricular on AI and ES is, forthcoming, to be Introduced to curricular on library and information science in the nation, In order of importance, (see 1. above).

  • PDF

A Study on the Indexing System Using a Controlled Vocabulary and Natural Language in the Secondary Legal Information Full-Text Databases : an Evaluation and Comparison of Retrieval Effectiveness (2차 법률정보 전문데이터베이스에 있어서 통제어 색인시스템과 자연어 색인시스템의 검색효율 평가에 관한 연구)

  • Roh Jeong-Ran
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.32 no.4
    • /
    • pp.69-86
    • /
    • 1998
  • The purpose of velop the indexing algorithm of secondary legal information by the study of characteristics of legal information, to compare the indexing system using controlled vocabulary to the indexing system using natural language in the secondary legal information full-text databases, and to prove propriety and superiority of the indexing system using controlled vocabulary. The results are as follows; 1)The indexing system using controlled vocabulary in the secondary legal information full-text databases has more effectiveness than the indexing system using natural language, in the recall rate, the precision rate, the distribution of propriety, and the faculty of searching for the unique proper-records which the indexing system using natural language fans to find 2)The indexing system which adds more words to the controlled vocabulary in the secondary legal information full-text databases does not better effectiveness in the retail rate, the precision rate, comparing to the indexing system using controlled vocabulary. 3)The indexing system using word-added controlled vocabulary with an extra weight in the secondary legal information full-text databases does not better effectiveness in the recall rate, the precision rate, comparing to the indexing system using word-added controlled vocabulary without an extra weight. This study indicates that it is necessary to have characteristic information the information experts recognize - that is to say, experimental and inherent knowledge only human being can have built-in into the system rather than to approach the information system by the linguistic, statistic or structuralistic way, and it can be more essential and intelligent information system.

  • PDF

Self-Efficacy as a Predictor of Self-Care in Persons with Diabetes Mellitus: Meta-Analysis

  • Lee, Hyang-Yeon
    • Journal of Korean Academy of Nursing
    • /
    • v.29 no.5
    • /
    • pp.1087-1102
    • /
    • 1999
  • Diabetes mellitus, a universal and prevalent chronic disease, is projected to be one of the most formidable worldwide health problems in the 21st century. For those living with diabetes, there is a need for self-care skills to manage a complex medical regimen. Self-efficacy which refers to one's belief in his/her capability to monitor and perform the daily activities required to manage diabetes has be found to be related to self-care. The concept of self-efficacy comes from social cognitive theory which maintains that cognitive mechanism mediate the performance of behavior. The literature cites several research studies which show a strong relationship between self-efficacy and self-care behavior. Meta-analysis is a technique that enables systematic review and quantitative integration of the results from multiple primary studies that are relevant to a particular research question. Therefore, this study was done using meta-analysis to quantitatively integrate the results of independent research studies to obtain numerical estimates of the overall effect of a self-efficacy with diabetic patient on self-care behaviors. The research proceeded in three stages : 1) literature search and retrieval of studies in which self-efficacy was related to self-care, 2) coding, and 3) calculation of mean effect size and data analysis. Seventeen studies which met the research criteria included study population of adults with diabetes, measures of self-care and measures of self-efficacy as a predictive variable. Computation of effect size was done on DSTAT which is a statistical computer program specifically designed for meta-analysis. To determine the effect of self-efficacy on self-care practice homogeneity tests were conducted. Pooled effect size estimates, to determine the best subvariable for composite variables, metabolic control variables and component of self-efficacy and self-care, indicated that the effect of self-efficacy composite on self-care composite was moderate to large. The weighted mean effect size of self-efficacy composite and self-care composite were +.76 and the confidence interval was from +.66 to +.86 with the number of subjects being 1,545. The total for this meta-analysis result showed that the weighted mean effect sizes ranged from +.70 to +1.81 which indicates a large effect. But since reliabilities of the instruments in the primary studies were low or not stated, caution must be applied in unconditionally accepting the results from these effect sizes. Meta-analysis is a useful took for clarifying the status of knowledge development and guiding decision making about future research and this study confirmed that there is a relationship between self-efficacy and self-care in patients with diabetes. It, thus, provides support for nurses to promote self-efficacy in their patients. While most of the studies included in this meta-analysis used social cognitive theory as a framework for the study, some studies use Fishbein & Ajzen's attitude model as a model for active self-care. Future research is needed to more fully define the concept of self-care and to determine what it is that makes patients feel competent in their self-care activities. The results of this study showed that self-efficacy can promote self-care. Future research is needed with experimental design to determine nursing interventions that will increase self-efficacy.

  • PDF

A study on the Job Analysis and Curriculum Development of Technical Information Searcher with DACUM (기술정보검색사의 직무분석 및 교육과정 개발에 관한 연구)

  • Noh, Dong-Jo
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.15 no.1
    • /
    • pp.177-191
    • /
    • 2004
  • According to the shift from industrial society to knowledge-based society, prompt acquisition, organization, and analysis of technical information at a variety of industrial organizations are becoming more important than before. Education for professionals in acquisition and management of technical information should be accomplished systematically, and connected with in-service training. The purpose of this study is to develop curriculum of information professionals from the analysis of the tasks of technical information searchers using DACUM methods. The results of this study is as follows: First, professional technical information searcher's tasks are divided into 6 categories and these are also divided into 40 sub-categories. Second, selection of information sources are the most important tasks in education. And last, major educational areas should include planning and development of databases, practice of OA applied programs, practice of PC communications, analysis of trends information, classification and practice of Internet, practice of interview, information architecture, information retrieval, understanding and practice of information sources, patents management, and planning and development of home pages.

  • PDF

Association Between the Pre-mir-218 Polymorphism and Cancer Risk in the Chinese Population: a Meta-Analysis

  • Gao, Yue;Liu, Yan;Liu, Ge-Li;Ran, Long-Ke;Zeng, Fan;Wu, Jia-Yan;Song, Fang-Zhou
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.6
    • /
    • pp.2517-2522
    • /
    • 2014
  • Background: Several recent studies have explored associations between pre-mir-218 polymorphism (rs11134527) and cancer risk. However, published data are still inconclusive. To obtain a more precise estimation of the relationship in the Chinese population, we carried out a meta-analysis for the first time. Materials and Methods: Through retrieval from the PubMed, Medline, Embase, Web of Science databases, China National Knowledge Infrastructure and the Chinese BioMedical Literature Database, a total of four studies were analyzed with 3,561 cases and 3,628 controls for SNP pre-mir-218 rs11134527. We calculated odds ratios (ORs) and 95% confidence intervals (95%CIs) to explore the strength of associations. Results: The results showed that the rs11134527 polymorphism was associated with decreased cancer risk in GG versus AA and GG versus AA+AG models tested ( GG vs AA: OR=0.82, 95%CI: 0.71-0.94; GG vs AA+AG: OR=0.84, 95%CI: 0.74-0.96), and significantly decreased cervical cancer risk was observed in GG versus AA and GG versus AA+AG models (GG vs AA: OR=0.79, 95%CI: 0.66-0.94; GG vs AA+AG: OR=0.80, 95%CI: 0.68-0.94). However, no significant association between the rs11134527polymorphism and hepatocellular carcinoma risk was observed in all comparison models tested (AG vs AA: OR=0.94, 95%CI: 0.79-1.11; GG vs AA: OR=0.88, 95%CI: 0.70-1.10; GG+AG vs AA: OR=0.92, 95%CI: 0.79-1.08; GG vs AA+AG: OR=0.91, 95%CI: 0.75-1.11). Conclusion: The findings suggest that pre-miR-218 rs11134527 polymorphism may have some relation to cancer development in Chinese. However, well-designed studies with larger sample size and more detailed data are needed to confirm these conclusions.

Analyzing the status of theoretical framework by subfields in library and information science research articles (문헌정보학 연구논문의 이론체계 현황분석 연구)

  • Kim, Sung-Jin;Jeong, Dong-Youl
    • Journal of the Korean Society for information Management
    • /
    • v.23 no.2
    • /
    • pp.21-37
    • /
    • 2006
  • Based upon the assumption that both theory building and theory use are intertwined to construct a cohesive body of knowledge in the filed, this study attempts to identify the state of theoretical framework by examining the number and the quality of theoretical articles by subfield. Theoretical article is characterized as an incident in which in which the author contributes to the development or the use of theory in his/her own paper. Theoretical incidents were identified by a content analysis of 1,661 articles in four LIS journals from 1984 to 2003. The findings suggest that the four subfields, such as information seeking/use, information retrieval, library management, and scholar communication had great contributions to both theory building and theory use. Also, two research areas such as bibliometrics and professionals are very likely to be theoretical. Further, the analysis of the name of theories used by subfields could give an insight into the understanding of how the theoretical frameworks of each subfield are related.

WordNet-Based Category Utility Approach for Author Name Disambiguation (저자명 모호성 해결을 위한 개념망 기반 카테고리 유틸리티)

  • Kim, Je-Min;Park, Young-Tack
    • The KIPS Transactions:PartB
    • /
    • v.16B no.3
    • /
    • pp.225-232
    • /
    • 2009
  • Author name disambiguation is essential for improving performance of document indexing, retrieval, and web search. Author name disambiguation resolves the conflict when multiple authors share the same name label. This paper introduces a novel approach which exploits ontologies and WordNet-based category utility for author name disambiguation. Our method utilizes author knowledge in the form of populated ontology that uses various types of properties: titles, abstracts and co-authors of papers and authors' affiliation. Author ontology has been constructed in the artificial intelligence and semantic web areas semi-automatically using OWL API and heuristics. Author name disambiguation determines the correct author from various candidate authors in the populated author ontology. Candidate authors are evaluated using proposed WordNet-based category utility to resolve disambiguation. Category utility is a tradeoff between intra-class similarity and inter-class dissimilarity of author instances, where author instances are described in terms of attribute-value pairs. WordNet-based category utility has been proposed to exploit concept information in WordNet for semantic analysis for disambiguation. Experiments using the WordNet-based category utility increase the number of disambiguation by about 10% compared with that of category utility, and increase the overall amount of accuracy by around 98%.