• Title/Summary/Keyword: subject indexing

Search Result 68, Processing Time 0.023 seconds

Semantic and syntactic relationships of indexing languages (색인언어의 어의적 관계 및 구문적 관계)

  • 윤구호
    • Journal of Korean Library and Information Science Society
    • /
    • v.22
    • /
    • pp.1-26
    • /
    • 1995
  • Indexes, especially subject indexes, are major tools for information retrieval. To enhance the retrieval effectiveness of subject indexes, the semantic and syntactic relationships of indexing languages are very important elements. This paper examines the afore-mentioned relationships, based on purely the syntax and semantics of Korean language. The outlines of this study are as follows: 1. The characteristics and usages of controlled vocabularies, particularly subject headings lists and thesaury, are reviewed. 2. The semantic relationships, such as equivalence, hierarchical and associative relationships, are defined, and their categories are investigated in detail. Accordingly, the usages of 'See' and 'See also' references are suggested circumstantially. 3. The syntactic relationships are also examined. Particularly, for the syntactic relationships of multiword indexing terms, two kinds of subject entry formats are compared. Since it is more rational for subject headings organized by the principle of context-dependency, the two-fine entry format is recommended for subject indexes. 4. Computerized production techniques of 'See' and 'See also' reference for the semantic relationships of indexing terms are presented. 5. Computerized production techniques of subject indexes representing the syntactic relationships of indexing terms are also presented.

  • PDF

Automatic indexing as a subject analysis technique (주제분석기법으로서의 자동색인)

  • 이영자
    • Journal of Korean Library and Information Science Society
    • /
    • v.12
    • /
    • pp.61-96
    • /
    • 1985
  • The human subject analysis of a document has some critical problems. The method results in the inconsistency in analysis process and the contradiction of two objects of the subject analysis (one is the identification of the content for the retrieval of specific items and the other is to identify the content for the grouping of related materials). Since the subject analysis by mechanized has been recognized to be the possible way to aggregate the problems of manual analysis, various a n.0, pproaches of automatic indexing have been studied and experimented. This study is to examine the automatic indexing as one of the promising subject analysis techniques by statistical, syntactical and semantic a n.0, pproaches. In conclusion, the reasonable a n.0, pplication time of the automatic indexing should be made a decision based on the through investigation on the cost verse effectiveness, and automatic indexing system should be developed in the close relationship with the on-line search which is a good retrieval system for information explosion society. From now on, since the machine-readable document-text will be envisaged to be more and more available due to the rapid development of computer technology, the more substantial research on the automatic indexing will be also possible, which can bring about the increasing of practical automatic indexing systems.

  • PDF

Theory and practice of alphabetical subject indexing (주제색인의 이론과 실제)

  • 윤구호
    • Journal of Korean Library and Information Science Society
    • /
    • v.10
    • /
    • pp.95-131
    • /
    • 1983
  • Index is a systematic guide to items contained in, or concepts derived from, a collection, Thus, it is represented as a paired set of index terms (t) and documents (D) : I= {(t,D) vertical bar t .mem. V, D .mem. W), where V is index vocabulary and W is document collection. Indexing is the process of analysing the informational content of records of knowledge and expressing the informational content in the language of the indexing system. It involves: 1) Selecting indexable concepts in a document; and 2) expressing these concepts in the language of the indexing system (as index entries): and an ordered list. Indexing process involves technical, semantic and syntactic problems. Technical problems are related to the accuracy of indexing, which is primarily governed by the indexer's ability of analysing subject, identifying indexable concepts, and coding. The proper levels of indexing exhaustivity, and index language specificity are also significant factors affecting the quality of index. Semantic problems are related to the choice of index terms and the form in which they should be used. Equivalent, hierarchical and affinitive/associative relationships of index terms are involved. Syntactic problems are largely related to the coordination of index terms. This process of coordination arises from the need to be able to search for the intersection of two or more classes defined by terms denoting distinct concepts. Finally, most valuable aspects of alphabetical subject indexing theories and practices are derived from those of Cutter, Kaiser, Ranganathan, Coates, Lynch and Austin, and discussed in details.

  • PDF

Function of the Korean String Indexing System for the Subject Catalog (주제목록을 위한 한국용어열색인 시스템의 기능)

  • Yoon Kooho
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.15
    • /
    • pp.225-266
    • /
    • 1988
  • Various theories and techniques for the subject catalog have been developed since Charles Ammi Cutter first tried to formulate rules for the construction of subject headings in 1876. However, they do not seem to be appropriate to Korean language because the syntax and semantics of Korean language are different from those of English and other European languages. This study therefore attempts to develop a new Korean subject indexing system, namely Korean String Indexing System(KOSIS), in order to increase the use of subject catalogs. For this purpose, advantages and disadvantages between the classed subject catalog nd the alphabetical subject catalog, which are typical subject ca-alogs in libraries, are investigated, and most of remarkable subject indexing systems, in particular the PRECIS developed by the British National Bibliography, are reviewed and analysed. KOSIS is a string indexing based on purely the syntax and semantics of Korean language, even though considerable principles of PRECIS are applied to it. The outlines of KOSIS are as follows: 1) KOSIS is based on the fundamentals of natural language and an ingenious conjunction of human indexing skills and computer capabilities. 2) KOSIS is. 3 string indexing based on the 'principle of context-dependency.' A string of terms organized accoding to his principle shows remarkable affinity with certain patterns of words in ordinary discourse. From that point onward, natural language rather than classificatory terms become the basic model for indexing schemes. 3) KOSIS uses 24 role operators. One or more operators should be allocated to the index string, which is organized manually by the indexer's intellectual work, in order to establish the most explicit syntactic relationship of index terms. 4) Traditionally, a single -line entry format is used in which a subject heading or index entry is presented as a single sequence of words, consisting of the entry terms, plus, in some cases, an extra qualifying term or phrase. But KOSIS employs a two-line entry format which contains three basic positions for the production of index entries. The 'lead' serves as the user's access point, the 'display' contains those terms which are themselves context dependent on the lead, 'qualifier' sets the lead term into its wider context. 5) Each of the KOSIS entries is co-extensive with the initial subject statement prepared by the indexer, since it displays all the subject specificities. Compound terms are always presented in their natural language order. Inverted headings are not produced in KOSIS. Consequently, the precision ratio of information retrieval can be increased. 6) KOSIS uses 5 relational codes for the system of references among semantically related terms. Semantically related terms are handled by a different set of routines, leading to the production of 'See' and 'See also' references. 7) KOSIS was riginally developed for a classified catalog system which requires a subject index, that is an index -which 'trans-lates' subject index, that is, an index which 'translates' subjects expressed in natural language into the appropriate classification numbers. However, KOSIS can also be us d for a dictionary catalog system. Accordingly, KOSIS strings can be manipulated to produce either appropriate subject indexes for a classified catalog system, or acceptable subject headings for a dictionary catalog system. 8) KOSIS is able to maintain a constistency of index entries and cross references by means of a routine identification of the established index strings and reference system. For this purpose, an individual Subject Indicator Number and Reference Indicator Number is allocated to each new index strings and new index terms, respectively. can produce all the index entries, cross references, and authority cards by means of either manual or mechanical methods. Thus, detailed algorithms for the machine-production of various outputs are provided for the institutions which can use computer facilities.

  • PDF

Bibliographical Description and Classification Indexing For Revolutionary Historical Archives in China(2) (중국의 혁명역사기록물의 목록기술과 검색분류(2))

  • Lee, Seung-hwi
    • The Korean Journal of Archival Studies
    • /
    • no.5
    • /
    • pp.209-242
    • /
    • 2002
  • Bibliographical Description for Revolutionary Historical Archives is created to describe records at the item level. It defines descriptive elements, punctuations, formats and methods. Descriptive elements are composed of 20 elements, each of which is either mandatory or optional. Mandatory elements are: repositories codes, documents codes, dates, creators, title, classification codes, and subject vocabularies. Abstracts were previously included in card cataloging and are removed in the computerized system. New elements, such as "uncontrolled vocabularies," "name of places," "personal names," "organizational structures" and "meetings," are added to allow keyword search. Considering that subject vocabulary searches are the most important in computerized systems, however, Guidelines for the Subject Indexing for Revolutionary Historical Archives as well as Subject Headings, as a result from the Guidelines, are created. The most extraordinary features in Chinese archival description are said to be the Guidelines for the Classification Indexing for Revolutionary Historical Archives and Materials as well as the Classification Scheme, both of which are created to allow subject search of records content. It is because Chinese practice of records management distinguishes the classification for arrangement from that for retrieval. Chinese archival description is, therefore, composed of bibliographic description rules, subject headings, and the classification scheme for retrieval.

An experiment in automatic indexing with korean texts : a comparison of syntactico-statistical and manual methods (구문 . 통계적 기법을 이용한 한국어 자동색인에 관한 연구)

  • 서은경
    • Journal of the Korean Society for information Management
    • /
    • v.10 no.1
    • /
    • pp.97-124
    • /
    • 1993
  • This study was undertaken in order to develop practical automatic indexing techniques suitable for Korean natural language texts. It has taken a modest step toward this goal by developing an automatic syntactico-statistical indexing method and evaluating the method by comparing the resutls with manual indexing. For this experimental study, the Korean text database was constructed manually based on 300 abstracts covering business subject. The experimental results showed that the performance of the automatic syntactico-statistical indexing system was comparable to that of other studies which have compared automatic indexing with manual indexing.

  • PDF

주제 색인법의 분석적 고찰(1)

  • 윤구호
    • Journal of the Korean Society for information Management
    • /
    • v.5 no.1
    • /
    • pp.3-30
    • /
    • 1988
  • Various theories and techniques for the subject index have been developed since Charles Ammi Cutter first tried to formulate rules for the construction of subject headings in 1876. However, they do not seem to be appropriate to Korean language since the syntax and semantics of Korean language are different from those of English and other European languages. This study therefore reviews and analysis most notable subject indexing systems as a preliminaries to developing a new Korean subject indexing system.

  • PDF

On The Full-Text Database Retrieval and Indexing Language

  • Chang, Hye-Rhan
    • Journal of the Korean Society for information Management
    • /
    • v.4 no.1
    • /
    • pp.24-46
    • /
    • 1987
  • The recent growth of full-text database operations has brought new opportunities for subject access. The fundamental problem of subject access in the online environment is the indexing language and technology. The purpose of this paper is to identify the characteristics and capabilities of full-text retrieval as compared to traditional bibliographic retrieval. Retrieval performance of indexing languages, full-text systems features achieved so far, and the new role of a controlled vocabulary, are examined. This paper also includes a review of the research on full-text retrieval performance.

  • PDF

A One-Size-Fits-All Indexing Method Does Not Exist: Automatic Selection Based on Meta-Learning

  • Jimeno-Yepes, Antonio;Mork, James G.;Demner-Fushman, Dina;Aronson, Alan R.
    • Journal of Computing Science and Engineering
    • /
    • v.6 no.2
    • /
    • pp.151-160
    • /
    • 2012
  • We present a methodology that automatically selects indexing algorithms for each heading in Medical Subject Headings (MeSH), National Library of Medicine's vocabulary for indexing MEDLINE. While manually comparing indexing methods is manageable with a limited number of MeSH headings, a large number of them make automation of this selection desirable. Results show that this process can be automated, based on previously indexed MEDLINE citations. We find that AdaBoostM1 is better suited to index a group of MeSH hedings named Check Tags, and helps improve the micro F-measure from 0.5385 to 0.7157, and the macro F-measure from 0.4123 to 0.5387 (both p < 0.01).

색인사 연구

  • 박준식
    • Journal of Korean Library and Information Science Society
    • /
    • v.2
    • /
    • pp.23-59
    • /
    • 1975
  • Indexes has not devcloped as an independent branch in library science from the beginning, but it has gradually evolved in a clo~eas sociation with catalog and under the direct influence of the development of publishing pro cesses and of the rapid social changes. Historically, index in the West can be traced back to eariler concordance. On the other hand, index in the Bast does not show a continuous development. It started with book catnlog, but other types of indexing were later 'adopted from the West. Indexing in the West and in the East can be summarized as follows: 1) In the West, Taylor considers Gesner's Pandectae was the first index but the Concordance of the Bible in 1247 was the first true index. Indexing method was first established later in 1545 in Gesner's Partitiones which appeared in three volumes. Classified index appeared after Partitions, but alphabetically ordered index was not developed until th eseventeenth century. The pxiodical index of La France S~auante in 1683 proved -its value, and Poole's An Alphabetical Index in the nineteenth century became the turning point in the development of indexing. After Poole's Index appeared periodical index and book catalog gradually began to be treated separately, and subject index and cross reference were incorporated into indexing. Also dictionary arrangement of the indexed items was adopted in the second half of the nincteenth, century after Charles A. Cutter developed his theory of rules for dictionary catalog and systematic studies of indexing were carried out by many scholars. In the twentieth century, index was mainly developed in the United States of America, especially by Wilson publishing Company. The general trend is to move away from the gcncral index to subject index. Also the ncwspapcr indcx such as The Times I~zdcx is 21 landmark in the history cf indcxing. 2) In China, thcs arc somc cvidcnccs that $Bizgluh(&), $ was the first indcx, but unforlunatcly the book itsclf has not been found as yet.

  • PDF