• Title/Summary/Keyword: Newspaper indexing

Search Result 9, Processing Time 0.021 seconds

A Study on Automatic Indexing System for Newspaper Articles (신문기사(新聞記事) 자동색인(自動索引)에 관한 고찰(考察))

  • Cho, Sun-Hee
    • Journal of Information Management
    • /
    • v.23 no.3
    • /
    • pp.19-44
    • /
    • 1992
  • As most of the domestic newspaper companies are adopting CTS system, the need for automatic indexing system, which can transfer the full-text into a computer, is sharply expanding. In this research, I tried to analyse problems and prospects of the automatic indexing system through various examples and studies conducted by other analysts previously.

  • PDF

Newspaper Thesaurus Construction in Theory and Practice (신문 시소러스 개발의 이론과 실제)

  • Chung Young-Mee
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.25
    • /
    • pp.51-82
    • /
    • 1993
  • Effective indexing systems are required to enhance the performance of full-text retrieval systems. The result of the analysis of index terms selected by human indexers without a newspaper thesaurus indicates that controlled indexing language is necessary for effective and consistent indexing of newspaper articles. In this paper, basic principles are established for keyword selection from Korean newspapers and significant problems identified in the process of developing a newspaper thesaurus are discussed in depth.

  • PDF

A study of indexing system based on thesaurus for newspaper database (시소러스를 이용한 신문기사 데이타베이스 색인시스템에 관한 연구)

  • 한상길
    • Journal of the Korean Society for information Management
    • /
    • v.11 no.1
    • /
    • pp.125-144
    • /
    • 1994
  • The Matter of vmbulary control for newspaper database has been studied for a long time. These efforts hadn't made any good achievements until JOINS Thesaurus system developed. The purpx of this paper is to introduce JOINS Thesaurus whch the Jcong-ang Daily News has developed for the first time in Korea. In addtion to that, thls study is corn- the efficiency of Auto-Indexing system with postcontrolled indexlng system for newspaper database on thesaurus.

  • PDF

A Study of automatic indexing based on the linguistic analysis for newspaper articles (언어학적 분석기법에 의한 신문기사 자동색인시스팀 설계에 관한 연구)

  • Seo, Gyeong-Ju;SaGong, Cheol
    • Journal of the Korean Society for information Management
    • /
    • v.8 no.1
    • /
    • pp.78-99
    • /
    • 1991
  • So far, most of Korea's newspapers indexing have been done manually using tesaurus. In recent years, however, the need for automatic indexing system has grown stronger so as for indexers to save time, efforts and money. And some newspapers have started establishing their databases along with introducing electronic newspapers and CTS. This thesis is on establishing and automatic indexing system for the full-text of the Korea Economic Daily's articles, which have been accumulated in its database, KETEL. In my thesis, I suggest methods to create a keyword file, a stopword list, an auxiliary word list and an infected word list by applying linguistic analysis methods to Hangul, taking advantage of the language's morphological peculiarity. Through these studies, I was able to reach four conclusions as follows. First, we can obtain satisfactory keywords by automatic indexing methods that were made through morphological analysis. Second, an indexer can improve the efficiency of indexing work by controlling extracted vocabulary, as syntax analysis and semantic analysis is not complete in Hangul. Third, The keyword file in this system which is made of about 20,000 most-frequently-used newspaper terms can be used in the future in compiling a thesaurus. Finally, the suggested methods to prepare an auxiliary word list and an infected word list can be applicable to designing other automatic systems.

  • PDF

A Study on the Evaluation of Newspaper Thesaurus (신문 시소러스의 평가에 관한 연구 : 신문기사 종합시소러스를 중심으로)

  • 이인애
    • Journal of the Korean Society for information Management
    • /
    • v.12 no.1
    • /
    • pp.99-113
    • /
    • 1995
  • This study evaluates representability and comprehensivity of the Theasurus in theeconomics and industry fields of the "General Thesaurus of Newspaper Articles." The methodsused in the study were, first, indexing of the pages covering economics and industry articlesusing the Thesaurus and second, comparing the Thesaurus terms with the words collectedfrom the newspapers articles and glossaries. The study clarifies the following problems whichmight occur in the construction and use of newspaper thesaurus: specificity of the subjectconcepts, separation of component concept, preference relationship between descriptors andentry terms, the methods of recording of proper nouns and allocation of terms among thesubject areas concern.he subject areas concern.

  • PDF

A Study on the Establishment and Applications of the "News Core Thesaurus" ("뉴스 코어 시소러스"의 구축 및 활용 방안에 관한 연구)

  • Chang, Inho
    • Journal of Korean Library and Information Science Society
    • /
    • v.44 no.3
    • /
    • pp.489-512
    • /
    • 2013
  • This study suggests the establishment and applications of the News core thesaurus for efficient indexing and searching of news information. News core thesaurus was constructed as macrothesauri which can cover all of news subjects and then has microthesauri like politics, economy, society, culture, etc. as its subsets. In this research, News core thesaurus embodied 2,012 descriptors and 74 non-descriptors by SKOS(Simple Knowledge Organization System). It suggests measures that treat only special subjects in detail in weekly newspaper or biweekly newspaper with little information and special subjects, which is not daily newspaper, and use each microthesauri by merging or integrating in huge news archives or portal sites.

What did They Read in the Newspapers?: A New Method of Measuring Readership (독자 중심의 신문 제작과 독자의 실제 열독률)

  • Park, Jae-Yung;Jeon, Sam-Hyoung-Joon
    • Korean journal of communication and information
    • /
    • v.35
    • /
    • pp.211-249
    • /
    • 2006
  • This study investigated how many and which articles readers read in daily newspapers. Distinguished from previous research that measured readers' perception of their reading habits, this study picked up readers who read the newspaper in the morning, showed them every article in the newspaper, and asked them whether they read each article. This method enhanced the accuracy of measuring subjects‘ reading behavior. According to the results, 6.2% of the readers read at least half of the articles of the general section in the newspaper. Fewer readers went over the economic section of the newspaper. It was 4.1% of the readers. There were only 1.1% of readers who did not read any article of the general section, but the rate soared to 26.5% for the economic section. Many newspaper readers did not skip the first five pages of the newspaper, however the highest readership pages were found in the national coverage located in the middle of the general section. On the other hand, few readers read the articles on pages covering culture, international issues, and people. Readership of the top stories showed some unexpected results. The top stories of national coverage located in the middle of the general section were read by more readers than the top stories of the front pages. This study also investigated the difference between young and old readers. The readers of twenties and thirties did not read as many top stories on major pages and editorials as the readers of forties and older.

  • PDF

An Index System using Restrictive Distance (거리 제한을 이용한 색인 시스템)

  • Park, Chan-Ee;Kim, Sang-Bok
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.1 s.39
    • /
    • pp.273-282
    • /
    • 2006
  • In this paper, we propose index method introducing distance concept in word by a method weighting word. This index method is frequent representing an inquiry word and document index and compound noun or more than two adjoin nouns or noun phrase, the farther the distance between these nouns, the fewer selected ratio decreases in index point is the aiming, this choose guide word candidate by existent weight grant method and distance between candidates chose candidate finally in index within 3 sentences. Using in these way I document of 100 kinds of newspaper, scientific treatise, web document and so on, showed the correctness rate resulted of newspaper 92.03% scientific treatise 95% web document 73.33%.

  • PDF

Future and Directions for Research in Full Text Databases (본문 데이타베이스 연구에 관한 고찰과 그 전망)

  • Ro Jung Soon
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.17
    • /
    • pp.49-83
    • /
    • 1989
  • A Full text retrieval system is a natural language document retrieval system in which the full text of all documents in a collection is stored on a computer so that every word in every sentence of every document can be located by the machine. This kind of IR System is recently becoming rapidly available online in the field of legal, newspaper, journal and reference book indexing. Increased research interest has been in this field. In this paper, research on full text databases and retrieval systems are reviewed, directions for research in this field are speculated, questions in the field that need answering are considered, and variables affecting online full text retrieval and various role that variables play in a research study are described. Two obvious research questions in full text retrieval have been how full text retrieval performs and how to improve the retrieval performance of full text databases. Research to improve the retrieval performance has been incorporated with ranking or weighting algorithms based on word occurrences, combined menu-driven and query-driven systems, and improvement of computer architectures and record structure for databases. Recent increase in the number of full text databases with various sizes, forms and subject matters, and recent development in computer architecture artificial intelligence, and videodisc technology promise new direction of its research and scholarly growth. Studies on the interrelationship between every elements of the full text retrieval situation and the relationship between each elements and retrieval performance may give a professional view in theory and practice of full text retrieval.

  • PDF