• Title/Summary/Keyword: 도서 범주화

Search Result 12, Processing Time 0.026 seconds

A Study on Book Categorization in Social Sciences Using kNN Classifiers and Table of Contents Text (목차 정보와 kNN 분류기를 이용한 사회과학 분야 도서 자동 분류에 관한 연구)

  • Lee, Yong-Gu
    • Journal of the Korean Society for information Management
    • /
    • v.37 no.1
    • /
    • pp.1-21
    • /
    • 2020
  • This study applied automatic classification using table of contents (TOC) text for 6,253 social science books from a newly arrived list collected by a university library. The k-nearest neighbors (kNN) algorithm was used as a classifier, and the ten divisions on the second level of the DDC's main class 300 given to books by the library were used as classes (labels). The features used in this study were keywords extracted from titles and TOCs of the books. The TOCs were obtained through the OpenAPI from an Internet bookstore. As a result, it was found that the TOC features were good for improving both classification recall and precision. The TOC was shown to reduce the overfitting problem of imbalanced data with its rich features. Law and education have high topic specificity in the field of social sciences, so the only title features can bring good classification performance in these fields.

A preliminary Study on Text Categorization of Book using Table of Contents and Book Description (목차, 책 소개를 이용한 단행본 문서 범주화에 관한 기초연구)

  • Do, Hyun-Ho;Lee, Yong-Gu
    • Proceedings of the Korean Society for Information Management Conference
    • /
    • 2014.08a
    • /
    • pp.127-130
    • /
    • 2014
  • 이 연구에서는 도서관의 주요 장서에 해당하는 단행본 도서에 대한 자동 분류를 적용가능한지 알아보고자 하였다. 분류자질로 메타데이터인 서명, 목차, 책 소개를 사용하였으며, 다양한 자질 가중치를 적용하여 581건의 단행본 도서를 통해 kNN 분류기의 분류성능을 파악하였다. 실험 결과 이들 메타데이터를 모두 사용하였을 때 가장 좋은 분류성능을 가져왔으며, 실험문헌집단의 규모가 작은 한계가 있지만 로그 TF를 취한 가중치 방법이 좋은 성능을 가져왔다.

  • PDF

A Study of Geographic Information Organization of Internet Resources (인터넷 지리정보 체계화에 대한 연구)

  • Kwak Chul-Wan
    • Journal of Korean Library and Information Science Society
    • /
    • v.37 no.2
    • /
    • pp.255-272
    • /
    • 2006
  • This study was investigated to the geography services and geographical directories through Internet in order to develop a geographical directory. Research method was the examination of different type of geography services through internet and geographical directories on the Internet search engines. The results show principles for a geographical directory construction: use both geographical names and subject names on the initial directory screen, repeated use the names through directory hierarchy, locality use on bottom directory as a geographical name, and fit use of the number of directories.

  • PDF

A Study of Designing the Intelligent Information Retrieval System by Automatic Classification Algorithm (자동분류 알고리즘을 이용한 지능형 정보검색시스템 구축에 관한 연구)

  • Seo, Whee
    • Journal of Korean Library and Information Science Society
    • /
    • v.39 no.4
    • /
    • pp.283-304
    • /
    • 2008
  • This is to develop Intelligent Retrieval System which can automatically present early query's category terms(association terms connected with knowledge structure of relevant terminology) through learning function and it changes searching form automatically and runs it with association terms. For the reason, this theoretical study of Intelligent Automatic Indexing System abstracts expert's index term through learning and clustering algorism about automatic classification, text mining(categorization), and document category representation. It also demonstrates a good capacity in the aspects of expense, time, recall ratio, and precision ratio.

  • PDF

The Study on the Effective Automatic Classification of Internet Document Using the Machine Learning (기계학습을 기반으로 한 인터넷 학술문서의 효과적 자동분류에 관한 연구)

  • 노영희
    • Journal of Korean Library and Information Science Society
    • /
    • v.32 no.3
    • /
    • pp.307-330
    • /
    • 2001
  • This study experimented the performance of categorization methods using the kNN classifier. Most sample based automatic text categorization techniques like the kNN classifier reduces the feature set of the training documents. We sought to find out which percentage reductions in the feature set would result in high performances. In addition, the kNN classifier has to find the k number of training documents most similar to the test documents in the training documents. We sought to verify the most appropriate k value through experiments.

  • PDF

A Study on the Categorization of Reading Strategies for Reading Instruction in School Library (학교도서관 중심의 독서교육을 위한 독서전략 범주화에 관한 연구)

  • Lee, Byeong-Ki
    • Journal of Korean Library and Information Science Society
    • /
    • v.39 no.3
    • /
    • pp.139-159
    • /
    • 2008
  • Much of the current literature on reading instruction supports the idea of teaching students a series of reading strategies instead of isolated reading skills. Reading strategies are plans or methods that can be used or taught to facilitate reading proficiency. In the meantime, the reading instruction program of school library is the reading promotion event has been limited. Therefore, the reading instruction program of school library need to focus reading strategies oriented instruction rather than reading skill. This Study categorizes Reading Strategies that divided into text type, text structure, reading process, cognitive strategies.

  • PDF

A Study on the School Librarian's Awareness of Task Process Using Social Network Services (소셜네트워크서비스의 업무적 활용에 대한 학교도서관 사서의 인식 조사)

  • Byeon, Hoi-Kyun;Cho, Hyun-Yang
    • Journal of Korean Library and Information Science Society
    • /
    • v.45 no.1
    • /
    • pp.27-49
    • /
    • 2014
  • This study aims to examine the awareness of the school librarians for task process using SNS(social network services). So, we have surveyed some of them and interviewed two focus groups by the sequential mixed method. The result showed the using SNS of them because of only one persons' operating system and role's characteristics. They are mostly satisfied with the using SNS, so we analyzed the causes. First, we extracted the 103's results except 4's from some librarians and examined descriptive statistics, frequency analysis and ANOVA. Second, we interviewed 2 focus groups and transcribed their opinions. We got 37's concepts, 22's sub categories, 13's categories from results of the open coding. Third, we synthesized the results of survey and open coding's and suggested the directions of future research of the field.

A Study on the Metadata based on the Semantic Structure of the Korean Studies Research Articles (한국학 연구 논문의 의미 구조 기반 메타데이터 연구)

  • Song, Min-Sun;Ko, Young Man
    • Journal of Korean Library and Information Science Society
    • /
    • v.46 no.3
    • /
    • pp.277-299
    • /
    • 2015
  • The purpose of this study is to build a metadata set based on the semantic structure of the Korean studies research articles. For this purpose, we analyzed the related researches which suggested the semantic structure of the research articles, categorized the concepts of author keywords of the Korean studies research articles, and drew the metadata set of 16 elements from the results of the analysis and the categorization. The significance of this study is that it propose a semantic metadata configuration methodology which can reflect the scholarly sense-making of researchers in Korean studies. Especially, this study is significant because it reflects the keywords which was given by the actual researchers to examine the content characteristics of the Korean studies research articles.

Functions and Characteristics of Public Library Theme Collection: Focusing on the User-centered Classification Perspective (공공도서관 테마 컬렉션의 기능과 특성 - 이용자 중심 분류의 관점에서 -)

  • Baek, Ji-Won
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.52 no.4
    • /
    • pp.51-69
    • /
    • 2018
  • The purpose of this study is to analyze the potential use of the theme collection as a new classification method that reflects the interest of users in terms of classification and categorization. For this purpose, the background of the theme collection was identified based on the discussion of the library resource organization and the introduction of the curation service of bookstore. In addition, based on case analysis, which is building the theme collection, concrete concepts and characteristics of theme collection are derived. Based on the above discussion, the classification and categorization characteristics of public library themes collections were analyzed, and the characteristics and functions as a classification were compared with other categories relatively. Finally, the utility and applicability of the theme collection is presented and it is based on the discussions about the user-centered classification system design of the library in the future.

Enhance Issues of the global competitiveness of Telemedicine Industry in Korea (우리나라 원격의료산업의 글로벌 경쟁력 강화를 위한 정책 과제)

  • Yoon, Young-Han
    • International Commerce and Information Review
    • /
    • v.13 no.3
    • /
    • pp.325-351
    • /
    • 2011
  • This paper is focused on problem in the law and system caused by the infringement of medical information and in the law and system indicate the solution. Interests in the medical service are increasing in internet environment as life quality of the people improves because of development in information and medical technology. The current main issues of the legislative system and the law improvement suggestion for telemedicine activation which is related to the ubiquitous health in which the medicine field and IT technology convergence appearance. In particular, South Korea in the privacy-related legislation should be amended. The reason, Medical information record contains a lot of patient's private secrets. Therefore, if privacy protection is not enough this could cause problem violate a patient's privacy. Thus we need consequently the maintenance of the health medical treatment field to suit a telemedicine environment of a law system. Specifically, this law enacted to protect medical treatment information and the technical security services with confidence and stability against security treats are necessary.

  • PDF