• Title/Summary/Keyword: 서지정보학

Search Result 391, Processing Time 0.02 seconds

Topic Model Augmentation and Extension Method using LDA and BERTopic (LDA와 BERTopic을 이용한 토픽모델링의 증강과 확장 기법 연구)

  • Kim, SeonWook;Yang, Kiduk
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.3
    • /
    • pp.99-132
    • /
    • 2022
  • The purpose of this study is to propose AET (Augmented and Extended Topics), a novel method of synthesizing both LDA and BERTopic results, and to analyze the recently published LIS articles as an experimental approach. To achieve the purpose of this study, 55,442 abstracts from 85 LIS journals within the WoS database, which spans from January 2001 to October 2021, were analyzed. AET first constructs a WORD2VEC-based cosine similarity matrix between LDA and BERTopic results, extracts AT (Augmented Topics) by repeating the matrix reordering and segmentation procedures as long as their semantic relations are still valid, and finally determines ET (Extended Topics) by removing any LDA related residual subtopics from the matrix and ordering the rest of them by F1 (BERTopic topic size rank, Inverse cosine similarity rank). AET, by comparing with the baseline LDA result, shows that AT has effectively concretized the original LDA topic model and ET has discovered new meaningful topics that LDA didn't. When it comes to the qualitative performance evaluation, AT performs better than LDA while ET shows similar performances except in a few cases.

Investigation of Topic Trends in Computer and Information Science by Text Mining Techniques: From the Perspective of Conferences in DBLP (텍스트 마이닝 기법을 이용한 컴퓨터공학 및 정보학 분야 연구동향 조사: DBLP의 학술회의 데이터를 중심으로)

  • Kim, Su Yeon;Song, Sung Jeon;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.32 no.1
    • /
    • pp.135-152
    • /
    • 2015
  • The goal of this paper is to explore the field of Computer and Information Science with the aid of text mining techniques by mining Computer and Information Science related conference data available in DBLP (Digital Bibliography & Library Project). Although studies based on bibliometric analysis are most prevalent in investigating dynamics of a research field, we attempt to understand dynamics of the field by utilizing Latent Dirichlet Allocation (LDA)-based multinomial topic modeling. For this study, we collect 236,170 documents from 353 conferences related to Computer and Information Science in DBLP. We aim to include conferences in the field of Computer and Information Science as broad as possible. We analyze topic modeling results along with datasets collected over the period of 2000 to 2011 including top authors per topic and top conferences per topic. We identify the following four different patterns in topic trends in the field of computer and information science during this period: growing (network related topics), shrinking (AI and data mining related topics), continuing (web, text mining information retrieval and database related topics), and fluctuating pattern (HCI, information system and multimedia system related topics).

A Study on the Method of Scholarly Paper Recommendation Using Multidimensional Metadata Space (다차원 메타데이터 공간을 활용한 학술 문헌 추천기법 연구)

  • Miah Kam;Jee Yeon Lee
    • Journal of the Korean Society for information Management
    • /
    • v.40 no.1
    • /
    • pp.121-148
    • /
    • 2023
  • The purpose of this study is to propose a scholarly paper recommendation system based on metadata attribute similarity with excellent performance. This study suggests a scholarly paper recommendation method that combines techniques from two sub-fields of Library and Information Science, namely metadata use in Information Organization and co-citation analysis, author bibliographic coupling, co-occurrence frequency, and cosine similarity in Bibliometrics. To conduct experiments, a total of 9,643 paper metadata related to "inequality" and "divide" were collected and refined to derive relative coordinate values between author, keyword, and title attributes using cosine similarity. The study then conducted experiments to select weight conditions and dimension numbers that resulted in a good performance. The results were presented and evaluated by users, and based on this, the study conducted discussions centered on the research questions through reference node and recommendation combination characteristic analysis, conjoint analysis, and results from comparative analysis. Overall, the study showed that the performance was excellent when author-related attributes were used alone or in combination with title-related attributes. If the technique proposed in this study is utilized and a wide range of samples are secured, it could help improve the performance of recommendation techniques not only in the field of literature recommendation in information services but also in various other fields in society.

Bibliographical Research on Yeogkwa Bo (역과보(譯科譜)에 대한 서지적 연구)

  • Han Mi-Kyung
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.40 no.2
    • /
    • pp.125-150
    • /
    • 2006
  • Yeogkwa Bo is a biographical source that was re-edited based on the primary sources such as Yeogkwa Bangmok which is the list of the successful applicants in Yeogkwa. 7 Kinds of the existing Yeogkwa Bo was studied and analyzed in bibliographical way. This study proves that the period of available record(of successful applicants' names) ranges from 1807 to 1891, although it has been mentioned before that the period of record covers as far as 1882. As a result of comparison of mentioned family names, family origins, total number of the successful applicants in Yeogkwa, and content of record, Yeog Bo of Dangrih University's shows the most extensive and substantial work, and Yeogkwa Bo of Jangseo Kag's is quite superior to other archives present at home. But both of them show problems such as errors or omission of some records, confusion in spelling and so on. Therefore, the above study implies that there should be process of checking through study of Yeogkwa Bangmok when making reference to Yeogkwa Bo which provides biographical information on family trees and origins as well as information on the individual successful applicants in Yeogkwa.

A Study on the Application of LibraryThing Folksonomy Tags through the Analysis of Elements related with Work (저작관련 요소분석을 통한 폭소노미 태그의 활용 방안에 관한 연구: LibraryThing을 중심으로)

  • Kim, Dong-Suk;Chung, Yeon-Kyoung
    • Journal of the Korean Society for information Management
    • /
    • v.27 no.1
    • /
    • pp.41-60
    • /
    • 2010
  • This study aims to analyze the properties of the tags used in the fiction genre, the structural aspect of the patterns and the contents of the tags by utilizing LibraryThing, where the tags are assigned in work units of FRBR. A comparative analysis was conducted in terms of the level of association between the descriptive terms in bibliography and LCSH terms. The study also examined the sources of the tags not included in the bibliographic descriptions or LCSHs, what aspects of work they represented, and the terms used as tags in relation to the work. By restricting the study to a single genre, a number of tags that reflected the characteristics of fiction (three elements of the fiction which are theme, plot, style and three elements of the fiction composition which are character, event, setting) were extracted. This study finds out the role of the tag making up the taxonomy and proposes a new direction for the tagging system by demonstrating the possibility of using tags as facets in information organization and retrieval.

Progress and Special Features in User Instruction of Korean Academic Libraries (국내 대학도서관 이용자교육의 추이와 특징)

  • Kim, Ryoung-Eun;Lee, Jae-Whoan
    • Journal of Korean Library and Information Science Society
    • /
    • v.48 no.4
    • /
    • pp.153-179
    • /
    • 2017
  • The purpose of this article is to discuss about both historical progress and current situation of user instruction in Korean academic libraries. Emphasis was on identifying and analyzing the special features and limitations of user instruction from the viewpoints of library users as well as those of librarians. To the end, data were collected from three methods: the first tool was such major statistical sources as 'KERIS university library statistics system' and 'Research Reports' by the Korean Private University Library Association. The second was from the results of surveys and interviews with librarians from 20 university libraries. And the third was from the results of surveys with library users in two sample university libraries.

A Study on Romanization Rules and Practices of the International institutions for Korean language materials (한글로마자표기에 대한 국제기관의 규정과 표기의 실제에 관한 연구)

  • Oh, Kyung-Mook
    • Journal of the Korean Society for information Management
    • /
    • v.24 no.4
    • /
    • pp.33-51
    • /
    • 2007
  • The fundamental issue of information retrieval in the Internet-based society is closely interrelated with the characteristics of language selected. The McCune-Reischauer Romanization system is not only considered as the international standard for romanizing Korean language, it is also familiar to the majority of the Korean material users internationally. McCune-Reischauer system is adopted by the ISO, UNGEGN, ALA, LC, British PCGN, BL, and the relevant agencies in Europe, Canada and Australia etc. Encouraging for switching to the new Romanization system(2000) would result in complications among the library's catalogs and online databases, causing confusion for both staffs and readers. This paper analysed that the international efforts and rules for Romanizing Korean language materials and recommended direction for bibliographical issues.

Question Analysis of the Collaborative Digital Reference Service at the National Library of Korea (협동 디지털참고서비스의 질문 분석: 국립중앙 도서관의 '사서에게 물어보세요'를 중심으로)

  • Chang, Hye Rhan;Yi, Kyung Suk
    • Journal of the Korean Society for information Management
    • /
    • v.31 no.4
    • /
    • pp.7-28
    • /
    • 2014
  • This study analyses the questions addressed to the collaborative digital reference service run by the National Library of Korea. The data consist of 661 question entries to the 'Ask a Librarian' service during first 6 months in 2014. Each entry includes average 1.17 questions, and 77.82% of the total questions are real reference in nature. Questions are analyzed by classification division, context of the questioner, desired end product, activities of librarians, and the resources used to respond them. Each category is subdivided and analyzed in detail. Results revealed interesting findings and problems, and suggestions for further endeavor are provided.

Users' Perception on Theses and Dissertation Services (학위논문 이용현황과 활성화 방안에 관한 연구)

  • Shin, Yu-Ri;Chung, Eun-Kyung
    • Journal of Information Management
    • /
    • v.40 no.1
    • /
    • pp.29-46
    • /
    • 2009
  • Theses and Dissertation(TD) have been considered one of valuable scholarly resources, while there have existed some limitations to collect, organize, and provide them. The purpose of this study is to investigate users' perception on six TD services from five institutions and to propose improvement strategies. Six TD services includes National Library, National Assembly Library, RISS, dCollections, NDSL, and Council of Theses and Dissertation Common Use. Based on the survey results from 151 users, the findings of this study identified that National Assembly Library and RISS were preferred by users. In addition, users preferred keyword, full text, department, abstract, table of contents, as well as title and author over other bibliographic information. More importantly, users needs were placed on whether specific TD services provide full text or not. In case that is not possible to provide full text, users have a preference for full text link information. As a result, in order to improve the TD services, service promotion activities, diverse access points, and full text provision are desirable.

A Study on the Service Method of Modern Literature Based on Linked Data (링크드 데이터 기반 근대문학자료의 서비스 방안 연구)

  • Park, Jin-Ho;Kwak, Seung-Jin
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.55 no.2
    • /
    • pp.5-24
    • /
    • 2021
  • This study suggested a plan to convert the modern literary data service of the National Library of Korea into linked data-based services. This is not to simply convert the modern literary data service into linked data, which is the current technological trend. This is to create high-quality source data capable of automated machine processing with continuous connection with various external data and information sources in the long term. To this end, in order to revitalize the service of modern literature and to solve the efficient data linkage with related institutions, various overseas library and bibliographic service cases that adopted linked data were first reviewed to draw implications. In addition, based on the reviewed implications, the plan to reorganize the modern literary service in terms of data management, system management, and user service was described in detail.