DOI QR코드

DOI QR Code

Designing a FRBR Work Grouping Algorithm of Bibliographic Records using a Role Term Dictionary of Authors

저자역할용어사전 구축 및 저작군집화에 관한 연구

  • 윤재혁 (성균관대학교 일반대학원 문헌정보학과) ;
  • 도슬기 (성균관대학교 일반대학원 문헌정보학과) ;
  • 오삼균 (성균관대학교 문과대학 문헌정보학과)
  • Received : 2020.05.26
  • Accepted : 2020.06.22
  • Published : 2020.06.30

Abstract

The purpose of this study is to analyze the issues resulted from the process of grouping KORMARC records using FRBR WORK concept and to suggest a new method. The previous studies did not sufficiently address the criteria or processes for identifying representative authors of records and their derivatives. Therefore, our study focused on devising a method of identifying the representative author when there are multiple contributors in a work. The study developed a method of identifying representative authors using an author role dictionary constructed by extracting role-terms from the statement of responsibility field (245). We also designed another way to group records as a work by calculating similarity measures of authors and titles. The accuracy rate of WORK grouping was the highest when blank spaces, parentheses, and controling processes were removed from titles and the measured similarity rates of authors and titles were higher than 80 percent. This was an experiment study where we developed an author-role dictionary that can be utilized in selecting a representative author and measured the similarity rate of authors and titles in order to achieve effective WORK grouping of KORMARC records. The future study will attempt to devise a way to improve the similarity measure of titles, incorporate FRBR Group 1 entities such as expression, manifestation and item data into the algorithm, and a method of improving the algorithm by utilizing other forms of MARC data that are widely used in Korea.

본 연구는 통합서지용 한국문헌자동화목록(KORMARC)으로 작성된 서지레코드를 FRBR의 저작(Work) 단위로 군집화하는 과정에서 나타난 이슈사항들을 분석하고, 이에 대한 해결방안을 고안하였다. 특히 기존의 연구에서는 대표저작자를 식별하고 처리하는 기준이 명확하게 드러나지 않거나 파생저작 레코드의 대표저작자를 선정하는 방법에 대한 논의가 충분히 이루어지지 않았다. 따라서 본 연구는 저작을 창작하는 데 기여한 사람이 다수일 때 대표저작자를 명확하게 식별하기 위한 방법을 고안하는 데 초점을 맞추었다. 이를 위해 책임표시사항(245) 필드의 책임표시 태그(▼d, ▼e)에서 추출한 역할용어를 토대로 표준화된 저자역할용어사전을 개발하여 대표저작자 판별에 활용하는 방안을 마련하였다. 또한 저자명의 유사도와 표제의 유사도를 각각 계산하여 유사도가 일정 수준 이상인 경우 동일한 저작으로 군집화 하는 방법을 채택하였다. 각각의 유사도를 계산하여 동일 저작을 판단하므로 공백, 관제처리, 괄호제거와 같은 데이터 정제 조건을 조정하여 6가지 패턴에 따른 군집화의 정확도를 비교하였고, 저자명과 표제의 유사도가 모두 80퍼센트 이상일 때의 정확도가 가장 높게 나타났다. 본 연구는 대표저작자 선정을 위한 역할용어사전 개발, 대표저작자와 표제의 유사도를 별도로 측정하여 저작군집화를 시도한 실험연구이며 후속 연구에서는 표제 간 유사도 측정의 정확도를 향상시키는 방안과 FRBR 1그룹의 다른 개체(표현형, 구현형, 개별자료) 수준으로 확대하여 활용하는 방안, 국내에서 사용하고 있는 다른 형태의 MARC 데이터에 적용하는 방안을 고안할 예정이다.

Keywords

References

  1. National Library of Korea (2014). Korean machine readable cataloging format - integrated format for bibliographic data. Retrieved from http://www.nl.go.kr/common/jsp/kormarc_2014/index.html
  2. Kim, S., & Lee, S. (2005). A study on bibliographic relationships of the FRBR model. Journal of Social Science, 16, 25-47.
  3. Kim, J. (2007). An analysis on the work types of Korean books based bibliographical relationship. Journal of Korean Library and Information Science Society, 38(3), 183-200. https://doi.org/10.16981/kliss.38.3.200709.183
  4. Kim, J. (2015). A study on the adoption of the FRBR according to the bibliographic relationships of Five Classics and Four Books. Journal of Korean Library and Information Science Society, 46(2), 317-336. https://doi.org/10.16981/kliss.46.2.201506.317
  5. Kim, J., Lee, S., & Lee, Y. (2015). A study on the development of FRBR algorithm for KORMARC bibliographic record. Journal of Korean Library and Information Science Society, 46(1), 1-23. https://doi.org/10.16981/kliss.46.1.201503.1
  6. Kim, H., Yoo, Y., & Park, S. (2007). An experimental study on the FRBR model adaptation to KORMARC database: Focusing on music materials. Journal of Korean Library and Information Science Society, 38(2), 185-202. https://doi.org/10.16981/kliss.38.2.200706.185
  7. Roh, J. (2008). An application of FRBR model to KORMARC records. Journal of Korean Library and Information Science Society, 39(2), 291-312. https://doi.org/10.16981/kliss.39.2.200806.291
  8. Park, J. (2008). Resource Description and Access (RDA). Korea Research Institute for Library and Information, 40, 1-23. Retrieved from https://wl.nl.go.kr/webzine/publish/krili/200907_02/pdf/policy01_0731.pdf
  9. Lee, M., & Chung, Y. (2008). A study of FRBR implementation to catalog by using work clustering. Journal of the Korean Society for Information Management, 25(3), 65-82. https://doi.org/10.3743/KOSIM.2008.25.3.065
  10. Lee, S., & Lee, H. (2013). A study on the adoption of the FRBR model according to the bibliographic relationships of Korean classical music. Journal of Social Science, 24(2), 399-421. https://doi.org/10.16881/jss.2013.04.24.2.399
  11. Cormen, T. H., Leiserson, C. E., Rivest, R. L., & Stein, C. (2009). Introduction to algorithms (3rd ed.). Cambridge, MA: The MIT Press.
  12. Hickey, T. B., & Toves, J. (2009, August). FRBR work-set algorithm: version 2.0. dublin, ohio: OCLC online computer library center, Inc. Retrieved from http://www.oclc.org/research/activities/past/orprojects/frbralgorithm/2009-08.pdf
  13. Lee, H., & Park, Z. (2012). FRBRizing bibliographic records focusing on identifiers and role indicators in the Korean cataloging environment. Cataloging & Classification Quarterly, 50(5-7), 688-704. http://dx.doi.org/10.1080/01639374.2012.681599
  14. Library of Congress (2004). FRBR display tool version 2.0. Retrieved from http://www.loc.gov/marc/marc-functional-analysis/tool.html
  15. Singhal, A. (2001). Modern information retrieval: A brief overview. IEEE Data Eng. Bull., 24(4), 35-43.
  16. Tillett, B. B. (1987). Bibliographic relationships: Toward a conceptual structure of bibliographicinformation used in catalog. Los Angeles: University of California. Quoted in Kim, S., &Lee, S. (2005). A study on bibliographic relationships of the FRBR model. Journal of SocialScience, 16, 25-47.
  17. Tillett, B. B. (2004). What is FRBR? a conceptual model for the bibliographic universe. Retrieved from https://www.loc.gov/cds/downloads/FRBR.PDF