DOI QR코드

DOI QR Code

A Study on the Statistical Characteristics for Table of Contents Text of the Books in Social Sciences Field

사회과학 분야 도서의 목차 텍스트에 대한 통계적 특성에 관한 연구

  • 이용구 (계명대학교 문헌정보학과)
  • Received : 2019.06.24
  • Accepted : 2019.06.28
  • Published : 2019.06.30

Abstract

Recently, the table of contents (TOC) has been becoming increasingly accessible and utilized. The study conducted descriptive statistics and comparative analysis of the table of contents in terms of parts of speech and subject in text. For this purpose, this study chose the books of the social sciences field from acquisition lists of an academic library, obtained Dewey class numbers of target books from KERIS union catalog, and extracted TOC data from online bookstore. Morphological analysis was performed on each book titles and TOCs, and descriptive statistics and frequency analysis were carried out. As a result, nouns made up roughly half of the morphemes of titles or the TOCs. TOCs had about 50 times more nouns than titles. The percentage of unique nouns that appeared only in the table of contents is estimated to be 95.2% of the TOC's total nouns. The table of contents also showed a differences in its lengths depending on the field of social science.

이 연구는 최근 접근 및 활용이 높아지고 있는 목차에 대해 품사 측면과 주제 측면에서 가지는 기술통계와 비교 분석을 수행하였다. 이를 위해 대학 도서관의 수서 목록에서 사회과학분야 도서를 추출하고 해당하는 도서에 대해 종합목록으로부터 DDC 분류기호를, 인터넷 서점으로부터 목차 정보를 추출하였다. 서명과 목차를 대상으로 형태소 분석하여 명사 중심의 어휘에 대해 기술통계와 빈도 분석을 실시하였다. 그 결과 형태소 측면에서 서명과 목차는 명사가 대략 절반가량 차지하며, 서명과 비교하여 목차는 50배 정도 더 많은 명사를 가지며, 목차에 출현한 명사 중에 목차만이 고유하게 가지는 비율이 95.2%에 달하는 것으로 파악되었다. 또한 목차는 사회과학 학문분야에 따라 길이가 차이가 나는 것으로 나타났다.

Keywords

JBGRBQ_2019_v36n2_255_f0001.png 이미지

<그림 1> 서명, 목차, 일반 텍스트의 형태소 빈도 분포

JBGRBQ_2019_v36n2_255_f0002.png 이미지

<그림 2> DDC 300대 강목에 따른 목차의 어절과 명사 빈도 그림 상자

JBGRBQ_2019_v36n2_255_f0003.png 이미지

<표 5> 강목별 서명과 목차에 출현한 명사 빈도 평균 및 표준편차

<표 1> 대학도서관의 DDC 300대 도서별 종합목록 분류기호 분할표

JBGRBQ_2019_v36n2_255_t0001.png 이미지

<표 2> 대학도서관의 DDC 300대 도서별 종합목록 300대 강목 분류기호 분할표

JBGRBQ_2019_v36n2_255_t0002.png 이미지

<표 3> 서명, 목차, 일반 텍스트의 형태소 분석에 따른 통계

JBGRBQ_2019_v36n2_255_t0003.png 이미지

<표 4> 서명과 목차에 출현한 단어의 기술 통계

JBGRBQ_2019_v36n2_255_t0004.png 이미지

<표 6> 목차에만 출현한 명사 비율

JBGRBQ_2019_v36n2_255_t0005.png 이미지

References

  1. Gu, Jung-Eok, & Lee, Eung-Bong (2009a). A study on the construction and usability test of meta search system using Open API. Journal of the Korean society for information management, 26(1), 185-214. https://doi.org/10.3743/KOSIM.2009.26.1.185
  2. Gu, Jung-Eok, & Lee, Eung-Bong (2009b). A study on the bibliographic records and the expansion of library catalog using Open API. Journal of the Korean Society for Library and Information Science, 43(2), 299-328. https://doi.org/10.4275/KSLIS.2009.43.2.299
  3. Do, Hyun-Ho (2016). An experimental study on using hierarchical clustering method for automatic classification of books. Unpublished master's thesis, Keimyung University, Department of Library and Information Science.
  4. Do, Hyun-Ho, & Lee, Yong-Gu (2014). A preliminary study on text categorization of book using table of contents and book description. In Proceedings of the 21th Conference of Korean Society for Information Management, 8, 127-130.
  5. Jeong, Hye-Mi, & Chung, Jae-Young (2008). A study on the providing model of table of contents for monography. Journal of Korean Library and Information Science Society, 39(1), 299-318. https://doi.org/10.16981/kliss.39.1.200803.299
  6. Korean Library Association (2010). The glossary of library and information science(revised edition). Seoul: The Korean Library Association.
  7. Byrum Jr., J. D., & Williamson, D. W. (2006). Enriching traditional cataloging for improved access to information: Library of congress tables of contents projects. Information Technology and Libraries, 25(1), 4-11. https://doi.org/10.6017/ital.v25i1.3324
  8. Chercourt, M., & Marshall, L. (2013). Making keywords work: Connecting patrons to resources through enhanced bibliographic records. Technical Services Quarterly, 30(3), 285-295. DOI: 10.1080/07317131.2013.785786
  9. Choi, Y., Hsieh-Yee, I., & Kules, B. (2007). Retrieval effectiveness of table of contents and subject headings. In Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries (pp. 103-104). ACM. DOI: 10.1145/1255175.1255195
  10. Dillon, M., & Wenzel, P. (1990). Retrieval effectiveness of enhanced bibliographic records. Library Hi Tech, 8(3), 43-46. DOI: https://doi.org/10.1108/eb047797
  11. Dinkins, D., & Kirkland, L. N. (2006). It's what's inside that counts: Adding contents notes to bibliographic records and its impact on circulation. College & Undergraduate Libraries, 13(1), 59-71. DOI: https://doi.org/10.1300/J106v13n01_07
  12. Morris, R. C. (2001). Online tables of contents for books: Effect on usage. Bulletin of the Medical Library Association, 89(1), 29-36.
  13. Pappas, E., & Herendeen, A. (2000) Enhancing bibliographic records with tables of contents derived from OCR technologies at the American Museum of Natural History Library. Cataloging & Classification Quarterly, 29:4, 61-72, DOI: 10.1300/J104v29n04_05
  14. Tosaka, Y., & Weng, C. (2011). Reexamining content-enriched access: Its effect on usage and discovery. College & Research Libraries, 72(5), 412-427. DOI: https://doi.org/10.5860/crl-137
  15. Van Orden, R. (1990). Content-enriched access to electronic information: Summaries of selected research. Library Hi Tech, 8(3), 27-32. DOI: https://doi.org/10.1108/eb047795
  16. Winke, R. Conrad. (1999). An analysis of tables of contents in recent english-language books. Library Resources & Technical Services, 43(1), 14-27. DOI: http://dx.doi.org/10.5860/lrts.43n1.14