• Title/Summary/Keyword: Title Classification

Search Result 79, Processing Time 0.03 seconds

A Study on the Improvement of Accessibility to Public Records: Based on the Construction of Subject Thesaurus for Presidential Archives (공공기록에 대한 접근성 제고 방안에 관한 연구 - 대통령기록관 주제시소러스 개발 사례를 중심으로 -)

  • Rieh, Hae-Young;Kwon, Yongchan;Seong, Hyojoo;Yoo, Byonghoo
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.14 no.4
    • /
    • pp.127-151
    • /
    • 2014
  • To search based on the functional classification or provenance is not easy for users, and the key word-based information retrieval presents only simple words matching with the title of the records. The Presidential Archive of Korea developed a subject classification scheme to improve the convenience of searching for various records and came up with a subject thesaurus based on the scheme that utilizes the terms appearing on the title of the records and the terms used by the users who searched the portal or requested information disclosure. This research presents the development process of subject thesaurus. It also presents the utilization methods for records management work and services.

Study on Security Grade Classification of Financial Company Documents (금융기관 문서 보안등급 분류에 관한 연구)

  • Kang, Bu Il;Kim, Seung Joo
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.24 no.6
    • /
    • pp.1319-1328
    • /
    • 2014
  • While the recent advance in network system has made it easier to collect and process personal information, the loss of customers, financial companies and even nations is getting bigger due to the leakage of personal information. Therefore, it is required to take a measure to prevent additional damage from the illegal use of leakaged personal information. Currently, financial companies use access control in accordance with job title or position on general documents as well as important documents including personal information. Therefore, even if a documents is confidential, it is possible for a person of the same job title or position to access the document properly. This paper propose setting up security grade of documents to improve current access control system. It will help preventing the leakage of personal information.

Feature Extraction to Detect Hoax Articles (낚시성 인터넷 신문기사 검출을 위한 특징 추출)

  • Heo, Seong-Wan;Sohn, Kyung-Ah
    • Journal of KIISE
    • /
    • v.43 no.11
    • /
    • pp.1210-1215
    • /
    • 2016
  • Readership of online newspapers has grown with the proliferation of smart devices. However, fierce competition between Internet newspaper companies has resulted in a large increase in the number of hoax articles. Hoax articles are those where the title does not convey the content of the main story, and this gives readers the wrong information about the contents. We note that the hoax articles have certain characteristics, such as unnecessary celebrity quotations, mismatch in the title and content, or incomplete sentences. Based on these, we extract and validate features to identify hoax articles. We build a large-scale training dataset by analyzing text keywords in replies to articles and thus extracted five effective features. We evaluate the performance of the support vector machine classifier on the extracted features, and a 92% accuracy is observed in our validation set. In addition, we also present a selective bigram model to measure the consistency between the title and content, which can be effectively used to analyze short texts in general.

A Study on the Modern Catalog Characteristics of Chosundoseohaeje ("조선도서해제"의 목록적 특성에 관한 연구)

  • 도태현
    • Journal of Korean Library and Information Science Society
    • /
    • v.34 no.2
    • /
    • pp.1-18
    • /
    • 2003
  • Chosundoseohaeje, a book type catalog was published three times under the rule of Japanese imperialism. This catalog has several characteristics of modem catalog as fellows : First, each record of this catalog includes title, volume no., statements of responsibility, printing type, annotated bibliography containing date, author, background, structure, contents of the book, and biographies. Second, this catalog has a subject retrieval system by the four-part classification(Kyung, Sa, Ja, Jib), title retrieval system by Japanese alphabetical index, and authorㆍeditor retrieval system by their family name index or king's name index. Third, this catalog has a system indicating the location of described books by Gyujanggak book numbers.

  • PDF

Developing an Automatic Classification System Based on Colon Classification: with Special Reference to the Books housed in Medical and Agricultural Libraries (콜론분류법에 바탕한 자동분류시스템의 개발에 관한 연구 - 농학 및 의학 전문도서관을 사레로 -)

  • Lee Kyung-Ho
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.23
    • /
    • pp.207-261
    • /
    • 1992
  • The purpose of this study is (1) to design and test a database which can be automatically classified, and (2) to generate automatic classification number by processing the keywords in titles using the code combination method of Colon Classification(CC) as well as an automatic recognition of subjects in order to develop an automatic classification system (Auto BC System) based on CC which can be applied to any research library. To conduct this study, 1,510 words in the fields of agricultrue and medicine were selected, analized in terms of [P], [M], [E], [S], [T] employed in CC, and included in a database for classification. For the above-mentioned subject fields, the principle of an automatic classification was specified in order to generate automatic classification codes as well as to perform an automatic subject recognition of the titles included. Whenever necessary, editing, deleting, appending and reindexing of a database can be made in this automatic classification system. Appendix 1 shows the result of the automatic classification of books in the fields of agriculture and medicine. The results of the study are summarized below. 1. The classification number for the title of a book can be automatically generated by using the facet principles of Colon Classification. 2. The automatic subject recognition of a book is achieved by designing a database making use of a globe-principle, and by specifying the subject field for each word. 3. The automatic subject-recognition of input data is achieved by measuring the number of searched words by each subject field. 4. The combination of classification numbers is achieved by flowcharting of classification formular of each subject field. 5. The efficient control of classification numbers is achieved by designing control codes on the database for classification. 6. The automatic classification by means of Auto BC has been proved to be successful in the research library concentrating on a Single field. The general library may have some problem in employing this system. The automatic classification through Auto BC has the following advantages: 1. Speed of the classification process can be improve. 2. The revision or updating of classification schemes can be facilitated. 3. Multiple concepts can be expressed in a single classification code. 4. The consistency of classification can be achieved with the classification formular rather than the classifier's subjective judgement. 5. A user's retrieving process can be made after combining the classification numbers through keywords relating to the material to be searched. 6. The materials can be classified by a librarian without subject backgrounds. 7. The large body of materials can be quickly classified by means of a machine processing. 8. This automatic classification is expected to make a good contribution to design of the total system for library operations. 9. The information flow among libraries can be promoted owing to the use of the same program for the automatic classification.

  • PDF

Automated Modelling of Ontology Schema for Media Classification (미디어 분류를 위한 온톨로지 스키마 자동 생성)

  • Lee, Nam-Gee;Park, Hyun-Kyu;Park, Young-Tack
    • Journal of KIISE
    • /
    • v.44 no.3
    • /
    • pp.287-294
    • /
    • 2017
  • With the personal-media development that has emerged through various means such as UCC and SNS, many media studies have been completed for the purposes of analysis and recognition, thereby improving the object-recognition level. The focus of these studies is a classification of media that is based on a recognition of the corresponding objects, rather than the use of the title, tag, and scripter information. The media-classification task, however, is intensive in terms of the consumption of time and energy because human experts need to model the underlying media ontology. This paper therefore proposes an automated approach for the modeling of the media-classification ontology schema; here, the OWL-DL Axiom that is based on the frequency of the recognized media-based objects is considered, and the automation of the ontology modeling is described. The authors conducted media-classification experiments across 15 YouTube-video categories, and the media-classification accuracy was measured through the application of the automated ontology-modeling approach. The promising experiment results show that 1500 actions were successfully classified from 15 media events with an 86 % accuracy.

A Study of Card Catalog Use in a University Library (대학도서관의 목록이용행태 특성에 관한 연구 - 덕성 여자대학교 도서관을 중심으로 -)

  • Yoo Jae-Ok
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.29
    • /
    • pp.281-304
    • /
    • 1995
  • The purpose of this study is to identify the degree to which the card catalog in a university library serves its users and to provide useful information for the design of conversion from card catalog to online catalog. From August 19th to September 16th 1995, 278 users of Duksung Women's University Library were randomley selected and surveyed in terms of card catalog use, success rate of card searching, and catalog use training received. Major findings are as follows: 1. Taking into considerations the fact that Library users tended to use more heavily oriental card catalog$(61.8\%)$ than western card catalog$(8.3\%)$ or classification card catalog$(26.3\%)$, oriental card catalog should be designed to improve its search function of the catalog. 2. It was found that the university library card catalog was not easy to use by $49.3\%$ of the users of Duksung University Library. 3. One of main reasons why the card catalog is hard to use is that there is no subject card to which users can access for subject searching. Besides, users have difficulties in locating appropriate classification numbers for subjects which users are interested in. 4. When success rate is defined as finding appropriate cards in catalog boxes, the success rate was reported to be $87.0\%$. 5. The major access points of known items which library users utilized mostly were author$(18.3\%)$ and title$(74.5\%)$. 6. In case of translated versions of foreign materials, original author name cards instead of pronounced original name card written in Korean were given to them as access points. $79.9\%$ of library users of Duksung Women's University insisted that both original and pronounced author name writ':en in Korean should be given as access points to foreign authors for the sake of user's convenience. 7. Formal training programs for card catalog use were found not to be sufficient. Small group informal training courses should be offered to users who need to get information for catalog use by library staffs. 8. Considering the trend that orders of access points have been changed from title, author and subject in card catalog to title, keyword, and author in online catalog, the existing card catalog of Duksung Women's University is expected not to meet future users' needs for subject searching unless the funcions of subject searching of catalog is improved.

  • PDF

On the Bibliographies of Chinese Historical Books - Classifying and cataloguing system of six historical bibliographies - (중국의 사지서목에 대하여 -육사예문$\cdot$경적지의 분류 및 편목체재 비교를 중심으로-)

  • Kang Soon-Ae
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.24
    • /
    • pp.289-332
    • /
    • 1993
  • In china, six bibliographies of offical historical books are evaluated at the most important things among the systematically-editing bibliographies. These bibliographies would be usful to study the orign of classical sciences and their development, bibliographic research of Chinese classics, bibliographic judgement on genuine books, titles, authors, volumes. They could be refered to research into graving, correcting, and existence of ancient books. therefore, these bibliographies would be applied to estimation the phase of scientific and cultural development. The study of these bibliographies has been not yet made in Korea. This thesis lays its importance on the background of their appearance, their classification norms, organizing system of their catalogue, and comparison between their difference. 1. Editing and compiling of Chilyak (칠약) by Liu Chin (유흠) and official histories played an important role of entering an apperance of historical book's bibliographies. Chilyak has been lost. However, its classification and compiling system of classical books would be traced by Hansoyemunji(한서예문지) of which basic system is similar to Chilyak. It classified books according to their scientific characteristic. If a few books didn't have their own categories, they were combined by the circles parallel to the books' characteristic. With the books classified under the same scientific characteristic, they were again divided into the scientific schools or structures. It also arranged the same kinds of books according to the chronology. The some books wi th duplicate subjects were classified multiplely by their duplicate subject. 2. Ssu-ma Chon's (사마천) The Historical Records (Saki, 사기) and Pan Ku's (반고) The History of the Former Han Dynasty (Hanso, 한서) has also took effects on appearance of historical books' bibliographies. Covering overall history, Saki was structured by the five parts: The basic annals(본기), the chronological tables (표), the documents (서), the hereditary houses (세가), biographies (열전). The basic annals dealt with kings and courts' affairs according to the chronology. The chronological tables was the records of the annals. The documents described overall the social and cultural systems. The hereditary houses recorded courts' meritorious officials and public figures. The biographies showed exemplars of seventy peoples selected by their social status. Pan Ku(반구)'s The History of the Former Han Dynasty(한서) deserved to be called the prototype for the offical histories after Saki's (사기; The Historical Records) apperance. Although it modelled on Saki, it had set up its own cataloguing system. It was organized by four parts; the basic annals (본기), the chronological tables (표), treatises(지), biographies (열전). The documents in the Hanso(한서) was converted into treatises(지). The hereditary houses and biographies were merged. For the first time, the treatise with The Yemunji could operate function for historical bibliographies. 3. There were six historical bibliographies: Hansoyemunji(한서예문지), Susokyongjeokji (수서경적지), Kudangsokyongjeokji(구당서경적지), Shindangsoyemunji (신당서예문지), Songsayemunji (송사예문지), Myongsayemunji (명사예문지). 1) Modelling on Liu Chin's Chilyak except Chipryak(집략), Hansoyemunji divided the characteristic of the books and documents into six parts: Yukrye(육예), Cheja(제자), Shibu(시부), Pyongsoh(병서), Susul(수술), Pangki(방기). Under six parts, there were thirty eight orders in Hansoyemunji. To its own classification, Hansoyemunji applied the Chilyak's theory of classification that the books or documents were managed according to characteristic of sciences, the difference of schools, the organization of sentences. However the overlapped subjects were deleted and unified into one. The books included into an unsuitable subject were corrected and converted into another. The Hansoyemunji consisted of main preface (Taesoh 대서), minor preface (Sosoh 소서) , the general preface (Chongso 총서). It also recorded the introduction of books and documents, the origin of sciences, the outline of subjects, and the establishment of orders. The books classified by the subject had title, author, and volumes. They were rearranged by titles and the chronological publication year. Sometimes author was the first access point to catalogue the books. If it was necessary for the books to take footnotes, detail notes were formed. The Volume number written consecutively to order and subject could clarify the quantity of books. 2) Refering to Classfication System by Seven Norms (칠분법) and Classification System by Four Norms(사분법), Susokyongjeokji(수서경적지) had accomplished the classification by four norms. In fact, its classification largely imitated Wanhyosoh(완효서)'s Chilrok(칠록), Susokyongjeokji's system of classification consisted of four parts-Kyung(경), Sa(사), Cha(자), Chip(칩). The four parts were divided into 40 orders. Its appendix was again divided into two parts, Buddihism and Taiosm. Under the two parts there were fifteen orders. Totally Susokyongjeokji was made of six parts and fifty five orders. In comparison with Hansoyemunji(한서예문지), it clearly showed the conception of Kyung, Sa, Cha, Chip. Especially it deserved to be paid attention that Hansoyemunji laied history off Chunchu(춘추) and removed history to Sabu(사부). However Chabu(사부) put many contrary subjects such as Cheja(제자), Kiye(기예), Sulsu(술수), Sosol(소설) into the same boundary, which committed errors insufficient theoretical basis. Anothor demerit of Susokyongjeokji was that it dealt with Taiosm scriptures and Buddism scriptures at the appendix because they were considered as quasi-religion. Its compilation of bibliographical facts consisted of main preface(Taesoh 대서), minor preface(Sosoh 소서), general preface (Chongsoh 총서), postscript (Husoh 후서). Its bibliological facts mainly focused on the titles. Its recorded authors' birth date and their position. It wrote the lost and existence of books consecutive to total number of books, which revealed total of the lost books in Su Dynasty. 3) Modelling on the basis of Kokumsorok(고분서록) and Naewaekyongrok(내외경록), Kudangsokyongjeokji(구당서경적지) had four parts and fourty five orders. It was estimated as the important role of establishing basic frame of classification by four norms in classification theory's history. However it had also its own limit. Editing and compling orders of Kudangsokyongjeokji had been not progressively changed. Its orders imitated by and large Susokyongjeokji. In Its system of organizing catalogue, with its minor preface and general preface deleting, Kudangsokyongjeokji by titles after orders sometimes broke out confusion because of unclear boundaries between orders. 4) Shindangsoyemunji(신당서예문지), adding 28,469 books to Kudangsokyongjeokji, recorded 82,384 books which were divided by four parts and fourty four orders. In comparison with Kudangkyongjeokj, Sindangsoyemunji corrected unclear order's norm. It merged the analogical norms four orders (for instance, Kohun 고훈 and Sohakryu 소학류) and seperated the different norms four orders (for example, Hyokyong 효경 and Noneuhryu 논어류, Chamwi 참위 and Kyonghaeryu 경해류, Pyonryon 편년 and Wisaryu 위사류). Recording kings' behaviors and speeches (Kikochuryu 기거주류) in the historical parts induced the concept of specfication category. For the first time, part of Chipbu (집부) set up the order of classification norm for historical and literatural books and documents (Munsaryu 문사류). Its editing and compiling had been more simplified than Kudangsokyongjeokji. Introduction was written at first part of bibliographies. Appendants except bibliographic items such subject, author, title, volume number, total were omitted. 5) Songsayemunji(송사예문지) were edited in the basis of combining Puksong(북송) and Namsong(남송), depending on Sabukuksayemunji(사부국사예문지). Generally Songsayemunji had lost a lot of bibliographical facts of many books. They were duplicated and wrongly classified books because it committed an error of the incorrectly annalistic editing. Particularly Namsong showed more open these defaults. Songsayemunji didin't include the books published since the king Youngchong(영종). Its system of classification was more better controlled. Chamwiryu(참위류) in the part of Kyongbu(경부) was omitted. In the part of history(Sabu 사부), recordings of kings' behaviors and speeches more merged in the annals. Historical abstract documents (Sachoryu 사초류) were seperately arranged. In the part of Chabu(자부), Myongdangkyongmaekryu(명당경맥류) and Euisulryu(의술류) were combined. Ohangryu(오행류) were laied off Shikuryu(시구류). In the part of Chipbu(집부), historical and literatural books (Munsaryu 문사류) were independentely arranged. There were the renamed orders; from Wisa(위사) to Paesa(패사), Chapsa (잡사) to Pyolsa(열사), Chapchonki(잡전기) to Chonki(전기), Ryusoh(류서) to Ryusa(류서). Introduction had only main preface. The books of each subject catalogued by title, the volume number, and author and arranged mainly by authors. Annotations were written consecutively after title and the volume number. In the afternote the number of not-treated books were revealed. Difference from Singdangsohyemunji(신당서예문지) were that the concept and boundary of orders became more clearer. It also wrote the number of books consecutive to main subject. 6) Modelling on Chonkyongdangsomok (경당서목), Myongsayemunji(명사예문지) was compiled in the basis of books and documents published in the Ming Danasty. In classification system, Myongsayemunji partly merged and the seperated some orders for it. It also deleted and renamed some of orders. In case of necessity, combining of orders' norm was occured particulary in the part of Sabu(사부) and Chabu(자부). Therefore these merging of orders norm didn't offer sufficient theretical background. For example, such demerits were seen in the case that historical books edited by annals were combined with offical historical ones which were differently compiled and edited from the former. In the part of Chabu(자부), it broke out another confusion that Pubga(법가), Meongga(명가), Mukga(묵가), Chonghweongka's(종횡가) thoughts were classified in the Chapka(잡가). Scriptures of Taiosim and Buddhism were seperated from each other. There were some deleted books such as Mokrokryu(목록류), Paesaryu(패사류) in the part of history (Sabu 사부) and Chosaryu(초사류) in the part of Chipbu(집부). The some in the each orders had been renamed. Imitating compiling system of Songsayemunji(송사예문지), with reffering to its differ-ence, Myongsayemunji(명사예문지) wrote the review and the change of the books by author. The number of not-treated books didn't appear at the total. It also deleted the total following main subject.

  • PDF

A Study on Book Categorization in Social Sciences Using kNN Classifiers and Table of Contents Text (목차 정보와 kNN 분류기를 이용한 사회과학 분야 도서 자동 분류에 관한 연구)

  • Lee, Yong-Gu
    • Journal of the Korean Society for information Management
    • /
    • v.37 no.1
    • /
    • pp.1-21
    • /
    • 2020
  • This study applied automatic classification using table of contents (TOC) text for 6,253 social science books from a newly arrived list collected by a university library. The k-nearest neighbors (kNN) algorithm was used as a classifier, and the ten divisions on the second level of the DDC's main class 300 given to books by the library were used as classes (labels). The features used in this study were keywords extracted from titles and TOCs of the books. The TOCs were obtained through the OpenAPI from an Internet bookstore. As a result, it was found that the TOC features were good for improving both classification recall and precision. The TOC was shown to reduce the overfitting problem of imbalanced data with its rich features. Law and education have high topic specificity in the field of social sciences, so the only title features can bring good classification performance in these fields.

An Automatic Classification System of Official Documents in Middle Schools Using Term Weighting of Titles (제목의 단어 가중치를 이용한 중등학교 공문서 자동분류시스템)

  • Kang, Hyun-Hee;Jin, Min
    • Journal of The Korean Association of Information Education
    • /
    • v.7 no.2
    • /
    • pp.219-226
    • /
    • 2003
  • It takes a lot of time to classify official documents in schools and educational institutions. In order to reduce the overhead, we propose an automatic document classification method using word information of the titles of documents in this paper. At first, meaningful words are extracted from titles of existing documents and Inverse Document Frequency(IDF) weights of words are calculated against each category. Then we build a word weight dictionary. Documents are automatically classified into the appropriate category of which the sum of weights of words of the title is the highest by using the word weight dictionary. We also evaluate the performance of the proposed method using a real dataset of a middle school.

  • PDF