• 제목/요약/키워드: Consistency for classification

검색결과 133건 처리시간 0.028초

연관도를 계산하는 자동화된 주제 기반 웹 수집기 (An Automated Topic Specific Web Crawler Calculating Degree of Relevance)

  • 서혜성;최영수;최경희;정기현;노상욱
    • 인터넷정보학회논문지
    • /
    • 제7권3호
    • /
    • pp.155-167
    • /
    • 2006
  • 인터넷을 사용하는 사람들에게 그들의 관심사와 부합하는 웹 페이지를 제공하는 것은 매우 중요하다. 이러한 관점에서 본 논문은 각 웹 페이지의 주제와 연관된 정도를 계산하여 웹 페이지 군(cluster)을 형성하며, 단어빈도/문서빈도 엔트로피(entropy) 및 컴파일된 규칙을 이용하여 수집된 웹 페이지를 정제하는 주제 기반 웹 수집기를 제안한다. 실험을 통하여 주제 기반 웹 수집기에 대한 분류의 정확성, 수집의 효율성 및 수집의 일관성을 평가하였다. 첫째, C4.5, 역전패(back propagation) 및 CN2 기계학습 알고리즘으로 컴파일한 규칙을 이용하여 실험한 웹 수집기의 분류 성능은 CN2를 사용한 분류 성능이 가장 우수 하였으며, 둘째, 수집의 효율성을 측정하여 각 범주별로 최적의 주제 연관 정도에 대한 임계값을 도출할 수 있었다. 마지막으로, 제안한 수집기의 수집정도에 대한 일관성을 평가하기 위하여 서로 다른 시작 URL을 사용하여 수집된 웹 페이지들의 중첩정도를 측정하였다. 실험 결과에서 제안한 주제 기반 웹 수집기가 시작 URL에 큰 영향을 받지 않고 상당히 일관적인 수집을 수행함을 알 수 있었다.

  • PDF

KDC 제5판의 주기분석에 관한 연구 (A Study on the Notes Analysis of KDC 5th Edition)

  • 정옥경
    • 한국비블리아학회지
    • /
    • 제22권3호
    • /
    • pp.207-228
    • /
    • 2011
  • 분류표에서 주기는 분류항목과 기호에 대한 유용한 정보를 제공하여 분류의 정확성과 일관성을 향상시켜 준다. KDC에서도 여러 가지 유형의 주기를 사용하고 있으나, 급변하는 학문의 진전과 확대를 따르기에는 주기의 이용이 상당히 미흡하다. 따라서 본 연구의 목적은 KDC에 적합한 주기유형과 개선방안을 제시하는 것이다. 이 목적을 위하여 KDC의 주기유형의 변천과 DDC23판, NDC신정9판, KDC5판의 서문에 제시하고 있는 주기를 분석하고, 각 분류표에서 실제로 사용되고 있는 주기의 유형을 비교 분석하여 KDC5판의 주기의 문제점과 개선방안을 제시하였다.

분류체계 일치를 통한 과학기술정보 상호 교환 방법에 관한 기초 연구 (A Preliminary Study on Interchange of Science and Technology Information through Harmonization of Classification Schemes)

  • 홍성화;서태설
    • 정보관리연구
    • /
    • 제35권3호
    • /
    • pp.109-123
    • /
    • 2004
  • 과학기술정보의 의미적 상호운용성 문제는 빈번하게 발생한다. 잘 만들어진 분류체계는 상이한 데이터베이스 간에 의미상 불일치 없이 정보를 교환하기 위한 도구로 사용될 것이다. 하지만 각 데이터베이스가 취하고 있는 분류체계가 상이함으로 인해서 여전히 현실적인 장벽이 존재한다. 따라서 분류체계간의 일치 및 조화는 매우 시급한 문제이다. 본 논문의 목표는 다른 분류체계('국가과학기술표준분류'와 'KISTI 표준 분류')를 갖는 데이터베이스간의 정보 교환 시에 발생할 수 있는 의미적 불일치를 해결하는 것이다. 이를 위해서 과학기술의 개념적 체계 분석을 수행하였고 다섯가지의 일치/불일치 유형을 사례에 기반하여 분석하였다.

Classified Chemicals in Accordance with the Globally Harmonized System of Classification and Labeling of Chemicals: Comparison of Lists of the European Union, Japan, Malaysia and New Zealand

  • Yazid, Mohd Fadhil H.A.;Ta, Goh Choo;Mokhtar, Mazlin
    • Safety and Health at Work
    • /
    • 제11권2호
    • /
    • pp.152-158
    • /
    • 2020
  • Background: The Globally Harmonized System of Classification and Labeling of Chemicals (GHS) was developed to enhance chemical classification and hazard communication systems worldwide. However, some of the elements such as building blocks and data sources have the potential to cause "disharmony" to the GHS, particularly in its classification results. It is known that some countries have developed their own lists of classified chemicals in accordance with the GHS to "standardize" the classification results within their respective countries. However, the lists of classified chemicals may not be consistent among these countries. Method: In this study, the lists of classified chemicals developed by the European Union, Japan, Malaysia, and New Zealand were selected for comparison of classification results for carcinogenicity, germ cell mutagenicity, and reproductive toxicity. Results: The findings show that only 54%, 66%, and 37% of the classification results for each Carcinogen, Mutagen and Reproductive toxicants hazard classes, respectively are the same among the selected countries. This indicates a "moderate" level of consistency among the classified chemicals lists. Conclusion: By using classification results for the carcinogenicity, germ cell mutagenicity, and reproductive toxicity hazard classes, this study demonstrates the "disharmony" in the classification results among the selected countries. We believe that the findings of this study deserve the attention of the relevant international bodies.

문헌분류법에서의 지역구분에 관한 연구 (A Study on the Structure of Geographical Division in Library Classification System)

  • 남태우;백혜경;이형미;정수진
    • 한국도서관정보학회지
    • /
    • 제39권4호
    • /
    • pp.189-214
    • /
    • 2008
  • 본 연구는 현 KDC 4판의 지역구분체계가 가지는 문제점을 지적하고 이에 대한 개선방안을 마련하는 데 그 목적이 있다. 이를 위해 주요 분류법들을 십진과 비십진으로 나누어 각각의 분류법에서 채택한 지역구분 원칙에 대해 분석하였으며, 아울러 한국, 미국, 일본의 국가기관에서 채택한 지역구분 기준에 대해 조사하였다. 이와 같은 분석 결과를 바탕으로 KDC 4판 한국지역구분표의 개선안을 도출하였다. 또한 국민편의를 위해 마련된 공공기관의 행정구역분류체계와의 연관성 및 일관성 유지 방안과 아울러 행정지리에 의한 구분 이외의 다양한 지리현상을 반영한 추가적인 지역구분기준의 마련 방안을 제시하였다.

  • PDF

Optimal dwelling time prediction for package tour using K-nearest neighbor classification algorithm

  • Aria Bisma Wahyutama;Mintae Hwang
    • ETRI Journal
    • /
    • 제46권3호
    • /
    • pp.473-484
    • /
    • 2024
  • We introduce a machine learning-based web application to help travel agents plan a package tour schedule. K-nearest neighbor (KNN) classification predicts the optimal tourists' dwelling time based on a variety of information to automatically generate a convenient tour schedule. A database collected in collaboration with an established travel agency is fed into the KNN algorithm implemented in the Python language, and the predicted dwelling times are sent to the web application via a RESTful application programming interface provided by the Flask framework. The web application displays a page in which the agents can configure the initial data and predict the optimal dwelling time and automatically update the tour schedule. After conducting a performance evaluation by simulating a scenario on a computer running the Windows operating system, the average response time was 1.762 s, and the prediction consistency was 100% over 100 iterations.

콜론분류법에 바탕한 자동분류시스템의 개발에 관한 연구 - 농학 및 의학 전문도서관을 사레로 - (Developing an Automatic Classification System Based on Colon Classification: with Special Reference to the Books housed in Medical and Agricultural Libraries)

  • 이경호
    • 한국문헌정보학회지
    • /
    • 제23권
    • /
    • pp.207-261
    • /
    • 1992
  • The purpose of this study is (1) to design and test a database which can be automatically classified, and (2) to generate automatic classification number by processing the keywords in titles using the code combination method of Colon Classification(CC) as well as an automatic recognition of subjects in order to develop an automatic classification system (Auto BC System) based on CC which can be applied to any research library. To conduct this study, 1,510 words in the fields of agricultrue and medicine were selected, analized in terms of [P], [M], [E], [S], [T] employed in CC, and included in a database for classification. For the above-mentioned subject fields, the principle of an automatic classification was specified in order to generate automatic classification codes as well as to perform an automatic subject recognition of the titles included. Whenever necessary, editing, deleting, appending and reindexing of a database can be made in this automatic classification system. Appendix 1 shows the result of the automatic classification of books in the fields of agriculture and medicine. The results of the study are summarized below. 1. The classification number for the title of a book can be automatically generated by using the facet principles of Colon Classification. 2. The automatic subject recognition of a book is achieved by designing a database making use of a globe-principle, and by specifying the subject field for each word. 3. The automatic subject-recognition of input data is achieved by measuring the number of searched words by each subject field. 4. The combination of classification numbers is achieved by flowcharting of classification formular of each subject field. 5. The efficient control of classification numbers is achieved by designing control codes on the database for classification. 6. The automatic classification by means of Auto BC has been proved to be successful in the research library concentrating on a Single field. The general library may have some problem in employing this system. The automatic classification through Auto BC has the following advantages: 1. Speed of the classification process can be improve. 2. The revision or updating of classification schemes can be facilitated. 3. Multiple concepts can be expressed in a single classification code. 4. The consistency of classification can be achieved with the classification formular rather than the classifier's subjective judgement. 5. A user's retrieving process can be made after combining the classification numbers through keywords relating to the material to be searched. 6. The materials can be classified by a librarian without subject backgrounds. 7. The large body of materials can be quickly classified by means of a machine processing. 8. This automatic classification is expected to make a good contribution to design of the total system for library operations. 9. The information flow among libraries can be promoted owing to the use of the same program for the automatic classification.

  • PDF

DDC에 있어서 종교류 분류전개상의 제문제 (A Study on the 'Religion Class' of DDC)

  • 변우열
    • 한국문헌정보학회지
    • /
    • 제22권
    • /
    • pp.259-304
    • /
    • 1992
  • This paper examines 'Religion Class' in the scheme of the DDC. The major findings of the study are summerized as follows. 1. The first edition of DDC was published in 1876 in order to classify Amherst College Library collections. In spite of the continuous study and revision of the experts, the frameworks of the DDC systems are still kept unchanged. Only their subdivisions, reflecting those developments in the academic world, are developed and detailed more sophisticatedly. 2. The division of 200 does not function as generalities for all class of religion. Therefore, it is necessary to amend the division of 200 to serve generalities for all the religions of the world. 3. Standard subdivision for the christian religion and for the non-christian religion is different. So, the mnemonic nature has become weakened due to the dual standard subdivisions and the classification number becomes much longer and complicated. Therefore, one standard subdivision for all religions of the world is required. 4. Religion science was organized in late 19 C and developed continuously, but the DDC does not accomodate the religion science as a science. Accodingly, the DDC should be revised recognize religion science as a science not the christian science. 5. The deployment of classification scheme in Dewey's 200 is severely biased. That is to say, 9 division were assigned for christian religion, whereas only 1 division was assigned for non-christian religion. Therefore, an adjustment should be made to allocate subdivisions equally to all religions of the world. 6. General classification order of religion is prehistoric, primitive, ancient, modem and world religion in religion science. But, DDC does not accept this general classification order of religion, sticking to the biased expansion towards christianity. Therefore, DDC must adopt the general classification order of religion in the religion science. 7. Lastly, because of the limitation of decimal notation in DC, DDC does not accomodate new subject equally and classification number becomes longer. Therefore, centesimal expansion is proposed in order to make the classification number short, to enlarge its capacity of inclusion of new subject and to maintain consistency in the scheme.

  • PDF

타이로신키나아제 억제제의 임상적으로 유의한 약물상호작용 정보 일관성 분석 (Evaluation of Information Consistency of Clinically Significant Drug Interactions in Tyrosine Kinase Inhibitors)

  • 안슬기;이주연;아영미
    • 한국임상약학회지
    • /
    • 제30권1호
    • /
    • pp.44-50
    • /
    • 2020
  • Background: Drug-drug interactions (DDIs) in patients using oral anticancer treatment are more common than in those using injectable anticancer agents. In addition, DDIs related to anticancer treatment are known to cause clinically significant outcomes, such as treatment failure and severe toxicity. To prevent these negative outcomes, significant DDIs are monitored and managed using the information provided in drug databases. We aimed to evaluate the consistency of information on clinically significant DDIs for tyrosine kinase inhibitors (TKIs) between representative drug databases. Methods: We selected clinically significant DDIs involving medications that are co-prescribed with TKIs and met the following criteria: the severity level of DDIs was equal or greater than "D" in Lexicomp® or "major" in Micromedex®. We then analyzed the consistency of the severity classification and evidence level between the drug databases. Spearman's correlation coefficient was used to identify the relationship between DDI information in the drug databases. Results: In total, 627 DDI pairs were identified as clinically significant; information on these was provided by Lexicomp® and Micromedex® for 571 and 438 pairs, respectively, and both drug databases provided information on 382 DDI pairs. There was no correlation between the severity and evidence level of DDIs provided in the two databases; Spearman's correlation coefficient for Lexicomp® and Micromedex® was -0.009 (p=0.861) and -0.064 (p=0.209), respectively. Conclusion: To judge the significance of DDIs, healthcare providers should consider that the information on DDIs may be different between drug information databases; hence, clinical factors must be considered concurrently.

Korean Brain Tumor Society Consensus Review for the Practical Recommendations on Glioma Management in Korea

  • Chul-Kee Park;Jong Hee Chang
    • Journal of Korean Neurosurgical Society
    • /
    • 제66권3호
    • /
    • pp.308-315
    • /
    • 2023
  • Recent updates in genomic-integrated glioma classification have caused confusion in current clinical practice, as management protocols and health insurance systems are based on evidence from previous diagnostic classifications. The Korean Brain Tumor Society conducted an electronic questionnaire for society members, asking for their ideas on risk group categorization and preferred treatment for each individual diagnosis listed in the new World Health Organization (WHO) classification of gliomas. Additionally, the current off-label drug use (OLDU) protocols for glioma management approved by the Health Insurance Review and Assessment Service (HIRA) in Korea were investigated. A total of 24 responses were collected from 20 major institutes in Korea. A consensus was reached on the dichotomic definition of risk groups for glioma prognosis, using age, performance status, and extent of resection. In selecting management protocols, there was general consistency in decisions according to the WHO grade and the risk group, regardless of the individual diagnosis. As of December 2022, there were 22 OLDU protocols available for the management of gliomas in Korea. The consensus and available options described in this report will be temporarily helpful until there is an accumulation of evidence for effective management under the new classification system for gliomas.