• Title/Summary/Keyword: Topic-Relevance

Search Result 62, Processing Time 0.03 seconds

A Focused Crawler by Segmentation of Context Information (주변정보 분할을 이용한 주제 중심 웹 문서 수집기)

  • Cho, Chang-Hee;Lee, Nam-Yong;Kang, Jin-Bum;Yang, Jae-Young;Choi, Joong-Min
    • The KIPS Transactions:PartB
    • /
    • v.12B no.6 s.102
    • /
    • pp.697-702
    • /
    • 2005
  • The focused crawler is a topic-driven document-collecting crawler that was suggested as a promising alternative of maintaining up-to-date web document Indices in search engines. A major problem inherent in previous focused crawlers is the liability of missing highly relevant documents that are linked from off-topic documents. This problem mainly originated from the lack of consideration of structural information in a document. Traditional weighting method such as TFIDF employed in document classification can lead to this problem. In order to improve the performance of focused crawlers, this paper proposes a scheme of locality-based document segmentation to determine the relevance of a document to a specific topic. We segment a document into a set of sub-documents using contextual features around the hyperlinks. This information is used to determine whether the crawler would fetch the documents that are linked from hyperlinks in an off-topic document.

A Trend Analysis of Radiological Research in Korea using Topic Modeling (토픽모델링을 이용한 국내 방사선 학술연구 트렌드 분석)

  • Hong, Dong-Hee
    • Journal of the Korean Society of Radiology
    • /
    • v.16 no.3
    • /
    • pp.343-349
    • /
    • 2022
  • We intend to use topic modeling to identify radiation-themed papers published from 1989 to 2022 and analyze the relevance and weight between topics. This study analyzed topics derived from national subjects for 717 papers published until recently in 2022 to contribute to the revitalization of research in the field of radiation. Through text mining, overall research trends on the subject distribution of the study were analyzed, and five topics were derived through topic modeling. First, among the papers to be analyzed, a total of 1,675 words were frequency-analyzed through the preprocessing process of key words in a total of 717 papers centered on keywords. Second, as a result of analyzing topics based on the association of constituent words for five topics, it was found that studies focused on minimizing dose in the range that does not degrade image quality in the fields of radiation, image, CT clinical. In addition, it was found that various studies were mainly conducted in the MRI, and the study of ultrasound in various areas of disease analysis was actively attempted.

A Survey on the Opinion of Teachers about the Content Relevance in the 7th Mathematics Curriculum (제7차 국민공통기본교육과정의 수학과 교육 내용 적정성에 관한 교사 의견 조사 연구)

  • Lee, Dae-Hyun;Yim, Jae-Hoon
    • Journal of the Korean School Mathematics Society
    • /
    • v.8 no.2
    • /
    • pp.223-248
    • /
    • 2005
  • This study is to survey and analyze the opinion of teachers about the relevance of educational content in the 7th mathematics curriculum. For the purpose of this study, we analyze the result of the questionnaire survey which consists in the question about the relevance(Quantity, level, validity) of educational content in the 7th mathematics curriculum. 515 elementary school teachers, 314 middle school teachers, and 323 high school teachers are participated in this survey. 75 percent of elementary school teachers think that the educational quantity must be reduced for the relevance of educational content. So do 50 percent of secondary school teachers. Both of them think that the number of topic must be reduced for the relevance. In special, this study shows that the response rate about the object which is related with interest is very low compared with any other mathematics education objects. So, it is necessary to pay more attention to the object which is related with interest.

  • PDF

Forecasting Open Government Data Demand Using Keyword Network Analysis (키워드 네트워크 분석을 이용한 공공데이터 수요 예측)

  • Lee, Jae-won
    • Informatization Policy
    • /
    • v.27 no.4
    • /
    • pp.24-46
    • /
    • 2020
  • This study proposes a way to timely forecast open government data (OGD) demand(i.e., OGD requests, search queries, etc.) by using keyword network analysis. According to the analysis results, most of the OGD belonging to the high-demand topics are provided by the domestic OGD portal(data.go.kr), while the OGD related to users' actual needs predicted through topic association analysis are rarely provided. This is because, when providing(or selecting) OGD, relevance to OGD topics takes precedence over relevance to users' OGD requests. The proposed keyword network analysis framework is expected to contribute to the establishment of OGD policies for public institutions in the future as it can quickly and easily forecast users' demand based on actual OGD requests.

Rutgers Information Retrieval Evaluation Project on IR Performance on Different Precision Levels (럿거스 정보검색 평가 프로젝트에 관한 연구)

  • Lee, Hyuk-Jin;Belkin Nicholas J.;Krovitz Bob
    • Journal of the Korean Society for information Management
    • /
    • v.23 no.2
    • /
    • pp.97-111
    • /
    • 2006
  • The purpose of this study is to investigate what level of difference in precision would be significantly perceived by a human user of an information retrieval system. Not many researches have been conducted with regards to this issue in information retrieval field. Despite the non-significant results, there were several interesting findings in recognizing different levels of precision rates. The correctness of relevance task had little to do with the taken time for the task. In addition, the strong relationship between the subjects' topic familiarity and rate of correct judgments is one of the most interesting results in this study. It turned out that the subjects have more difficulty in a situation they have to judge between the two lists having more non-relevant documents than in a situation they do between the lists haying more relevant documents. Finally, the serious influence from the first top N documents in a list for relevance judgment task has been confirmed.

3-Step Security Vulnerability Risk Scoring considering CVE Trends (CVE 동향을 반영한 3-Step 보안 취약점 위험도 스코어링)

  • Jihye, Lim;Jaewoo, Lee
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.27 no.1
    • /
    • pp.87-96
    • /
    • 2023
  • As the number of security vulnerabilities increases yearly, security threats continue to occur, and the vulnerability risk is also important. We devise a security threat score calculation reflecting trends to determine the risk of security vulnerabilities. The three stages considered key elements such as attack type, supplier, vulnerability trend, and current attack methods and techniques. First, it reflects the results of checking the relevance of the attack type, supplier, and CVE. Secondly, it considers the characteristics of the topic group and CVE identified through the LDA algorithm by the Jaccard similarity technique. Third, the latest version of the MITER ATT&CK framework attack method, technology trend, and relevance between CVE are considered. We used the data within overseas sites provide reliable security information to review the usability of the proposed final formula CTRS. The scoring formula makes it possible to fast patch and respond to related information by identifying vulnerabilities with high relevance and risk only with some particular phrase.

Dynamic Text Categorizing Method using Text Mining and Association Rule

  • Kim, Young-Wook;Kim, Ki-Hyun;Lee, Hong-Chul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.23 no.10
    • /
    • pp.103-109
    • /
    • 2018
  • In this paper, we propose a dynamic document classification method which breaks away from existing document classification method with artificial categorization rules focusing on suppliers and has changing categorization rules according to users' needs or social trends. The core of this dynamic document classification method lies in the fact that it creates classification criteria real-time by using topic modeling techniques without standardized category rules, which does not force users to use unnecessary frames. In addition, it can also search the details through the relevance analysis by calculating the relationship between the words that is difficult to grasp by word frequency alone. Rather than for logical and systematic documents, this method proposed can be used more effectively for situation analysis and retrieving information of unstructured data which do not fit the category of existing classification such as VOC (Voice Of Customer), SNS and customer reviews of Internet shopping malls and it can react to users' needs flexibly. In addition, it has no process of selecting the classification rules by the suppliers and in case there is a misclassification, it requires no manual work, which reduces unnecessary workload.

Construction of Record Retrieval System based on Topic Map (토픽맵 기반의 기록정보 검색시스템 구축에 관한 연구)

  • Kwon, Chang-Ho
    • The Korean Journal of Archival Studies
    • /
    • no.19
    • /
    • pp.57-102
    • /
    • 2009
  • Recently, distribution of record via web and coefficient of utilization are increase. so, Archival information service using website becomes essential part of record center. The main point of archival information service by website is making record information retrieval easy. It has need of matching user's request and representation of record resources correctly to making archival information retrieval easy. Archivist and record manager have used various information representation tools from taxonomy to recent thesaurus, still, the accuracy of information retrieval has not solved. This study constructed record retrieval system based on Topic Map by modeling record resources which focusing on description metadata of the records to improve this problem. The target user of the system is general web users and its range is limited to the president related sources in the National Archives Portal Service. The procedure is as follows; 1) Design an ontology model for archival information service based on topic map which focusing on description metadata of the records. 2) Buildpractical record retrieval system with topic map that received information source list, which extracted from the National Archives Portal Service, by editor. 3) Check and assess features of record retrieval system based on topic map through user interface. Through the practice, relevance navigation to other record sources by semantic inference of description metadata is confirmed. And also, records could be built up as knowledge with result of scattered archival sources.

An Exploration of Korean Discourses on Public Diplomacy

  • Ayhan, Kadir Jun
    • Journal of Contemporary Eastern Asia
    • /
    • v.19 no.1
    • /
    • pp.31-42
    • /
    • 2020
  • There is great confusion over what constitutes public diplomacy (PD), who its actors are, and the relevance of non-state actors. In the Korean context, in addition to the general fuzziness of the concept, linguistic peculiarities of the terms gonggong and gongjung both of which refer to public, waegyo, which is interchangeably used for international affairs, foreign policy and diplomacy, and juche which is simultaneously used for actor and agent, add more layers of confusion. While the term PD in Korea is based almost entirely on Western conceptualization, these linguistic peculiarities prevent fruitful conversations among scholars and practitioners on PD. Against this background, this research note explores and addresses conceptual ambiguities that pertains to PD and the policy discourse on the topic, particularly on non-state PD in Korea. The paper draws on Korean government's PD-related policy documents and Diplomatic White Papers and all relevant academic articles found in Korean-language journals registered in the Korean Citation Index (KCI), which are analysed to gain an understanding of the PD-related policy discourse in Korea.

A book review; "Rare earth elements in human and environmental health; at the crossroads between toxicity and safety"

  • Rim, Kyung-Taek
    • Journal of Applied Biological Chemistry
    • /
    • v.60 no.3
    • /
    • pp.207-211
    • /
    • 2017
  • It is introduced an outstanding book about an important topic in occupational and environmental sciences i.e., the opportunities and challenges that may be connected with increasing the use and distribution of rare earth elements. These chemically similar elements, comprising the lanthanides, scandium, and yttrium, are involved in a number of essential technological applications, and their effects raise a number of human health issues of relevance to the occupational and environmental sciences. The book that I introduced here, "Rare Earth Elements in Human and Environmental Health; At the Crossroads between Toxicity and Safety" edited by Giovanni Pagano (Pan Stanford Publishing Pte. Ltd., Temasek Boulevard, Singapore) represents a break from that situation. It is essential to increase our knowledge about the environmental fate and biological effects of these technologically important metals in order to prevent unforeseen long-term man-made consequences to human health. This book is likely to become an important resource for scientists, engineers, and decision makers who understand the need for sensible exploitation of this resource.